convert scans (images) of pages of a book to text ?

Started by RemiD, September 21, 2019, 12:26:49

Previous topic - Next topic

RemiD

hi,

i have found a scan of the pages of an old book (pdf) on the web but i can't select the text (it is not protected, i have checked), it seems like scan images in a pdf...

any idea how i can convert these images to (selectable) text ?

thanks

Matty

You will need software that does optical character recognition.

It does exist, I've used free software at work.

But you will likely have to correct a lot of it.

Amon


RemiD

ok, i was aware of "ocr", will take a look. i was wondering if there was another solution... thanks

Derron

You could even translate via your "translate" app if you have a android phone with google-apps. Open the picture there and get the text translated (and selectable). Might be of interest too.


bye
Ron

RemiD


RemiD

so... OCR kinda works, but only if the background of the page is white with uniform lighting and no imperfections... but in my case this does not help.  ;D
anyway, it was interesting to try it.

Derron

Did you try google translate? I used it for signs in Czech on a camping place - random dusk time picture. And it translated well. Used it on menu cards ... worked well. Might work better than traditional OCR.


bye
Ron