Incidentally, I noticed that if you try to use tesseract on an image taken from ... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

Estragon on Nov 26, 2010 | parent | context | favorite | on: OCR by uploading images to Google Docs

Incidentally, I noticed that if you try to use tesseract on an image taken from a Google Books page, you get terrible OCR accuracy. Anyone know why that is?

zzleeper on Nov 26, 2010 [–]

I recall that on some google-scanned books, there was some metadata from abbyy finereader. So that may be why.

Also, tesseract often needs to be configured.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact