Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Estragon
on Nov 26, 2010
|
parent
|
context
|
favorite
| on:
OCR by uploading images to Google Docs
Incidentally, I noticed that if you try to use tesseract on an image taken from a Google Books page, you get terrible OCR accuracy. Anyone know why that is?
zzleeper
on Nov 26, 2010
[–]
I recall that on some google-scanned books, there was some metadata from abbyy finereader. So that may be why.
Also, tesseract often needs to be configured.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: