Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
devmor
10 months ago
|
parent
|
context
|
favorite
| on:
Ingesting PDFs and why Gemini 2.0 changes everythi...
If OCR is a solution designed to recognize documents and it does not recognize all documents, then it is an imperfect solution.
That is not to say there
is
a perfect solution, but it is still the fault of the solution.
nnurmanov
10 months ago
[–]
E.g. oftentimes there is l and I (capital I), this may be an issue for OCR. The perfect case is when there is a PDF document and data embedded as XML data, but unfortunately it is not the case.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
That is not to say there is a perfect solution, but it is still the fault of the solution.