I think very soon a new model will destroy whatever startups and services are bu...

layer8 · 2025-02-05T21:02:42 1738789362

Extracting plain text isn’t that much of a problem, relatively speaking. It’s interpreting more complex elements like nested lists, tables, side bars, footnotes/endnotes, cross-references, images and diagrams where things get challenging.

visarga · 2025-02-06T05:33:18 1738819998

OCR is not 100% either. Reading order is also fragile, it might OCR the word but mess up the line structure.

depr · 2025-02-05T20:18:16 1738786696

I think the Azure Document Intelligence, Google Document AI and Amazon Textract are among the best if not the best services though and they offer these models.

nnurmanov · 2025-02-06T06:00:11 1738821611

I have not tested Azure Document Intelligence, Google Document AI, but AWS Textract, LLamaparse, Unstructured and Omni made to my shortlist. I have not tested Docling, as I could not install it on my Windows laptop.