Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I have some pretty good experiences with PaddleOCR but you may refer to this Chinese and badly documented ones.

For our use case PaddleOCR + LLM has been quite nice combo.




Yes, that's one of the ones I tried. It seemed to be more designed for things like receipts and menus rather than books. But in any case, I found it hard to set up and use (and it's likely slow on the CPU compared to Tesseract, which despite its low accuracy, is at least very fast on CPU).




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: