Hacker Newsnew | past | comments | ask | show | jobs | submit | ntkris's commentslogin

This is awesome. Have you seen / heard of any benchmarks where the data is actually a structured JSON vs. markdown?


Yes, we needed to solve the problem for our other product (https://kili.so). We spent a lot of time getting accuracy up for dense and multi-page invoices. Then realised other teams have this need as well so decided to ship the API.

On the accuracy point, given our work so far we believe we are best in class in terms of accuracy for document extraction. We've also set up a system of evaluations internally that allow us to keep iterating and improving (hence us mentioning that we want to continue working on it).


Not off topic at all!

I can only speak to our experience. Once you get under the hood, you find that this is a hard problem to solve.

There are also a lot of workflows that involve documents in every sector and every function. In other words, the opportunity is massive.

For our product, our customers are either internal engineering teams or folks building products that require document extraction but don’t want to invest time in it.


Are you just looking to meet other folks building or have a specific goal in mind (e.g. finding a cofounder)? I can recommend accordingly


I'm solo technical founder and currently bootstrapping, not actively looking for a cofounder but looking for like-minded individuals who want to bounce ideas off each other, whether that be technical or non-technical. In my case specifically I have a consumer facing app which is live and I'm in the mystical stage of nailing down PMF.


Not yet, but we're open to creating one


We're focussed on company operations (accounting, procurement etc).

The tool is totally self serve and does allow you to set up, upload and access documents.

We clearly need to call that out more so will add this to the landing page


Great feedback, we will make this more clear


We are seeing a bunch of interest for scraping related use cases and document processing (eg. automatically processing invoices that are emailed in)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: