When I was putting together the pitchdeck for our startup I wanted to search for slides to learn from - but I was looking for specific sections or types of startups for slide decks. I had to open tens of decks and scroll through them which sucked. So I decided to make a tool that would allow me to search inside the decks more easily. Happy to answer questions
Nice project for looking for pitchdeck references. Thanks for building and sharing it. I am curious about the tech behind it - are you doing OCR on images? The search is very responsive - it's definitely not elastic search, curious what index/search system are you using?
Glad it helps! There are 4 key steps that I took:
- Upscaling (using Upscayl[0])
- OCR (using tesseract[1])
- Indexing (using Algolia[2])
- Scaling the processing and running on AWS (Klotho[3] - our startup)