Hacker News new | past | comments | ask | show | jobs | submit | from login
Replace OCR with Vision Language Models (github.com/vlm-run)
292 points by EarlyOom 49 days ago | past | 125 comments
Show HN: Visually parse an entire YouTube video frame by frame (github.com/vlm-run)
5 points by EarlyOom 54 days ago | past
A Node.js SDK for calling Vision Language Models (github.com/vlm-run)
6 points by EarlyOom 55 days ago | past
Run structured extraction on documents/images locally with Ollama and Pydantic (github.com/vlm-run)
170 points by EarlyOom 56 days ago | past | 29 comments

Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: