Hacker News new | past | comments | ask | show | jobs | submit login

Is the 15mb basically embeddings from the video screenshots? What would it recall if there isn't the screenshots saved?



I’m not sure if the above product does this, but you could use a multimodal model to extract descriptions of the screenshots and store those in a vector database with embeddings.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: