I made a script which does exactly the same thing but locally using koboldcpp fo...

nirav72 · 2024-11-16T10:08:14 1731751694

MiniCPM-v 2.6 is probably the best self-hosted vision model I have used so far. Not just for OCR, but also image analysis. I have it setup, so my NVR (frigate) sends couple of images upon motion alert from a driveway security camera to Ollama with minicpm-v 2.6. I’m able to get a reasonably accurate description of the vehicle that pulled into the driveway. Including describing the person that exits the vehicle and also the license plate. All sent to my phone.

timmattison · 2024-11-17T12:22:20 1731846140

I love this. Can you share the source?