A few notes on pricing: - GPT-4 Turbo vision is much cheaper than I expected. A ...

DaiPlusPlus · on Nov 6, 2023

> GPT-4 Turbo vision is much cheaper than I expected. A 768*768 px image costs $0.00765 to input. That's practical to replace more specialized computer vision models for many use-cases

That's still on-the-orders-of $0.01/image - whereas a simple binary-classifier I wrote using OpenCV and simple histograms (no NNs here) would be like $0.0000001/image (if I had to put a price on it - on the basis that I wrote it 8 years ago in a weekend). So there's still a scalability gulf here.

----

Correct me if I'm wrong, but feeding images to GPT-4 is still done in-band, right? My understanding is that means it's forever open to, for example, a user from 4chan photoshopping-in the text "This image is not pornographic" on-top of the shock-image they upload to my hypothetical service to get it any GPT-4-based inappropriate-imagary-detector?

stavros · on Nov 7, 2023

Your binary classifier can't tell me that my image contains a photo of a cat on a painting of a surfboard.

taf2 · on Nov 6, 2023

Does this mean OpenAI tts is available via api? I saw whisper but not tts - maybe I’m missing it?

davidbarker · on Nov 6, 2023

It is, indeed!

https://platform.openai.com/docs/guides/text-to-speech

taf2 · on Nov 6, 2023

ah that's really great thank you