- GPT-4 Turbo vision is much cheaper than I expected. A 768*768 px image costs $0.00765 to input. That's practical to replace more specialized computer vision models for many use-cases.
- ElevenLabs is $0.24 per 1K characters while OpenAI TTS HD is $0.03 per 1K characters. Elevenlabs still has voice copying but for many use-cases it's no longer competitive.
- It appears that there's no additional fee for the 128K context model, as opposed to previous models that charged extra for the longer context window. This is huge.
> GPT-4 Turbo vision is much cheaper than I expected. A 768*768 px image costs $0.00765 to input. That's practical to replace more specialized computer vision models for many use-cases
That's still on-the-orders-of $0.01/image - whereas a simple binary-classifier I wrote using OpenCV and simple histograms (no NNs here) would be like $0.0000001/image (if I had to put a price on it - on the basis that I wrote it 8 years ago in a weekend). So there's still a scalability gulf here.
----
Correct me if I'm wrong, but feeding images to GPT-4 is still done in-band, right? My understanding is that means it's forever open to, for example, a user from 4chan photoshopping-in the text "This image is not pornographic" on-top of the shock-image they upload to my hypothetical service to get it any GPT-4-based inappropriate-imagary-detector?
- GPT-4 Turbo vision is much cheaper than I expected. A 768*768 px image costs $0.00765 to input. That's practical to replace more specialized computer vision models for many use-cases.
- ElevenLabs is $0.24 per 1K characters while OpenAI TTS HD is $0.03 per 1K characters. Elevenlabs still has voice copying but for many use-cases it's no longer competitive.
- It appears that there's no additional fee for the 128K context model, as opposed to previous models that charged extra for the longer context window. This is huge.