I was wondering about this one too... > At best, they say, Orion performs better...

overgard · 2024-12-23T00:45:41 1734914741

Well, those server farms don't pay for themselves.

maxrmk · 2024-12-23T01:29:00 1734917340

sure, but once it's trained there isn't a running maintenance cost

wongarsu · 2024-12-23T01:37:49 1734917869

If you offer an API you need to dedicate servers to it that keep the model loaded in GPU memory. Unless you don't care about latency at all.

Though I wouldn't be surprised if the bigger reason is the PR cost of releasing with an exciting name but unexciting results. The press would immediately declare the end of the AI growth curve

wavemode · 2024-12-23T01:59:19 1734919159

Of course running inference costs money. You think GPUs are free?

bhouston · 2024-12-23T01:37:11 1734917831

Well if it takes a ton of memory/compute for inference because of its size, it may be cost prohibitive to run compared to the ROI it generates?

overgard · 2024-12-23T07:39:15 1734939555

There definitely is, storage, machines at the ready, data centers, etc. Also OpenAI basically loses money every time you interact with ChatGPT https://www.wheresyoured.at/subprimeai/