Hacker News new | past | comments | ask | show | jobs | submit login

Come on, you can do the critical thinking here to understand why these companies would want the best in class (open/closed) weight LLMs.



then why would they cheat?


I didn't see evidence of cheating in the article. Having a slightly differently tuned version of 4 is not the most dastardly thing that can be done. Everything else is insinuation.


Well we'll see if they suffer consequences of this and they cheated too hard, but being perceived as best in class is arguably worth even more than being the best in class, especially if differences in performance are hard to perceive anecdotally.

The goal is long term control over a technology's marketshare, as winner take all dynamics are in play here.


they're all cheating, see grok


Are you referring to this [1]?

> Critics have pointed out that xAI’s approach involves running Grok 3 multiple times and cherry-picking the best output while comparing it against single runs of competitor models.

[1] https://medium.com/@cognidownunder/the-hype-machine-gpt-4-5-...




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: