“Objective benchmarks are useless, let’s argue about which one works better for ... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

whynotminot 6 months ago | parent | context | favorite | on: OpenAI O3 breakthrough high score on ARC-AGI-PUB

“Objective benchmarks are useless, let’s argue about which one works better for me personally.”

csomar 6 months ago | [–]

Yes. My benchmarks and their benchmarks means AGI. Their benchmarks only means over-fitted.

whynotminot 6 months ago | | [–]

Ok so what if we get different results for our own personal benchmarks/use cases.

(See why objective benchmarks exist?)

bakugo 6 months ago | [–]

Yes, "objective" benchmarks can be gamed, real-life tasks cannot.

Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact