Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
whynotminot
6 months ago
|
parent
|
context
|
favorite
| on:
OpenAI O3 breakthrough high score on ARC-AGI-PUB
“Objective benchmarks are useless, let’s argue about which one works better for me personally.”
csomar
6 months ago
|
next
[–]
Yes. My benchmarks
and
their benchmarks means AGI. Their benchmarks only means over-fitted.
whynotminot
6 months ago
|
parent
|
next
[–]
Ok so what if we get different results for our own personal benchmarks/use cases.
(See why objective benchmarks exist?)
bakugo
6 months ago
|
prev
[–]
Yes, "objective" benchmarks can be gamed, real-life tasks cannot.
Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: