Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
jack_pp
4 months ago
|
parent
|
context
|
favorite
| on:
30% drop in O1-preview accuracy when Putnam proble...
It's not p-hacking, he's right. You're both right. First test the same prompt on different versions then the ones that got it right go to the next round, variations on the prompt
Join us for
AI Startup School
this June 16-17 in San Francisco!
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: