Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
csomar
on Dec 21, 2024
|
parent
|
context
|
favorite
| on:
OpenAI O3 breakthrough high score on ARC-AGI-PUB
Models are predictable at 0 temperatures. They might have tested the output beforehand.
fzzzy
on Dec 21, 2024
[–]
Models in practice haven't been deterministic at 0 temperature, although nobody knows exactly why. Either hardware or software bugs.
Jensson
on Dec 21, 2024
|
parent
[–]
We know exactly why, it is because floating point operations aren't associative but the GPU scheduler assumes they are, and the scheduler isn't deterministic. Running the model strictly hurts performance so they don't do that.
fzzzy
on Dec 22, 2024
|
root
|
parent
[–]
Cool, thanks a lot for the explanation. Makes sense.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: