Hacker News new | past | comments | ask | show | jobs | submit login

> while using this implicit knowledge and feedback it gets from running the program.

What feedback, and what program, are you referring to?




Basically solutions that were doing well in arc just threw thousands of ideas at the wall and picked the ones that stuck. They were literally generating thousands of python programs, running them and checking if any produced the correct output when fed with data from examples.

This o3 doesn't need to run python. It itself executes programs written in tokens inside it's own context window which is wildly inefficient but gives better results and is potentially more general.


So basically it's a massively inefficient trial-and-error leetcode solver which only works because it throws incredible amounts of compute at the problem.

This is hilarious.


Previous best specialized ARC solver was exactly that.

This o3 thing might be a bit different because it's just chain of thought llm that can do many other things as well.

It's not uncommon for people to have a handful of wrong ideas before they stumble upon a correct solution either.


I assume that o3 can run Python scripts and observe the outputs.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: