Hacker News new | past | comments | ask | show | jobs | submit login

Every solvable problem is solved by using known information, patterns, context, etc. We (and LLMs) are using some model of the universe and trying to coordinate it in a way that will help us solve some task. The difference between us and "old" LLMs is that we can generate new information/patterns/etc., immediately add it to our model, and use it to solve more complex problems. New LLMs such as o1-o3 are also capable of thinking over and over again and producing new information (in the current context) and trying to apply it to the current task that might not be solvable with just the information that it was trained on.

(This is my understanding, I’m not ml engineer)




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: