Fundamentally, the pre-trained model would need to learn a "world model" to pred...

whimsicalism · 2024-05-08T21:24:14 1715203454

But the data generating process could be literally anything. We are not constrained by physics in any real sense if we predicting financial markets or occurrences of a certain build error or termite behavior.

shaism · 2024-05-08T21:49:06 1715204946

Sure, there are limits. Not everything is predictable, not even physics. But that is also not the point of such a model. The goal is to forecast across a broad range of use cases that do have underlying laws. Similar to LLM, they could also be fine-tuned.

wavemode · 2024-05-09T14:04:24 1715263464

"predicting the next token well means that you understand the underlying reality that led to the creation of that token"

People on the AI-hype side of things tend to believe this, but I really fundamentally don't.

It's become a philosophical debate at this point (what does it mean to "understand" something, etc.)