Probably drawing an analogy to how causal pretrained models go through stages of... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

remontoire 11 months ago | parent | context | favorite | on: A decoder-only foundation model for time-series fo...

Probably drawing an analogy to how causal pretrained models go through stages of understanding language, words -> grammar -> meaning. Gwen mentions this experience when training character level RNNs. https://gwern.net/scaling-hypothesis#why-does-pretraining-wo...

horacemorace 11 months ago [–]

Totally. Also basic temporal scale or cyclic properties. It’s kind of mind blowing that the shape of most recorded human patterns is reducible in this way.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact