It's interesting that there are no reasoning models yet, 2.5 months after DeepSe...

cheptsov · 2025-04-05T19:42:17 1743882137

https://www.llama.com/llama4-reasoning-is-coming/

jlpom · 2025-04-05T20:20:45 1743884445

The page is blank for now.

sroussey · 2025-04-05T20:43:59 1743885839

Yeah, it is listed here:

https://www.llama.com/llama4/

And going to that page just says coming soon.

voxgen · 2025-04-06T08:52:50 1743929570

> It's interesting that there are no reasoning models yet

This may be merely a naming distinction, leaving the name open for a future release based on their recent research such as coconut[1]. They did RL post-training, and when fed logic problems it appears to do significant amounts of step-by-step thinking[2]. It seems it just doesn't wrap it in <thinking> tags.

[1] https://arxiv.org/abs/2412.06769 "Training Large Language Models to Reason in a Continuous Latent Space" [2] https://www.youtube.com/watch?v=12lAM-xPvu8 (skip through this - it's recorded in real time)

azinman2 · 2025-04-06T18:16:25 1743963385

But if the final result is of high enough quality, who cares about reasoning? It’s a trick to get the quality higher, at the cost of tokens and latency.

whimsicalism · 2025-04-06T18:20:24 1743963624

reasoning is giving the option to trade $ for additional performance, seems like you would always desire this optionality for any model