Hacker News new | past | comments | ask | show | jobs | submit login

So, basically, it's chain of thought as a service?

Not a model, per se, but a service that chains multiple model requests behind the scene?




Who knows? Certainly not the public.

It might be a finetuned model that works better in such a setting.


The linked blog posts explains that it is fine-tuned on some reinforcement learning process. It doesn’t go into details but they do claim it’s not just the base model with chain of thought, there’s some fine-tuning going on.




Consider applying for YC's W25 batch! Applications are open till Nov 12.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: