Hacker News new | past | comments | ask | show | jobs | submit login

For example, the model named "deepseek-r1:8b" by ollama is not a deepseek r1 model. It is actually a fine tune of Meta's Llama 8b, fine tuned on data generated by deepseek r1.





Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: