Hacker News new | past | comments | ask | show | jobs | submit login

They aren’t, there is a 1.58 version of deepseek that’s like 200gb instead of 700





That's not a real BitNet, it's just a post-training quantisation, and its performance suffers compared to if it was trained from scratch at 1.58 bits.



Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: