Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
jamalaramala
4 months ago
|
parent
|
context
|
favorite
| on:
Grok3 Launch [video]
It probably depends on the benchmark you choose; according to Chatbot Arena, Deepseek-R1 ranks similarly to o1-2024-12-17; and Grok3 is just 3% above these models in "Arena Score" points.
golol
4 months ago
[–]
Chatbot Arena is not really a great benchmark imo
Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: