Hacker News new | past | comments | ask | show | jobs | submit login

Same. I am eager to run my code editing benchmark [1] against it, to compare it with gpt-4-0314 and gpt-4-0613.

Edit: Ha, I just re-read the announcement [2] and it says 1pm in the 5th sentence:

  We’ll begin rolling out new features to OpenAI customers starting at 1pm PT today.

[1] https://aider.chat/docs/benchmarks.html

[2] https://openai.com/blog/new-models-and-developer-products-an...




I've been able to generate some preliminary code editing evaluations. OpenAI is enforcing very low rate limits on the new GPT-4 model. I will update the results as quickly my rate limit allows.

https://news.ycombinator.com/item?id=38172621

Also, aider now supports these new models, including `gpt-4-1106-preview` with the massive 128k context window.

https://github.com/paul-gauthier/aider/releases/tag/v0.17.0


I'm also eager for you to run your code editing benchmark against it. :)


Hey. Would really love to know the results of your benchmark testing.


"begin".

Other comments says this can take days to get to everyone.


Good find - Looks like I now have access!




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: