Hacker News new | past | comments | ask | show | jobs | submit login

They're chilling it out together with Nethack in the Club for AI Benchmarks yet to be Beaten.

Interestingly, Bongard problems do not have a private test set, unlike ARC-AGI. Can that be because they don't need it? Is it possible that Bongard Problems are a true test of (visual) reasoning that requires intelligence to be solved?

Ooooh! Frisson of excitement!

But I guess it's just that nobody remembers them and so nobody has seriously tried to solve them with Big Data stuff.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: