They're chilling it out together with Nethack in the Club for AI Benchmarks yet ...

They're chilling it out together with Nethack in the Club for AI Benchmarks yet to be Beaten.

Interestingly, Bongard problems do not have a private test set, unlike ARC-AGI. Can that be because they don't need it? Is it possible that Bongard Problems are a true test of (visual) reasoning that requires intelligence to be solved?

Ooooh! Frisson of excitement!

But I guess it's just that nobody remembers them and so nobody has seriously tried to solve them with Big Data stuff.