Alpha zero is a generalized AlphaGo, so they are headed in the right direction! ...

ForHackernews · on Jan 21, 2019

Yeah maybe. How do you define "win" in terms of general intelligence? Outsmart all top human experts at their field?

iheartpotatoes · on Jan 21, 2019

Impossible to answer: who "won" politics? Democrats or republicans?

Eventually we go from a black-and-white rules (hell, even Go is kind of a negotiation), to using economics as a tool to determine what option satisfies all parties. I don't think humans have solved that yet, but like radiology, perhaps AI can make better compromises?

wetpaws · on Jan 21, 2019

Whatever the reward function dictates would be a win :)

ForHackernews · on Jan 21, 2019

Right. So we've just successfully moved the goalposts to "write a reward function that objectively evaluates g"