Hacker News new | past | comments | ask | show | jobs | submit login

The google paper uses memory. You cant remember a never ending set of 2D random noise, there are limitless variations.



They don't remember all the previous scenes, just ones which are "novel" enough.

But how do we decide whether the agent is seeing the same thing as an existing memory? Checking for an exact match could be meaningless: in a realistic environment, the agent rarely sees exactly the same thing twice. For example, even if the agent returned to exactly the same room, it would still see this room under a different angle compared to its memories.

Instead of checking for an exact match in memory, we use a deep neural network that is trained to measure how similar two experiences are.


Ok. Though it still depends on limited number of tv shows.

If there is an unlimited number of shows and the agent walks right up to it, I think its still trapped.





Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: