Hacker News new | past | comments | ask | show | jobs | submit login

I find the python code very clear, but I would prefer to see a real life interesting application that doesn't require a lot of computation. In a post in wildml there is an example of using NLP and deep learning for a simple task but after 22 hours of computation the final result is a little disappointing to say the least.

I like to read wildml.com and fastml.com blogs, but I would like to find more simple applications that shows real value without using lots of resources. Perhaps there is a subfield of RL where using some kind of proper human intelligence one can hope to beat those giants provided of unlimited computational and financial resources




I gave a talk a PyConSG this year[1], which included a demonstration of training a Reinforcement Learning model on a 'Bubble Breaker' game. There's also more detail available[2].

The Jupyter notebook is included in the GitHub repo[3], and includes a 'scaled down version' that takes ~5mins to train on a MacBook's CPU. There's also a downloadable 'full scale' model that was trained in ~7hours on a Titan X. It plays the game (on average) better than me...

[1] http://blog.mdda.net/ai/2016/06/23/workshop-at-pycon-sg-2016 (has slides, and YouTube link) [2] http://redcatlabs.com/2016-07-30_FifthElephant-DeepLearning-... [3] https://github.com/mdda/deep-learning-workshop : have a look at notebooks/7-Reinforcement-Learning.ipynb


Most RL algorithms are polynomial time or worse, and they use large datasets. Computation is always going to be an issue which is why most successful implementations are around simplifying models and datasets.

If you can figure out a way of making RL better than polynomial time there is at least a Turing Award for you.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: