Hacker News new | past | comments | ask | show | jobs | submit login
Reinforcement Learning – A Reference (jakubhalmes.substack.com)
108 points by jac08h 3 days ago | hide | past | favorite | 3 comments





While studying for an RL course, I created a reference for several algorithms with a brief description of what limitations they solve. Example:

Problem: SARSA pushes q-values towards the current policy, but ideally we'd want optimal values. Solution: Use the best action in TD-target calculation -> Q-learning

Perhaps someone else will find it helpful!


Very cool write-up! I also took the course this semester. What a coincidence.

Only wish you publicised it before the exam haha :-)

492982


Haha, cool, thank you! I had some notes ready but didn't get around to finishing it sooner. Besides, I'm sure the course slides were much better material for exam prep ;)



Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: