Enable JavaScript to see more content

Related: TFIDF
[1708.07902v2] Deep Learning for Video Game Playing[1811.12560v2] An Introduction to Deep Reinforcement Learning[1811.12560] An Introduction to Deep Reinforcement Learning[1701.08878] Deep Reinforcement Learning for Robotic Manipulation-The state of the art[1703.01988] Neural Episodic Control[1708.07902] Deep Learning for Video Game Playing[1810.06746] Using Deep Reinforcement Learning for the Continuous Control of Robotic Arms[1810.06339] Deep Reinforcement Learning[1708.05866] A Brief Survey of Deep Reinforcement Learning[1708.05866v2] A Brief Survey of Deep Reinforcement Learning
Mentions
[1606.01868] Unifying Count-Based Exploration and Intrinsic Motivation[1410.5401] Neural Turing Machines[1507.06527] Deep Recurrent Q-Learning for Partially Observable MDPs[1605.05359] Option Discovery in Hierarchical Reinforcement Learning using Spatio-Temporal Clustering[1506.00019] A Critical Review of Recurrent Neural Networks for Sequence Learning[1511.09249] On Learning to Think: Algorithmic Information Theory for Novel Combinations of Reinforcement Learning Controllers and Recurrent Neural World Models[1604.07255] A Deep Hierarchical Approach to Lifelong Learning in Minecraft[1410.3916] Memory Networks
Related: Semantic Math
[1907.08823] Potential-Based Advice for Stochastic Policy Learning[1907.08823] Potential-Based Advice for Stochastic Policy Learning[1804.06459] On Learning Intrinsic Rewards for Policy Gradient Methods[1804.06459] On Learning Intrinsic Rewards for Policy Gradient Methods[1705.04862] Efficient Parallel Methods for Deep Reinforcement Learning[1705.04862] Efficient Parallel Methods for Deep Reinforcement Learning[1812.06502] A Logarithmic Barrier Method For Proximal Policy Optimization[1812.06502] A Logarithmic Barrier Method For Proximal Policy Optimization[1902.03633] Diverse Exploration via Conjugate Policies for Policy Gradient Methods[1807.00442] Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization