介绍#
文档#
表格#
下面的表格基本不能看,因为是 AIGC,网页都是错乱的。
Category | Name | Author | Description | Link |
---|---|---|---|---|
Algorithms | SARSA | Richard S. Sutton & Andrew G. Barto | A q-learning algorithm that uses Temporal Difference (TD) to update its policy. | 1 |
Algorithms | Q-Learning | Christopher J. C. H. Watkins & Peter Dayan | A model-free reinforcement learning algorithm that learns to predict the expected return for taking a particular action in a particular state. | 2 |
Algorithms | Deep Q-Networks (DQN) | Volodymyr Mnih & Geoffrey Hinton | A neural network-based algorithm that applies Q-learning to play Atari games with high-level controls. | 3 |
Algorithms | Deep Deterministic Policy Gradients (DDPG) | John Schulman & Philip Mirowski | An off-policy algorithm for continuous control, using a critic and policy networks. | 7 |
Algorithms | Proximal Policy Optimization (PPO) | John Schulman & Philip Mirowski | A model-free algorithm that uses trust region optimization to update its policy. | 8 |
Algorithms | Advantage Actor-Critic (A2C) | John Schulman & Philip Mirowski | An algorithm that combines the benefits of policy gradient methods and value-based methods. | 9 |
Frameworks | Gym | University of Alberta & Uber | A Python library for reinforcement learning, providing a common interface for implementations of various aspects of the field. | 4 |
Frameworks | Ray | Raphaël Sellem & Eric Jang | A high-level framework for developing and training reinforcement learning models. | 5 |
Books | Reinforcement Learning | Richard S. Sutton & Andrew G. Barto | A comprehensive book that covers the basics and applications of reinforcement learning. | 6 |
Books | Deep Learning | Ian Goodfellow, Yoshua Bengio & Aaron Courville | A comprehensive book on deep learning, including a chapter on reinforcement learning. | 10 |
Journals | Journal of Machine Learning Research | Various Authors | A leading journal for machine learning research, including reinforcement learning topics. | 11 |