| 11842281 |
Reinforcement learning with auxiliary tasks |
Volodymyr Mnih, Wojciech Czarnecki, Maxwell Elliot Jaderberg, Tom Schaul, Koray Kavukcuoglu |
2023-12-12 |
| 11836620 |
Meta-gradient updates for training return functions for reinforcement learning systems |
Zhongwen Xu, Hado Philip van Hasselt |
2023-12-05 |
| 11836625 |
Training action selection neural networks using look-ahead search |
Karen Simonyan, Julian Schrittwieser |
2023-12-05 |
| 11803750 |
Continuous control with deep reinforcement learning |
Timothy Paul Lillicrap, Jonathan James Hunt, Alexander Pritzel, Nicolas Manfred Otto Heess, Tom Erez +2 more |
2023-10-31 |
| 11783182 |
Asynchronous deep reinforcement learning |
Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, Timothy James Alexander Harley, Koray Kavukcuoglu |
2023-10-10 |
| 11651208 |
Training action selection neural networks using a differentiable credit function |
Zhongwen Xu, Hado Phillip van Hasselt, Joseph Varughese Modayil, Andre da Motta Salles Barreto |
2023-05-16 |
| 11627165 |
Multi-agent reinforcement learning with matchmaking policies |
Oriol Vinyals, Maxwell Elliot Jaderberg |
2023-04-11 |
| 11568250 |
Training neural networks using a prioritized experience memory |
Tom Schaul, John Quan |
2023-01-31 |