| 12147899 |
Training action selection neural networks using look-ahead search |
Karen Simonyan, Julian Schrittwieser |
2024-11-19 |
$89,094,000 |
| 12141677 |
Environment prediction using reinforcement learning |
Tom Schaul, Matteo Hessel, Hado Philip van Hasselt |
2024-11-12 |
$95,442,000 |
| 12086714 |
Training neural networks using a prioritized experience memory |
Tom Schaul, John Quan |
2024-09-10 |
$77,150,000 |
| 12067491 |
Multi-agent reinforcement learning with matchmaking policies |
Oriol Vinyals, Maxwell Elliot Jaderberg |
2024-08-20 |
$112,098,000 |
| 12020155 |
Reinforcement learning using baseline and policy neural networks |
Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, Timothy James Alexander Harley, Koray Kavukcuoglu |
2024-06-25 |
$162,704,000 |
| 11842281 |
Reinforcement learning with auxiliary tasks |
Volodymyr Mnih, Wojciech Czarnecki, Maxwell Elliot Jaderberg, Tom Schaul, Koray Kavukcuoglu |
2023-12-12 |
$79,228,000 |
| 11836625 |
Training action selection neural networks using look-ahead search |
Karen Simonyan, Julian Schrittwieser |
2023-12-05 |
$99,298,000 |
| 11836620 |
Meta-gradient updates for training return functions for reinforcement learning systems |
Zhongwen Xu, Hado Philip van Hasselt |
2023-12-05 |
$99,298,000 |
| 11803750 |
Continuous control with deep reinforcement learning |
Timothy Paul Lillicrap, Jonathan James Hunt, Alexander Pritzel, Nicolas Manfred Otto Heess, Tom Erez +2 more |
2023-10-31 |
$122,489,000 |
| 11783182 |
Asynchronous deep reinforcement learning |
Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, Timothy James Alexander Harley, Koray Kavukcuoglu |
2023-10-10 |
$80,003,000 |
| 11651208 |
Training action selection neural networks using a differentiable credit function |
Zhongwen Xu, Hado Phillip van Hasselt, Joseph Varughese Modayil, Andre da Motta Salles Barreto |
2023-05-16 |
$119,668,000 |
| 11627165 |
Multi-agent reinforcement learning with matchmaking policies |
Oriol Vinyals, Maxwell Elliot Jaderberg |
2023-04-11 |
$98,654,000 |
| 11568250 |
Training neural networks using a prioritized experience memory |
Tom Schaul, John Quan |
2023-01-31 |
$74,974,000 |
| 11507827 |
Distributed training of reinforcement learning systems |
Praveen Srinivasan, Rory Fearon, Cagdas Alcicek, Arun Sarath Nair, Samuel Blackwell +5 more |
2022-11-22 |
$91,879,000 |
| 11449750 |
Training action selection neural networks using look-ahead search |
Karen Simonyan, Julian Schrittwieser |
2022-09-20 |
$85,272,000 |
| 11334792 |
Asynchronous deep reinforcement learning |
Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, Timothy James Alexander Harley, Koray Kavukcuoglu |
2022-05-17 |
$65,297,000 |
| 10956820 |
Reinforcement learning with auxiliary tasks |
Volodymyr Mnih, Wojciech Czarnecki, Maxwell Elliot Jaderberg, Tom Schaul, Koray Kavukcuoglu |
2021-03-23 |
$68,942,000 |
| 10936946 |
Asynchronous deep reinforcement learning |
Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, Timothy James Alexander Harley, Koray Kavukcuoglu |
2021-03-02 |
$86,389,000 |
| 10867242 |
Selecting actions to be performed by a reinforcement learning agent using tree search |
Thore Graepel, Shih-Chieh Huang, Arthur Clement Guez, Laurent Sifre, Ilya Sutskever +1 more |
2020-12-15 |
$58,660,000 |
| 10860926 |
Meta-gradient updates for training return functions for reinforcement learning systems |
Zhongwen Xu, Hado Philip van Hasselt |
2020-12-08 |
$58,530,000 |
| 10776692 |
Continuous control with deep reinforcement learning |
Timothy Paul Lillicrap, Jonathan James Hunt, Alexander Pritzel, Nicolas Manfred Otto Heess, Tom Erez +2 more |
2020-09-15 |
$40,632,000 |
| 10733501 |
Environment prediction using reinforcement learning |
Tom Schaul, Matteo Hessel, Hado Philip van Hasselt |
2020-08-04 |
$33,918,000 |
| 10650310 |
Training neural networks using a prioritized experience memory |
Tom Schaul, John Quan |
2020-05-12 |
$39,282,000 |
| 10628733 |
Selecting reinforcement learning actions using goals and observations |
Tom Schaul, Daniel George Horgan, Karol Gregor |
2020-04-21 |
$38,833,000 |
| 10445641 |
Distributed training of reinforcement learning systems |
Praveen Srinivasan, Rory Fearon, Cagdas Alcicek, Arun Sarath Nair, Samuel Blackwell +5 more |
2019-10-15 |
$31,201,000 |