| 12299574 |
Distributed training using actor-critic reinforcement learning with off-policy correction factors |
Hubert Josef Soyer, Lasse Espeholt, Karen Simonyan, Yotam Doron, Vlad Firoiu +5 more |
2025-05-13 |
| 12299575 |
Augmenting neural networks with external memory |
Alexander Benjamin Graves, Ivo Danihelka, Malcolm Kevin Campbell Reynolds, Gregory Duncan Wayne |
2025-05-13 |
| 12020155 |
Reinforcement learning using baseline and policy neural networks |
Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, David Silver, Koray Kavukcuoglu |
2024-06-25 |
| 11907821 |
Population-based training of machine learning models |
Ang Li, Valentin Clement Dalibard, David Budden, Ola Spyra, Maxwell Elliot Jaderberg +3 more |
2024-02-20 |
| 11868894 |
Distributed training using actor-critic reinforcement learning with off-policy correction factors |
Hubert Josef Soyer, Lasse Espeholt, Karen Simonyan, Yotam Doron, Vlad Firoiu +5 more |
2024-01-09 |
| 11783182 |
Asynchronous deep reinforcement learning |
Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, David Silver, Koray Kavukcuoglu |
2023-10-10 |
| 11593646 |
Distributed training using actor-critic reinforcement learning with off-policy correction factors |
Hubert Josef Soyer, Lasse Espeholt, Karen Simonyan, Yotam Doron, Vlad Firoiu +5 more |
2023-02-28 |
| 11334792 |
Asynchronous deep reinforcement learning |
Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, David Silver, Koray Kavukcuoglu |
2022-05-17 |
| 11151443 |
Augmenting neural networks with sparsely-accessed external memory |
Ivo Danihelka, Gregory Duncan Wayne, Fu-Min Wang, Edward Thomas Grefenstette, Jack William Rae +3 more |
2021-10-19 |
| 10936946 |
Asynchronous deep reinforcement learning |
Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, David Silver, Koray Kavukcuoglu |
2021-03-02 |
| 10832134 |
Augmenting neural networks with external memory |
Alexander Benjamin Graves, Ivo Danihelka, Malcolm Kevin Campbell Reynolds, Gregory Duncan Wayne |
2020-11-10 |
| 10346741 |
Asynchronous deep reinforcement learning |
Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, David Silver, Koray Kavukcuoglu |
2019-07-09 |