| 12299574 |
Distributed training using actor-critic reinforcement learning with off-policy correction factors |
Hubert Josef Soyer, Lasse Espeholt, Karen Simonyan, Yotam Doron, Vlad Firoiu +5 more |
2025-05-13 |
| 11868894 |
Distributed training using actor-critic reinforcement learning with off-policy correction factors |
Hubert Josef Soyer, Lasse Espeholt, Karen Simonyan, Yotam Doron, Vlad Firoiu +5 more |
2024-01-09 |
| 11842261 |
Deep reinforcement learning with fast updating recurrent neural networks and slow updating recurrent neural networks |
Wojciech Czarnecki, Maxwell Elliot Jaderberg |
2023-12-12 |
| 11593646 |
Distributed training using actor-critic reinforcement learning with off-policy correction factors |
Hubert Josef Soyer, Lasse Espeholt, Karen Simonyan, Yotam Doron, Vlad Firoiu +5 more |
2023-02-28 |
| 10872293 |
Deep reinforcement learning with fast updating recurrent neural networks and slow updating recurrent neural networks |
Wojciech Czarnecki, Maxwell Elliot Jaderberg |
2020-12-22 |