| 12271823 |
Training machine learning models by determining update rules using neural networks |
Misha Man Ray Denil, Marcin Andrychowicz, Joao Ferdinando Gomes de Freitas, Sergio Gomez Colmenarejo, Matthew William Hoffman +1 more |
2025-04-08 |
| 12154029 |
Continual reinforcement learning with a multi-task agent |
Matteo Hessel, Hado Philip van Hasselt, Daniel J. Mankowitz |
2024-11-26 |
| 12141677 |
Environment prediction using reinforcement learning |
David Silver, Matteo Hessel, Hado Philip van Hasselt |
2024-11-12 |
| 12086714 |
Training neural networks using a prioritized experience memory |
John Quan, David Silver |
2024-09-10 |
| 12061964 |
Modulating agent behavior to optimize learning progress |
Diana Luiza Borsa, Fengning Ding, David Szepesvari, Georg Ostrovski, Simon Osindero +1 more |
2024-08-13 |
| 11842281 |
Reinforcement learning with auxiliary tasks |
Volodymyr Mnih, Wojciech Czarnecki, Maxwell Elliot Jaderberg, David Silver, Koray Kavukcuoglu |
2023-12-12 |
| 11676035 |
Learning non-differentiable weights of neural networks using evolutionary strategies |
Karel Lenc, Karen Simonyan, Erich Konrad Elsen |
2023-06-13 |
| 11615310 |
Training machine learning models by determining update rules using recurrent neural networks |
Misha Man Ray Denil, Marcin Andrychowicz, Joao Ferdinando Gomes de Freitas, Sergio Gomez Colmenarejo, Matthew William Hoffman +1 more |
2023-03-28 |
| 11568250 |
Training neural networks using a prioritized experience memory |
John Quan, David Silver |
2023-01-31 |
| 10956820 |
Reinforcement learning with auxiliary tasks |
Volodymyr Mnih, Wojciech Czarnecki, Maxwell Elliot Jaderberg, David Silver, Koray Kavukcuoglu |
2021-03-23 |
| 10733501 |
Environment prediction using reinforcement learning |
David Silver, Matteo Hessel, Hado Philip van Hasselt |
2020-08-04 |
| 10650310 |
Training neural networks using a prioritized experience memory |
John Quan, David Silver |
2020-05-12 |
| 10628733 |
Selecting reinforcement learning actions using goals and observations |
Daniel George Horgan, Karol Gregor, David Silver |
2020-04-21 |
| 10282662 |
Training neural networks using a prioritized experience memory |
John Quan, David Silver |
2019-05-07 |
| 10055687 |
Method for creating predictive knowledge structures from experience in an artificial agent |
Mark Ring |
2018-08-21 |