| 12175737 |
Reinforcement learning for active sequence processing |
Viorica Patraucean, Joao Carreira, Volodymyr Mnih, Simon Osindero |
2024-12-24 |
| 11977983 |
Noisy neural network layers with noise parameters |
Mohammad Gheshlaghi Azar, Meire Fortunato, Olivier Claude Pietquin, Jacob Lee Menick, Volodymyr Mnih +2 more |
2024-05-07 |
| 11886997 |
Training action selection neural networks using apprenticeship |
Olivier Claude Pietquin, Martin Riedmiller, Wang Fumin, Mel Vecerik, Todd Hester +4 more |
2024-01-30 |
| 11868882 |
Training action selection neural networks using apprenticeship |
Olivier Claude Pietquin, Martin Riedmiller, Wang Fumin, Mel Vecerik, Todd Hester +4 more |
2024-01-09 |
| 11714990 |
Jointly learning exploratory and non-exploratory action selection policies |
Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Zhaohan Guo, Steven James Kapturowski +2 more |
2023-08-01 |
| 10839293 |
Noisy neural network layers with noise parameters |
Mohammad Gheshlaghi Azar, Meire Fortunato, Olivier Claude Pietquin, Jacob Lee Menick, Volodymyr Mnih +2 more |
2020-11-17 |