| 12423571 |
Training actor-critic algorithms in laboratory settings |
Piyush Khandelwal, Peter R. Wurman |
2025-09-23 |
| 12354027 |
Method and system for an intelligent artificial agent |
Mark Ring, Satinder Singh Baveja, Peter Stone, Samuel Barrett, Roberto Capobianco +3 more |
2025-07-08 |
| 12277194 |
Task prioritized experience replay algorithm for reinforcement learning |
Varun Kompella, Peter R. Wurman, Peter Stone |
2025-04-15 |
| 12217156 |
Computing temporal convolution networks in real time |
Piyush Khandelwal, Peter R. Wurman, Fabrizio Santini |
2025-02-04 |
| 12153385 |
Methods and systems to adapt PID coefficients through reinforcement learning |
Samuel Barrett, Varun Kompella, Peter R. Wurman, Goker Erdogan, Fabrizio Santini |
2024-11-26 |
| 12017148 |
User interface for operating artificial intelligence experiments |
Rory Douglas, Dion Whitehead, Leon Barrett, Piyush Khandelwal, Thomas J. Walsh +4 more |
2024-06-25 |
| 11816591 |
Reinforcement learning through a double actor critic algorithm |
— |
2023-11-14 |
| 11443229 |
Method and system for continual learning in an intelligent artificial agent |
Mark Ring, Satinder Singh Baveja, Roberto Capobianco, Varun Kompella, Kaushik Subramanian |
2022-09-13 |