| 12430515 |
Machine-learned language models which generate intermediate textual analysis in service of contextual text generation |
Daniel De Freitas Adiwardana |
2025-09-30 |
| 12393840 |
Granular neural network architecture search over low-level primitives |
David Richard So, Quoc V. Le, Hanxiao Liu, Wojciech Andrzej Manke, Zihang Dai |
2025-08-19 |
| 12373688 |
Granular neural network architecture search over low-level primitives |
David Richard So, Quoc V. Le, Hanxiao Liu, Wojciech Andrzej Manke, Zihang Dai |
2025-07-29 |
| 12353991 |
Fast decoding in sequence models using discrete latent variables |
Lukasz Mieczyslaw Kaiser, Aurko Roy, Ashish Teku Vaswani, Niki J. Parmar, Samuel Bengio +1 more |
2025-07-08 |
| 12354005 |
Attention-based decoder-only sequence transduction neural networks |
Lukasz Mieczyslaw Kaiser, Etienne Pot, Mohammad Saleh, Ben Goodrich, Peter J. Liu +1 more |
2025-07-08 |
| 12299573 |
Attention-based decoder-only sequence transduction neural networks |
Lukasz Mieczyslaw Kaiser, Etienne Pot, Mohammad Saleh, Ben Goodrich, Peter J. Liu +1 more |
2025-05-13 |
| 12299572 |
Attention-based decoder-only sequence transduction neural networks |
Lukasz Mieczyslaw Kaiser, Etienne Pot, Mohammad Saleh, Ben Goodrich, Peter J. Liu +1 more |
2025-05-13 |
| 12271817 |
Attention-based decoder-only sequence transduction neural networks |
Lukasz Mieczyslaw Kaiser, Etienne Pot, Mohammad Saleh, Ben Goodrich, Peter J. Liu +1 more |
2025-04-08 |
| 12265903 |
Distributing tensor computations across computing devices |
— |
2025-04-01 |
| 12254411 |
Attention neural networks with linear units |
— |
2025-03-18 |
| 12217173 |
Attention-based sequence transduction neural networks |
Aidan Nicholas Gomez, Lukasz Mieczyslaw Kaiser, Jakob D. Uszkoreit, Llion Owen Jones, Niki J. Parmar +2 more |
2025-02-04 |