| 12277400 |
Multimedia content management for large language model(s) and/or other generative model(s) |
Sanil Jain, Wei Yu, Ágoston Weisz, Michael Andrew Goodman, Diana Avram +10 more |
2025-04-15 |
| 12242948 |
Systems and methods for routing within multitask mixture-of-experts models |
Yanping Huang, Dmitry Lepikhin, Maxim Krikun, Orhan Firat, Ankur Bapna +1 more |
2025-03-04 |
| 12210845 |
Contrastive pre-training for language tasks |
Quoc V. Le, Kevin Stefan Clark |
2025-01-28 |
| 12118064 |
Training machine learning models using unsupervised data augmentation |
Quoc V. Le, Qizhe Xie, Zihang Dai |
2024-10-15 |
| 11947923 |
Multimedia content management for large language model(s) and/or other generative model(s) |
Sanil Jain, Wei Yu, Ágoston Weisz, Michael Andrew Goodman, Diana Avram +10 more |
2024-04-02 |
| 11922281 |
Training machine learning models using teacher annealing |
Quoc V. Le, Kevin Stefan Clark |
2024-03-05 |
| 11914969 |
Contrastive pre-training for language tasks |
Quoc V. Le, Kevin Stefan Clark |
2024-02-27 |
| 11907674 |
Generating multi-modal response(s) through utilization of large language model(s) |
Oscar Akerlund, Evgeny Sluzhaev, Golnaz Ghiasi, Yifeng Lu, Igor Petrovski +11 more |
2024-02-20 |
| 11501168 |
Learning longer-term dependencies in neural network using auxiliary losses |
Andrew M. Dai, Quoc V. Le, Hoang Trieu Trinh |
2022-11-15 |
| 11488067 |
Training machine learning models using teacher annealing |
Quoc V. Le, Kevin Stefan Clark |
2022-11-01 |
| 11481609 |
Computationally efficient expressive output layers for neural networks |
Quoc V. Le, Zhilin Yang |
2022-10-25 |
| 11449684 |
Contrastive pre-training for language tasks |
Quoc V. Le, Kevin Stefan Clark |
2022-09-20 |
| 11080589 |
Sequence processing using online attention |
Ron J. Weiss, Peter J. Liu, Colin Abraham Raffel, Douglas Eck |
2021-08-03 |