| 12482453 |
Training for long-form speech recognition |
Zhanming Lu, Thibault Doutre, Yanwei Pan, Liangliang Cao, Rohit Prakash Prabhavalkar +1 more |
2025-11-25 |
|
| 12444408 |
Two-pass end to end speech recognition |
Tara N. Sainath, Ruoming Pang, David Rybach, Yanzhang He, Rohit Prakash Prabhavalkar +6 more |
2025-10-14 |
|
| 12417770 |
Unified cascaded encoder ASR model for dynamic model sizes |
Shaojin Ding, Yangzhang He, Xin Wang, Weiran Wang, Tara N. Sainath +8 more |
2025-09-16 |
|
| 12412566 |
Lookup-table recurrent language model |
Ronny Huang, Tara N. Sainath, Shankar Kumar |
2025-09-09 |
|
| 12406147 |
Dialog management for large language model- based (LLM-based) dialogs |
Martin Baeuml, Alexander Bailey, Jonas Bragagnolo, Florent D'Halluin |
2025-09-02 |
|
| 12361927 |
Emitting word timings with end-to-end models |
Tara N. Sainath, Basilio Garcia Castillo, David Rybach, Ruoming Pang |
2025-07-15 |
|
| 12354598 |
Rare word recognition with LM-aware MWER training |
Weiran Wang, Tongzhou Chen, Tara N. Sainath, Ehsan Variani, Rohit Prakash Prabhavalkar +7 more |
2025-07-08 |
|
| 12354595 |
Deliberation by text-only and semi-supervised training |
Ke Hu, Tara N. Sainath, Yanzhang He, Rohit Prakash Prabhavalkar, Sepand Mavandadi +1 more |
2025-07-08 |
|
| 12354597 |
Disfluency detection models for natural conversational voice systems |
Shuo-yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang |
2025-07-08 |
|
| 12340799 |
Identifying and correcting automatic speech recognition (ASR) misrecognitions in a decentralized manner |
Rajiv Mathews, Rohit Prakash Prabhavalkar, Giovanni Motta, Mingqing Chen, Lillian Zhou +3 more |
2025-06-24 |
|
| 12315497 |
Intended query detection using E2E modeling for continued conversation |
Shuo-yiin Chang, Guru Prakash Arumugam, Zelin Wu, Tara N. Sainath, Bo Li +4 more |
2025-05-27 |
|
| 12190869 |
Optimizing inference performance for conformer |
Tara N. Sainath, Rami Botros, Anmol Gulati, Krzysztof Marcin Choromanski, Ruoming Pang +2 more |
2025-01-07 |
|
| 12183322 |
Language agnostic multilingual end-to-end streaming on-device ASR system |
Bo Li, Tara N. Sainath, Ruoming Pang, Shuo-yiin Chang, Qiumin Xu +6 more |
2024-12-31 |
$118,851,000 |
| 12154581 |
Cascaded encoders for simplified streaming and non-streaming ASR |
Arun Narayanan, Tara N. Sainath, Chung-Cheng Chiu, Ruoming Pang, Rohit Prakash Prabhavalkar +2 more |
2024-11-26 |
$103,833,000 |
| 12126845 |
Ephemeral learning of machine learning model(s) |
Francoise Beaufays, Khe Chai Sim, Oren Litvin |
2024-10-22 |
$118,222,000 |
| 12118988 |
Transducer-based streaming deliberation for cascaded encoders |
Ke Hu, Tara N. Sainath, Arun Narayanan, Ruoming Pang |
2024-10-15 |
$100,699,000 |
| 12051416 |
Methods and systems for reducing latency in automated assistant interactions |
Lior Alon, Rafael Goldfarb, Dekel Auster, Dan Rasin, Michael Andrew Goodman +3 more |
2024-07-30 |
$89,515,000 |
| 12051404 |
Efficient streaming non-recurrent on-device end-to-end model |
Tara N. Sainath, Arun Narayanan, Rami Botros, Yanzhang He, Ehsan Variani +3 more |
2024-07-30 |
$89,515,000 |
| 12027154 |
Emitting word timings with end-to-end models |
Tara N. Sainath, Basilio Garcia Castillo, David Rybach, Ruoming Pang |
2024-07-02 |
$130,174,000 |
| 12020703 |
Enabling natural conversations with soft endpointing for an automated assistant |
Jaclyn Konzelmann, Jonathan Bloom, Johan Schalkwyk, Joseph Smarr |
2024-06-25 |
$162,704,000 |
| 11763813 |
Methods and systems for reducing latency in automated assistant interactions |
Lior Alon, Rafael Goldfarb, Dekel Auster, Dan Rasin, Michael Andrew Goodman +3 more |
2023-09-19 |
$79,588,000 |
| 11715458 |
Efficient streaming non-recurrent on-device end-to-end model |
Tara N. Sainath, Arun Narayanan, Rami Botros, Yanzhang He, Ehsan Variani +3 more |
2023-08-01 |
$112,973,000 |
| 11594212 |
Attention-based joint acoustic and text on-device end-to-end model |
Tara N. Sainath, Ruoming Pang, Ron J. Weiss, Yanzhang He, Chung-Cheng Chiu |
2023-02-28 |
$65,971,000 |
| 11580956 |
Emitting word timings with end-to-end models |
Tara N. Sainath, Basi Garcia, David Rybach, Ruoming Pang |
2023-02-14 |
$83,918,000 |
| 9741339 |
Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores |
Fuchun Peng, Francoise Beaufays, Brian Strope, Xin Lei, Pedro J. Moreno Mengibar |
2017-08-22 |
$19,881,000 |