| 12488791 |
Contextual biasing with text injection |
Rohit Prakash Prabhavalkar, Diamantino Antonio Caseiro, Patrick Maxim Rondon, Cyril Georges Luc Allauzen |
2025-12-02 |
| 12482455 |
Systems and methods for training dual-mode machine-learned speech recognition models |
Jiaxue Yu, Ruoming Pang, Wei Han, Anmol Gulati, Chen-hwa Chiu +2 more |
2025-11-25 |
| 12444408 |
Two-pass end to end speech recognition |
Ruoming Pang, David Rybach, Yanzhang He, Rohit Prakash Prabhavalkar, Wei Li +6 more |
2025-10-14 |
| 12437752 |
Large-scale language model data selection for rare-word speech recognition |
Wenqing Huang |
2025-10-07 |
| 12417770 |
Unified cascaded encoder ASR model for dynamic model sizes |
Shaojin Ding, Yangzhang He, Xin Wang, Weiran Wang, Trevor Strohman +8 more |
2025-09-16 |
| 12412566 |
Lookup-table recurrent language model |
Ronny Huang, Trevor Strohman, Shankar Kumar |
2025-09-09 |
| 12361927 |
Emitting word timings with end-to-end models |
Basilio Garcia Castillo, David Rybach, Trevor Strohman, Ruoming Pang |
2025-07-15 |
| 12354598 |
Rare word recognition with LM-aware MWER training |
Weiran Wang, Tongzhou Chen, Ehsan Variani, Rohit Prakash Prabhavalkar, Ronny Huang +7 more |
2025-07-08 |
| 12354595 |
Deliberation by text-only and semi-supervised training |
Ke Hu, Yanzhang He, Rohit Prakash Prabhavalkar, Sepand Mavandadi, Weiran Wang +1 more |
2025-07-08 |
| 12354597 |
Disfluency detection models for natural conversational voice systems |
Shuo-yiin Chang, Bo Li, Trevor Strohman, Chao Zhang |
2025-07-08 |
| 12322383 |
Predicting word boundaries for on-device batching of end-to-end speech recognition models |
Shaan Jagdeep Patrick Bijwadia, Jiahui Yu, Shuo-yiin Chang, Yangzhang He |
2025-06-03 |
| 12315497 |
Intended query detection using E2E modeling for continued conversation |
Shuo-yiin Chang, Guru Prakash Arumugam, Zelin Wu, Bo Li, Qiao Liang +4 more |
2025-05-27 |
| 12254865 |
Multi-dialect and multilingual speech recognition |
Zhifeng Chen, Bo Li, Eugene Weinstein, Yonghui Wu, Pedro J. Moreno Mengibar +3 more |
2025-03-18 |
| 12249317 |
Joint unsupervised and supervised training for multilingual ASR |
Bo Li, Junwen Bai, Yu Zhang, Ankur Bapna, Nikhil Siddhartha +1 more |
2025-03-11 |
| 12211509 |
Fusion of acoustic and text representations in RNN-T |
Chao Zhang, Bo Li, Zhiyun Lu, Shuo-yiin Chang |
2025-01-28 |
| 12190869 |
Optimizing inference performance for conformer |
Rami Botros, Anmol Gulati, Krzysztof Marcin Choromanski, Ruoming Pang, Trevor Strohman +2 more |
2025-01-07 |