| 12183322 |
Language agnostic multilingual end-to-end streaming on-device ASR system |
Bo Li, Tara N. Sainath, Shuo-yiin Chang, Qiumin Xu, Trevor Strohman +6 more |
2024-12-31 |
| 12175963 |
Synthesis of speech from text in a voice of a target speaker using neural networks |
Ye Jia, Zhifeng Chen, Yonghui Wu, Jonathan Shen, Ron J. Weiss +5 more |
2024-12-24 |
| 12154581 |
Cascaded encoders for simplified streaming and non-streaming ASR |
Arun Narayanan, Tara N. Sainath, Chung-Cheng Chiu, Rohit Prakash Prabhavalkar, Jiahui Yu +2 more |
2024-11-26 |
| 12148444 |
Synthesizing speech from text using neural networks |
Yonghui Wu, Jonathan Shen, Ron J. Weiss, Michael Schuster, Navdeep Jaitly +7 more |
2024-11-19 |
| 12131244 |
Hardware-optimized neural architecture search |
Sheng Li, Norman Paul Jouppi, Quoc V. Le, Mingxing Tan, Liqun Cheng +1 more |
2024-10-29 |
| 12118988 |
Transducer-based streaming deliberation for cascaded encoders |
Ke Hu, Tara N. Sainath, Arun Narayanan, Trevor Strohman |
2024-10-15 |
| 12112198 |
Asynchronous distributed data flow for machine learning workloads |
Jeffrey Adgate Dean, Sudip Roy, Michael Isard, Aakanksha Chowdhery, Brennan Saeta +9 more |
2024-10-08 |
| 12094453 |
Fast emit low-latency streaming ASR with sequence-level emission regularization utilizing forward and backward probabilities between nodes of an alignment lattice |
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-yiin Chang, Tara N. Sainath +5 more |
2024-09-17 |
| 12079703 |
Convolution-augmented transformer models |
Anmol Gulati, Niki Parmar, Jiahui Yu, Wei Han, Chung-Cheng Chiu +5 more |
2024-09-03 |
| 12073824 |
Two-pass end to end speech recognition |
Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Antoine Jean Bruguier +2 more |
2024-08-27 |
| 12051404 |
Efficient streaming non-recurrent on-device end-to-end model |
Tara N. Sainath, Arun Narayanan, Rami Botros, Yanzhang He, Ehsan Variani +3 more |
2024-07-30 |
| 12027151 |
Unsupervised learning of disentangled speech content and style representation |
Andros Tjandra, Yu Zhang, Shigeki Karita |
2024-07-02 |
| 12027158 |
Deliberation model-based two-pass end-to-end speech recognition |
Ke Hu, Tara N. Sainath, Rohit Prakash Prabhavalkar |
2024-07-02 |
| 12027154 |
Emitting word timings with end-to-end models |
Tara N. Sainath, Basilio Garcia Castillo, David Rybach, Trevor Strohman |
2024-07-02 |
| 11928574 |
Neural architecture search with factorized hierarchical search space |
Mingxing Tan, Quoc V. Le, Bo Chen, Vijay Vasudevan |
2024-03-12 |
| 11908461 |
Deliberation model-based two-pass end-to-end speech recognition |
Ke Hu, Tara N. Sainath, Rohit Prakash Prabhavalkar |
2024-02-20 |