| 12373666 |
Convolution-augmented transformer models |
Anmol Gulati, Weikeng Qin, Zhengdong Zhang, Ruoming Pang, Niki Parmar +5 more |
2025-07-29 |
| 12175202 |
Enhanced attention mechanisms |
Colin Abraham Raffel |
2024-12-24 |
| 12154581 |
Cascaded encoders for simplified streaming and non-streaming ASR |
Arun Narayanan, Tara N. Sainath, Ruoming Pang, Rohit Prakash Prabhavalkar, Jiahui Yu +2 more |
2024-11-26 |
| 12119014 |
Joint acoustic echo cancelation, speech enhancement, and voice separation for automatic speech recognition |
Arun Narayanan, Tom O'malley, Quan Wang, Alex Park, James Walker +2 more |
2024-10-15 |
| 12106749 |
Speech recognition with sequence-to-sequence models |
Rohit Prakash Prabhavalkar, Zhifeng Chen, Bo Li, Kanury Kanishka Rao, Yonghui Wu +8 more |
2024-10-01 |
| 12094453 |
Fast emit low-latency streaming ASR with sequence-level emission regularization utilizing forward and backward probabilities between nodes of an alignment lattice |
Jiahui Yu, Bo Li, Shuo-yiin Chang, Tara N. Sainath, Wei Han +5 more |
2024-09-17 |
| 12079703 |
Convolution-augmented transformer models |
Anmol Gulati, Ruoming Pang, Niki Parmar, Jiahui Yu, Wei Han +5 more |
2024-09-03 |
| 11922932 |
Minimum word error rate training for attention-based sequence-to-sequence models |
Rohit Prakash Prabhavalkar, Tara N. Sainath, Yonghui Wu, Patrick Nguyen, Zhifeng Chen +1 more |
2024-03-05 |
| 11816577 |
Augmentation of audiographic images for improved machine learning |
Daniel Sung-Joon Park, Quoc V. Le, William Chan, Ekin Dogus Cubuk, Barret Zoph +1 more |
2023-11-14 |
| 11804212 |
Streaming automatic speech recognition with non-streaming model distillation |
Thibault Doutre, Wei Han, Min Ma, Zhiyun Lu, Ruoming Pang +4 more |
2023-10-31 |
| 11646019 |
Minimum word error rate training for attention-based sequence-to-sequence models |
Rohit Prakash Prabhavalkar, Tara N. Sainath, Yonghui Wu, Patrick Nguyen, Zhifeng Chen +1 more |
2023-05-09 |
| 11625572 |
Recurrent neural networks for online sequence generation |
Navdeep Jaitly, John D. Lawson, George Jay Tucker |
2023-04-11 |
| 11594212 |
Attention-based joint acoustic and text on-device end-to-end model |
Tara N. Sainath, Ruoming Pang, Ron J. Weiss, Yanzhang He, Trevor Strohman |
2023-02-28 |
| 11335333 |
Speech recognition with sequence-to-sequence models |
Wei Han, Yu Zhang, Yonghui Wu, Patrick Nguyen, Sergey Kishchenko |
2022-05-17 |
| 11210475 |
Enhanced attention mechanisms |
Colin Abraham Raffel |
2021-12-28 |
| 11145293 |
Speech recognition with sequence-to-sequence models |
Rohit Prakash Prabhavalkar, Zhifeng Chen, Bo Li, Kanury Kanishka Rao, Yonghui Wu +8 more |
2021-10-12 |
| 11138471 |
Augmentation of audiographic images for improved machine learning |
Daniel Sung-Joon Park, Quoc V. Le, William Chan, Ekin Dogus Cubuk, Barret Zoph +1 more |
2021-10-05 |
| 11107463 |
Minimum word error rate training for attention-based sequence-to-sequence models |
Rohit Prakash Prabhavalkar, Tara N. Sainath, Yonghui Wu, Patrick Nguyen, Zhifeng Chen +1 more |
2021-08-31 |
| 10656605 |
Recurrent neural networks for online sequence generation |
Navdeep Jaitly, Ilya Sutskever, Yuping Luo |
2020-05-19 |
| 10281885 |
Recurrent neural networks for online sequence generation |
Navdeep Jaitly, Ilya Sutskever, Yuping Luo |
2019-05-07 |
| 8331623 |
Method for tracking and processing image |
Bing-Fei Wu, Chao-Jung Chen, Chih-Chung Kao, Meng-Liang Chung, Min-Yu Ku +2 more |
2012-12-11 |
| 8284239 |
Asynchronous photography automobile-detecting apparatus |
Wen-Chung Chen, Meng-Liang Chung |
2012-10-09 |
| 8218877 |
Tracking vehicle method by using image processing |
Bing-Fei Wu, Chao-Jung Chen, Chih-Chung Kao, Meng-Liang Chung, Min-Yu Ku +2 more |
2012-07-10 |
| 8041079 |
Apparatus and method for detecting obstacle through stereovision |
Meng-Liang Chung, Wen-Chung Chen, Min-Yu Ku |
2011-10-18 |