| 12175202 |
Enhanced attention mechanisms |
Colin Abraham Raffel |
2024-12-24 |
| 12154581 |
Cascaded encoders for simplified streaming and non-streaming ASR |
Arun Narayanan, Tara N. Sainath, Ruoming Pang, Rohit Prakash Prabhavalkar, Jiahui Yu +2 more |
2024-11-26 |
| 12119014 |
Joint acoustic echo cancelation, speech enhancement, and voice separation for automatic speech recognition |
Arun Narayanan, Tom O'malley, Quan Wang, Alex Park, James Walker +2 more |
2024-10-15 |
| 12106749 |
Speech recognition with sequence-to-sequence models |
Rohit Prakash Prabhavalkar, Zhifeng Chen, Bo Li, Kanury Kanishka Rao, Yonghui Wu +8 more |
2024-10-01 |
| 12094453 |
Fast emit low-latency streaming ASR with sequence-level emission regularization utilizing forward and backward probabilities between nodes of an alignment lattice |
Jiahui Yu, Bo Li, Shuo-yiin Chang, Tara N. Sainath, Wei Han +5 more |
2024-09-17 |
| 12079703 |
Convolution-augmented transformer models |
Anmol Gulati, Ruoming Pang, Niki Parmar, Jiahui Yu, Wei Han +5 more |
2024-09-03 |
| 11922932 |
Minimum word error rate training for attention-based sequence-to-sequence models |
Rohit Prakash Prabhavalkar, Tara N. Sainath, Yonghui Wu, Patrick Nguyen, Zhifeng Chen +1 more |
2024-03-05 |