| 12394420 |
Hypothesis stitcher for speech recognition of long-form audio |
Naoyuki KANDA, Xuankai Chang, Xiaofei Wang, Zhong Meng, Takuya Yoshioka |
2025-08-19 |
| 12353839 |
On-device streaming inverse text normalization (ITN) |
Nicholas Kibre, Issac Alphonso, Jian Xue, Jinyu Li, Piyush BEHRE +1 more |
2025-07-08 |
| 12136034 |
Dynamic gradient aggregation for training neural networks |
Dimitrios Dimitriadis, Kenichi Kumatani, Robert Peter Gmyr, Masaki Itagaki, Nanshan Zeng +1 more |
2024-11-05 |
| 11984127 |
Training and using a transcript generation model on a multi-speaker audio stream |
Naoyuki KANDA, Takuya Yoshioka, Zhuo Chen, Jinyu Li, Zhong Meng +2 more |
2024-05-14 |
| 11935542 |
Hypothesis stitcher for speech recognition of long-form audio |
Naoyuki KANDA, Xuankai Chang, Xiaofei Wang, Zhong Meng, Takuya Yoshioka |
2024-03-19 |
| 11915686 |
Speaker adaptation for attention-based encoder-decoder |
Zhong Meng, Jinyu Li, Yifan Gong |
2024-02-27 |
| 11574639 |
Hypothesis stitcher for speech recognition of long-form audio |
Naoyuki KANDA, Xuankai Chang, Xiaofei Wang, Zhong Meng, Takuya Yoshioka |
2023-02-07 |
| 11562745 |
Sequence-to-sequence speech recognition with latency threshold |
Jinyu Li, Liang Lu, Hirofumi INAGUMA, Yifan Gong |
2023-01-24 |
| 11527238 |
Internal language model for E2E models |
Zhong Meng, Sarangarajan Parthasarathy, Xie Sun, Naoyuki KANDA, Liang Lu +4 more |
2022-12-13 |
| 11232782 |
Speaker adaptation for attention-based encoder-decoder |
Zhong Meng, Jinyu Li, Yifan Gong |
2022-01-25 |
| 10971142 |
Systems and methods for robust speech recognition using generative adversarial networks |
Anuroop Sriram, Hee Woo Jun, Sanjeev Satheesh |
2021-04-06 |
| 10657955 |
Systems and methods for principled bias reduction in production speech models |
Eric Dean Battenberg, Rewon Child, Adam Coates, Christopher Fougner, Jiaji Huang +10 more |
2020-05-19 |