| 12400633 |
End-to-end speech waveform generation through data density gradient estimation |
Byungha Chun, Mohammad Norouzi, Nanxin Chen, Ron J. Weiss, William Chan +1 more |
2025-08-26 |
|
| 12373666 |
Convolution-augmented transformer models |
Anmol Gulati, Weikeng Qin, Zhengdong Zhang, Ruoming Pang, Niki Parmar +5 more |
2025-07-29 |
|
| 12353981 |
Training of large neural networks |
Slav Petrov, Andrew M. Dai, David Richard So, Dmitry Lepikhin, Erica Ann Moreira +19 more |
2025-07-08 |
|
| 12282857 |
Relative margin for contrastive learning |
Siyuan Qiao, Chenxi Liu, Jiahui Yu |
2025-04-22 |
|
| 12254865 |
Multi-dialect and multilingual speech recognition |
Zhifeng Chen, Bo Li, Eugene Weinstein, Pedro J. Moreno Mengibar, Ron J. Weiss +3 more |
2025-03-18 |
|
| 12249315 |
Unsupervised parallel tacotron non-autoregressive and controllable text-to-speech |
Isaac Elias, Byungha Chun, Jonathan Shen, Ye Jia, Yu Zhang |
2025-03-11 |
|
| 12222994 |
Quick application startup method and related apparatus |
Litao Yu, Fei Sun, Guoqiang Li |
2025-02-11 |
|
| 12190860 |
End-to-end text-to-speech conversion |
Samuel Bengio, Yuxuan Wang, Zongheng Yang, Zhifeng Chen, Ioannis Agiomyrgiannakis +7 more |
2025-01-07 |
|
| 12175963 |
Synthesis of speech from text in a voice of a target speaker using neural networks |
Ye Jia, Zhifeng Chen, Jonathan Shen, Ruoming Pang, Ron J. Weiss +5 more |
2024-12-24 |
$142,724,000 |
| 12170667 |
Fast access to local area network (LAN) graphical user interface (GUI) by client device |
— |
2024-12-17 |
|
| 12148444 |
Synthesizing speech from text using neural networks |
Jonathan Shen, Ruoming Pang, Ron J. Weiss, Michael Schuster, Navdeep Jaitly +7 more |
2024-11-19 |
$89,094,000 |
| 12112198 |
Asynchronous distributed data flow for machine learning workloads |
Jeffrey Adgate Dean, Sudip Roy, Michael Isard, Aakanksha Chowdhery, Brennan Saeta +9 more |
2024-10-08 |
$117,740,000 |
| 12106749 |
Speech recognition with sequence-to-sequence models |
Rohit Prakash Prabhavalkar, Zhifeng Chen, Bo Li, Chung-Cheng Chiu, Kanury Kanishka Rao +8 more |
2024-10-01 |
$125,607,000 |
| 12100382 |
Text-to-speech using duration prediction |
Yu Zhang, Isaac Elias, Byungha Chun, Ye Jia, Mike Chrzanowski +1 more |
2024-09-24 |
$174,759,000 |
| 12094453 |
Fast emit low-latency streaming ASR with sequence-level emission regularization utilizing forward and backward probabilities between nodes of an alignment lattice |
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-yiin Chang, Tara N. Sainath +5 more |
2024-09-17 |
$108,260,000 |
| 12087273 |
Multilingual speech synthesis and cross-language voice cloning |
Yu Zhang, Ron J. Weiss, Byungha Chun, Zhifeng Chen, Russell John Wyatt Skerry-Ryan +3 more |
2024-09-10 |
$77,150,000 |
| 12079703 |
Convolution-augmented transformer models |
Anmol Gulati, Ruoming Pang, Niki Parmar, Jiahui Yu, Wei Han +5 more |
2024-09-03 |
$114,566,000 |
| 12047445 |
Application interface migration system and method, and related device |
Fei Sun, Litao Yu |
2024-07-23 |
|
| 12032920 |
Direct speech-to-speech translation via machine learning |
Ye Jia, Zhifeng Chen, Melvin Johnson, Fadi Biadsy, Ron J. Weiss +1 more |
2024-07-09 |
$110,555,000 |
| 12020685 |
Phonemes and graphemes for neural text-to-speech |
Ye Jia, Byungha Chun, Yu Zhang, Jonathan Shen |
2024-06-25 |
$162,704,000 |
| 11922932 |
Minimum word error rate training for attention-based sequence-to-sequence models |
Rohit Prakash Prabhavalkar, Tara N. Sainath, Patrick Nguyen, Zhifeng Chen, Chung-Cheng Chiu +1 more |
2024-03-05 |
$62,822,000 |
| 11908448 |
Parallel tacotron non-autoregressive and controllable TTS |
Isaac Elias, Jonathan Shen, Yu Zhang, Ye Jia, Ron J. Weiss +1 more |
2024-02-20 |
$78,256,000 |
| 11900915 |
Multi-dialect and multilingual speech recognition |
Zhifeng Chen, Bo Li, Eugene Weinstein, Pedro J. Moreno Mengibar, Ron J. Weiss +3 more |
2024-02-13 |
$89,356,000 |
| 11862142 |
End-to-end text-to-speech conversion |
Samuel Bengio, Yuxuan Wang, Zongheng Yang, Zhifeng Chen, Ioannis Agiomyrgiannakis +7 more |
2024-01-02 |
$82,860,000 |
| 11848002 |
Synthesis of speech from text in a voice of a target speaker using neural networks |
Ye Jia, Zhifeng Chen, Jonathan Shen, Ruoming Pang, Ron J. Weiss +5 more |
2023-12-19 |
$133,110,000 |