| 12400633 |
End-to-end speech waveform generation through data density gradient estimation |
Mohammad Norouzi, Nanxin Chen, Ron J. Weiss, William Chan, Yu Zhang +1 more |
2025-08-26 |
| 12327544 |
Two-level speech prosody transfer |
Lev Finkelstein, Chun-an Chan, Ye Jia, Yu Zhang, Robert Andrew James Clark +1 more |
2025-06-10 |
| 12260851 |
Two-level text-to-speech systems using synthetic training data |
Lev Finkelstein, Chun-an Chan, Norman Casagrande, Yu Zhang, Robert Andrew James Clark +1 more |
2025-03-25 |
| 12249315 |
Unsupervised parallel tacotron non-autoregressive and controllable text-to-speech |
Isaac Elias, Jonathan Shen, Ye Jia, Yu Zhang, Yonghui Wu |
2025-03-11 |
| 12100382 |
Text-to-speech using duration prediction |
Yu Zhang, Isaac Elias, Ye Jia, Yonghui Wu, Mike Chrzanowski +1 more |
2024-09-24 |
| 12087273 |
Multilingual speech synthesis and cross-language voice cloning |
Yu Zhang, Ron J. Weiss, Yonghui Wu, Zhifeng Chen, Russell John Wyatt Skerry-Ryan +3 more |
2024-09-10 |
| 12020685 |
Phonemes and graphemes for neural text-to-speech |
Ye Jia, Yu Zhang, Jonathan Shen, Yonghui Wu |
2024-06-25 |
| 11908448 |
Parallel tacotron non-autoregressive and controllable TTS |
Isaac Elias, Jonathan Shen, Yu Zhang, Ye Jia, Ron J. Weiss +1 more |
2024-02-20 |
| 11823656 |
Unsupervised parallel tacotron non-autoregressive and controllable text-to-speech |
Isaac Elias, Jonathan Shen, Ye Jia, Yu Zhang, Yonghui Wu |
2023-11-21 |
| 11580952 |
Multilingual speech synthesis and cross-language voice cloning |
Yu Zhang, Ron J. Weiss, Yonghui Wu, Zhifeng Chen, Russell John Wyatt Skerry-Ryan +3 more |
2023-02-14 |
| 11514888 |
Two-level speech prosody transfer |
Lev Finkelstein, Chun-an Chan, Ye Jia, Yu Zhang, Robert Andrew James Clark +1 more |
2022-11-29 |
| 11475874 |
Generating diverse and natural text-to-speech samples |
Yu Zhang, Bhuvana Ramabhadran, Andrew Rosenberg, Yonghui Wu, Ron J. Weiss +1 more |
2022-10-18 |
| 11335321 |
Building a text-to-speech system from a small amount of speech data |
Ye Jia, Yusuke Oda, Norman Casagrande, Tejas Iyer, Fan Luo +4 more |
2022-05-17 |
| 9905220 |
Multilingual prosody generation |
Javier Gonzalvo Fructuoso, Andrew W. Senior |
2018-02-27 |
| 9195656 |
Multilingual prosody generation |
Javier Gonzalvo Fructuoso, Andrew W. Senior |
2015-11-24 |
| 8527276 |
Speech synthesis using deep neural networks |
Andrew W. Senior, Michael Schuster |
2013-09-03 |
| 8438029 |
Confidence tying for unsupervised synthetic speech adaptation |
Matthew Nicholas Stuttle |
2013-05-07 |