| 11875809 |
Speech denoising via discrete representation learning |
Zhao Song |
2024-01-16 |
| 11869483 |
Unsupervised alignment for text to speech synthesis using neural networks |
Kevin Shih, Jose Rafael Valle Gomes da Costa, Rohan Badlani, Adrian Lancucki, Bryan Catanzaro |
2024-01-09 |
| 11769481 |
Unsupervised alignment for text to speech synthesis using neural networks |
Kevin Shih, Jose Rafael Valle Gomes da Costa, Rohan Badlani, Adrian Lancucki, Bryan Catanzaro |
2023-09-26 |
| 11651763 |
Multi-speaker neural text-to-speech |
Sercan Omer Arik, Gregory Diamos, Andrew Gibiansky, John Miller, Kainan Peng +2 more |
2023-05-16 |
| 11521592 |
Small-footprint flow-based models for raw audio |
Kainan Peng, Kexin Zhao, Zhao Song |
2022-12-06 |
| 11482207 |
Waveform generation using end-to-end text-to-waveform system |
Kainan Peng, Jitong Chen |
2022-10-25 |
| 11238843 |
Systems and methods for neural voice cloning with a few samples |
Sercan Omer Arik, Jitong Chen, Kainan Peng, Yanqi Zhou |
2022-02-01 |
| 11138964 |
Inaudible watermark enabled text-to-speech framework |
Zhenyu Zhong, Yueqiang Cheng, Xing Li, Tao Wei |
2021-10-05 |
| 11017761 |
Parallel neural text-to-speech |
Kainan Peng, Zhao Song, Kexin Zhao |
2021-05-25 |
| 10896669 |
Systems and methods for multi-speaker neural text-to-speech |
Sercan Omer Arik, Gregory Diamos, Andrew Gibiansky, John Miller, Kainan Peng +2 more |
2021-01-19 |
| 10872596 |
Systems and methods for parallel wave generation in end-to-end text-to-speech |
Kainan Peng, Jitong Chen |
2020-12-22 |
| 10796686 |
Systems and methods for neural text-to-speech using convolutional sequence learning |
Sercan Omer Arik, Kainan Peng, Sharan Narang, Ajay Kannan, Andrew Gibiansky +2 more |
2020-10-06 |