| 12400638 |
Using aligned text and speech representations to train automatic speech recognition models without transcribed speech data |
Zhehuai Chen, Ankur Bapna, Yu Zhang, Bhuvana Ramabhadran |
2025-08-26 |
| 12272363 |
Advancing the use of text and speech in ASR pretraining with consistency and contrastive losses |
Zhehuai Chen, Bhuvana Ramabhadran, Pedro J. Moreno Mengibar, Yuan-Fang Wang, Yu Zhang |
2025-04-08 |
| 12230249 |
Supervised and unsupervised training with contrastive loss over sequences |
Bhuvana Ramabhadran, Zhehuai Chen, Yuan-Fang Wang, Yu Zhang, Jesse Emond |
2025-02-18 |
| 12190862 |
Using non-parallel voice conversion for speech conversion models |
Gary Wang, Bhuvana Ramabhadran, Fadi Biadsy |
2025-01-07 |
| 12159617 |
Injecting text in self-supervised speech pre-training |
Zhehuai Chen, Bhuvana Ramabhadran, Yu Zhang, Pedro J. Moreno Mengibar |
2024-12-03 |
| 12087272 |
Training speech synthesis to generate distinct speech sounds |
Bhuvana Ramabhadran, Fadi Biadsy, Yu Zhang |
2024-09-10 |
| 12087273 |
Multilingual speech synthesis and cross-language voice cloning |
Yu Zhang, Ron J. Weiss, Byungha Chun, Yonghui Wu, Zhifeng Chen +3 more |
2024-09-10 |
| 11990117 |
Using speech recognition to improve cross-language speech synthesis |
Zhehuai Chen, Bhuvana Ramabhadran, Yu Zhang, Pedro J. Moreno Mengibar |
2024-05-21 |
| 11929060 |
Consistency prediction on streaming sequence models |
Zhehuai Chen, Bhuvana Ramabhadran, Pedro J. Moreno Mengibar |
2024-03-12 |
| 11837216 |
Speech recognition using unspoken text and speech synthesis |
Zhehuai Chen, Bhuvana Ramabhadran, Pedro J. Moreno Mengibar |
2023-12-05 |
| 11823697 |
Improving speech recognition with speech synthesis-based model adapation |
Bhuvana Ramabhadran |
2023-11-21 |
| 11676572 |
Instantaneous learning in text-to-speech during dialog |
Vijayaditya Peddinti, Bhuvana Ramabhadran, Mateusz Golebiewski |
2023-06-13 |
| 11605368 |
Speech recognition using unspoken text and speech synthesis |
Zhehuai Chen, Bhuvana Ramabhadran, Pedro J. Moreno Mengibar |
2023-03-14 |
| 11580952 |
Multilingual speech synthesis and cross-language voice cloning |
Yu Zhang, Ron J. Weiss, Byungha Chun, Yonghui Wu, Zhifeng Chen +3 more |
2023-02-14 |
| 11475874 |
Generating diverse and natural text-to-speech samples |
Yu Zhang, Bhuvana Ramabhadran, Yonghui Wu, Byungha Chun, Ron J. Weiss +1 more |
2022-10-18 |
| 11335324 |
Synthesized data augmentation using voice conversion and speech recognition models |
Fadi Biadsy, Liyang Jiang, Pedro J. Moreno Mengibar |
2022-05-17 |
| 11222620 |
Speech recognition using unspoken text and speech synthesis |
Zhehuai Chen, Bhuvana Ramabhadran, Pedro J. Moreno Mengibar |
2022-01-11 |
| 9093067 |
Generating prosodic contours for synthesized speech |
Martin Jansche, Michael Dennis Riley, Terry Tai |
2015-07-28 |
| 8321225 |
Generating prosodic contours for synthesized speech |
Martin Jansche, Michael Dennis Riley, Terry Tai |
2012-11-27 |