| 12400633 |
End-to-end speech waveform generation through data density gradient estimation |
Byungha Chun, Mohammad Norouzi, Nanxin Chen, Ron J. Weiss, William Chan +1 more |
2025-08-26 |
| 12373666 |
Convolution-augmented transformer models |
Anmol Gulati, Weikeng Qin, Zhengdong Zhang, Ruoming Pang, Niki Parmar +5 more |
2025-07-29 |
| 12353981 |
Training of large neural networks |
Slav Petrov, Andrew M. Dai, David Richard So, Dmitry Lepikhin, Erica Ann Moreira +19 more |
2025-07-08 |
| 12282857 |
Relative margin for contrastive learning |
Siyuan Qiao, Chenxi Liu, Jiahui Yu |
2025-04-22 |
| 12254865 |
Multi-dialect and multilingual speech recognition |
Zhifeng Chen, Bo Li, Eugene Weinstein, Pedro J. Moreno Mengibar, Ron J. Weiss +3 more |
2025-03-18 |
| 12249315 |
Unsupervised parallel tacotron non-autoregressive and controllable text-to-speech |
Isaac Elias, Byungha Chun, Jonathan Shen, Ye Jia, Yu Zhang |
2025-03-11 |
| 12222994 |
Quick application startup method and related apparatus |
Litao Yu, Fei Sun, Guoqiang Li |
2025-02-11 |
| 12190860 |
End-to-end text-to-speech conversion |
Samuel Bengio, Yuxuan Wang, Zongheng Yang, Zhifeng Chen, Ioannis Agiomyrgiannakis +7 more |
2025-01-07 |