Interpretable deep learning model for the detection and reconstruction of dysarthric speech D Korzekwa, R Barra-Chicote, B Kostek, T Drugman, M Lajszczak arXiv preprint arXiv:1907.04743, 2019 | 37 | 2019 |
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data M Łajszczak, G Cámbara, Y Li, F Beyhan, A van Korlaar, F Yang, A Joly, ... arXiv preprint arXiv:2402.08093, 2024 | 36 | 2024 |
In other news: A bi-style text-to-speech model for synthesizing newscaster voice with limited data N Prateek, M Łajszczak, R Barra-Chicote, T Drugman, J Lorenzo-Trueba, ... arXiv preprint arXiv:1904.02790, 2019 | 33 | 2019 |
Simple and effective multi-sentence TTS with expressive and coherent prosody P Makarov, A Abbas, M Łajszczak, A Joly, S Karlapati, A Moinet, ... arXiv preprint arXiv:2206.14643, 2022 | 16 | 2022 |
CopyCat2: A single model for multi-speaker TTS and many-to-many fine-grained prosody transfer S Karlapati, P Karanasou, M Lajszczak, A Abbas, A Moinet, P Makarov, ... arXiv preprint arXiv:2206.13443, 2022 | 14 | 2022 |
Distribution augmentation for low-resource expressive text-to-speech M Lajszczak, A Prasad, A Van Korlaar, B Bollepalli, A Bonafonte, A Joly, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 8 | 2022 |
Mapache: Masked Parallel Transformer for Advanced Speech Editing and Synthesis G Cámbara, PL Tobing, M Babianski, R Vipperla, DWR Shmelkin, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Controllable Emphasis with zero data for text-to-speech A Joly, M Nicolis, E Peterova, A Lombardi, A Abbas, A van Korlaar, ... arXiv preprint arXiv:2307.07062, 2023 | 2 | 2023 |
Discrete acoustic space for an efficient sampling in neural text-to-speech M Strong, J Rohnke, A Bonafonte, M Łajszczak, T Wood arXiv preprint arXiv:2110.12539, 2021 | 2 | 2021 |
Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations Á Martín-Cortinas, D Sáez-Trigueros, I Vallés-Pérez, B Tura-Vecino, ... arXiv preprint arXiv:2402.03407, 2024 | 1 | 2024 |
Mapache: Masked parallel transformer for advanced speech editing and synthesis GC Ruiz, P Tobing, M Babianski, R chander Vipperla, D Wang, ... | | 2024 |