Bibliographie complète
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios
Type de ressource
Preprint
Auteurs/contributeurs
- Gong, Cheng (Author)
- Cooper, Erica (Author)
- Wang, Xin (Author)
- Qiang, Chunyu (Author)
- Geng, Mengzhe (Author)
- Wells, Dan (Author)
- Wang, Longbiao (Author)
- Dang, Jianwu (Author)
- Tessier, Marc (Author)
- Pine, Aidan (Author)
- Richmond, Korin (Author)
- Yamagishi, Junichi (Author)
Title
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios
Abstract
Self-supervised learning (SSL) representations from massively multilingual models offer a promising solution for low-resource language speech tasks. Despite advancements, language adaptation in TTS systems remains an open problem. This paper explores the language adaptation capability of ZMM-TTS, a recent SSL-based multilingual TTS system proposed in our previous work. We conducted experiments on 12 languages using limited data with various fine-tuning configurations. We demonstrate that the similarity in phonetics between the pre-training and target languages, as well as the language category, affects the target language's adaptation performance. Additionally, we find that the fine-tuning dataset size and number of speakers influence adaptability. Surprisingly, we also observed that using paired data for fine-tuning is not always optimal compared to audio-only data. Beyond speech intelligibility, our analysis covers speaker similarity, language identification, and predicted MOS.
Repository
arXiv
Archive ID
arXiv:2406.08911
Date
2024-06-13
Accessed
02/10/2024 12:09
Library Catalog
Extra
arXiv:2406.08911 [cs, eess]
Référence
Gong, C., Cooper, E., Wang, X., Qiang, C., Geng, M., Wells, D., Wang, L., Dang, J., Tessier, M., Pine, A., Richmond, K., & Yamagishi, J. (2024). An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios (No. arXiv:2406.08911). arXiv. https://doi.org/10.48550/arXiv.2406.08911
Tâche
Lien vers cette notice