Rytis Maskeliūnas1, Robertas Damaševičius1,*, Audrius Kulikajevas1, Kipras Pribuišis2, Nora Ulozaitė-Stanienė2, Virgilijus Uloza2
CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.3, pp. 4203-4223, 2025, DOI:10.32604/cmes.2025.072790
- 23 December 2025
Abstract This study introduces a novel voice cloning framework driven by Mordukhovich Subdifferential Optimization (MSO) to address the complex multi-objective challenges of pathological speech synthesis in under-resourced Lithuanian language with unique phonemes not present in most pre-trained models. Unlike existing voice synthesis models that often optimize for a single objective or are restricted to major languages, our approach explicitly balances four competing criteria: speech naturalness, speaker similarity, computational efficiency, and adaptability to pathological voice patterns. We evaluate four model configurations combining Lithuanian and English encoders, synthesizers, and vocoders. The hybrid model (English encoder, Lithuanian synthesizer, English More >