Ekspresivni viŔejezični sintetizator govora - PhDData

Access database of worldwide thesis




Ekspresivni viŔejezični sintetizator govora

The thesis was published by Nosek Tijana, in September 2023, University of Novi Sad.

Abstract:

The aim of this thesis is to investigate the possibility of synthesizing speech in the voice of a speaker in a language which he had never spoken. Multilanguage models are created, both for the languages whose databases are annotated using the same conventions, and for the languages whose databases are annotated using different conventions, which includes the Serbian language. Regarding quality of synthesized speech, some models even surpass the quality of synthesis produced by standard monolanguage models. Beside architecture for multilanguage models, а method for adaptation of such models to the data of a new speaker is proposed. The proposed method of adaptation enables fast and simple production of new voices, while preserving the possibility to synthesize speech in any language supported by the model, regardless of the target speaker’s original language.



Read the last PhD tips