five

Listening test materials for "A study of speaker adaptation for DNN-based speech synthesis"

收藏
DataCite Commons2023-04-27 更新2025-04-17 收录
下载链接:
https://datashare.ed.ac.uk/handle/10283/792
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset contains the testing stimuli and listeners' MUSHRA test responses for the Interspeech 2015 paper, "A study of speaker adaptation for DNN-based speech synthesis". In this paper, we conduct an experimental analysis of speaker adaptation for Deep Neural Network (DNN) based speech synthesis at different levels. In particular, we augment a low-dimensional speaker-specific vector with linguistic features as input to represent speaker identity, perform model adaptation to scale the hidden activation weights, and perform a feature space transformation at the output layer to modify generated acoustic features. We systematically analyse the performance of each individual adaptation technique and that of their combinations. Experimental results confirm the adaptability of the DNN, and listening tests demonstrate that the DNN can achieve significantly better adaptation performance than the hidden Markov model (HMM) baseline in terms of naturalness and speaker similarity.
提供机构:
University of Edinburgh. The Centre for Speech Technology Research (CSTR)
创建时间:
2015-06-10
二维码
社区交流群
二维码
科研交流群
商业服务