Emotionally Incongruent Synthetic Speech Dataset (EMIS)
收藏IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/emotionally-incongruent-synthetic-speech-dataset-emis
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains 1248 speech audio samples synthetically generated by Text-to-speech systems. The audios are emotionally incongruent between transcription and voice tone. To generate each speech sample, we leverage emotion-rich sentences divided into four distinct emotions: angry, happy, neutral, and sad. For each sentence, we employ three different TTS systems to generate speech in the same four different emotions, thus resulting in three emotionally incongruent speech samples per sentence. Unlike standard emotional speech samples that are used to train and test emotion recognition systems, this dataset provided incongruency between the sentiment present in the tone of the voice and that present in the transcription of the sample.
提供机构:
Paula Costa; João Lima; Lucas Ueda; Pedro Corrêa; Victor Moreno



