Chinese Natural Speech Complex Emotion Dataset

Name: Chinese Natural Speech Complex Emotion Dataset
Creator: Xinjiang University; Mingxing Xu; Tsinghua University; Xiaolong Wu
Published: 2025-02-24 00:00:00
License: 暂无描述

科学数据银行2025-02-24 更新2026-04-23 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=394f27fbc9014cd486951b770fdefa10

下载链接

链接失效反馈

官方服务：

资源简介：

Although Chinese speech affective computing has received increasing attention, existing datasets still have defects such as lack of naturalness, single pronunciation style, and unreliable annotation, which seriously hinder the research in this field. To address these issues, this paper introduces the first Chinese Natural Speech Complex Emotion Dataset (CNSCED) to provide natural data resources for Chinese speech affective computing. CNSCED was collected from publicly broadcasted civil dispute and interview television programs in China, reflecting the authentic emotional characteristics of Chinese people in daily life. The dataset includes 14 hours of speech data from 454 speakers of various ages, totaling 15777 samples. Based on the inherent complexity and ambiguity of natural emotions, this paper proposes an emotion vector annotation method. This method utilizes a vector composed of six meta-emotional dimensions (angry, sad, aroused, happy, surprise, and fear) of different intensities to describe any single or complex emotion. The CNSCED released two subtasks: complex emotion classification and complex emotion intensity regression. In the experimental section, we evaluated the CNSCED dataset using deep neural network models and provided a baseline result. To the best of our knowledge, CNSCED is the first public Chinese natural speech complex emotion dataset, which can be used for scientific research free of charge.

提供机构：

Xinjiang University; Mingxing Xu; Tsinghua University; Xiaolong Wu

创建时间：

2025-02-14

5,000+

优质数据集

54 个

任务类型

进入经典数据集