Chinese Aligned Multimodal Sentiment Dataset v1.0

Name: Chinese Aligned Multimodal Sentiment Dataset v1.0
Creator: Science Data Bank
Published: 2025-08-29 03:36:55
License: 暂无描述

DataCite Commons2025-08-29 更新2026-05-05 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=c692036429f7444b8e8369182c2f3a0f

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset is constructed for multimodal sentiment analysis in Chinese context. It contains natural conversational and interview scenarios, covering three modalities: text, audio, and vision. All original video and audio data have been de-identified; only sentence-level aligned feature vectors and corresponding sentiment annotations are provided.The dataset consists of 2,281 Chinese utterances, and each sample includes:Text modality: [CLS] vector extracted by BERT-base-chinese (768 dimensions);Audio modality: Prosodic and spectral features extracted using COVAREP, aggregated by temporal mean pooling into a 74-dimensional vector;Vision modality: Action Units (AU), eye gaze and head pose features extracted with Facet, mean-pooled into a 35-dimensional vector;Sentiment intensity label: ranged in [-3, +3], where negative values denote negative emotions, positive values denote positive emotions, and larger absolute values indicate stronger intensity.The dataset also provides train/validation/test splits to ensure reproducibility of experiments. It is suitable for research on multimodal sentiment modeling, confidence-aware fusion methods, and training and evaluation of related machine learning and deep learning models.

提供机构：

Science Data Bank

创建时间：

2025-08-29

5,000+

优质数据集

54 个

任务类型

进入经典数据集