five

Chinese Aligned Multimodal Sentiment Dataset v1.0

收藏
DataCite Commons2025-08-29 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=c692036429f7444b8e8369182c2f3a0f
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset is constructed for multimodal sentiment analysis in Chinese context. It contains natural conversational and interview scenarios, covering three modalities: text, audio, and vision. All original video and audio data have been de-identified; only sentence-level aligned feature vectors and corresponding sentiment annotations are provided.The dataset consists of 2,281 Chinese utterances, and each sample includes:Text modality: [CLS] vector extracted by BERT-base-chinese (768 dimensions);Audio modality: Prosodic and spectral features extracted using COVAREP, aggregated by temporal mean pooling into a 74-dimensional vector;Vision modality: Action Units (AU), eye gaze and head pose features extracted with Facet, mean-pooled into a 35-dimensional vector;Sentiment intensity label: ranged in [-3, +3], where negative values denote negative emotions, positive values denote positive emotions, and larger absolute values indicate stronger intensity.The dataset also provides train/validation/test splits to ensure reproducibility of experiments. It is suitable for research on multimodal sentiment modeling, confidence-aware fusion methods, and training and evaluation of related machine learning and deep learning models.
提供机构:
Science Data Bank
创建时间:
2025-08-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作