MuSe-Sent: Multimodal Sentiment Classification in-the-Wild (MuSe2021)

Mendeley Data2024-05-10 更新2024-06-28 收录

下载链接：

https://zenodo.org/records/4654371

下载链接

链接失效反馈

官方服务：

资源简介：

MuSe-Sent of the 2nd Multimodal Sentiment in-the-Wild Challenge! Predicting five advanced intensity classes for each of the emotional dimensions (valence, arousal) for segments of audio-video-text data. This package includes only MuSe-Sent features (all partitions) and labels of the training and development set (test scoring via the MuSe website). More: https://www.muse-challenge.org/muse2021 General: The purpose of the Multimodal Sentiment Analysis in Real-life media Challenge and Workshop (MuSe) is to bring together communities from different disciplines. We introduce the novel dataset MuSe-CAR that covers the range of aforementioned desiderata. MuSe-CAR is a large (>36h), multimodal dataset which has been gathered in-the-wild with the intention of further understanding Multimodal Sentiment Analysis in-the-wild, e.g., the emotional engagement that takes place during product reviews (i.e., automobile reviews) where a sentiment is linked to a topic or entity. We have designed MuSe-CAR to be of high voice and video quality, as informative video social media content, as well as everyday recording devices have improved in recent years. This enables robust learning, even with a high degree of novel, in-the-wild characteristics, for example as related to: i) Video: Shot size (a mix of close-up, medium, and long shots), face-angle (side, eye, low, high), camera motion (free, free but stable, and free but unstable, switch, e.g., zoom, fixed), reviewer visibility (full body, half-body, face only, and hands only), highly varying backgrounds, and people interacting with objects (car parts). ii) Audio: Ambient noises (car noises, music), narrator and host diarisation, diverse microphone types, and speaker locations. iii) Text: Colloquialisms, and domain-specific terms.

创建时间：

2023-06-28

搜集汇总

数据集介绍

背景与挑战

背景概述

MuSe-Sent是MuSe2021挑战赛的多模态情感分类数据集，用于预测音频-视频-文本数据片段中情感维度（效价和唤醒度）的强度类别。该数据集基于大型、高质量的MuSe-CAR“在野”数据集，收集自真实汽车评论，涵盖多样化的视频、音频和文本特征，适用于真实场景下的多模态情感分析研究。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集