five

BC-AC Multimodal Emotional Speech Dataset (Partial Release)

收藏
IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/bc-ac-multimodal-emotional-speech-dataset-partial-release
下载链接
链接失效反馈
官方服务:
资源简介:
To support research on multimodal speech emotion recognition (SER), we developed a dual-channel emotional speech database featuring synchronized recordings of bone-conducted (BC) and air-conducted (AC) speech. The recordings were conducted in a professionally treated anechoic chamber with 100 gender-balanced volunteers. AC speech was captured via a digital microphone on the left channel, while BC speech was recorded from an in-ear BC microphone on the right channel, both at a 44.1 kHz sampling rate to ensure high-fidelity audio. Each participant completed a 60-minute session consisting of two phases: emotion induction and expression. Participants first watched emotion-inducing video clips and reported their emotional state using a discrete label and a 0–5 intensity scale. They then read predefined prompts while maintaining the induced emotion, generating labeled utterances. The speech material included 480 sentences (120 per emotion: angry, happy, neutral, sad) selected from a standardized iFLYTEK corpus. Emotional and neutral utterances were alternated to minimize cognitive fatigue, and the material was evenly divided into four subsets to maintain emotional balance. Manual perceptual evaluation yielded recognition accuracies of 93.56% for emotional and 82.38% for neutral utterances, confirming data quality. After segmentation, the final dataset contains 24,191 labeled utterances.   
提供机构:
Zhao, Shujie
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作