five

RAVDESS Emotional speech audio

收藏
www.kaggle.com2019-01-19 更新2025-03-23 收录
下载链接:
https://www.kaggle.com/uwrfkaggler/ravdess-emotional-speech-audio
下载链接
链接失效反馈
官方服务:
资源简介:
**Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS)** ------------ Speech audio-only files (16bit, 48kHz .wav) from the RAVDESS. Full dataset of speech and song, audio and video (24.8 GB) available from [Zenodo][1]. Construction and perceptual validation of the RAVDESS is described in our Open Access [paper in PLoS ONE][2]. Check out our [Kaggle Song emotion dataset][3]. **Files** This portion of the RAVDESS contains 1440 files: 60 trials per actor x 24 actors = 1440. The RAVDESS contains 24 professional actors (12 female, 12 male), vocalizing two lexically-matched statements in a neutral North American accent. Speech emotions includes calm, happy, sad, angry, fearful, surprise, and disgust expressions. Each expression is produced at two levels of emotional intensity (normal, strong), with an additional neutral expression. **File naming convention** Each of the 1440 files has a unique filename. The filename consists of a 7-part numerical identifier (e.g., 03-01-06-01-02-01-12.wav). These identifiers define the stimulus characteristics: *Filename identifiers* - Modality (01 = full-AV, 02 = video-only, 03 = audio-only). - Vocal channel (01 = speech, 02 = song). - Emotion (01 = neutral, 02 = calm, 03 = happy, 04 = sad, 05 = angry, 06 = fearful, 07 = disgust, 08 = surprised). - Emotional intensity (01 = normal, 02 = strong). NOTE: There is no strong intensity for the 'neutral' emotion. - Statement (01 = "Kids are talking by the door", 02 = "Dogs are sitting by the door"). - Repetition (01 = 1st repetition, 02 = 2nd repetition). - Actor (01 to 24. Odd numbered actors are male, even numbered actors are female). *Filename example: 03-01-06-01-02-01-12.wav* 1. Audio-only (03) 2. Speech (01) 3. Fearful (06) 4. Normal intensity (01) 5. Statement "dogs" (02) 6. 1st Repetition (01) 7. 12th Actor (12) Female, as the actor ID number is even. **How to cite the RAVDESS** *Academic citation* If you use the RAVDESS in an academic publication, please use the following citation: Livingstone SR, Russo FA (2018) The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English. PLoS ONE 13(5): e0196391. https://doi.org/10.1371/journal.pone.0196391. *All other attributions* If you use the RAVDESS in a form other than an academic publication, such as in a blog post, school project, or non-commercial product, please use the following attribution: "The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS)" by Livingstone & Russo is licensed under CC BY-NA-SC 4.0. [1]: https://zenodo.org/record/1188976 [2]: https://doi.org/10.1371/journal.pone.0196391 [3]: https://www.kaggle.com/uwrfkaggler/ravdess-emotional-song-audio

雷斯顿情感语音与歌曲视听数据库(RAVDESS)。该数据库包含仅语音的音频文件(16位,48kHz .wav格式),完整数据集包括语音和歌曲、音频和视频(24.8 GB),可从[Zenodo](https://zenodo.org/record/1188976)获取。RAVDESS的构建和感知验证在我们在PLoS ONE上发表的开放获取[论文](https://doi.org/10.1371/journal.pone.0196391)中有详细描述。 敬请参阅我们的[Kaggle歌曲情感数据集](https://www.kaggle.com/uwrfkaggler/ravdess-emotional-song-audio)。 **文件** RAVDESS的此部分包含1440个文件:每位演员60个试验 x 24位演员 = 1440。RAVDESS包含24位专业演员(12位女性,12位男性),以中性北美口音发音两个词汇上匹配的陈述。语音情感包括平静、快乐、悲伤、愤怒、恐惧、惊讶和厌恶的表达。每种表情以两种情感强度级别(正常、强烈)呈现,另附一个中性表情。 **文件命名规范** 这1440个文件中的每一个都有一个唯一的文件名。文件名由7部分数字标识符组成(例如,03-01-06-01-02-01-12.wav)。这些标识符定义了刺激特征: *文件标识符* - 模式(01 = 全视听,02 = 视频仅,03 = 音频仅)。 - 语音通道(01 = 语音,02 = 歌曲)。 - 情感(01 = 中性,02 = 平静,03 = 快乐,04 = 悲伤,05 = 愤怒,06 = 恐惧,07 = 厌恶,08 = 惊讶)。 - 情感强度(01 = 正常,02 = 强烈)。注意:'中性'情感没有强烈强度。 - 陈述(01 = "孩子们在门口说话
提供机构:
www.kaggle.com
搜集汇总
数据集介绍
main_image_url
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作