five

Dusha

收藏
arXiv2022-12-23 更新2024-06-21 收录
下载链接:
https://github.com/salute-developers/golos/tree/master/dusha
下载链接
链接失效反馈
官方服务:
资源简介:
Dusha数据集是由俄罗斯的Sber机构创建,专注于语音情感识别任务,包含约350小时的音频数据,超过30万条俄语语音及其转录文本。该数据集通过众包平台进行标注,分为表演和真实生活两个子集,前者用于模型预训练,后者用于微调和验证。Dusha数据集旨在通过其丰富的情感表达和多样的语音来源,解决现有数据集在情感识别上的局限性,特别是在面对新说话者时的表现。

The Dusha dataset was developed by Sber, a Russian institution, and focuses on the speech emotion recognition task. It contains approximately 350 hours of audio data, along with over 300,000 Russian speech utterances and their corresponding transcriptions. Annotated via crowdsourcing platforms, the dataset is split into two subsets: the acted subset and the real-life subset. The acted subset is designed for model pre-training, while the real-life subset is reserved for fine-tuning and validation. The Dusha dataset aims to address the limitations of existing speech emotion recognition datasets, especially their performance when dealing with unseen speakers, by providing rich emotional expressions and diverse speech sources.
提供机构:
Sber, 俄罗斯
创建时间:
2022-12-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作