emolia_top_1000_subsets
收藏魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/laion/emolia_top_1000_subsets
下载链接
链接失效反馈官方服务:
资源简介:
# emolia_top_1000_subsets
This dataset contains tar archives with top-1000 subsets derived from the LAION EMOLIA audio emotion dataset: for each EMOLIA annotation category we selected the 1000 clips with the highest scores and re-annotated them with Gemini 2.5 Flash, producing captions that focus strongly on foreground versus background sounds, environmental and background noise, speaker emotion, and stable speaker attributes, so these subsets provide compact, high-quality material for research on audio captioning, emotion recognition, and robust audio scene understanding.
# emolia_top_1000_subsets
本数据集包含源自LAION EMOLIA音频情感数据集(LAION EMOLIA audio emotion dataset)的Top-1000子集的tar归档文件。针对每一个EMOLIA标注类别,我们选取了得分最高的1000条音频片段,并使用Gemini 2.5 Flash进行重新标注,生成的字幕高度聚焦于前景与背景音效的区分、环境及背景噪声、说话者情感以及稳定的说话者属性特征。上述子集可为音频字幕生成(audio captioning)、情感识别以及鲁棒音频场景理解(robust audio scene understanding)相关研究提供紧凑且高质量的研究素材。
提供机构:
maas
创建时间:
2025-12-02



