five

EzAudioCaps

收藏
魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/OpenSound/EzAudioCaps
下载链接
链接失效反馈
官方服务:
资源简介:
1. Prepare following dataset: AudioSet: https://research.google.com/audioset/ AudioSet Strongly Labeled Subset: https://research.google.com/audioset/download_strong.html VggSound: https://www.robots.ox.ac.uk/~vgg/data/vggsound/ AudioCaps: https://audiocaps.github.io/ 2. The audio file name format: AudioSet data: {ytb_id}\_{start_second\*1000}\_{end_second\*1000} AudioSet Strongly Labeled Subset: {ytb_id}_{start_second\*1000} (10s clip from the the start second) VggSound: {ytb_id}_{start_second} (10s clip from the the start second) AudioCaps: {audiocap_id}

1. 准备如下数据集: AudioSet:https://research.google.com/audioset/ AudioSet强标注子集(AudioSet Strongly Labeled Subset):https://research.google.com/audioset/download_strong.html VggSound:https://www.robots.ox.ac.uk/~vgg/data/vggsound/ AudioCaps:https://audiocaps.github.io/ 2. 音频文件命名格式: AudioSet数据集:{ytb_id}_{start_second*1000}_{end_second*1000} AudioSet强标注子集:{ytb_id}_{start_second*1000}(取自起始秒的10秒音频片段) VggSound:{ytb_id}_{start_second}(取自起始秒的10秒音频片段) AudioCaps:{audiocap_id}
提供机构:
maas
创建时间:
2025-08-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作