EzAudioCaps
收藏魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/OpenSound/EzAudioCaps
下载链接
链接失效反馈官方服务:
资源简介:
1. Prepare following dataset:
AudioSet: https://research.google.com/audioset/
AudioSet Strongly Labeled Subset: https://research.google.com/audioset/download_strong.html
VggSound: https://www.robots.ox.ac.uk/~vgg/data/vggsound/
AudioCaps: https://audiocaps.github.io/
2. The audio file name format:
AudioSet data: {ytb_id}\_{start_second\*1000}\_{end_second\*1000}
AudioSet Strongly Labeled Subset: {ytb_id}_{start_second\*1000} (10s clip from the the start second)
VggSound: {ytb_id}_{start_second} (10s clip from the the start second)
AudioCaps: {audiocap_id}
1. 准备如下数据集:
AudioSet:https://research.google.com/audioset/
AudioSet强标注子集(AudioSet Strongly Labeled Subset):https://research.google.com/audioset/download_strong.html
VggSound:https://www.robots.ox.ac.uk/~vgg/data/vggsound/
AudioCaps:https://audiocaps.github.io/
2. 音频文件命名格式:
AudioSet数据集:{ytb_id}_{start_second*1000}_{end_second*1000}
AudioSet强标注子集:{ytb_id}_{start_second*1000}(取自起始秒的10秒音频片段)
VggSound:{ytb_id}_{start_second}(取自起始秒的10秒音频片段)
AudioCaps:{audiocap_id}
提供机构:
maas
创建时间:
2025-08-26



