Clotho Analysis Set
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6604108
下载链接
链接失效反馈官方服务:
资源简介:
This dataset is derived from the evaluation subset of Clotho dataset. It is designed to analyze the behavior of the captioning system under certain perturbation in order to try and identify some open challenges in automated audio captioning. The original audio clips are transformed with audio_degrader. The transformations applied are the following:
Microphone response simulation
Mixup with another clip from the dataset (ratio -6dB, -3dB and 0dB)
Additive noise from DESED (ratio -12dB, -6dB, 0dB)
创建时间:
2022-06-03



