Language-based audio retrieval DCASE 2022 evaluation dataset

NIAID Data Ecosystem2026-03-13 收录

下载链接：

https://zenodo.org/record/6590982

下载链接

链接失效反馈

官方服务：

资源简介：

This is the evaluation dataset for Task 6 (Subtask B), Language-based Audio Retrieval, in DCASE 2022 Challenge. This evaluation dataset is meant to be used for the purposes of the Subtask B in the Task 6 at the scientific challenge 2022. This dataset is not meant to be used for developing language-based audio retrieval methods. For developing language-based audio retrieval methods, you should use the development dataset, i.e., the Clotho v2.1 dataset, which can be found also in Zenodo, at: https://zenodo.org/record/4783391. == License == The audio files in the archives: retrieval_audio.7z and the associated meta-data in the CSV file: retrieval_audio_metadata.csv are under the corresponding licenses of Freesound [1] platform, mentioned explicitly in the CSV file for each of the audio files. That is, each audio file in the 7z archives is listed in the CSV file with the meta-data. The meta-data for each file are: File name Keywords URL for the orignal audio file Start and end samples for the excerpt that is used in the dataset Uploader/user in the Freesound platform (manufacturer) Link to the license of the file The caption queries in the file: retrieval_captions.csv are under the Tampere University license, described in the LICENSE file. ==References== [1] Frederic Font, Gerard Roma, and Xavier Serra. 2013. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 411-412. DOI: https://doi.org/10.1145/2502081.2502245

创建时间：

2022-06-01

5,000+

优质数据集

54 个

任务类型

进入经典数据集