Language-based audio retrieval DCASE 2022 evaluation dataset
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6590982
下载链接
链接失效反馈官方服务:
资源简介:
This is the evaluation dataset for Task 6 (Subtask B), Language-based Audio Retrieval, in DCASE 2022 Challenge.
This evaluation dataset is meant to be used for the purposes of the Subtask B in the Task 6 at the scientific challenge 2022. This dataset is not meant to be used for developing language-based audio retrieval methods. For developing language-based audio retrieval methods, you should use the development dataset, i.e., the Clotho v2.1 dataset, which can be found also in Zenodo, at: https://zenodo.org/record/4783391.
== License ==
The audio files in the archives:
retrieval_audio.7z
and the associated meta-data in the CSV file:
retrieval_audio_metadata.csv
are under the corresponding licenses of Freesound [1] platform, mentioned explicitly in the CSV file for each of the audio files. That is, each audio file in the 7z archives is listed in the CSV file with the meta-data. The meta-data for each file are:
File name
Keywords
URL for the orignal audio file
Start and end samples for the excerpt that is used in the dataset
Uploader/user in the Freesound platform (manufacturer)
Link to the license of the file
The caption queries in the file:
retrieval_captions.csv
are under the Tampere University license, described in the LICENSE file.
==References==
[1] Frederic Font, Gerard Roma, and Xavier Serra. 2013. Freesound technical demo. In Proceedings of the 21st ACM international conference on Multimedia (MM '13). ACM, New York, NY, USA, 411-412. DOI: https://doi.org/10.1145/2502081.2502245
创建时间:
2022-06-01



