CAPTDURE: Captioned Sound Dataset of Single Sources
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7965762
下载链接
链接失效反馈官方服务:
资源简介:
Description
This is a dataset with captions for a single-source sound that can be used in various tasks that use environmental sounds. The dataset consists of 1,044 single-source sounds and 4,902 captions (3 or more captions per single-source sound). This dataset also consists of 1,044 multiple-source sounds and 3,132 captions (3 captions per multiple-source sound). The detail of the dataset is described in [1].
Conditions of use
This dataset was made by Hitachi, Ltd. and is available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) license.
Citation
If you use this dataset, please cite as follow:
[1] Yuki Okamoto, Kanta Shimonishi, Keisuke Imoto, Kota Dohi, Shota Horiguchi, and Yohei Kawaguchi, "CAPTDURE: Captioned sound Dataset of Single Sources," Proc. INTERSPEECH, pp. 1683-1687, 2023.
Feedback
If there is any problem, please contact us
Yuki Okamoto, y-okamoto@ieee.org
Yohei Kawaguchi, yohei.kawaguchi.xk@hitachi.com
创建时间:
2023-08-20



