RWCP-SSD-Onomatopoeia
收藏arXiv2020-07-09 更新2024-06-21 收录
下载链接:
https://www.ksuke.net/dataset
下载链接
链接失效反馈官方服务:
资源简介:
RWCP-SSD-Onomatopoeia是一个由立命馆大学等机构创建的数据集,包含155,568个拟声词及其对应的音频样本,用于环境声音合成。数据集涵盖了105种声音事件,每个事件包含约100个音频样本,总样本数达9,722个。创建过程中,通过众包方式收集了拟声词,并记录了众包工人的自我报告信心分数和他人报告的接受分数。该数据集旨在解决环境声音合成中对声音精细时间-频率结构控制的需求,适用于电影、游戏制作、虚拟现实内容生成及声音事件检测和场景分类的数据增强。
RWCP-SSD-Onomatopoeia is a dataset developed by institutions including Ritsumeikan University, which contains 155,568 onomatopoeic annotations and their corresponding audio samples for environmental sound synthesis. The dataset covers 105 sound event categories, with approximately 100 audio samples per category, amounting to a total of 9,722 audio samples. During the dataset creation process, onomatopoeic annotations were collected via crowdsourcing, and both self-reported confidence scores from crowdworkers and peer-reported acceptance scores were recorded. This dataset aims to address the demand for fine-grained time-frequency structure control in environmental sound synthesis, and is suitable for data augmentation in film production, game development, virtual reality content generation, as well as sound event detection and scene classification tasks.
提供机构:
立命馆大学, 日本, 同志社大学, 日本, 东京大学, 日本, 关西大学, 日本
创建时间:
2020-07-09



