FSDnoisy18k Dataset
收藏paperswithcode.com2025-03-24 收录
下载链接:
https://paperswithcode.com/dataset/fsdnoisy18k
下载链接
链接失效反馈官方服务:
资源简介:
The FSDnoisy18k dataset is an open dataset containing 42.5 hours of audio across 20 sound event classes, including a small amount of manually-labeled data and a larger quantity of real-world noisy data. The audio content is taken from Freesound, and the dataset was curated using the Freesound Annotator. The noisy set of FSDnoisy18k consists of 15,813 audio clips (38.8h), and the test set consists of 947 audio clips (1.4h) with correct labels. The dataset features two main types of label noise: in-vocabulary (IV) and out-of-vocabulary (OOV). IV applies when, given an observed label that is incorrect or incomplete, the true or missing label is part of the target class set. Analogously, OOV means that the true or missing label is not covered by those 20 classes.
FSDnoisy18k数据集系一项开放资源,囊括了20个声事件类别共计42.5小时的音频内容。该数据集包含少量人工标注数据与大量现实世界噪声数据。音频素材源自Freesound,并由Freesound Annotator进行精选。FSDnoisy18k的噪声集包含15,813个音频片段(时长38.8小时),测试集则包含947个音频片段(时长1.4小时),并附有正确标签。数据集主要呈现两种类型的标签噪声:词汇内(IV)与词汇外(OOV)。当观察到的标签错误或不完整时,若真实或缺失的标签属于目标类别集合之中,则称为词汇内;反之,若真实或缺失的标签不属于上述20个类别,则定义为词汇外。
提供机构:
paperswithcode.com



