five

simAugCENSE-18K

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/5150013
下载链接
链接失效反馈
官方服务:
资源简介:
Created By Félix Gontier and Mathieu Lagrange, LS2N, CNRS, Ecole Centrale Nantes Contact : mathieu.lagrange@ns2n.fr If used for research, please refer to: @article{gontier2021spatial, title={Spatial and temporal pretext tasks for few-shot urban sound classification}, author={Félix Gontier and Vincent Lostanlen, and Mathieu Lagrange and Nicolas Fortin and Jean-Francois Petiot and Catherine Lavandier}, journal={submitted}, year={2021} } simAugCENSE-18k contains 400 acoustic scenes of duration equal to 45 seconds. We synthesized these polyphonic scenes via the simScene software, based on monophonic audio clips from the CENSE-2k dataset. The total duration of the dataset is equal to 18k seconds, i.e., five hours.The audio is made available as third octave spectral data, see demoTob.zip for an implementation of its computation from audio in Python. >> import numpy as np >> s=np.load('simAugCENSE-18k_train_spectralData.npy') >> print(s.shape) (280, 351, 29) The three dimensions respectively corresponds to the sceneId, the frameId (time), and the spectralId (frequency). >> a=np.load('simAugCENSE-18k_train_annotations.npy') >> print(a.shape) (280, 344, 3) >> s=np.load('simAugCENSE-18k_validation_spectralData.npy') >> print(s.shape) (120, 344, 29) >> a=np.load('simAugCENSE-18k_validation_annotations.npy') >> print(a.shape) (120, 344, 3) The 3 dimensions corresponds to the sceneId, the frameId (time), the sourceId (traffic, voice, birds). Annotation is provided as a binary indicator of source presence for one second, that is 8 consecutive 125 ms frames with a hop of one frame.
创建时间:
2021-08-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作