simAugCENSE-18K
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/5150013
下载链接
链接失效反馈官方服务:
资源简介:
Created By Félix Gontier and Mathieu Lagrange, LS2N, CNRS, Ecole Centrale Nantes
Contact : mathieu.lagrange@ns2n.fr
If used for research, please refer to:
@article{gontier2021spatial,
title={Spatial and temporal pretext tasks for few-shot urban sound classification},
author={Félix Gontier and Vincent Lostanlen, and Mathieu Lagrange and Nicolas Fortin and Jean-Francois Petiot and Catherine Lavandier},
journal={submitted},
year={2021}
}
simAugCENSE-18k contains 400 acoustic scenes of duration equal to 45 seconds.
We synthesized these polyphonic scenes via the simScene software, based on monophonic audio clips from the CENSE-2k dataset.
The total duration of the dataset is equal to 18k seconds, i.e., five hours.The audio is made available as third octave spectral data, see demoTob.zip for an implementation of its computation from audio in Python.
>> import numpy as np
>> s=np.load('simAugCENSE-18k_train_spectralData.npy')
>> print(s.shape)
(280, 351, 29)
The three dimensions respectively corresponds to the sceneId, the frameId (time), and the spectralId (frequency).
>> a=np.load('simAugCENSE-18k_train_annotations.npy')
>> print(a.shape)
(280, 344, 3)
>> s=np.load('simAugCENSE-18k_validation_spectralData.npy')
>> print(s.shape)
(120, 344, 29)
>> a=np.load('simAugCENSE-18k_validation_annotations.npy')
>> print(a.shape)
(120, 344, 3)
The 3 dimensions corresponds to the sceneId, the frameId (time), the sourceId (traffic, voice, birds). Annotation is provided as a binary indicator of source presence for one second, that is 8 consecutive 125 ms frames with a hop of one frame.
创建时间:
2021-08-02



