five

环境声音分类,环境声音的原始音频分类数据集

收藏
帕依提提2024-03-04 收录
下载链接:
https://www.payititi.com/opendatasets/show-9862.html
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset consists in 50 WAV files sampled at 16KHz for 50 different classes. To each one of the classes, corresponds 40 audio sample of 5 seconds each. All of these audio files have been concatenated by class in order to have 50 wave files of 3 min. 20sec. In our example notebook, we show how to access the data and visualize a piece of it. We have not much credit in proposing the dataset here. Much of the work have been done by the authors of the ESC-50 Dataset for Environmental Sound Classification. In order to fit on Kaggle, we processed the files with the to_wav.py file present in the original repository. You might also notice that we transformed the data from OGG to WAV as the former didn't seem to be supported in Anaconda.

本数据集包含50个采样率为16kHz的WAV格式音频文件(WAV),对应50个不同类别。每个类别对应40段时长为5秒的音频采样。所有此类音频文件均按类别拼接,最终得到50段时长为3分20秒的WAV文件。在配套的示例Notebook中,我们演示了如何读取该数据集并对其中部分数据进行可视化。我们并非本数据集的原创作团队,绝大多数原创工作由环境声音分类ESC-50数据集(ESC-50 Dataset)的作者完成。为适配Kaggle平台,我们使用原始代码仓库中的`to_wav.py`脚本对文件进行了处理。您可能还会注意到,由于Anaconda似乎不支持OGG格式音频(OGG),我们已将原始数据从OGG格式转换为WAV格式。
提供机构:
帕依提提
二维码
社区交流群
二维码
科研交流群
商业服务