five

LAMA World Music Genre Dataset

收藏
DataONE2020-12-10 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:dd6ba575f0869639cb1169b059d038f42bf2f1c1aba3b34d28d62b3c32f46aea
下载链接
链接失效反馈
官方服务:
资源简介:
LAMA World Music Genre Dataset LAMA - LatinAmerica, Asia, MiddleEastern, Africa Genre Dataset This Dataset consists of the .wav files of audio classified into four categories: LatinAmerica, Asia, MiddleEastern, and Africa. We went through Google AudioSet ontology and pickled the ones we double-chekched to be from the region. We added 1-min audio (.wav), plots (.png), and numerical datapoints for training (.json). I hope that this work can help in several Deep Learning, Machine Learning projects in Music Genre Classification. Getting Started The data contained in LAMA can be classified into three categories: ->>>Section Format LatinAmerica Asia MiddleEast Africa audio .wav 535 539 548 645 graph plots .png 2140 2156 2192 2580 numerical .json 101650 102410 104120 122550 Overall statistics of LAMA. The numbers in “audio” and “plots” rows are counts of the included files in each section. The numbers provided in the “numerical” row are counted based on the total number of raw datapoints. Datapoints from two related files (trainMFCC.json, trainSC.json) were counted. Datapoints from trainZCR.json, and trainRMSE.json were not counted in this figure. LatinA refers to Latin America, and MiddleE refers to Middle East. What's in? audio .wav -> 1 min clip audio files from Latin America, Africa, Asia, and Middle East graph plots .png -> MFCC, STFT, FFT, waveform numerical .json -> x13 MFCC datapoints, x6 spectral contrast datapoints

LAMA世界音乐流派数据集 LAMA——拉丁美洲、亚洲、中东与非洲音乐流派数据集 本数据集包含已分类为四大类别的.wav格式音频文件,分别对应拉丁美洲、亚洲、中东与非洲区域。我们参照Google AudioSet本体库,筛选出经双重校验确认为对应区域的音频样本。此外还补充了1分钟时长的.wav音频文件、.png格式可视化图谱以及用于模型训练的.json格式数值数据集。希望本数据集能够为音乐流派分类相关的深度学习、机器学习项目提供助力。 ## 快速入门 LAMA数据集的数据可分为三大类别,数据集格式说明如下: | 数据类型 | 拉丁美洲 | 亚洲 | 中东 | 非洲 | |--------------------|----------|--------|--------|--------| | 音频(.wav) | 535 | 539 | 548 | 645 | | 可视化图谱(.png) | 2140 | 2156 | 2192 | 2580 | | 数值数据集(.json) | 101650 | 102410 | 104120 | 122550 | ### 数据集总体统计说明 LAMA数据集的统计信息说明如下: 1. "音频"与"可视化图谱"行的数值为对应类别下的文件总数量; 2. "数值数据集"行的数值为原始数据点的总数量,该统计包含了trainMFCC.json与trainSC.json两份相关文件的数据点,但未计入trainZCR.json与trainRMSE.json的数据点。 注:文中LatinA指代拉丁美洲,MiddleE指代中东。 ### 数据集内容详情 1. **.wav格式音频文件**:来自拉丁美洲、非洲、亚洲与中东地区的1分钟时长音频片段 2. **.png格式可视化图谱**:包含梅尔频率倒谱系数(Mel-Frequency Cepstral Coefficients, MFCC)图谱、短时傅里叶变换(Short-Time Fourier Transform, STFT)图谱、快速傅里叶变换(Fast Fourier Transform, FFT)图谱以及波形图 3. **.json格式数值数据集**:包含13维梅尔频率倒谱系数数据点与6维频谱对比度数据点
创建时间:
2023-11-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作