five

Lehrig/GTZAN-Collection

收藏
Hugging Face2022-06-13 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Lehrig/GTZAN-Collection
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 --- # Dataset Card for GTZAN Collection ## Table of Contents - [Dataset Description](#dataset-description) - [Dataset Summary](#dataset-summary) - [Supported Tasks](#supported-tasks-and-leaderboards) - [Languages](#languages) - [Dataset Structure](#dataset-structure) - [Data Instances](#data-instances) - [Data Fields](#data-instances) - [Data Splits](#data-instances) - [Dataset Creation](#dataset-creation) - [Curation Rationale](#curation-rationale) - [Source Data](#source-data) - [Annotations](#annotations) - [Personal and Sensitive Information](#personal-and-sensitive-information) - [Considerations for Using the Data](#considerations-for-using-the-data) - [Social Impact of Dataset](#social-impact-of-dataset) - [Discussion of Biases](#discussion-of-biases) - [Other Known Limitations](#other-known-limitations) - [Additional Information](#additional-information) - [Dataset Curators](#dataset-curators) - [Licensing Information](#licensing-information) - [Citation Information](#citation-information) ## Dataset Description - **Homepage:** https://github.com/derekahuang/Music-Classification - **Repository:** https://github.com/derekahuang/Music-Classification - **Paper:** [Musical genre classification of audio signals](https://ieeexplore.ieee.org/document/1021072) - **Leaderboard:** [Needs More Information] - **Point of Contact:** [Needs More Information] ### Dataset Summary The dataset consists of 1000 audio tracks each 30 seconds long. It contains 10 genres, each represented by 100 tracks. The tracks are all 22050Hz Mono 16-bit audio files in .wav format. The genres are: * blues * classical * country * disco * hiphop * jazz * metal * pop * reggae * rock This collection includes the following GTZAN variants: * raw (original WAV files) * melspectrograms (from each WAV file, contiguous 2-second windows at 4 random locations are sampled and transformed to Mel Spectrograms, resulting in 8000 Mel Spectrograms) ### Supported Tasks and Leaderboards [Needs More Information] ### Languages [Needs More Information] ## Dataset Structure ### Data Instances [Needs More Information] ### Data Fields [Needs More Information] ### Data Splits [Needs More Information] ## Dataset Creation ### Curation Rationale [Needs More Information] ### Source Data #### Initial Data Collection and Normalization [Needs More Information] #### Who are the source language producers? [Needs More Information] ### Annotations #### Annotation process [Needs More Information] #### Who are the annotators? [Needs More Information] ### Personal and Sensitive Information [Needs More Information] ## Considerations for Using the Data ### Social Impact of Dataset [Needs More Information] ### Discussion of Biases [Needs More Information] ### Other Known Limitations [Needs More Information] ## Additional Information ### Dataset Curators [Needs More Information] ### Licensing Information [Needs More Information] ### Citation Information [Needs More Information]

--- 许可证:Apache-2.0 --- # GTZAN 数据集集合卡片(Dataset Card) ## 目录(Table of Contents) - [数据集描述(Dataset Description)](#dataset-description) - [数据集概览(Dataset Summary)](#dataset-summary) - [支持任务与排行榜(Supported Tasks and Leaderboards)](#supported-tasks-and-leaderboards) - [语言(Languages)](#languages) - [数据集结构(Dataset Structure)](#dataset-structure) - [数据实例(Data Instances)](#data-instances) - [数据字段(Data Fields)](#data-instances) - [数据划分(Data Splits)](#data-instances) - [数据集构建(Dataset Creation)](#dataset-creation) - [遴选依据(Curation Rationale)](#curation-rationale) - [源数据(Source Data)](#source-data) - [标注信息(Annotations)](#annotations) - [个人与敏感信息(Personal and Sensitive Information)](#personal-and-sensitive-information) - [数据集使用注意事项(Considerations for Using the Data)](#considerations-for-using-the-data) - [数据集的社会影响(Social Impact of Dataset)](#social-impact-of-dataset) - [偏差讨论(Discussion of Biases)](#discussion-of-biases) - [其他已知局限性(Other Known Limitations)](#other-known-limitations) - [附加信息(Additional Information)](#additional-information) - [数据集策展人(Dataset Curators)](#dataset-curators) - [许可证信息(Licensing Information)](#licensing-information) - [引用信息(Citation Information)](#citation-information) ## 数据集描述(Dataset Description) - **主页(Homepage)**:https://github.com/derekahuang/Music-Classification - **代码仓库(Repository)**:https://github.com/derekahuang/Music-Classification - **相关论文(Paper)**:[音频信号的音乐流派分类](https://ieeexplore.ieee.org/document/1021072) - **排行榜(Leaderboard)**:[需补充更多信息] - **联络人(Point of Contact)**:[需补充更多信息] ### 数据集概览(Dataset Summary) 本数据集包含1000条时长为30秒的音频片段,涵盖10个音乐流派,每个流派对应100条音频片段。所有音频均为22050Hz单声道16位的WAV格式文件,涵盖的流派包括: * 布鲁斯(blues) * 古典(classical) * 乡村(country) * 迪斯科(disco) * 嘻哈(hiphop) * 爵士(jazz) * 金属(metal) * 流行(pop) * 雷鬼(reggae) * 摇滚(rock) 本数据集集合包含以下两种GTZAN变体: * 原始版(raw):即原始WAV文件 * 梅尔频谱图版(melspectrograms):从每条原始WAV文件中,在4个随机位置截取连续2秒的音频窗口,并转换为梅尔频谱图(Mel Spectrograms),最终共生成8000张梅尔频谱图。 ### 支持任务与排行榜(Supported Tasks and Leaderboards) [需补充更多信息] ### 语言(Languages) [需补充更多信息] ## 数据集结构(Dataset Structure) ### 数据实例(Data Instances) [需补充更多信息] ### 数据字段(Data Fields) [需补充更多信息] ### 数据划分(Data Splits) [需补充更多信息] ## 数据集构建(Dataset Creation) ### 遴选依据(Curation Rationale) [需补充更多信息] ### 源数据(Source Data) #### 初始数据收集与归一化 [需补充更多信息] #### 源数据生产者是谁? [需补充更多信息] ### 标注信息(Annotations) #### 标注流程 [需补充更多信息] #### 标注人员是谁? [需补充更多信息] ### 个人与敏感信息(Personal and Sensitive Information) [需补充更多信息] ## 数据集使用注意事项(Considerations for Using the Data) ### 数据集的社会影响(Social Impact of Dataset) [需补充更多信息] ### 偏差讨论(Discussion of Biases) [需补充更多信息] ### 其他已知局限性(Other Known Limitations) [需补充更多信息] ## 附加信息(Additional Information) ### 数据集策展人(Dataset Curators) [需补充更多信息] ### 许可证信息(Licensing Information) [需补充更多信息] ### 引用信息(Citation Information) [需补充更多信息]
提供机构:
Lehrig
原始信息汇总

数据集概述

数据集名称

GTZAN Collection

数据集总结

  • 音频数量:1000个音频文件
  • 音频长度:每个音频30秒
  • 音频格式:22050Hz Mono 16-bit WAV格式
  • 音频类型:包含10种音乐类型,每种类型100个文件
  • 音乐类型:blues, classical, country, disco, hiphop, jazz, metal, pop, reggae, rock
  • 数据集变体:包括原始WAV文件和从每个WAV文件中提取的Mel Spectrograms(8000个)

数据集结构

  • 数据实例:[信息缺失]
  • 数据字段:[信息缺失]
  • 数据分割:[信息缺失]

数据集创建

  • 精选理由:[信息缺失]
  • 源数据:[信息缺失]
  • 注释:[信息缺失]
  • 个人和敏感信息:[信息缺失]

使用数据注意事项

  • 数据集的社会影响:[信息缺失]
  • 偏见讨论:[信息缺失]
  • 其他已知限制:[信息缺失]

附加信息

  • 数据集管理者:[信息缺失]
  • 许可信息:Apache-2.0
  • 引用信息:[信息缺失]
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
GTZAN-Collection数据集包含1000个30秒长的音频文件,涵盖10种音乐流派,每种流派100个样本。音频文件为22050Hz Mono 16-bit的.wav格式,并提供原始音频和Mel频谱图两种形式。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作