Lehrig/GTZAN-Collection
收藏Hugging Face2022-06-13 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Lehrig/GTZAN-Collection
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
---
# Dataset Card for GTZAN Collection
## Table of Contents
- [Dataset Description](#dataset-description)
- [Dataset Summary](#dataset-summary)
- [Supported Tasks](#supported-tasks-and-leaderboards)
- [Languages](#languages)
- [Dataset Structure](#dataset-structure)
- [Data Instances](#data-instances)
- [Data Fields](#data-instances)
- [Data Splits](#data-instances)
- [Dataset Creation](#dataset-creation)
- [Curation Rationale](#curation-rationale)
- [Source Data](#source-data)
- [Annotations](#annotations)
- [Personal and Sensitive Information](#personal-and-sensitive-information)
- [Considerations for Using the Data](#considerations-for-using-the-data)
- [Social Impact of Dataset](#social-impact-of-dataset)
- [Discussion of Biases](#discussion-of-biases)
- [Other Known Limitations](#other-known-limitations)
- [Additional Information](#additional-information)
- [Dataset Curators](#dataset-curators)
- [Licensing Information](#licensing-information)
- [Citation Information](#citation-information)
## Dataset Description
- **Homepage:** https://github.com/derekahuang/Music-Classification
- **Repository:** https://github.com/derekahuang/Music-Classification
- **Paper:** [Musical genre classification of audio signals](https://ieeexplore.ieee.org/document/1021072)
- **Leaderboard:** [Needs More Information]
- **Point of Contact:** [Needs More Information]
### Dataset Summary
The dataset consists of 1000 audio tracks each 30 seconds long.
It contains 10 genres, each represented by 100 tracks.
The tracks are all 22050Hz Mono 16-bit audio files in .wav format.
The genres are:
* blues
* classical
* country
* disco
* hiphop
* jazz
* metal
* pop
* reggae
* rock
This collection includes the following GTZAN variants:
* raw (original WAV files)
* melspectrograms (from each WAV file, contiguous 2-second windows at 4 random locations are sampled and transformed to Mel Spectrograms, resulting in 8000 Mel Spectrograms)
### Supported Tasks and Leaderboards
[Needs More Information]
### Languages
[Needs More Information]
## Dataset Structure
### Data Instances
[Needs More Information]
### Data Fields
[Needs More Information]
### Data Splits
[Needs More Information]
## Dataset Creation
### Curation Rationale
[Needs More Information]
### Source Data
#### Initial Data Collection and Normalization
[Needs More Information]
#### Who are the source language producers?
[Needs More Information]
### Annotations
#### Annotation process
[Needs More Information]
#### Who are the annotators?
[Needs More Information]
### Personal and Sensitive Information
[Needs More Information]
## Considerations for Using the Data
### Social Impact of Dataset
[Needs More Information]
### Discussion of Biases
[Needs More Information]
### Other Known Limitations
[Needs More Information]
## Additional Information
### Dataset Curators
[Needs More Information]
### Licensing Information
[Needs More Information]
### Citation Information
[Needs More Information]
---
许可证:Apache-2.0
---
# GTZAN 数据集集合卡片(Dataset Card)
## 目录(Table of Contents)
- [数据集描述(Dataset Description)](#dataset-description)
- [数据集概览(Dataset Summary)](#dataset-summary)
- [支持任务与排行榜(Supported Tasks and Leaderboards)](#supported-tasks-and-leaderboards)
- [语言(Languages)](#languages)
- [数据集结构(Dataset Structure)](#dataset-structure)
- [数据实例(Data Instances)](#data-instances)
- [数据字段(Data Fields)](#data-instances)
- [数据划分(Data Splits)](#data-instances)
- [数据集构建(Dataset Creation)](#dataset-creation)
- [遴选依据(Curation Rationale)](#curation-rationale)
- [源数据(Source Data)](#source-data)
- [标注信息(Annotations)](#annotations)
- [个人与敏感信息(Personal and Sensitive Information)](#personal-and-sensitive-information)
- [数据集使用注意事项(Considerations for Using the Data)](#considerations-for-using-the-data)
- [数据集的社会影响(Social Impact of Dataset)](#social-impact-of-dataset)
- [偏差讨论(Discussion of Biases)](#discussion-of-biases)
- [其他已知局限性(Other Known Limitations)](#other-known-limitations)
- [附加信息(Additional Information)](#additional-information)
- [数据集策展人(Dataset Curators)](#dataset-curators)
- [许可证信息(Licensing Information)](#licensing-information)
- [引用信息(Citation Information)](#citation-information)
## 数据集描述(Dataset Description)
- **主页(Homepage)**:https://github.com/derekahuang/Music-Classification
- **代码仓库(Repository)**:https://github.com/derekahuang/Music-Classification
- **相关论文(Paper)**:[音频信号的音乐流派分类](https://ieeexplore.ieee.org/document/1021072)
- **排行榜(Leaderboard)**:[需补充更多信息]
- **联络人(Point of Contact)**:[需补充更多信息]
### 数据集概览(Dataset Summary)
本数据集包含1000条时长为30秒的音频片段,涵盖10个音乐流派,每个流派对应100条音频片段。所有音频均为22050Hz单声道16位的WAV格式文件,涵盖的流派包括:
* 布鲁斯(blues)
* 古典(classical)
* 乡村(country)
* 迪斯科(disco)
* 嘻哈(hiphop)
* 爵士(jazz)
* 金属(metal)
* 流行(pop)
* 雷鬼(reggae)
* 摇滚(rock)
本数据集集合包含以下两种GTZAN变体:
* 原始版(raw):即原始WAV文件
* 梅尔频谱图版(melspectrograms):从每条原始WAV文件中,在4个随机位置截取连续2秒的音频窗口,并转换为梅尔频谱图(Mel Spectrograms),最终共生成8000张梅尔频谱图。
### 支持任务与排行榜(Supported Tasks and Leaderboards)
[需补充更多信息]
### 语言(Languages)
[需补充更多信息]
## 数据集结构(Dataset Structure)
### 数据实例(Data Instances)
[需补充更多信息]
### 数据字段(Data Fields)
[需补充更多信息]
### 数据划分(Data Splits)
[需补充更多信息]
## 数据集构建(Dataset Creation)
### 遴选依据(Curation Rationale)
[需补充更多信息]
### 源数据(Source Data)
#### 初始数据收集与归一化
[需补充更多信息]
#### 源数据生产者是谁?
[需补充更多信息]
### 标注信息(Annotations)
#### 标注流程
[需补充更多信息]
#### 标注人员是谁?
[需补充更多信息]
### 个人与敏感信息(Personal and Sensitive Information)
[需补充更多信息]
## 数据集使用注意事项(Considerations for Using the Data)
### 数据集的社会影响(Social Impact of Dataset)
[需补充更多信息]
### 偏差讨论(Discussion of Biases)
[需补充更多信息]
### 其他已知局限性(Other Known Limitations)
[需补充更多信息]
## 附加信息(Additional Information)
### 数据集策展人(Dataset Curators)
[需补充更多信息]
### 许可证信息(Licensing Information)
[需补充更多信息]
### 引用信息(Citation Information)
[需补充更多信息]
提供机构:
Lehrig
原始信息汇总
数据集概述
数据集名称
GTZAN Collection
数据集总结
- 音频数量:1000个音频文件
- 音频长度:每个音频30秒
- 音频格式:22050Hz Mono 16-bit WAV格式
- 音频类型:包含10种音乐类型,每种类型100个文件
- 音乐类型:blues, classical, country, disco, hiphop, jazz, metal, pop, reggae, rock
- 数据集变体:包括原始WAV文件和从每个WAV文件中提取的Mel Spectrograms(8000个)
数据集结构
- 数据实例:[信息缺失]
- 数据字段:[信息缺失]
- 数据分割:[信息缺失]
数据集创建
- 精选理由:[信息缺失]
- 源数据:[信息缺失]
- 注释:[信息缺失]
- 个人和敏感信息:[信息缺失]
使用数据注意事项
- 数据集的社会影响:[信息缺失]
- 偏见讨论:[信息缺失]
- 其他已知限制:[信息缺失]
附加信息
- 数据集管理者:[信息缺失]
- 许可信息:Apache-2.0
- 引用信息:[信息缺失]
搜集汇总
数据集介绍

背景与挑战
背景概述
GTZAN-Collection数据集包含1000个30秒长的音频文件,涵盖10种音乐流派,每种流派100个样本。音频文件为22050Hz Mono 16-bit的.wav格式,并提供原始音频和Mel频谱图两种形式。
以上内容由遇见数据集搜集并总结生成



