darshanmakwana/music_genre_tokenized

Name: darshanmakwana/music_genre_tokenized
Creator: darshanmakwana
Published: 2024-07-16 09:18:58
License: 暂无描述

Hugging Face2024-07-16 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/darshanmakwana/music_genre_tokenized

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含使用SemantiCodec进行标记化的音频数据，用于进行AR音乐生成的实验。数据集来源于[lewtun/music_genres](https://huggingface.co/datasets/lewtun/music_genres)，并提供了用于标记化的Python脚本。

This dataset contains tokenized audio from lewtun/music_genres using SemantiCodec for performing experiments on AR music generation. The dataset includes four features: audio_tokens (sequence of audio tokens), genre_id (genre ID), genre (genre), and song_id (song ID). It is divided into a training set with 19909 examples and a test set with 5076 examples. The download size of the dataset is 123311267 bytes, and the total size is 601934148 bytes.

提供机构：

darshanmakwana

5,000+

优质数据集

54 个

任务类型

进入经典数据集