teticio/audio-diffusion-instrumental-hiphop-256

Name: teticio/audio-diffusion-instrumental-hiphop-256
Creator: teticio
Published: 2022-11-09 10:50:58
License: 暂无描述

Hugging Face2022-11-09 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/teticio/audio-diffusion-instrumental-hiphop-256

下载链接

链接失效反馈

官方服务：

资源简介：

--- annotations_creators: [] language: [] language_creators: [] license: [] multilinguality: [] pretty_name: Mel spectrograms of instrumental Hip Hop music size_categories: - 10K<n<100K source_datasets: [] tags: - audio - spectrograms task_categories: - image-to-image task_ids: [] --- 256x256 mel spectrograms of 5 second samples of instrumental Hip Hop. The code to convert from audio to spectrogram and vice versa can be found in https://github.com/teticio/audio-diffusion along with scripts to train and run inference using De-noising Diffusion Probabilistic Models. ``` x_res = 256 y_res = 256 sample_rate = 22050 n_fft = 2048 hop_length = 512 ```

提供机构：

teticio

原始信息汇总

数据集概述

基本信息

名称: Mel spectrograms of instrumental Hip Hop music
大小: 10K<n<100K
语言: 无记录
许可证: 无记录
多语言性: 无记录

数据集内容

类型: 音频
格式: Mel spectrograms
分辨率: 256x256
样本时长: 5秒
音频特性:
- 采样率: 22050 Hz
- FFT大小: 2048
- 跳跃长度: 512

任务相关

任务类别: image-to-image
任务ID: 无记录

附加信息

转换代码: 提供音频到频谱图及反向转换的代码，位于 https://github.com/teticio/audio-diffusion，包含训练和运行推理的脚本，使用De-noising Diffusion Probabilistic Models。

5,000+

优质数据集

54 个

任务类型

进入经典数据集