five

WaivOps WRLD-SMB: Open Audio Resources for Machine Learning in Music

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13921289
下载链接
链接失效反馈
官方服务:
资源简介:
WRLD-SMB Dataset WRLD-SMB is an open audio dataset featuring a collection of synthetic drum recordings in the style of Brazilian samba music. It includes 1,100 audio loops recorded in uncompressed stereo WAV format, along with paired JSON files intended for the supervised training of generative AI audio models. Overview This dataset was developed using multi-velocity audio samples and a paired MIDI dataset. The intended use of this dataset is to train or fine-tune AI models in learning high-performance drum notations, aiming to replicate the live sound of a small drum ensemble. To facilitate augmentation and supervised training with labeled audio data, a dropout technique was employed on the rendered audio files to generate variational mixes of the drum tracks. The primary purpose of this dataset is to provide accessible content for machine learning applications in music and audio. Potential use cases include generative music, feature extraction, tempo detection, audio classification, rhythm analysis, drum synthesis, music information retrieval (MIR), sound design and signal processing. Specifications 1,100 audio loops (approximately 5.5 hours) 16-bit 44.1kHz WAV format Tempo range: 90–120 BPM Paired label data (WAV + JSON) Variational drum patterns Subgenre styles (Traditional and modern samba, bossa nova, fusion) A JSON file is provided for referencing and converting MIDI note numbers to text labels. You can update the text labels to suit your preferences. License This dataset was compiled by WaivOps, a crowdsourced music project managed by the sound label company Patchbanks. All recordings have been compiled by verified sources for copyright clearance. The WRLD-SMB dataset is licensed under Creative Commons Attribution 4.0 International (CC BY 4.0). Additional Info For audio examples or more information about this dataset, please refer to the GitHub repository.
创建时间:
2024-10-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作