five

Audio Dataset of Traditional Portuguese Musical Instruments

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/yjdfnymgf2
下载链接
链接失效反馈
官方服务:
资源简介:
DESCRIPTION: This dataset is a curated and annotated audio dataset designed specifically for Music Information Retrieval (MIR) tasks in low-resource contexts and for the preservation of intangible cultural heritage. The dataset is derived from the initiative ‘A Música Portuguesa a Gostar Dela Própria’, an extensive ethnographic video library dedicated to documenting Portugal's oral and musical traditions. The corpus contains audio clips of seven instruments representative of various Portuguese regions, including string instruments (Portuguese guitar, Viola Braguesa), woodwind instruments (Pífaro, Gaita de Foles, Saxophone) and free-reed instruments (Concertina, Harmonica). Unlike traditional study datasets, this is characterised by its nature (field recordings). The samples present uncontrolled acoustic conditions, including natural reverberation, ambient noise and variability in microphone distance, which provides high validity for evaluating the robustness of deep learning models. DATASET STATISTICS: The dataset consists of 1,734 audio files (WAV) with the following class distribution: - Concertina: 419 samples - Harmonica (Armónica): 407 samples - Portuguese Bagpipes (Gaita de Foles): 375 samples - Saxophone (Saxofone): 190 samples - Portuguese Guitar (Guitarra Portuguesa): 160 samples - Viola Braguesa: 92 samples - Pífaro: 91 samples METADATA STRUCTURE: The accompanying CSV file contains the following columns: - filename: Name of the audio file. - target: Numeric class identifier (0-6). - category: Instrument name (label). - take: Unique recording session identifier. (Note: It is highly recommended to use this field for take-stratified splitting to prevent data leakage between sets).
创建时间:
2026-01-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作