five

Howling Corrupted Music Speech (HCMS)

收藏
DataCite Commons2025-03-25 更新2025-04-16 收录
下载链接:
https://rdr.kuleuven.be/citation?persistentId=doi:10.48804/EOW7OF
下载链接
链接失效反馈
官方服务:
资源简介:
The database contains in total 28 music files and 30 speech files. The original speech files are taken from 8 original signals files including recorded male and female speech in four languages (Chinese, English, Dutch, and Russian) from an audiobook database [1]. while the original music files is taken from 7 pieces spanning various genres (e.g., jazz, opera) from 2 different music databases [2, 3]. Each file is a 20 s excerpt where the howling is simulated to start between the 8th and 9th second. This is simulated by feeding the music or speech source signal to a closed-loop system with a varying broadband gain. A total of 8 acoustic impulse responses (AIRs) were used for the simulations, hence covering a wide range of howling frequencies. The final dataset1 contains fewer excerpts than the number of excerpts originally generated, as it went through a pruning step conducted by three experts, aiming to eliminate unsuitable examples (exhibiting howling at multiple howling frequencies or no howling at all). For more details see Ch. 7 of" Acoustic Event Detection: Feature, Evaluation and Dataset design", PhD thesis Mina Mounir. [1] Kearns, J.: LibriVox: Free Public Domain Audiobooks. Emerald Group Publishing Limited (2014) [2] Emiya, V.: MAPS Database: A piano database for multipitch estimation and automatic transcription of music. http://www.tsi.telecom-paristech.fr/aao/en/2 010/07/08/maps-database-a-piano-database-for-multipitch-estimation-and-aut omatic-transcription-of-music/ (2008) [3] Defferrard, M., Benzi, K., Vandergheynst, P., Bresson, X.: FMA: A DATASET FOR MUSIC ANALYSIS. Proc. 18th Int. Symp. on Music Information Retrieval (ISMIR ’17), 8 (2017)
提供机构:
KU Leuven RDR
创建时间:
2025-02-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作