five

DEAR Dataset

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14646594
下载链接
链接失效反馈
官方服务:
资源简介:
The DEAR benchmark is generated by adding speech signals to background sound scenes to ensure full control over the acoustic properties of the final mixture. The background recordings were selected from the HOA-SSR dataset sound scene library (Force Technology, Denmark),1 which is a curated collection of 150 audiovisual scenes captured using specialized equipment, designed for comprehensive evaluations in audio product development. In particular, we use the 4th order ambisonics audio, which was recorded using an Eigenmikeem32 and encoded in 25-channel AmbiX format at 48 kHz with a bit depth of 24. The category selection has the purpose of capturing typical everyday situations. The speech signals are proprietary anechoic monologues recorded with Lavalier microphones. They span different vocal effort levels, which are elicited by playing pink noise through headphones at different levels. The anechoic speech signals are then convolved with a set of impulse responses to produce sound mixtures with different combinations of speakers, positions, reverberation, and SNRs. Throughout the process, attention was paid to avoid violations of the overall consistency of the generated sound scenes.
创建时间:
2025-01-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作