five

Creating speech zones with self-distributing acoustic swarms (Augmented Dataset Part 1 of 2)

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8222713
下载链接
链接失效反馈
官方服务:
资源简介:
Datasets used in the paper: "Creating speech zones with self-distributing acoustic swarms" This deposit contains the first part of the augmented dataset containing simulated and real world collected data. The datasets contains 18000 training mixtures of 3-5 speakers, of which 6000 are simulated using PyRoomAcoustics, 6000 are created from synchronized real world recordings in an anechoic chamber, and 6000 are created from synchronized recordings in ordinary reverberant rooms. It also includes a validation set of 500 mixtures from reverberant rooms, and a testing set of 1000 mixtures from reverberant rooms. The source sounds are various utterances from the VCTK dataset. For real world data, the utterances are played over a Rokono Bass+ Mini Speaker. The recordings are captured from an array of 7 microphones, as they are recorded by our robotic swarm as it is distributed across the table. The recorded audio in the real world has been subjected to audio compression and decompression using the Opus Codec to enable multiple simultaneous streams. You must download both the first and the second part of this dataset in order to use it properly. To uncompress the two datasets, download both and execute: ```cat *.tar.gz.* | tar xvfz -``` Please see the Readme for more information. Please see related identifiers for other datasets.
创建时间:
2023-08-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作