Creating speech zones with self-distributing acoustic swarms (Simulated + Clutter)
收藏Mendeley Data2024-05-10 更新2024-06-28 收录
下载链接:
https://zenodo.org/records/8219720
下载链接
链接失效反馈官方服务:
资源简介:
Datasets used in the paper: "Creating speech zones with self-distributing acoustic swarms" This deposit contains 2 distinct datasets: A dataset of speech mixtures containing 2-5 speakers simulated using PyRoomAcoustics. The dataset consists of 8000 training mixtures, 500 validation mixtures and 1000 testing mixtures. A dataset of speech mixtures containing 3-5 speakers created from synchronized recordings in reverberant rooms with objects cluttering the table. The dataset consists of 500 testing mixtures. The source sounds are various utterances from the VCTK dataset. For real world data, the utterances are played over a Rokono Bass+ Mini Speaker. The recordings are captured from an array of 7 microphones, as they are recorded by our robotic swarm as it is distributed across the table. The recorded audio in the real world has been subjected to audio compression and decompression using the Opus Codec to enable multiple simultaneous streams. Please see the Readme for more infromation. Please see related identifiers for other datasets.
创建时间:
2023-08-22



