Deep Xi Training Set
收藏Mendeley Data2024-03-27 更新2024-06-27 收录
下载链接:
https://ieee-dataport.org/open-access/deep-xi-training-set
下载链接
链接失效反馈官方服务:
资源简介:
The clean-speech and noise recordings used to train Deep Xi (https://github.com/anicolson/DeepXi). A validation set is also included. Clean speech:The clean-speech recordings are from the test-clean-100 set of Librispeech (http://www.openslr.org/12/) and from the CSTR VCTK corpus (https://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html). Noise:The noise recordings are from the Environmental Background Noise dataset (https://personal.utdallas.edu/~nxk019000/VAD-dataset/), the Nonspeech dataset (http://web.cse.ohio-state.edu/pnl/corpus/HuNonspeech/HuCorpus.html), the QUT-NOISE dataset (https://research.qut.edu.au/saivt/databases/qut-noise-databases-and-protocols/), multiple Freesound packs (https://freesound.org/), the noise set of the MUSAN corpus (https://www.openslr.org/17/), the RSG-10 noise database (http://www.steeneken.nl/wp-content/uploads/2014/04/RSG-10_Noise-data-base.pdf) and the Urban Sound dataset (http://www.justinsalamon.com/uploads/4/3/9/4/4394963/salamon_urbansound_acmmm14.pdf). Note that the clean-speech and noise recordings used for this training set are separate from those used in the following test sets: Deep Xi Test Set (), Test Set from (), and the DEMAND Voicebank test set ().
本数据集包含用于训练Deep Xi(https://github.com/anicolson/DeepXi)的纯净语音与噪声录音,并附带验证集。纯净语音:该数据集的纯净语音录音取自Librispeech的test-clean-100子集与CSTR VCTK语料库,相关资源链接分别为http://www.openslr.org/12/ 及https://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html。噪声:本数据集的噪声录音来源于多类公开数据集与资源,包括环境背景噪声数据集(https://personal.utdallas.edu/~nxk019000/VAD-dataset/)、非语音数据集(http://web.cse.ohio-state.edu/pnl/corpus/HuNonspeech/HuCorpus.html)、QUT-NOISE数据集(https://research.qut.edu.au/saivt/databases/qut-noise-databases-and-protocols/)、多款Freesound素材包(https://freesound.org/)、MUSAN语料库的噪声子集(https://www.openslr.org/17/)、RSG-10噪声数据库(http://www.steeneken.nl/wp-content/uploads/2014/04/RSG-10_Noise-data-base.pdf)以及城市声音数据集(http://www.justinsalamon.com/uploads/4/3/9/4/4394963/salamon_urbansound_acmmm14.pdf)。需特别说明,本训练集所使用的纯净语音与噪声录音,与后续三类测试集所用数据相互独立,这三类测试集分别为Deep Xi测试集()、来源未指定测试集()以及DEMAND Voicebank测试集()。
创建时间:
2023-06-28
搜集汇总
数据集介绍

背景与挑战
背景概述
Deep Xi数据集是一个专为语音增强和语音分离任务设计的开源数据集,包含训练集和测试集。训练集结合了来自Librispeech和CSTR VCTK的干净语音以及多个噪声数据库的噪声,测试集则用于评估模型性能,包含特定噪声条件下的语音样本。数据集以.wav格式提供,总大小为21.69 GB。
以上内容由遇见数据集搜集并总结生成



