five

Deep Xi Training Set

收藏
Mendeley Data2024-03-27 更新2024-06-27 收录
下载链接:
https://ieee-dataport.org/open-access/deep-xi-training-set
下载链接
链接失效反馈
官方服务:
资源简介:
The clean-speech and noise recordings used to train Deep Xi (https://github.com/anicolson/DeepXi). A validation set is also included. Clean speech:The clean-speech recordings are from the test-clean-100 set of Librispeech (http://www.openslr.org/12/) and from the CSTR VCTK corpus (https://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html) (the recordings from speakers p232 and p257 are excluded as they are used in the test set of the DEMAND Voicebank dataset (http://ssw9.talp.cat/papers/ssw9_PS2-4_Valentini-Botinhao.pdf)). Noise:The noise recordings are from the Environmental Background Noise dataset (https://personal.utdallas.edu/~nxk019000/VAD-dataset/), the Nonspeech dataset (http://web.cse.ohio-state.edu/pnl/corpus/HuNonspeech/HuCorpus.html), the QUT-NOISE dataset (https://research.qut.edu.au/saivt/databases/qut-noise-databases-and-protocols/), multiple Freesound packs (https://freesound.org/), the noise set of the MUSAN corpus (https://www.openslr.org/17/), the RSG-10 noise database (http://www.steeneken.nl/wp-content/uploads/2014/04/RSG-10_Noise-data-base.pdf) (voice babble, F16, and factory (welding) are excluded as they are used in the Deep Xi Test Set and the Test Set From 10.1016/J.SPECOM.2019.06.002) and the Urban Sound dataset (http://www.justinsalamon.com/uploads/4/3/9/4/4394963/salamon_urbansound_acmmm14.pdf) (street music no. 26,270 is excluded as it is used in the Deep Xi Test Set and the Test Set From 10.1016/J.SPECOM.2019.06.002). Note that the clean-speech and noise recordings used for this training set are separate from those used in the following test sets: Deep Xi Test Set, the Test Set From 10.1016/J.SPECOM.2019.06.002, and the DEMAND Voicebank test set (http://ssw9.talp.cat/papers/ssw9_PS2-4_Valentini-Botinhao.pdf).

本数据集包含用于训练Deep Xi(https://github.com/anicolson/DeepXi)的纯净语音与噪声录音,同时附带验证集。纯净语音:本数据集的纯净语音录音来源于Librispeech(http://www.openslr.org/12/)的test-clean-100子集,以及CSTR VCTK语料库(https://homepages.inf.ed.ac.uk/jyamagis/page3/page58/page58.html),其中排除了编号为p232与p257的说话人录音,因该部分录音被用于DEMAND Voicebank数据集(http://ssw9.talp.cat/papers/ssw9_PS2-4_Valentini-Botinhao.pdf)的测试集。噪声:本数据集的噪声录音来源于环境背景噪声数据集(https://personal.utdallas.edu/~nxk019000/VAD-dataset/)、非语音语料库(http://web.cse.ohio-state.edu/pnl/corpus/HuNonspeech/HuCorpus.html)、QUT-NOISE数据集(https://research.qut.edu.au/saivt/databases/qut-noise-databases-and-protocols/)、多个Freesound素材包(https://freesound.org/)、MUSAN语料库的噪声子集(https://www.openslr.org/17/)、RSG-10噪声数据库(http://www.steeneken.nl/wp-content/uploads/2014/04/RSG-10_Noise-data-base.pdf),其中人声嘈杂、F16与工厂(焊接)噪声被排除,因该类噪声被用于Deep Xi测试集及10.1016/J.SPECOM.2019.06.002的测试集;此外城市声音数据集(http://www.justinsalamon.com/uploads/4/3/9/4/4394963/salamon_urbansound_acmmm14.pdf)中的编号26270的街头音乐录音被排除,因该录音被用于Deep Xi测试集及10.1016/J.SPECOM.2019.06.002的测试集。注意:本训练集所使用的纯净语音与噪声录音,与以下三类测试集所使用的录音相互独立:Deep Xi测试集、10.1016/J.SPECOM.2019.06.002的测试集,以及DEMAND Voicebank测试集(http://ssw9.talp.cat/papers/ssw9_PS2-4_Valentini-Botinhao.pdf)。
创建时间:
2023-06-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作