five

Google Speech Commands - Musan Dataset

收藏
paperswithcode.com2025-03-24 收录
下载链接:
https://paperswithcode.com/dataset/google-speech-commands-musan
下载链接
链接失效反馈
官方服务:
资源简介:
This noisy speech test set is created from the Google Speech Commands v2 [1] and the Musan dataset[2]. It could be downloaded here: https://zenodo.org/record/6066174#.Yn7NPJPMLyU Specifically, we created this test set by mixing the speech in the Google Speech Commands v2 test set with random noise in the Musan dataset at different signal to noise ratio -12.5,-10,0,10,20,30 and 40 decibel (dB). The Google Speech Commands v2 dataset is under the Creative Commons BY 4.0 license. It could be downloaded at: http://download.tensorflow.org/data/speech_commands_v0.02.tar.gz The Musan dataset is under Attribution 4.0 International (CC BY 4.0). It could be downlowned at https://www.openslr.org/17/ Citations: [1] Pete Warden, “Speech commands: A dataset for limited-vocabulary speech recognition,” arXiv preprint arXiv:1804.03209, 2018. [2] David Snyder, Guoguo Chen, and Daniel Povey, “Musan: A music, speech, and noise corpus,” arXiv preprint arXiv:1510.08484, 2015.

本噪声语音测试集由谷歌语音命令v2数据集[1]与Musan数据集[2]融合而成。该测试集可通过以下链接获取:https://zenodo.org/record/6066174#.Yn7NPJPMLyU。具体而言,我们通过将谷歌语音命令v2测试集中的语音与Musan数据集中的随机噪声以不同的信噪比进行混合,包括-12.5、-10、0、10、20、30和40分贝(dB),构建了此测试集。谷歌语音命令v2数据集遵循Creative Commons BY 4.0许可协议,可从以下地址下载:http://download.tensorflow.org/data/speech_commands_v0.02.tar.gz。Musan数据集遵循Attribution 4.0国际(CC BY 4.0)许可协议,可从以下网址下载:https://www.openslr.org/17/。 参考文献: [1] Pete Warden,“Speech commands: A dataset for limited-vocabulary speech recognition,” arXiv预印本 arXiv:1804.03209,2018。 [2] David Snyder,Guoguo Chen,及Daniel Povey,“Musan: A music, speech, and noise corpus,” arXiv预印本 arXiv:1510.08484,2015。
提供机构:
paperswithcode.com
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作