five

Youtube-Dataset for Language Identification in Speech Signals

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3968291
下载链接
链接失效反馈
官方服务:
资源简介:
Youtube-Dataset for Language Identification in Speech Signals - for scientific use only, for questions contact: jakob.abesser@idmt.fraunhofer.de Reference In case you use this dataset for your research, please cite Alexandra Draghici, Jakob Abeßer & Hanna Lukashevich: A Study on Spoken Language Identification using Deep Neural Networks, Proceedings of the Audio Mostly Conference 2020 Dataset The YouTube News Collection is a collection of videos from various Youtube news channels. We gathered data from channels like BBC news, France24, DW News, and Noticias Telemundo. - 135664 npy files (numpy matrices exported from Python) - each npy file includes a mel spectrogram (see below) of an audio file - the subfolders "0" - "5" encode the language id:   0 - English   1 - French   2 - German   3 - Greek   4 - Italian   5 - Spanish Audio Processing - mono, sample rate 22.05 kHz - mel spectrogram (librosa python package) - windows size 512 samples - hopsize 441 samples (20 ms) - 129 mel bands - file-level spectrogram are normalized to maximum of 1
创建时间:
2020-08-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作