Youtube-Dataset for Language Identification in Speech Signals

NIAID Data Ecosystem2026-03-11 收录

下载链接：

https://zenodo.org/record/3968291

下载链接

链接失效反馈

官方服务：

资源简介：

Youtube-Dataset for Language Identification in Speech Signals - for scientific use only, for questions contact: jakob.abesser@idmt.fraunhofer.de Reference In case you use this dataset for your research, please cite Alexandra Draghici, Jakob Abeßer & Hanna Lukashevich: A Study on Spoken Language Identification using Deep Neural Networks, Proceedings of the Audio Mostly Conference 2020 Dataset The YouTube News Collection is a collection of videos from various Youtube news channels. We gathered data from channels like BBC news, France24, DW News, and Noticias Telemundo. - 135664 npy files (numpy matrices exported from Python) - each npy file includes a mel spectrogram (see below) of an audio file - the subfolders "0" - "5" encode the language id: 0 - English 1 - French 2 - German 3 - Greek 4 - Italian 5 - Spanish Audio Processing - mono, sample rate 22.05 kHz - mel spectrogram (librosa python package) - windows size 512 samples - hopsize 441 samples (20 ms) - 129 mel bands - file-level spectrogram are normalized to maximum of 1

创建时间：

2020-08-01

5,000+

优质数据集

54 个

任务类型

进入经典数据集