Youtube-Dataset for Language Identification in Speech Signals
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3968291
下载链接
链接失效反馈官方服务:
资源简介:
Youtube-Dataset for Language Identification in Speech Signals
- for scientific use only, for questions contact: jakob.abesser@idmt.fraunhofer.de
Reference
In case you use this dataset for your research, please cite
Alexandra Draghici, Jakob Abeßer & Hanna Lukashevich: A Study on Spoken Language Identification
using Deep Neural Networks, Proceedings of the Audio Mostly Conference 2020
Dataset
The YouTube News Collection is a collection of videos from various
Youtube news channels. We gathered data from channels like BBC
news, France24, DW News, and Noticias Telemundo.
- 135664 npy files (numpy matrices exported from Python)
- each npy file includes a mel spectrogram (see below) of an audio file
- the subfolders "0" - "5" encode the language id:
0 - English
1 - French
2 - German
3 - Greek
4 - Italian
5 - Spanish
Audio Processing
- mono, sample rate 22.05 kHz
- mel spectrogram (librosa python package)
- windows size 512 samples
- hopsize 441 samples (20 ms)
- 129 mel bands
- file-level spectrogram are normalized to maximum of 1
创建时间:
2020-08-01



