Cnam-LMSSC/multilingual_librispeech_spanish_phoneme
收藏Hugging Face2026-03-11 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Cnam-LMSSC/multilingual_librispeech_spanish_phoneme
下载链接
链接失效反馈官方服务:
资源简介:
---
annotations_creators:
- machine-generated
language:
- es
language_creators:
- crowdsourced
license: cc-by-4.0
multilinguality:
- monolingual
size_categories:
- 1K<n<10K
- 10K<n<100K
source_datasets:
- facebook/multilingual_librispeech
task_categories:
- automatic-speech-recognition
- text-to-speech
pretty_name: Multilingual LibriSpeech Spanish (Phonemized & Curated)
tags:
- audio
- speech
- phonemes
- ipa
- spanish
- mls
- librispeech
- cnam-lmssc
---
# Multilingual LibriSpeech Spanish Phoneme
## Dataset Description
- **Repository:** [Cnam-LMSSC/multilingual_librispeech_spanish_phoneme](https://huggingface.co/datasets/Cnam-LMSSC/multilingual_librispeech_spanish_phoneme)
- **Point of Contact:** [Cnam-LMSSC Team](https://huggingface.co/Cnam-LMSSC)
### Dataset Summary
This dataset is a **curated version of the Spanish subset of Multilingual LibriSpeech (MLS)**, enriched with a phonetic transcription column (`phoneme`).
The **Laboratoire de Mécanique des Structures et des Systèmes Couplés (Cnam-LMSSC)** created this version to facilitate research into Spanish acoustic modeling, phoneme recognition, and speech synthesis. It builds upon the high-quality audio derived from LibriVox audiobooks found in the original MLS dataset.
### Supported Tasks
- **Phoneme Recognition:** Training models to map audio features directly to IPA phonemes.
- **Automatic Speech Recognition (ASR):** Robust speech-to-text tasks using audiobook data.
- **Text-to-Speech (TTS):** Utilizing the high-quality, read-speech nature of LibriSpeech for synthesis.
### Languages
The dataset contains audio in **Spanish (`es`)**.
## Dataset Structure
### Data Instances
A typical data point contains the audio, metadata about the speaker and chapter (derived from LibriVox), the original text, and the generated phonetic transcription.
```python
{
'id': '1234_5678_00001',
'audio': {
'path': '1234_5678_00001.flac',
'array': array([-0.012, 0.045, ...], dtype=float32),
'sampling_rate': 16000
},
'text': "pestañeado quedóse en donde estaba apoyándose apenas en su muleta",
'phoneme': "pestaɲeaðo keðose en donde estaβa apoʝandose apenas en su muleta",
'speaker_id': 1234,
'chapter_id': 5678,
}
提供机构:
Cnam-LMSSC



