trivitaai/Blum_vi_voice
收藏Hugging Face2026-04-08 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/trivitaai/Blum_vi_voice
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- vi
license: cc-by-4.0
task_categories:
- automatic-speech-recognition
---
# Blum Vi Voice
Vietnamese speech dataset with 757,400 samples (~148GB audio).
## Fields
- `split`: train/test split
- `index`: sample index
- `audio_path`: relative path to audio file in this repo (under `data/`)
- `sampling_rate`: 16000 Hz
- `duration`: duration in seconds
- `text`: transcript
## Loading
```python
from datasets import load_dataset
ds = load_dataset("trivitaai/Blum_vi_voice")
```
提供机构:
trivitaai



