YMA-MamunAI/barcha-speech-datasetlar
收藏Hugging Face2026-03-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/YMA-MamunAI/barcha-speech-datasetlar
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: audio
dtype:
audio:
sampling_rate: 16000
- name: text
dtype: string
splits:
- name: train
num_examples: 968654
download_size: 120259174400
dataset_size: 172018243066
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
# Barcha O'zbek Speech Datasetlari
O'zbek tili uchun yig'ilgan barcha ochiq audio-matn datasetlari.
- **Rows:** 968,654
- **Audio:** 16kHz, mono
- **Duration filter:** 0.5s - 31s
- **Language:** Uzbek
---
dataset_info:
features:
- name: 音频
dtype:
audio:
sampling_rate: 16000
- name: 文本
dtype: 字符串
splits:
- name: 训练集
num_examples: 968654
download_size: 120259174400
dataset_size: 172018243066
configs:
- config_name: 默认
data_files:
- split: 训练集
path: data/train-*
---
# 全部乌兹别克语语音数据集
专为乌兹别克语收集的全部开源音频-文本数据集。
- **样本总数:** 968,654
- **音频规格:** 16kHz,单声道
- **时长筛选范围:** 0.5秒至31秒
- **语言:** 乌兹别克语
提供机构:
YMA-MamunAI



