Paytmlabs/S2R_Kathbhat_hindi
收藏Hugging Face2026-04-10 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Paytmlabs/S2R_Kathbhat_hindi
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
config_name: hindi
features:
- name: audio
dtype: audio
- name: text
dtype: string
- name: continuation
dtype: string
configs:
- config_name: hindi
data_files:
- split: train
path: data/train/train-*.parquet
- split: validation
path: data/validation/validation-*.parquet
---
# Paytmlabs/S2R_Kathbhat_hindi
Hindi speech dataset prepared from [ai4bharat/Kathbath](https://huggingface.co/datasets/ai4bharat/Kathbath) for Ultravox training.
## Schema
| Column | Type | Description |
|---|---|---|
| `audio` | Audio | Speech audio |
| `text` | string | Verbatim transcript |
| `continuation` | string | LLM-generated continuation (≤50 words) |
## Progress
- Train chunks: 19/19
- Validation: done
提供机构:
Paytmlabs



