danielk73/commonvoice-indian_accent
收藏Hugging Face2026-03-02 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/danielk73/commonvoice-indian_accent
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: client_id
dtype: string
- name: audio
dtype: audio
- name: sentence_id
dtype: string
- name: sentence
dtype: string
- name: sentence_domain
dtype: float64
- name: up_votes
dtype: int64
- name: down_votes
dtype: int64
- name: age
dtype: string
- name: gender
dtype: string
- name: accents
dtype: string
- name: variant
dtype: float64
- name: locale
dtype: string
- name: segment
dtype: string
- name: duration_ms
dtype: int64
splits:
- name: train
num_bytes: 4470860426.32
num_examples: 110088
download_size: 4124966078
dataset_size: 4470860426.32
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
# CommonVoice Indian Accent Dataset
## Dataset Summary
- **Total Audio Duration**: 163.89 hours
- **Number of Recordings**: 110,088
- **Language**: English (Indian Accent)
- **Source**: Mozilla CommonVoice Corpus v21
## Licensing
CC0 - Follows Mozilla CommonVoice dataset terms
---
dataset_info: 数据集信息
features: 特征字段
- name: 客户端ID(client_id)
dtype: 字符串(string)
- name: 音频(audio)
dtype: 音频(audio)
- name: 句子ID(sentence_id)
dtype: 字符串(string)
- name: 句子(sentence)
dtype: 字符串(string)
- name: 句子领域(sentence_domain)
dtype: 64位浮点型(float64)
- name: 点赞数(up_votes)
dtype: 64位整型(int64)
- name: 点踩数(down_votes)
dtype: 64位整型(int64)
- name: 年龄(age)
dtype: 字符串(string)
- name: 性别(gender)
dtype: 字符串(string)
- name: 口音(accents)
dtype: 字符串(string)
- name: 变体(variant)
dtype: 64位浮点型(float64)
- name: 区域设置(locale)
dtype: 字符串(string)
- name: 片段(segment)
dtype: 字符串(string)
- name: 时长(毫秒)(duration_ms)
dtype: 64位整型(int64)
splits: 划分集
- 划分集名称:训练集(train)
字节数:4470860426.32
样本数:110088
下载大小:4124966078
数据集总大小:4470860426.32
configs: 配置项
- 配置名称:默认(default)
数据文件:
- 划分集:训练集(train)
路径:data/train-*
---
# 通用语音(CommonVoice)印度口音数据集
## 数据集概览
- **总音频时长**:163.89小时
- **录音总数**:110,088条
- **语言**:英语(印度口音)
- **数据来源**:Mozilla通用语音语料库v21
## 授权协议
CC0协议 — 遵循Mozilla通用语音数据集的使用条款
提供机构:
danielk73



