zainulhakim/Stella_Four_Accents_ASR
收藏Hugging Face2024-06-06 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/zainulhakim/Stella_Four_Accents_ASR
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: audio
dtype:
audio:
sampling_rate: 16000
- name: text
dtype: string
splits:
- name: arabs_train
num_bytes: 45468480.0
num_examples: 71
- name: arabs_validation
num_bytes: 9606017.0
num_examples: 15
- name: arabs_test
num_bytes: 10246418.0
num_examples: 16
- name: indians_train
num_bytes: 44828079.0
num_examples: 70
- name: indians_validation
num_bytes: 9606017.0
num_examples: 15
- name: indians_test
num_bytes: 10246418.0
num_examples: 16
- name: chinese_train
num_bytes: 67242119.0
num_examples: 105
- name: chinese_validation
num_bytes: 14088825.0
num_examples: 22
- name: chinese_test
num_bytes: 15369627.0
num_examples: 24
- name: us_train
num_bytes: 44187678.0
num_examples: 69
- name: us_validation
num_bytes: 8965616.0
num_examples: 14
- name: us_test
num_bytes: 10246418.0
num_examples: 16
download_size: 286427705
dataset_size: 290101712.0
configs:
- config_name: default
data_files:
- split: arabs_train
path: data/arabs_train-*
- split: arabs_validation
path: data/arabs_validation-*
- split: arabs_test
path: data/arabs_test-*
- split: indians_train
path: data/indians_train-*
- split: indians_validation
path: data/indians_validation-*
- split: indians_test
path: data/indians_test-*
- split: chinese_train
path: data/chinese_train-*
- split: chinese_validation
path: data/chinese_validation-*
- split: chinese_test
path: data/chinese_test-*
- split: us_train
path: data/us_train-*
- split: us_validation
path: data/us_validation-*
- split: us_test
path: data/us_test-*
---
提供机构:
zainulhakim
原始信息汇总
数据集概述
数据集特征
- 音频(audio)
- 数据类型:音频
- 采样率:16000
- 文本(text)
- 数据类型:字符串
数据集分割
- 阿拉伯语数据集
- 训练集(arabs_train)
- 字节数:45468480.0
- 示例数:71
- 验证集(arabs_validation)
- 字节数:9606017.0
- 示例数:15
- 测试集(arabs_test)
- 字节数:10246418.0
- 示例数:16
- 训练集(arabs_train)
- 印度语数据集
- 训练集(indians_train)
- 字节数:44828079.0
- 示例数:70
- 验证集(indians_validation)
- 字节数:9606017.0
- 示例数:15
- 测试集(indians_test)
- 字节数:10246418.0
- 示例数:16
- 训练集(indians_train)
- 中文数据集
- 训练集(chinese_train)
- 字节数:67242119.0
- 示例数:105
- 验证集(chinese_validation)
- 字节数:14088825.0
- 示例数:22
- 测试集(chinese_test)
- 字节数:15369627.0
- 示例数:24
- 训练集(chinese_train)
- 美国英语数据集
- 训练集(us_train)
- 字节数:44187678.0
- 示例数:69
- 验证集(us_validation)
- 字节数:8965616.0
- 示例数:14
- 测试集(us_test)
- 字节数:10246418.0
- 示例数:16
- 训练集(us_train)
数据集大小
- 下载大小:286427705
- 数据集总大小:290101712.0



