amanuelbyte/african_speech_dataset_new_uncleaned
收藏Hugging Face2026-03-17 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/amanuelbyte/african_speech_dataset_new_uncleaned
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: afr
features:
- name: text
dtype: string
- name: audio
dtype:
audio:
sampling_rate: 16000
decode: false
- name: lang
dtype: string
- name: source
dtype: string
splits:
- name: train
num_bytes: 4860234691.0
num_examples: 49432
download_size: 4807482901
dataset_size: 4860234691.0
- config_name: amh
features:
- name: audio
dtype:
audio:
sampling_rate: 16000
decode: false
- name: text
dtype: string
- name: lang
dtype: string
- name: source
dtype: string
splits:
- name: train
num_bytes: 78064630539.019
num_examples: 210617
download_size: 73254772458
dataset_size: 78064630539.019
- config_name: arz
features:
- name: audio
dtype:
audio:
sampling_rate: 16000
decode: false
- name: text
dtype: string
- name: lang
dtype: string
- name: source
dtype: string
splits:
- name: train
num_bytes: 28824557569.304
num_examples: 208432
download_size: 28334304081
dataset_size: 28824557569.304
- config_name: hau
features:
- name: audio
dtype:
audio:
sampling_rate: 16000
decode: false
- name: text
dtype: string
- name: lang
dtype: string
- name: source
dtype: string
splits:
- name: train
num_bytes: 5433399422.0
num_examples: 20494
download_size: 5398892883
dataset_size: 5433399422.0
- config_name: som
features:
- name: audio
dtype:
audio:
sampling_rate: 16000
decode: false
- name: text
dtype: string
- name: lang
dtype: string
- name: source
dtype: string
splits:
- name: train
num_bytes: 5648152174.54
num_examples: 23565
download_size: 5563935936
dataset_size: 5648152174.54
- config_name: zul
features:
- name: audio
dtype:
audio:
sampling_rate: 16000
decode: false
- name: text
dtype: string
- name: lang
dtype: string
- name: source
dtype: string
splits:
- name: train
num_bytes: 3530661145.056
num_examples: 1899
download_size: 2968407804
dataset_size: 3530661145.056
configs:
- config_name: afr
data_files:
- split: train
path: afr/train-*
- config_name: amh
data_files:
- split: train
path: amh/train-*
- config_name: arz
data_files:
- split: train
path: arz/train-*
- config_name: hau
data_files:
- split: train
path: hau/train-*
- config_name: som
data_files:
- split: train
path: som/train-*
- config_name: zul
data_files:
- split: train
path: zul/train-*
---
提供机构:
amanuelbyte



