israel/waxal-autolabled
收藏Hugging Face2026-04-09 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/israel/waxal-autolabled
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: string
- name: speaker_id
dtype: string
- name: transcription
dtype: string
- name: language
dtype: string
- name: gender
dtype: string
- name: audio
dtype: audio
splits:
- name: amh_asr
num_bytes: 138046409849.875
num_examples: 80321
- name: orm_asr
num_bytes: 138890235695.75
num_examples: 76738
- name: sid_asr
num_bytes: 131939182435.875
num_examples: 78777
- name: wal_asr
num_bytes: 145520665649.375
num_examples: 87381
- name: tir_asr
num_bytes: 132465457266.375
num_examples: 76093
download_size: 826777635270
dataset_size: 686861950897.25
configs:
- config_name: default
data_files:
- split: amh_asr
path: data/unlabeled-*
- split: orm_asr
path: data/oromo-*
- split: sid_asr
path: data/sid_asr-*
- split: wal_asr
path: data/wal_asr-*
- split: tir_asr
path: data/tir_asr-*
language:
- am
- om
- ti
---
# Auot-Lableing Waxal unlabeled dataset on Best Multilingual Ethio-ASR models

```
@article{abdullah2026ethio,
title={Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages},
author={Abdullah, Badr M and Azime, Israel Abebe and Tonja, Atnafu Lambebo and Alabi, Jesujoba O and Alemu, Abel Mulat and Hagos, Eyob G and Balcha, Bontu Fufa and Nerea, Mulubrhan A and Yadeta, Debela Desalegn and Marilign, Dagnachew Mekonnen and others},
journal={arXiv preprint arXiv:2603.23654},
year={2026}
}
```
提供机构:
israel



