egirma/afrivoice-swahili-agriculture-subset
收藏Hugging Face2026-03-18 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/egirma/afrivoice-swahili-agriculture-subset
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
language:
- sw
pretty_name: Afrivoice Swahili Agriculture Subset
task_categories:
- automatic-speech-recognition
tags:
- DigitalUmuganda
- DU
- sw
- swa
- Swahili
- asr
- stt
- voice
- speech
- agriculture
size_categories:
- 100K<n<1M
---
# Dataset Card for the image text and voice dataset
## Dataset Description
Subset of Afrivoice dataset:
- approx. 100 hours of train (sampled, stratified)
- Full dev + test from original repo (DigitalUmuganda/Afrivoice_Swahili)
- Audio: .webm
- Includes images + transcriptions from original repo (DigitalUmuganda/Afrivoice_Swahili)
- Includes metadata csv
## License
CC-BY-4.0 (derived from DigitalUmuganda/Afrivoice_Swahili)
## Notes
- Train split sampled using stratification + image clusters
提供机构:
egirma



