five

willson8972/aura-african-language-corpus

收藏
Hugging Face2026-03-27 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/willson8972/aura-african-language-corpus
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - tw license: cc-by-4.0 task_categories: - text-generation - translation - automatic-speech-recognition tags: - african-languages - twi - asante-twi - nlp - low-resource - instruction-tuning pretty_name: Aura African Language Corpus size_categories: - 1K<n<10K --- # Aura African Language Corpus Collected and curated by [Aura OS](https://auraos.uk) — an AI operating system built Africa-first. ## Dataset Description 5450 approved samples covering Asante Twi expressions, code-switching, proverbs, and instruction pairs. All community-contributed and human-reviewed. ## Languages twi_Latn ## Sample Types - **expression**: Natural Asante Twi phrases and expressions - **codeswitching**: Mixed Twi-English phrases (real conversational style) - **proverb**: Traditional Akan proverbs with English meanings - **instruction_pair**: Prompt → response pairs for fine-tuning LLMs in Asante Twi ## License CC-BY 4.0 — free to use with attribution to [Aura OS](https://auraos.uk).
提供机构:
willson8972
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作