allandclive/UgandaLex
收藏Hugging Face2023-07-10 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/allandclive/UgandaLex
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- text-generation
- translation
language:
- ach
- alz
- teo
- gwr
- adh
- keo
- kin
- laj
- lgg
- myx
- kdj
- nyn
- nuj
- xog
- lg
- en
- luc
- kbo
- tjl
- rub
pretty_name: UgandaLex
size_categories:
- 1K<n<10K
---
### UgandaLex: A Parallel Text Translation Corpus in 21 Ugandan Languages
UgandaLex Parallel Texts in Ugandan Languages is a remarkable dataset consisting of parallel texts sourced from Bible translations across 21 Ugandan languages. This expansive corpus provides an invaluable resource for studying and analyzing the linguistic variations and nuances within Uganda's diverse language landscape. With aligned texts from various Bible translations, researchers, linguists, and developers can delve into the intricacies of Ugandan languages, explore translation patterns, and investigate the cultural and linguistic heritage of different communities. UgandaLex opens up avenues for advancing research in computational linguistics, cross-linguistic analysis, and the development of language technologies tailored specifically for Ugandan languages.
### Languages
Acholi, Alur, Aringa, Ateso, Ganda, Gwere, Jopadhola, Kakwa, Kinyarwanda, Kumam, Lango, Lugbara, Masaaba, Ng'akarimojong, Nyankore, Nyole, Soga, Swahili, English, Gungu, Keliko, Talinga-Bwisi
### Contributors
@allandclive & @oumo_os
提供机构:
allandclive
原始信息汇总
数据集概述
名称: UgandaLex
任务类别:
- 文本生成
- 翻译
语言:
- Acholi
- Alur
- Aringa
- Ateso
- Ganda
- Gwere
- Jopadhola
- Kakwa
- Kinyarwanda
- Kumam
- Lango
- Lugbara
- Masaaba
- Ngakarimojong
- Nyankore
- Nyole
- Soga
- Swahili
- English
- Gungu
- Keliko
- Talinga-Bwisi
美观名称: UgandaLex
大小类别:
- 1K<n<10K
数据集详情
UgandaLex是一个包含21种乌干达语言的平行文本翻译语料库,主要来源于圣经翻译。该数据集为研究乌干达语言的多样性和语言细微差别提供了宝贵的资源。通过不同圣经翻译的文本对齐,研究人员、语言学家和开发者可以深入探讨乌干达语言的复杂性,探索翻译模式,并研究不同社区的文化和语言遗产。UgandaLex为计算语言学、跨语言分析和针对乌干达语言的语言技术开发提供了研究途径。



