five

allandclive/UgandaLex

收藏
Hugging Face2023-07-10 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/allandclive/UgandaLex
下载链接
链接失效反馈
官方服务:
资源简介:
--- task_categories: - text-generation - translation language: - ach - alz - teo - gwr - adh - keo - kin - laj - lgg - myx - kdj - nyn - nuj - xog - lg - en - luc - kbo - tjl - rub pretty_name: UgandaLex size_categories: - 1K<n<10K --- ### UgandaLex: A Parallel Text Translation Corpus in 21 Ugandan Languages UgandaLex Parallel Texts in Ugandan Languages is a remarkable dataset consisting of parallel texts sourced from Bible translations across 21 Ugandan languages. This expansive corpus provides an invaluable resource for studying and analyzing the linguistic variations and nuances within Uganda's diverse language landscape. With aligned texts from various Bible translations, researchers, linguists, and developers can delve into the intricacies of Ugandan languages, explore translation patterns, and investigate the cultural and linguistic heritage of different communities. UgandaLex opens up avenues for advancing research in computational linguistics, cross-linguistic analysis, and the development of language technologies tailored specifically for Ugandan languages. ### Languages Acholi, Alur, Aringa, Ateso, Ganda, Gwere, Jopadhola, Kakwa, Kinyarwanda, Kumam, Lango, Lugbara, Masaaba, Ng'akarimojong, Nyankore, Nyole, Soga, Swahili, English, Gungu, Keliko, Talinga-Bwisi ### Contributors @allandclive & @oumo_os
提供机构:
allandclive
原始信息汇总

数据集概述

名称: UgandaLex

任务类别:

  • 文本生成
  • 翻译

语言:

  • Acholi
  • Alur
  • Aringa
  • Ateso
  • Ganda
  • Gwere
  • Jopadhola
  • Kakwa
  • Kinyarwanda
  • Kumam
  • Lango
  • Lugbara
  • Masaaba
  • Ngakarimojong
  • Nyankore
  • Nyole
  • Soga
  • Swahili
  • English
  • Gungu
  • Keliko
  • Talinga-Bwisi

美观名称: UgandaLex

大小类别:

  • 1K<n<10K

数据集详情

UgandaLex是一个包含21种乌干达语言的平行文本翻译语料库,主要来源于圣经翻译。该数据集为研究乌干达语言的多样性和语言细微差别提供了宝贵的资源。通过不同圣经翻译的文本对齐,研究人员、语言学家和开发者可以深入探讨乌干达语言的复杂性,探索翻译模式,并研究不同社区的文化和语言遗产。UgandaLex为计算语言学、跨语言分析和针对乌干达语言的语言技术开发提供了研究途径。

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作