oumo-os/ugalang_0
收藏Hugging Face2023-07-10 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/oumo-os/ugalang_0
下载链接
链接失效反馈官方服务:
资源简介:
---
language: en
license: mit
tags:
- translation
- east-african-languages
- english
- bible-texts
datasets:
- name: ugalang_0
description: >
The ugalang_0 dataset contains Bible texts translated into East African
languages, including English.
It can be used for various translation tasks and language-related research
in the context of East African languages.
Languages included in the dataset:
- Daasanach
- Masaaba
- Rendille
- Ganda
- Aringa
- Kakwa
- Lugbara
- Talinga-Bwisi
- Samburu
- Lango
- Rundi
- Swahili
- Ateso
- Somali
- English
- Chidigo
- Kinyarwanda
- Gwere
- Acholi
- Kumam
- Jopadhola
- Keliko
- Suba
- Gungu
- Soga
- Nyankore
- Kipfokomo
- Ng'akarimojong
- Nyole
- Kiswahili
- Alur English
task_categories:
- machine-translation
- natural-language-understanding
- multilingual
languages:
- Daasanach
- Masaaba
- Rendille
- Ganda
- Aringa
- Kakwa
- Lugbara
- Talinga-Bwisi
- Samburu
- Lango
- Rundi
- Swahili
- Ateso
- Somali
- English
- Chidigo
- Kinyarwanda
- Gwere
- Acholi
- Kumam
- Jopadhola
- Keliko
- Suba
- Gungu
- Soga
- Nyankore
- Kipfokomo
- Ng'akarimojong
- Nyole
- Kiswahili
- Alur English
licenses:
- MIT
size_in_bytes: <size_in_bytes>
download_size_in_bytes: <download_size_in_bytes>
task_ids:
- machine-translation
- language-modeling
huggingface_hub:
- repository: <link_to_huggingface_hub_repository>
commit: <commit_sha>
---
The ugalang_0 dataset contains Bible texts translated into East African languages, including English. It can be used for various translation tasks and language-related research in the context of East African languages.
## Dataset Details
- Languages: Daasanach, Masaaba, Rendille, Ganda, Aringa, Kakwa, Lugbara, Talinga-Bwisi, Samburu, Lango, Rundi, Swahili, Ateso, Somali, English, Chidigo, Kinyarwanda, Gwere, Acholi, Kumam, Jopadhola, Keliko, Suba, Gungu, Soga, Nyankore, Kipfokomo, Ng'akarimojong, Nyole, Kiswahili, Alur English
- License: MIT
## Dataset Preparation
The dataset was created by collecting Bible texts translated into various East African languages, including English. The texts were obtained from open-source sources with permission to use for research purposes.
提供机构:
oumo-os
原始信息汇总
数据集概述
数据集名称
- 名称: ugalang_0
数据集描述
- 描述: ugalang_0 数据集包含翻译成东非语言的圣经文本,包括英语。该数据集适用于多种翻译任务和东非语言相关的语言研究。
包含语言
- 语言: Daasanach, Masaaba, Rendille, Ganda, Aringa, Kakwa, Lugbara, Talinga-Bwisi, Samburu, Lango, Rundi, Swahili, Ateso, Somali, English, Chidigo, Kinyarwanda, Gwere, Acholi, Kumam, Jopadhola, Keliko, Suba, Gungu, Soga, Nyankore, Kipfokomo, Ngakarimojong, Nyole, Kiswahili, Alur English
许可协议
- 许可: MIT
任务类别
- 任务类别: 机器翻译, 自然语言理解, 多语言
数据集创建
- 创建方法: 通过收集翻译成多种东非语言(包括英语)的圣经文本来创建。文本来源于开放源代码,并获得用于研究目的的使用许可。



