five

oumo-os/ugalang_0

收藏
Hugging Face2023-07-10 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/oumo-os/ugalang_0
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: en license: mit tags: - translation - east-african-languages - english - bible-texts datasets: - name: ugalang_0 description: > The ugalang_0 dataset contains Bible texts translated into East African languages, including English. It can be used for various translation tasks and language-related research in the context of East African languages. Languages included in the dataset: - Daasanach - Masaaba - Rendille - Ganda - Aringa - Kakwa - Lugbara - Talinga-Bwisi - Samburu - Lango - Rundi - Swahili - Ateso - Somali - English - Chidigo - Kinyarwanda - Gwere - Acholi - Kumam - Jopadhola - Keliko - Suba - Gungu - Soga - Nyankore - Kipfokomo - Ng'akarimojong - Nyole - Kiswahili - Alur English task_categories: - machine-translation - natural-language-understanding - multilingual languages: - Daasanach - Masaaba - Rendille - Ganda - Aringa - Kakwa - Lugbara - Talinga-Bwisi - Samburu - Lango - Rundi - Swahili - Ateso - Somali - English - Chidigo - Kinyarwanda - Gwere - Acholi - Kumam - Jopadhola - Keliko - Suba - Gungu - Soga - Nyankore - Kipfokomo - Ng'akarimojong - Nyole - Kiswahili - Alur English licenses: - MIT size_in_bytes: <size_in_bytes> download_size_in_bytes: <download_size_in_bytes> task_ids: - machine-translation - language-modeling huggingface_hub: - repository: <link_to_huggingface_hub_repository> commit: <commit_sha> --- The ugalang_0 dataset contains Bible texts translated into East African languages, including English. It can be used for various translation tasks and language-related research in the context of East African languages. ## Dataset Details - Languages: Daasanach, Masaaba, Rendille, Ganda, Aringa, Kakwa, Lugbara, Talinga-Bwisi, Samburu, Lango, Rundi, Swahili, Ateso, Somali, English, Chidigo, Kinyarwanda, Gwere, Acholi, Kumam, Jopadhola, Keliko, Suba, Gungu, Soga, Nyankore, Kipfokomo, Ng'akarimojong, Nyole, Kiswahili, Alur English - License: MIT ## Dataset Preparation The dataset was created by collecting Bible texts translated into various East African languages, including English. The texts were obtained from open-source sources with permission to use for research purposes.
提供机构:
oumo-os
原始信息汇总

数据集概述

数据集名称

  • 名称: ugalang_0

数据集描述

  • 描述: ugalang_0 数据集包含翻译成东非语言的圣经文本,包括英语。该数据集适用于多种翻译任务和东非语言相关的语言研究。

包含语言

  • 语言: Daasanach, Masaaba, Rendille, Ganda, Aringa, Kakwa, Lugbara, Talinga-Bwisi, Samburu, Lango, Rundi, Swahili, Ateso, Somali, English, Chidigo, Kinyarwanda, Gwere, Acholi, Kumam, Jopadhola, Keliko, Suba, Gungu, Soga, Nyankore, Kipfokomo, Ngakarimojong, Nyole, Kiswahili, Alur English

许可协议

  • 许可: MIT

任务类别

  • 任务类别: 机器翻译, 自然语言理解, 多语言

数据集创建

  • 创建方法: 通过收集翻译成多种东非语言(包括英语)的圣经文本来创建。文本来源于开放源代码,并获得用于研究目的的使用许可。
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作