lblommesteyn/papuan-climate-science-corpus
收藏Hugging Face2026-04-24 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/lblommesteyn/papuan-climate-science-corpus
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含500条记录,记录了巴布亚新几内亚两种严重缺乏文档的巴布亚语言(Usan和Yélî Dnye)中的气候科学和环境知识。这两种语言的使用者数量较少且处于濒危状态。数据集的结构包括文本、英文翻译、领域、子领域、语言等多种信息。巴布亚新几内亚拥有800多种语言,其中大多数语言缺乏数字文档,而土著语言中的气候知识随着语言的濒危而消失。数据集的使用案例包括土著气候知识保护、气候适应研究、计算语言学等。
This dataset contains 500 records documenting climate science and environmental knowledge in two severely under-documented Papuan languages (Usan and Yélî Dnye) from Papua New Guinea. These languages have small speaker populations and are vulnerable. The dataset structure includes text, English translation, domain, subdomain, language, and other information. Papua New Guinea has over 800 languages, most of which lack digital documentation, and indigenous climate knowledge is disappearing as languages become endangered. Use cases for the dataset include indigenous climate knowledge preservation, climate adaptation research, and computational linguistics.
提供机构:
lblommesteyn



