five

nharshavardhana/Santali-Ol-Chiki-Agriculture_Question-Answer_Dataset

收藏
Hugging Face2026-04-23 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/nharshavardhana/Santali-Ol-Chiki-Agriculture_Question-Answer_Dataset
下载链接
链接失效反馈
官方服务:
资源简介:
Santali (Ol Chiki)农业问答数据集是一个精心策划的问答对集合,专注于Santali语言中的农业知识,使用Ol Chiki文字书写。数据集包含Santali语言中关于农业、畜牧业和农村健康主题的问答对。内容涵盖针对部落社区的作物疾病、土壤管理、牲畜护理和耕作技术。该数据集旨在支持低资源语言NLP的研究和开发,同时为代表性不足的语言社区提供特定领域的知识。农业仍然是许多讲Santali语言人口的主要生计,但以他们的母语提供的数字可访问知识极为有限。该数据集旨在通过提供结构化的、机器可读的Santali农业问答数据来弥合这一差距。

The Santali (Ol Chiki) Agriculture Question-Answer Dataset is a curated collection of question–answer pairs focused on agricultural knowledge in the Santali language, written in the Ol Chiki script. The dataset consists of question-answer pairs in the Santali language focusing on agriculture, animal husbandry, and rural health topics. The content covers crop diseases, soil management, livestock care, and farming techniques tailored for tribal communities. This dataset is designed to support research and development in low-resource language NLP, while also enabling access to domain-specific knowledge for underrepresented linguistic communities. Agriculture remains a primary livelihood for many Santali-speaking populations, yet digitally accessible knowledge in their native language is extremely limited. This dataset aims to bridge that gap by providing structured, machine-readable agricultural Q&A data in Santali.
提供机构:
nharshavardhana
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作