five

Amharic health question answering dataset

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/8cks7m5f8s
下载链接
链接失效反馈
官方服务:
资源简介:
The **AmHQA** dataset is an Amharic health question answering corpus curated to support research in low-resource language natural language processing and medical question answering. The dataset consists of **1600 question–answer pairs for training** and **400 pairs for testing**, all provided in **CSV format**. The content is written entirely in **Amharic** and is intended to facilitate the development and evaluation of extractive and neural question answering systems in the health domain. AmHQA is released under the **Creative Commons Attribution 4.0 International (CC BY 4.0) licence**, allowing unrestricted use, distribution, and adaptation with appropriate attribution. Researchers using this dataset are requested to cite it as: *Bogale, B., et al. (2026). AmHQA: An Amharic Health Question Answering Dataset. Mendeley Data*. For further information or inquiries, please contact **[berhanubogale0101@gmail.com](mailto:berhanubogale0101@gmail.com)** (mobile: **0938282528**).
创建时间:
2026-01-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作