Amharic health question answering dataset

NIAID Data Ecosystem2026-05-10 收录

下载链接：

https://data.mendeley.com/datasets/8cks7m5f8s

下载链接

链接失效反馈

官方服务：

资源简介：

The **AmHQA** dataset is an Amharic health question answering corpus curated to support research in low-resource language natural language processing and medical question answering. The dataset consists of **1600 question–answer pairs for training** and **400 pairs for testing**, all provided in **CSV format**. The content is written entirely in **Amharic** and is intended to facilitate the development and evaluation of extractive and neural question answering systems in the health domain. AmHQA is released under the **Creative Commons Attribution 4.0 International (CC BY 4.0) licence**, allowing unrestricted use, distribution, and adaptation with appropriate attribution. Researchers using this dataset are requested to cite it as: *Bogale, B., et al. (2026). AmHQA: An Amharic Health Question Answering Dataset. Mendeley Data*. For further information or inquiries, please contact **[berhanubogale0101@gmail.com](mailto:berhanubogale0101@gmail.com)** (mobile: **0938282528**).

创建时间：

2026-01-05

5,000+

优质数据集

54 个

任务类型

进入经典数据集