Bilingual Personal Health Conversational Dataset for English and Yoruba
收藏IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/bilingual-personal-health-conversational-dataset-english-and-yoruba
下载链接
链接失效反馈官方服务:
资源简介:
This dataset features 26,000 personal health inquiries and responses in a conversational format, in both English and Yoruba. Sourced from reputable health forums such as iCliniq, eHealth Forum, Question Doctors, and WebMD, it is licensed under the MIT license, with all personal identifiers removed to ensure privacy. Certified health professionals addressed all English inquiries. The English dataset was translated into Yoruba using Neural Machine Translation (NMT) via the Google Translate API. The English data was split into chunks of 1,000 samples for translation, utilizing default Google Translate parameters to maintain consistency and accuracy.
提供机构:
Femi Godslove, Julius



