five

HealthAid: Extracting domain targeted high precision procedural knowledge from online communities.

收藏
Mendeley Data2024-03-27 更新2024-06-27 收录
下载链接:
https://data.mendeley.com/datasets/scp7bv5sv4
下载链接
链接失效反馈
官方服务:
资源简介:
HealthAidKB, a Knowledge Base, is the result of an automatic extraction and clustering pipeline of common procedural knowledge in the domain of health. Our goal is to construct domain targeted high precision procedural knowledge base containing task frames. We developed a pipeline of methods leveraging Open IE to extract procedural knowledge by tapping into on-line communities. In addition, we devise a mechanism to canonicalize the task frames into clusters based on the similarity of the problems they intend to solve. The resulting knowledge base shows high precision based on an evaluation by human experts in the domain. We extracted the procedural knowledge by tapping into the health category of wiki how (https://www.wikihow.com/Category:Health ) and how to cure (https://howtocure.com/).

HealthAidKB作为一款知识库(Knowledge Base),是针对健康领域常见流程性知识开展自动提取与聚类流水线工作的成果。本项目的目标是构建面向特定领域、包含任务框架的高精度流程性知识库。我们研发了一套方法流水线,借助开放信息抽取(Open IE)技术,通过挖掘在线社区资源来提取流程性知识。此外,我们设计了一套规范化机制,可根据任务框架拟解决的问题相似度,将其聚类归类。经该领域人类专家评估,最终产出的知识库展现出了较高的精确性。本次研究的流程性知识提取来源为wiki How的健康分类板块(https://www.wikihow.com/Category:Health)以及howtocure网站(https://howtocure.com/)。
创建时间:
2024-01-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作