five

Webis Health CauseNet 2022

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7123482
下载链接
链接失效反馈
官方服务:
资源简介:
An efficient assessment of the health relatedness of text passages is important to mine the web at scale to conduct health sociological analyses or to develop a health search engine. We propose a new efficient and effective termhood score for predicting the health relatedness of phrases and sentences, which achieves 69% recall at over 90% precision on a web dataset with cause–effect statements. It is more effective than state-of-the-art medical entity linkers and as effective but much faster than BERT-based approaches. Using our method, we compile the Webis Health CauseNet 2022, a new resource of 7.8 million health-related cause–effect statements such as “Studies show that stress induces insomnia” in which the cause (‘stress’) and effect (‘insomnia’) are labeled. @InProceedings{schlatt2022health-causenet, author = {Ferdinand Schlatt and Dieter Bettin and Matthias Hagen and Benno Stein and Martin Potthast}, booktitle = {29th International Conference on Computational Linguistics (COLING 2022)}, publisher = {Association for Computational Linguistics}, site = {Gyeongju, Republic of Korea}, title = {{Mining Health-related Cause-Effect Statements with High Precision at Large Scale}}, year = 2022 }
创建时间:
2022-09-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作