Disease Mentions
收藏Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/99tkhbwvfg
下载链接
链接失效反馈官方服务:
资源简介:
The data comprises 5 csv files containing phrases that mention different disease terms. The largest file contains 13004 annotated phrases containing mentions of “influenza”, “flu”, “common cold” and “listeria”. The phrases have been obtained by paraphrasing tweets using the Hugging Face Pegasus transformer neural network model. This is ideally meant to be the training and validation data for creating prospective language models. The other four files contain mentions of “norovirus”, “gastroenteritis” and "stomach flu, “conjunctivitis” and conjunctivitis as “pink eye”. The data could be used to build classifiers for web-based disease surveillance systems
创建时间:
2023-04-17



