Classification of Text Data on Rare Diseases
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13882002
下载链接
链接失效反馈官方服务:
资源简介:
A dataset with text and labels for 3 categories:
- Rare Diseases
- Non-Rare Diseases
- Other
It is a subset of abstracts obtained from PubMed and sorted into the 3 classes on the basis of their MeSH terms.
The dataset is provided for demonstration and methodology validation purposes. The original PubMed data was randomly under-sampled.
The dataset consists of 3 files in the Tab-Separated Values (TSV) format, corresponding to the 3 splits used in the article:
Rei L, Pita Costa J, Zdolšek Draksler T. Automatic Classification and Visualization of Text Data on Rare Diseases. _Journal of Personalized Medicine_. 2024; 14(5):545. https://doi.org/10.3390/jpm14050545
创建时间:
2024-10-02



