five

Datas of Disease Patterns

收藏
DataCite Commons2025-05-01 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/Datas_of_Disease_Patterns/5035775/2
下载链接
链接失效反馈
官方服务:
资源简介:
1.the "dingxiang_datas.xls"contains all the original data which is crawling from DingXiang forum, and also the word segmentation result for each medical record in the dataset.<br>2.the "pmi_new_words.txt" is the result of new medical word found by calculating mutual information, and the "new_medical_dict.txt" is the updated dictionary which we have used for word segmentation.<br>3.the "structure_medical_record.txt" is the structured text after the word segmentation for original data, which contains the symptoms and diseases in each medical record,and the symptom word frequency is showed in "wordfreq.txt".<br>4.the"0.3_disease-disease_rules.txt"、"0.3_symptom-disease_rules.txt"、"0.3_symptom-symptom_rules.txt"are association rules mined from the dataset which h-confidence threshold is set 0.3 and support threshold is set 0.0001.<br>5.the "network_data.txt" contains the data for building the symptom-disease bipartite network.<br>6.the "network_communities.csv" describes the communities in which each disease belongs.<br>p.s. if you encounter a "d", it means the word is a disease description vocabulary, and "z" or "s" represents a symptom description vocabulary.
提供机构:
figshare
创建时间:
2017-05-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作