流感类流行病学数据集
收藏天津市数据知识产权登记平台2024-10-16 更新2024-10-30 收录
下载链接:
https://dengji.tjippc.cn/xxgg_nr?id=657fc59b-d2bf-4007-bdc0-d3c49542c679
下载链接
链接失效反馈官方服务:
资源简介:
多维诊疗数据构建患者主索引:将患者数据特征向量定义为患者性别、住址、家族遗传病、过敏原等信息,使用DBSCAN算法,基于特征向量的密度,将密度相近的数据点划为同一个簇,将患者数据点进行聚类,每个聚类可以视为一个患者群体,作为主索引的标识。
专病诊断名称分类模型:通过分析医学文献、临床数据和专家知识,建立一个诊断数据库。经过分词和打乱顺序的预处理后,使用 train_supervised 函数进行训练(迭代200次,学习率0.1,词N-grams长度为1,损失函数为"hs")。模型性能通过 classification_report 方法评估,表现良好。参数更新通过命令同步模型、标签和标签名,从而快速、准确地诊断专病类型。
Constructing Patient Master Index with Multi-dimensional Medical and Treatment Data: Define the feature vector of patient data as information such as patient's gender, address, family genetic history, allergens and other relevant details. Use the DBSCAN algorithm to cluster patient data points based on the density of their feature vectors, dividing data points with similar densities into the same cluster. Each cluster can be regarded as a patient group, which serves as the identifier for the Patient Master Index.
Specialized Disease Diagnosis Name Classification Model: Establish a diagnosis database by analyzing medical literature, clinical data and expert knowledge. After preprocessing steps including word segmentation and data shuffling, use the train_supervised function for training (trained for 200 iterations, with a learning rate of 0.1, word N-grams length of 1, and loss function set as "hs"). The model performance is evaluated using the classification_report method, showing excellent results. Parameters are updated by synchronizing the model, labels and label names via commands, enabling fast and accurate diagnosis of specialized disease types.
提供机构:
天津健康医疗大数据有限公司
创建时间:
2024-10-14
搜集汇总
数据集介绍

特点
该数据集包含25万条流感类流行病学数据,每月更新,涵盖患者性别、出生日期、诊断疾病名称等多维诊疗信息。适用于医疗、教学和科研领域,支持疾病预防和控制策略的制定,采用DBSCAN算法和专病诊断名称分类模型进行数据处理和分析。
以上内容由遇见数据集搜集并总结生成



