five

阿尔茨海默病数据集

收藏
天津市数据知识产权登记平台2024-12-27 更新2025-01-13 收录
下载链接:
https://dengji.tjippc.cn/xxgg_nr?id=78354840-d675-4126-90df-3de422cdcb4f
下载链接
链接失效反馈
官方服务:
资源简介:
专病诊断名称分类模型:通过分析医学文献、临床数据和专家知识,建立一个诊断数据库。经过分词和打乱顺序的预处理后,使用 train_supervised 函数进行训练(迭代200次,学习率0.1,词N-grams长度为1,损失函数为"hs")。模型性能通过 classification_report 方法评估,表现良好。参数更新通过命令同步模型、标签和标签名,从而快速、准确地诊断专病类型。 电子病历质控分类模型:该模型通过自然语言处理技术对电子病历中的主诉、现病史、既往史等文本进行识别和分析,提取关键信息并进行分类。包含7个类别,每类250个样本。数据处理包括标签化、分词,并转换为TXT文件。用 BERT的分词器将病历文本转化为BERT所需的输入格式,质控标签转换为数值标签。训练集与测试集按9:1比例划分。使用 BertForSequenceClassification模型进行训练。模型评估通过 classification_report 方法进行。参数更新步骤包括将数据放入指定文件夹,运行训练和更新命令,确保模型、标签和标签名同步。

Specialized Disease Diagnosis Name Classification Model: By analyzing medical literature, clinical data and expert knowledge, a diagnostic database is established. After preprocessing including word segmentation and data shuffling, the model is trained using the `train_supervised` function with 200 training iterations, a learning rate of 0.1, word N-grams length of 1, and the loss function set as "hs". The model's performance is evaluated using the `classification_report` method, achieving satisfactory results. Parameter updates synchronize the model, labels and label names via commands, enabling fast and accurate diagnosis of specialized disease types. Electronic Medical Record Quality Control Classification Model: This model identifies and analyzes texts such as chief complaints, current history of present illness, past medical history in electronic medical records via natural language processing (NLP) techniques, extracts key information and performs classification. The dataset contains 7 categories, with 250 samples per category. Data processing includes labelization, word segmentation and conversion to TXT files. The BERT tokenizer is used to convert medical record texts into the input format required by BERT, while quality control labels are converted into numerical labels. The training and test sets are split at a ratio of 9:1. The model is trained using the `BertForSequenceClassification` architecture. Model performance is evaluated using the `classification_report` method. The parameter update steps include placing the data into the specified folder, running training and update commands to ensure synchronization of the model, labels and label names.
提供机构:
天津健康医疗大数据有限公司
创建时间:
2024-12-10
搜集汇总
数据集介绍
main_image_url
特点
阿尔茨海默病数据集包含30万条记录,每月更新,涵盖患者就诊信息、诊断结果、用药情况等,适用于医疗、教学和科研领域。数据集通过专病诊断和电子病历质控分类模型进行处理,支持诊疗模式研究和药物经济学研究。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作