five

区域完整健康档案数据集

收藏
贵州省数据知识产权登记平台2025-11-13 更新2025-11-14 收录
下载链接:
https://gzdipp.gzsis.cn:12020/noticeDetail?id=1515&type=1
下载链接
链接失效反馈
官方服务:
资源简介:
1、数据采集:通过医卫通AI一体机结合多种专业医疗检测设备,在用户授权下依法合规采集居民健康相关的全量结构化数据。原始字段涵盖健康档案元数据(健康档案名称、id、创建时间、来源、完善度)、个人属性(关联用户id、姓名、性别、年龄、手机号、所属区域)、健康行为与状态(吸烟情况、喝酒情况、牙齿健康情况)及服务关系(签约医生、最新随访时间)等,确保数据覆盖全面、来源可溯。 2、数据处理:1)以“健康档案id”和“关联用户id”为联合主键,对多源数据进行清洗、去重、标准化与关联融合,解决数据碎片化问题;2)对敏感字段(如姓名、手机号)实施分级管理策略,用于内部系统协同时保留必要标识,用于外部分析或共享时执行脱敏或匿名化处理;3)统一数据格式与编码标准(如性别编码、区域编码),确保数据一致性;4)基于字段完整性规则动态计算“档案完善度”,并标记数据质量等级;5)构建标准化、结构化的完整健康档案数据集,支持多维查询、聚合分析与模型训练。 3、数据应用:输出高质量、全维度的健康档案数据,作为大数据分析平台的核心数据源、人工智能模型的训练集/测试集、健康监测可视化系统的数据底座及跨机构数据共享的基准数

1. Data Collection: Collect full-volume structured data related to residents' health legally and compliantly with user authorization, using the Yiweitong AI integrated device combined with various professional medical testing equipment. The original fields cover health record metadata (health record name, ID, creation time, source, completeness), personal attributes (associated user ID, name, gender, age, phone number, affiliated region), health behaviors and status (smoking status, drinking status, dental health status), and service relationships (contracting physician, latest follow-up time), ensuring comprehensive data coverage and traceable sources. 2. Data Processing: 1) Clean, deduplicate, standardize, correlate and integrate multi-source data with "health record ID" and "associated user ID" as the joint primary key to solve the problem of data fragmentation; 2) Implement hierarchical management strategies for sensitive fields (such as name and phone number): retain necessary identifiers for internal system collaboration, and perform desensitization or anonymization processing when used for external analysis or sharing; 3) Unify data formats and coding standards (such as gender coding and regional coding) to ensure data consistency; 4) Dynamically calculate the "record completeness" based on field integrity rules and mark the data quality level; 5) Build a standardized and structured complete health record dataset to support multi-dimensional query, aggregate analysis and model training. 3. Data Application: Output high-quality, full-dimensional health record data, which serves as the core data source for big data analysis platforms, training and test sets for artificial intelligence models, the data foundation for health monitoring visualization systems, and benchmark data for cross-institutional data sharing.
提供机构:
贵州和泰皓璟科技有限公司
创建时间:
2025-10-31
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集是一个综合性健康数据资源,整合了居民健康档案的全量核心字段,覆盖个体属性、健康状态和服务关系等,用于支持全域健康数据管理和多维度分析。数据来源于多源采集,通过标准化处理和敏感字段分级管理,确保数据质量和合规性。适用于AI模型训练、慢性病预测和公共卫生应急等场景,为智慧医疗体系提供高质量数据基础。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作