five

COVID-19 Dataset

收藏
www.kaggle.com2022-11-13 更新2025-01-16 收录
下载链接:
https://www.kaggle.com/meirnizri/covid19-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
## Context Coronavirus disease (COVID-19) is an infectious disease caused by a newly discovered coronavirus. Most people infected with COVID-19 virus will experience mild to moderate respiratory illness and recover without requiring special treatment. Older people, and those with underlying medical problems like cardiovascular disease, diabetes, chronic respiratory disease, and cancer are more likely to develop serious illness. During the entire course of the pandemic, one of the main problems that healthcare providers have faced is the shortage of medical resources and a proper plan to efficiently distribute them. In these tough times, being able to predict what kind of resource an individual might require at the time of being tested positive or even before that will be of immense help to the authorities as they would be able to procure and arrange for the resources necessary to save the life of that patient. The main goal of this project is to build a machine learning model that, given a Covid-19 patient's current symptom, status, and medical history, will predict whether the patient is in high risk or not. ## content The dataset was provided by the Mexican government [(link)](https://datos.gob.mx/busca/dataset/informacion-referente-a-casos-covid-19-en-mexico). This dataset contains an enormous number of anonymized patient-related information including pre-conditions. The raw dataset consists of 21 unique features and 1,048,576 unique patients. **In the Boolean features, 1 means "yes" and 2 means "no". values as 97 and 99 are missing data**. - sex: 1 for female and 2 for male. - age: of the patient. - classification: covid test findings. Values 1-3 mean that the patient was diagnosed with covid in different degrees. 4 or higher means that the patient is not a carrier of covid or that the test is inconclusive. - patient type: type of care the patient received in the unit. 1 for returned home and 2 for hospitalization. - pneumonia: whether the patient already have air sacs inflammation or not. - pregnancy: whether the patient is pregnant or not. - diabetes: whether the patient has diabetes or not. - copd: Indicates whether the patient has Chronic obstructive pulmonary disease or not. - asthma: whether the patient has asthma or not. - inmsupr: whether the patient is immunosuppressed or not. - hypertension: whether the patient has hypertension or not. - cardiovascular: whether the patient has heart or blood vessels related disease. - renal chronic: whether the patient has chronic renal disease or not. - other disease: whether the patient has other disease or not. - obesity: whether the patient is obese or not. - tobacco: whether the patient is a tobacco user. - usmr: Indicates whether the patient treated medical units of the first, second or third level. - medical unit: type of institution of the National Health System that provided the care. - intubed: whether the patient was connected to the ventilator. - icu: Indicates whether the patient had been admitted to an Intensive Care Unit. - date died: If the patient died indicate the date of death, and 9999-99-99 otherwise.

本数据集由墨西哥政府提供 [(链接)](https://datos.gob.mx/busca/dataset/informacion-referente-a-casos-covid-19-en-mexico)。该数据集包含大量匿名化的患者相关信息,包括既往病史。原始数据集由21个独特特征组成,涵盖1,048,576例独特患者的病例。在布尔特征中,1代表“是”,2代表“否”,而数值97和99则表示数据缺失。 - 性别:1代表女性,2代表男性。 - 年龄:患者的年龄。 - 分类:新冠病毒检测结果。数值1至3表示患者被诊断为不同程度的COVID-19,4或以上则表示患者非新冠病毒携带者或检测结果不明确。 - 患者类型:患者在单位接收的护理类型。1代表回家,2代表住院。 - 肺炎:患者是否已患有肺泡炎症。 - 妊娠:患者是否怀孕。 - 糖尿病:患者是否患有糖尿病。 - 慢阻肺:指示患者是否患有慢性阻塞性肺疾病。 - 支气管哮喘:患者是否患有哮喘。 - 免疫抑制:患者是否免疫抑制。 - 高血压:患者是否患有高血压。 - 心血管疾病:患者是否患有心脏或血管相关疾病。 - 慢性肾病:患者是否患有慢性肾病。 - 其他疾病:患者是否患有其他疾病。 - 肥胖:患者是否肥胖。 - 吸烟:患者是否吸烟。 - 医疗单位级别:指示患者接受治疗的第一、第二或第三级医疗单位。 - 医疗机构:提供护理的国家级卫生系统机构类型。 - 机械通气:患者是否已连接到呼吸机。 - 重症监护:指示患者是否已被收入重症监护室。 - 死亡日期:如果患者死亡,则显示死亡日期,否则为9999-99-99。
提供机构:
Kaggle
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作