Pima Indians Diabetes Database
收藏www.kaggle.com2016-10-06 更新2025-03-23 收录
下载链接:
https://www.kaggle.com/uciml/pima-indians-diabetes-database
下载链接
链接失效反馈官方服务:
资源简介:
## Context
This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.
## Content
The datasets consists of several medical predictor variables and one target variable, `Outcome`. Predictor variables includes the number of pregnancies the patient has had, their BMI, insulin level, age, and so on.
## Acknowledgements
Smith, J.W., Everhart, J.E., Dickson, W.C., Knowler, W.C., & Johannes, R.S. (1988). [Using the ADAP learning algorithm to forecast the onset of diabetes mellitus][1]. *In Proceedings of the Symposium on Computer Applications and Medical Care* (pp. 261--265). IEEE Computer Society Press.
## Inspiration
Can you build a machine learning model to accurately predict whether or not the patients in the dataset have diabetes or not?
[1]: http://rexa.info/paper/04587c10a7c92baa01948f71f2513d5928fe8e81
本数据集源于美国糖尿病、消化和肾脏疾病国家研究院。该数据集旨在通过数据集中包含的特定诊断测量值,对患者的糖尿病诊断结果进行预测。在从更大数据库中选取这些实例时,设定了多项限制条件。具体而言,数据集中的所有患者均为至少21岁的女性,且为皮马印第安人血统。
数据集包含多个医学预测变量和一个目标变量,即‘结果’。预测变量包括患者怀孕次数、BMI、胰岛素水平、年龄等。
致谢:Smith, J.W.,Everhart, J.E.,Dickson, W.C.,Knowler, W.C.,& Johannes, R.S.(1988). 利用 ADAP 学习算法预测糖尿病 mellitus 的发作。[见《计算机在医疗保健中的应用研讨会》论文集](http://rexa.info/paper/04587c10a7c92baa01948f71f2513d5928fe8e81)(第261--265页)。IEEE 计算机协会出版社。
灵感:能否构建一个机器学习模型,以准确预测数据集中的患者是否患有糖尿病?
提供机构:
UCI Machine Learning



