lotusbdr-9/adult-census-income
收藏Hugging Face2025-12-13 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/lotusbdr-9/adult-census-income
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为成人人口普查收入数据集,来源于UCI机器学习仓库。数据是从1994年美国人口普查局数据库中提取的,经过筛选条件((年龄>16岁) && (调整后总收入>100) && (最终权重>1) && (每周工作时间>0小时))得到相对干净的记录。主要预测任务是判断一个人年收入是否超过5万美元。此外,还详细解释了最终权重(fnlwgt)的计算方法,该权重基于当前人口调查(CPS)文件,并考虑了美国非机构平民人口的独立估计,包括各州16岁以上人口的单细胞估计、按年龄和性别划分的西班牙裔来源控制,以及按种族、年龄和性别划分的控制。
The dataset is named Adult Census Income Dataset and was retrieved from the UCI machine learning repository. The data was extracted from the 1994 Census bureau database, with reasonably clean records filtered using the conditions: ((AAGE>16) && (AGI>100) && (AFNLWGT>1) && (HRSWK>0)). The prediction task is to determine whether a person makes over $50K a year. Additionally, it provides a detailed description of fnlwgt (final weight), which is based on the Current Population Survey (CPS) files and controlled to independent estimates of the civilian noninstitutional population of the US, including a single cell estimate of the population 16+ for each state, controls for Hispanic Origin by age and sex, and controls by Race, age and sex.
提供机构:
lotusbdr-9



