five

jlh/uci-census-income-94

收藏
Hugging Face2023-04-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/jlh/uci-census-income-94
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: age dtype: string - name: class_of_worker dtype: int64 - name: detailed_industry_recode dtype: int64 - name: detailed_occupation_recode dtype: string - name: education dtype: int64 - name: wage_per_hour dtype: string - name: enroll_in_edu_inst_last_wk dtype: string - name: marital_stat dtype: string - name: major_industry_code dtype: string - name: major_occupation_code dtype: string - name: race dtype: string - name: hispanic_origin dtype: string - name: sex dtype: string - name: member_of_a_labor_union dtype: string - name: reason_for_unemployment dtype: string - name: full_or_part_time_employment_stat dtype: int64 - name: capital_gains dtype: int64 - name: capital_losses dtype: int64 - name: dividends_from_stocks dtype: string - name: tax_filer_stat dtype: string - name: region_of_previous_residence dtype: string - name: state_of_previous_residence dtype: string - name: detailed_household_and_family_stat dtype: string - name: detailed_household_summary_in_household dtype: float64 - name: migration_code-change_in_msa dtype: string - name: migration_code-change_in_reg dtype: string - name: migration_code-move_within_reg dtype: string - name: live_in_this_house_1_year_ago dtype: string - name: migration_prev_res_in_sunbelt dtype: string - name: num_persons_worked_for_employer dtype: int64 - name: family_members_under_18 dtype: string - name: country_of_birth_father dtype: string - name: country_of_birth_mother dtype: string - name: country_of_birth_self dtype: string - name: citizenship dtype: string - name: own_business_or_self_employed dtype: int64 - name: fill_inc_questionnaire_for_veteran's_admin dtype: string - name: veterans_benefits dtype: int64 - name: weeks_worked_in_year dtype: int64 - name: year dtype: int64 - name: income dtype: class_label: names: '0': ' - 50000.' '1': ' 50000+.' splits: - name: train num_bytes: 129952005 num_examples: 199523 download_size: 7989520 dataset_size: 129952005 --- # Dataset Card for "uci-census-income-94" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
jlh
原始信息汇总

数据集概述

数据集名称

uci-census-income-94

数据集特征

  • age: 字符串类型
  • class_of_worker: 整数类型 (int64)
  • detailed_industry_recode: 整数类型 (int64)
  • detailed_occupation_recode: 字符串类型
  • education: 整数类型 (int64)
  • wage_per_hour: 字符串类型
  • enroll_in_edu_inst_last_wk: 字符串类型
  • marital_stat: 字符串类型
  • major_industry_code: 字符串类型
  • major_occupation_code: 字符串类型
  • race: 字符串类型
  • hispanic_origin: 字符串类型
  • sex: 字符串类型
  • member_of_a_labor_union: 字符串类型
  • reason_for_unemployment: 字符串类型
  • full_or_part_time_employment_stat: 整数类型 (int64)
  • capital_gains: 整数类型 (int64)
  • capital_losses: 整数类型 (int64)
  • dividends_from_stocks: 字符串类型
  • tax_filer_stat: 字符串类型
  • region_of_previous_residence: 字符串类型
  • state_of_previous_residence: 字符串类型
  • detailed_household_and_family_stat: 字符串类型
  • detailed_household_summary_in_household: 浮点数类型 (float64)
  • migration_code-change_in_msa: 字符串类型
  • migration_code-change_in_reg: 字符串类型
  • migration_code-move_within_reg: 字符串类型
  • live_in_this_house_1_year_ago: 字符串类型
  • migration_prev_res_in_sunbelt: 字符串类型
  • num_persons_worked_for_employer: 整数类型 (int64)
  • family_members_under_18: 字符串类型
  • country_of_birth_father: 字符串类型
  • country_of_birth_mother: 字符串类型
  • country_of_birth_self: 字符串类型
  • citizenship: 字符串类型
  • own_business_or_self_employed: 整数类型 (int64)
  • fill_inc_questionnaire_for_veterans_admin: 字符串类型
  • veterans_benefits: 整数类型 (int64)
  • weeks_worked_in_year: 整数类型 (int64)
  • year: 整数类型 (int64)
  • income: 分类标签,包含两个类别:
    • 0: - 50000.
    • 1: 50000+.

数据集分割

  • train: 训练集
    • 数据大小: 129952005 字节
    • 样本数量: 199523

数据集大小

  • 下载大小: 7989520 字节
  • 数据集总大小: 129952005 字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作