Ryan-Pupia/CS482-HousingDataSet
收藏Hugging Face2024-01-28 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Ryan-Pupia/CS482-HousingDataSet
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: log_stand__housing_median_age
dtype: float64
- name: log_stand__total_rooms
dtype: float64
- name: log_stand__total_bedrooms
dtype: float64
- name: log_stand__population
dtype: float64
- name: log_stand__households
dtype: float64
- name: log_stand__median_income
dtype: float64
- name: log_stand__median_house_value
dtype: float64
- name: encode__ocean_proximity_<1H OCEAN
dtype: float64
- name: encode__ocean_proximity_INLAND
dtype: float64
- name: encode__ocean_proximity_ISLAND
dtype: float64
- name: encode__ocean_proximity_NEAR BAY
dtype: float64
- name: encode__ocean_proximity_NEAR OCEAN
dtype: float64
- name: scale__longitude
dtype: float64
- name: scale__latitude
dtype: float64
splits:
- name: train
num_bytes: 1648864
num_examples: 14722
- name: test
num_bytes: 412272
num_examples: 3681
download_size: 1130408
dataset_size: 2061136
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
license: mit
language:
- en
pretty_name: Pre-Processed Housing Data
---
This dataset consists of processed and separated data for producing and validating a model using california housing data
提供机构:
Ryan-Pupia
原始信息汇总
数据集概述
特征信息
- log_stand__housing_median_age: 数据类型为
float64 - log_stand__total_rooms: 数据类型为
float64 - log_stand__total_bedrooms: 数据类型为
float64 - log_stand__population: 数据类型为
float64 - log_stand__households: 数据类型为
float64 - log_stand__median_income: 数据类型为
float64 - log_stand__median_house_value: 数据类型为
float64 - encode__ocean_proximity_<1H OCEAN: 数据类型为
float64 - encode__ocean_proximity_INLAND: 数据类型为
float64 - encode__ocean_proximity_ISLAND: 数据类型为
float64 - encode__ocean_proximity_NEAR BAY: 数据类型为
float64 - encode__ocean_proximity_NEAR OCEAN: 数据类型为
float64 - scale__longitude: 数据类型为
float64 - scale__latitude: 数据类型为
float64
数据分割
- train: 包含 14722 个样本,占用 1648864 字节
- test: 包含 3681 个样本,占用 412272 字节
数据集大小
- 下载大小: 1130408 字节
- 数据集大小: 2061136 字节
配置信息
- config_name: default
- data_files:
- train: 路径为
data/train-* - test: 路径为
data/test-*
- train: 路径为
- data_files:
许可证
- license: MIT
语言
- language: 英语
数据集名称
- pretty_name: Pre-Processed Housing Data



