Dallas Residential Property Listing SAMPLE
收藏Databricks2025-11-05 收录
下载链接:
https://marketplace.databricks.com/details/65d890bb-7d1f-4038-a588-fbf3cefa7dcc/AIDC-Inc-_Dallas-Residential-Property-Listing-SAMPLE
下载链接
链接失效反馈官方服务:
资源简介:
**Overview**
This dataset provides a structured collection of 5,430 residential and multi-family property listings across Dallas, Texas. It includes both current and historical (sold) properties, making it valuable for analytics, modeling, and research on housing trends. The dataset captures structural, locational, and transactional data points suitable for price prediction, market segmentation, and geospatial analysis.
A sample of 30 records is available for preview. The dataset was compiled and analyzed by Maths with Kanchana LLC in partnership with AIDC, completed on June 9, 2025.
**Use cases**
- **Price Prediction:** Train regression models to estimate list or sale price based on property attributes.
- **Market Segmentation:** Cluster listings by ZIP, lot size, or home type to identify investment trends.
- **Spatial Analysis:** Visualize heatmaps of price per square foot across Dallas neighborhoods.
- **Feature Engineering:** Derive metrics like home age, lot-to-living area ratio, or renovation effects.
**Product details**
This dataset was sourced exclusively from publicly available real estate listings, using ethical research methods. All personally identifiable information (PII) — including names, addresses, phone numbers, and emails — has been redacted and replaced with [Redacted_Entity]. The dataset complies fully with GDPR and U.S. data privacy standards.
**Column Description**
- **`type`** – General property category (e.g., `single_family`, `multi_family`).
- **`sub_type`** – Further classification such as townhouse or duplex.
- **`text`** – Redacted property listing description for NLP applications.
- **`status`** – Indicates whether a property is currently `for_sale` or previously `sold_on`.
- **`year_built`** – Year property was built.
- **`soldOn`** – Date of sale, if applicable.
- **`lot_sqft`** – Lot size in square feet.
- **`sqft`** – Interior living space in square feet.
- **`stories`** – Number of floors.
- **`baths`** – Total bathrooms.
- **`baths_full`** – Full bathrooms only.
- **`baths_full_calc`** – Engineered full bath count for consistency.
- **`beds`** – Bedrooms.
- **`garage`** – Garage spaces.
- **`listPrice`** – Current or last listing price (USD).
- **`zip`** – Five-digit ZIP code representing Dallas neighborhood.
**Full Dataset Available for Purchase**
Click through the documentation link to see purchase criteria or respond to our outreach.
**数据集概览**
本数据集收录了得克萨斯州达拉斯市范围内共计5430条住宅及多户房产挂牌信息,采用结构化格式存储。数据集涵盖在售及历史成交(已售出)房产两类数据,可用于住房趋势相关的分析、建模与研究工作,具备较高应用价值。其采集了房产结构、区位及交易相关的多维度数据字段,可支撑房价预测、市场细分与地理空间分析等场景。
平台提供30条数据记录的样本供预览。本数据集由Maths with Kanchana LLC与AIDC合作编制并分析,于2025年6月9日完成。
**应用场景**
- **房价预测**:基于房产属性训练回归模型,以估算挂牌价或成交价。
- **市场细分**:按邮政编码、地块面积或房产类型对挂牌信息进行聚类,以挖掘投资趋势。
- **空间分析**:可视化达拉斯各社区的每平方英尺房价热力图。
- **特征工程**:衍生各类指标,如房龄、地块面积与居住面积比、翻新改造影响等。
**产品详情**
本数据集仅通过公开可获取的房产挂牌信息采集,采用合规的研究方法获取。所有个人身份识别信息(Personally Identifiable Information),包括姓名、地址、电话号码及电子邮箱等,均已做脱敏处理,替换为`[Redacted_Entity]`。本数据集完全符合GDPR及美国数据隐私标准。
**字段说明**
- **`type`**:房产通用分类(例如`single_family`独栋住宅、`multi_family`多户住宅)。
- **`sub_type`**:更细致的分类,例如`townhouse`联排别墅、`duplex`双拼住宅。
- **`text`**:已脱敏的房产挂牌描述文本,适用于自然语言处理(Natural Language Processing)应用。
- **`status`**:标识房产当前状态,可为`for_sale`在售或`sold_on`已成交。
- **`year_built`**:房产建造年份。
- **`soldOn`**:房产成交日期(仅适用于已成交房产)。
- **`lot_sqft`**:地块面积,单位为平方英尺。
- **`sqft`**:室内居住面积,单位为平方英尺。
- **`stories`**:房产楼层数。
- **`baths`**:卫浴间总数。
- **`baths_full`**:完整卫浴间数量。
- **`baths_full_calc`**:为保证一致性通过工程方法计算得到的完整卫浴间数量。
- **`beds`**:卧室数量。
- **`garage`**:车库车位数量。
- **`listPrice`**:当前挂牌价或最后一次挂牌价(单位:美元)。
- **`zip`**:代表达拉斯社区的5位ZIP码。
**完整数据集可购买**
点击文档链接查看购买标准,或回复我们的对接邀约。
提供机构:
AIDC, Inc.



