electricsheepafrica/africa-synth-flooding-property-insurance-risk-all
收藏Hugging Face2026-04-14 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/electricsheepafrica/africa-synth-flooding-property-insurance-risk-all
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
task_categories:
- tabular-classification
tags:
- insurance
- property
- risk
- flood
- fire
- storm
- sub-saharan-africa
- synthetic
- catastrophe
pretty_name: African Property Insurance Risk
size_categories:
- 10K<n<100K
language:
- en
configs:
- config_name: baseline
data_files: data/property_risk_baseline.csv
- config_name: high_risk_period
data_files: data/property_risk_high_risk_period.csv
- config_name: climate_adapted
data_files: data/property_risk_climate_adapted.csv
data_type: synthetic
---
> ⚠️ **Synthetic dataset** — Parameterized from published SSA literature, not real observations. Not suitable for empirical analysis or policy inference.
# African Property Insurance Risk
Synthetic property insurance risk dataset covering 12 Sub-Saharan African countries across 3 underwriting scenarios. Designed for catastrophe risk modeling, premium pricing, loss forecasting, and insurance portfolio management research.
## Dataset Description
- **100,000** synthetic property insurance risk records
- **12 countries**: South Africa, Nigeria, Kenya, Ghana, Tanzania, Uganda, Rwanda, Ethiopia, Senegal, Cote d'Ivoire, Zambia, Mozambique
- **3 scenarios**: baseline, high_risk_period, climate_adapted
- **24 variables** per record
## Variables
| Variable | Type | Description |
|---|---|---|
| policy_id | string | Unique policy identifier |
| country | string | Country of origin |
| region | string | Sub-region (East, West, Southern) |
| year | int | Policy year |
| property_type | string | Property category (residential, commercial, industrial, agricultural) |
| property_value_usd | float | Property insured value in USD |
| construction_type | string | Construction (brick, concrete, wood, metal, mudbrick, mixed) |
| building_age | int | Building age in years (1-80) |
| floors | int | Number of floors (1-20) |
| flood_risk_score | float | Flood risk score (0-1) |
| fire_risk_score | float | Fire risk score (0-1) |
| storm_risk_score | float | Storm/hail risk score (0-1) |
| earthquake_risk_score | float | Seismic risk score (0-1) |
| crime_risk_score | float | Theft/vandalism risk score (0-1) |
| combined_risk_score | float | Weighted aggregate risk score (0-1) |
| premium_usd | float | Annual premium in USD |
| deductible_usd | float | Policy deductible in USD |
| coverage_type | string | coverage level (basic, standard, comprehensive) |
| flood_zone | string | FEMA-style flood zone (A, B, C, X, V) |
| proximity_to_water_km | float | Distance to nearest water body in km |
| elevation_m | float | Property elevation in meters |
| previous_claims_count | int | Number of prior claims (0-10) |
| previous_losses_usd | float | Total prior losses in USD |
| risk_class | string | low_risk, medium_risk, high_risk, extreme_risk |
| scenario | string | baseline, high_risk_period, or climate_adapted |
## Scenarios
- **baseline** (50K records): Standard market conditions, 2020-2024. Risk distribution reflects current underwriting practices.
- **high_risk_period** (30K records): Elevated risk due to climate change impacts, increased extreme weather events, 2023-2025.
- **climate_adapted** (20K records): Climate-adjusted risk assessment with updated models, 2024-2025.
## Generation Methodology
Parameters are calibrated against published data from:
- Santam Insurance Barometer 2024/2025
- Deloitte Africa Insurance Outlook 2024/2025
- South African Insurance Association catastrophe data
- Kenya Insurance Regulatory Authority risk assessments
- Munich Re Africa catastrophe loss data
- World Bank disaster risk financing for Sub-Saharan Africa
- FEMA flood zone classification adapted for African contexts
Risk scores follow beta distributions modulated by property characteristics, geographic factors, and climate projections. Premiums are calculated using actuarial models adjusted for country-specific loss experience.
## Use Cases
- Catastrophe risk modeling and accumulation management
- Climate change impact on property insurance
- Premium pricing optimization
- Cross-country risk comparison
- Portfolio risk segmentation
- Synthetic data generation evaluation
## Citation
```bibtex
@dataset{african_property_risk_2026,
title={African Property Insurance Risk Dataset},
author={ElectricSheepAfrica},
year={2026},
license={cc-by-4.0}
}
```
## License
CC BY 4.0 - This is synthetic data generated for research and educational purposes.
许可证:cc-by-4.0
任务类别:
- 表格分类(tabular-classification)
标签:
- 保险
- 财产
- 风险
- 洪水
- 火灾
- 风暴
- 撒哈拉以南非洲(sub-saharan-africa)
- 合成数据
- 巨灾(catastrophe)
数据集名称:非洲财产保险风险(African Property Insurance Risk)
规模类别:10K<n<100K
语言:英语
配置项:
- 配置名称:baseline(基准场景),数据文件:data/property_risk_baseline.csv
- 配置名称:high_risk_period(高风险期场景),数据文件:data/property_risk_high_risk_period.csv
- 配置名称:climate_adapted(气候适配场景),数据文件:data/property_risk_climate_adapted.csv
数据类型:合成数据(synthetic)
⚠️ **合成数据集** — 本数据集参数基于已发表的撒哈拉以南非洲相关文献设定,并非真实观测数据,不适用于实证分析或政策推演。
# 非洲财产保险风险数据集
本合成财产保险风险数据集覆盖撒哈拉以南非洲12个国家,包含3种承保场景,旨在服务于巨灾风险建模、保费定价、损失预测以及保险组合管理等研究方向。
## 数据集概况
- **100,000**条合成财产保险风险记录
- **12个覆盖国家**:南非、尼日利亚、肯尼亚、加纳、坦桑尼亚、乌干达、卢旺达、埃塞俄比亚、塞内加尔、科特迪瓦、赞比亚、莫桑比克
- **3种研究场景**:基准场景(baseline)、高风险期场景(high_risk_period)、气候适配场景(climate_adapted)
- 每条记录包含**24个变量**
## 变量说明
| 变量 | 类型 | 描述 |
|---|---|---|
| policy_id | 字符串 | 唯一保单标识符 |
| country | 字符串 | 保单所属国家 |
| region | 字符串 | 所属大区(东部、西部、南部非洲) |
| year | 整数 | 保单承保年份 |
| property_type | 字符串 | 财产类别(住宅、商业、工业、农业) |
| property_value_usd | 浮点数 | 以美元计价的投保财产总价值 |
| construction_type | 字符串 | 建筑结构类型(砖结构、混凝土结构、木结构、金属结构、泥砖结构、混合结构) |
| building_age | 整数 | 建筑已使用年限(1-80年) |
| floors | 整数 | 建筑楼层数(1-20层) |
| flood_risk_score | 浮点数 | 洪水风险评分(取值范围0-1) |
| fire_risk_score | 浮点数 | 火灾风险评分(取值范围0-1) |
| storm_risk_score | 浮点数 | 风暴/冰雹风险评分(取值范围0-1) |
| earthquake_risk_score | 浮点数 | 地震风险评分(取值范围0-1) |
| crime_risk_score | 浮点数 | 盗窃及故意损毁财物风险评分(取值范围0-1) |
| combined_risk_score | 浮点数 | 加权综合风险评分(取值范围0-1) |
| premium_usd | 浮点数 | 以美元计价的年度保费 |
| deductible_usd | 浮点数 | 以美元计价的保单免赔额 |
| coverage_type | 字符串 | 保障等级(基础型、标准型、全面型) |
| flood_zone | 字符串 | 参照美国联邦紧急事务管理局(FEMA)标准的洪水区域等级(A、B、C、X、V) |
| proximity_to_water_km | 浮点数 | 距最近水体的直线距离(单位:千米) |
| elevation_m | 浮点数 | 财产所在地海拔高度(单位:米) |
| previous_claims_count | 整数 | 既往索赔次数(0-10次) |
| previous_losses_usd | 浮点数 | 以美元计价的既往总损失金额 |
| risk_class | 字符串 | 风险等级(低风险、中风险、高风险、极高风险) |
| scenario | 字符串 | 所属场景类型(基准场景、高风险期场景或气候适配场景) |
## 场景详情
- **基准场景(baseline)**:共50,000条记录,对应标准市场环境,时间跨度为2020-2024年,风险分布贴合当前行业承保实践。
- **高风险期场景(high_risk_period)**:共30,000条记录,对应气候变化引发的风险抬升、极端天气事件频发的场景,时间跨度为2023-2025年。
- **气候适配场景(climate_adapted)**:共20,000条记录,采用更新后的模型开展气候调整后的风险评估,时间跨度为2024-2025年。
## 数据生成方法
数据集参数基于以下公开文献及行业数据进行校准:
- Santam保险晴雨表2024/2025
- 德勤(Deloitte)非洲保险展望2024/2025
- 南非保险协会巨灾损失数据
- 肯尼亚保险监管局风险评估报告
- 慕尼黑再保险(Munich Re)非洲巨灾损失数据库
- 世界银行撒哈拉以南非洲灾害风险融资报告
- 适配非洲地区场景的美国联邦紧急事务管理局(FEMA)洪水区域分类标准
风险评分遵循Beta分布,并结合财产特征、地理区位因素及气候预测数据进行动态调整。保费计算采用精算模型,并针对各国本土损失经验进行适配优化。
## 应用场景
- 巨灾风险建模与风险累积管理
- 气候变化对财产保险市场的影响研究
- 保费定价策略优化
- 跨国区域风险对比分析
- 保险组合风险细分
- 合成数据生成算法效果评估
## 引用格式
bibtex
@dataset{african_property_risk_2026,
title={African Property Insurance Risk Dataset},
author={ElectricSheepAfrica},
year={2026},
license={cc-by-4.0}
}
## 许可证
CC BY 4.0 — 本数据集为供研究及教育用途生成的合成数据。
提供机构:
electricsheepafrica



