five

electricsheepafrica/climate-adaptation-tech-ssa-synthetic

收藏
Hugging Face2025-11-18 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/electricsheepafrica/climate-adaptation-tech-ssa-synthetic
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - tabular-regression - tabular-classification tags: - agriculture - food-security - africa - synthetic-data - climate-adaptation - technology-adoption - smallholder-farming - climate-change - resilience size_categories: - 1M<n<10M language: - en --- # Climate Adaptation & Technology - Sub-Saharan Africa Synthetic Dataset ## Dataset Summary **Dataset 4 in the African Agriculture Portfolio** This dataset contains **1,000,000 synthetic household records** focused on climate adaptation strategies and technology adoption in Sub-Saharan African agriculture. Building on Datasets 1-3 (farm systems, livestock, and post-harvest), this adds critical climate change response variables. **Key Features:** - 🌍 **57 variables** (41 inherited + 16 new climate/tech variables) - 🌡️ **Climate exposure**: Droughts, floods, temperature trends - 🚜 **Technology adoption**: Seeds, irrigation, conservation, mechanization, digital tools - 🏗️ **Adaptation strategies**: Diversification, migration responses - 💰 **Support access**: Credit, insurance, climate finance - 📚 **16 literature sources** for new variables - 🔒 **100% synthetic**: No real households - ✅ **Literature-grounded**: All parameters from peer-reviewed research ## Dataset Details ### Dataset Description **Size**: 1,000,000 rows × 57 variables **Format**: CSV (617 MB) + Parquet (193 MB) **License**: CC-BY-4.0 **Created**: November 2025 **Version**: 1.0 **Random Seed**: 42 (reproducible) ### New Variables (16 Climate & Technology) #### **Climate Shocks & Exposure (4 variables)** | Variable | Type | Description | Range | |----------|------|-------------|-------| | `drought_events_5yr` | count | Droughts in past 5 years | 0-5 events | | `flood_exposure` | binary | Experienced flood damage | yes/no | | `temperature_trend` | continuous | Perceived temp change (°C) | -1 to +3°C | | `climate_info_access` | binary | Access to weather forecasts | yes/no | **Key Stats**: - Average **1.8 droughts per 5 years** (matches Sheffield et al. 2014) - **22% flood exposure** rate (matches regional data) - **27% have climate info access** (growing with mobile penetration) #### **Technology Adoption (6 variables)** | Variable | Type | Description | Adoption Rate | |----------|------|-------------|---------------| | `improved_seed_use` | binary | Hybrid/improved seeds | ~18% | | `irrigation_access` | categorical | Irrigation type | 7% total | | `conservation_agriculture` | binary | CA practices | ~12% | | `agroforestry` | binary | Tree integration | ~30% | | `mechanization_level` | categorical | Land prep tech | 60% manual | | `digital_ag_tools` | binary | Smartphone ag apps | ~10% | **Reflects reality**: Low tech adoption rates typical of SSA smallholders #### **Adaptation Strategies (3 variables)** | Variable | Type | Description | |----------|------|-------------| | `crop_diversification_index` | continuous | Crop diversity (0-100) | | `livelihood_diversification` | categorical | Off-farm income levels | | `migration_response` | binary | Climate-induced migration | **Key Stats**: - Mean crop diversity: **48.7** (moderate diversification) - **60% have some off-farm income** (coping strategy) - **15% climate-induced migration** (vulnerable areas) #### **Support Access (3 variables)** | Variable | Type | Description | Access Rate | |----------|------|-------------|-------------| | `credit_access` | categorical | Credit source | 27% total | | `crop_insurance` | categorical | Insurance type | 6% covered | | `climate_finance_access` | binary | Climate adaptation finance | 7% | **Reflects gap**: Very low financial service access in rural SSA ### Inherited Variables (41 from Datasets 1-3) **From Dataset 1 - Smallholder Farming (12 vars)**: - Farm characteristics: AEZ, region, farm size, soil quality - Climate: Annual & seasonal rainfall - Demographics: Household size - Market access & inputs: Distance, livestock, extension, fertilizer - Yields: Maize production **From Dataset 2 - Livestock (15 vars)**: - Herd structure: Cattle, small ruminants, poultry - Health: Disease incidence, vaccination, mortality - Veterinary access: Distance, visits, treatment - Management: Grazing, pasture, feed, water **From Dataset 3 - Post-Harvest (14 vars)**: - Storage: Type, duration, losses, hermetic tech - Handling: Drying, pest control, sorting - Markets: Sales, buyers, prices, transport This creates a **comprehensive household profile** spanning production, livestock, post-harvest, and adaptation. ## Uses ### Direct Use ✅ **Permitted**: - Climate adaptation research & algorithm development - Technology adoption modeling - Policy scenario analysis (with caveats) - Agricultural extension program design - Climate finance targeting research - Educational training on climate-smart agriculture - Resilience indicator development ❌ **Prohibited**: - Real-world policy targeting of actual households - Climate finance allocation decisions - Technology subsidy targeting - Any use that could harm vulnerable populations ### Research Applications **Climate Adaptation Studies**: - Modeling household adaptation responses to climate shocks - Technology diffusion under resource constraints - Migration as adaptation strategy - Diversification patterns **Agricultural Development**: - Extension service optimization - Technology adoption barriers - Credit access impact on adaptation - Digital agriculture potential **Policy Analysis**: - Climate finance targeting algorithms (test only) - Agricultural insurance design - Technology subsidy effectiveness - Integrated adaptation pathways ## Dataset Creation ### Curation Rationale Climate change is the defining challenge for African agriculture, yet **household-level adaptation data is scarce**: - Privacy-sensitive (household finances, migration) - Expensive to collect (multi-year, multi-topic surveys) - Limited geographic coverage - Difficult to share This synthetic dataset enables **climate adaptation research without privacy concerns**. ### Source Data #### Literature Sources (16 peer-reviewed papers for new variables) **Climate Shocks**: - Sheffield et al. (2014) - Drought monitoring SSA - Paeth et al. (2011) - Flood exposure patterns - Mbow et al. (2019) - IPCC Africa temperature trends - Tall et al. (2018) - Climate information access **Technology Adoption**: - Suri (2011) - Improved seed adoption economics - You et al. (2011) - Irrigation potential Africa - Giller et al. (2009) - Conservation agriculture SSA - Garrity et al. (2010) - Agroforestry adoption - Diao et al. (2014) - Mechanization levels - Tsan et al. (2019) - Digital agriculture uptake **Adaptation Strategies**: - Di Falco & Chavas (2009) - Crop diversification - Barrett et al. (2001) - Livelihood diversification - Gray & Mueller (2012) - Climate migration **Support Access**: - Fletschner & Kenney (2014) - Credit access rural women - Carter et al. (2017) - Index insurance developing countries - Buchner et al. (2019) - Climate finance landscape **Plus 24+ sources** for inherited variables from Datasets 1-3. ### Personal and Sensitive Information **None** - This dataset contains NO personal information. All data is synthetically generated. ## Bias, Risks, and Limitations ### Known Limitations 1. **Simplified climate-tech relationships**: Real adoption decisions more complex 2. **Cross-sectional only**: No panel dynamics (see Dataset 5 when available) 3. **Missing variables**: Social networks, labor availability, detailed prices 4. **No spatial coordinates**: Zone-level only 5. **Adoption barriers simplified**: Cultural, information, structural barriers not fully modeled ### Biases - **Literature bias**: Reflects published research, may under-represent marginalized groups - **Tech optimism**: May overstate technology effectiveness vs. literature - **Temporal**: Primarily 2010-2020 patterns, climate change accelerating ### Recommendations Users should: 1. **Always validate** on real climate adaptation data before deployment 2. **Acknowledge synthetic nature** in all publications 3. **Test sensitivity** to key assumptions (e.g., adoption rates) 4. **Consider context**: SSA-specific, may not transfer to other regions 5. **Cite properly** (see below) ## Additional Information ### Dataset Curators **Electric Sheep Africa** Generated using **Synthetic Data Generation Playbook** methodology ### Portfolio Context This is **Dataset 4** in the African Agriculture Synthetic Data Portfolio: 1. ✅ [Smallholder Farming Systems](https://huggingface.co/datasets/electricsheepafrica/african-agriculture-synthetic) 2. ✅ [Livestock Health & Disease](https://huggingface.co/datasets/electricsheepafrica/livestock-health-disease-ssa-synthetic) 3. ✅ [Post-Harvest Value Chains](https://huggingface.co/datasets/electricsheepafrica/postharvest-value-chains-ssa-synthetic) 4. 🔄 **Climate Adaptation & Technology** (this dataset) 5. 📋 Panel Data (planned) **Combined**: 4,000,000 records, 93 unique variables, 40+ literature sources ### Licensing Information - **Dataset**: Creative Commons Attribution 4.0 (CC-BY-4.0) - **Code**: MIT License - **Documentation**: CC-BY-4.0 **Requirements**: - ✅ Attribution required - ✅ Must acknowledge synthetic nature - ✅ Follow acceptable use policy - ❌ No warranty provided ### Citation Information ```bibtex @dataset{african_climate_adaptation_synthetic_2025, title = {Climate Adaptation and Technology - Sub-Saharan Africa Synthetic Dataset}, author = {Electric Sheep Africa}, year = {2025}, publisher = {HuggingFace}, version = {1.0.0}, url = {https://huggingface.co/datasets/electricsheepafrica/climate-adaptation-tech-ssa-synthetic}, note = {Dataset 4 of African Agriculture Portfolio. 1 million synthetic records, 57 variables, 16 new literature sources for climate/tech variables} } ``` **Cite the full portfolio**: ```bibtex @collection{african_agriculture_portfolio_2025, title = {African Agriculture Synthetic Data Portfolio}, author = {Electric Sheep Africa}, year = {2025}, publisher = {HuggingFace}, note = {4 datasets, 4 million records, 93 variables, 40+ literature sources} } ``` ### Contributions Report issues or improvements: - Data quality issues - Documentation improvements - New variable suggestions - Validation findings against real climate adaptation data ## Quick Start ```python # Load dataset from datasets import load_dataset ds = load_dataset("electricsheepafrica/climate-adaptation-tech-ssa-synthetic") # Or with pandas import pandas as pd df = pd.read_parquet("hf://datasets/electricsheepafrica/climate-adaptation-tech-ssa-synthetic/climate_adaptation_data.parquet") # Explore climate adaptation patterns print(df[['drought_events_5yr', 'improved_seed_use', 'irrigation_access']].describe()) # Technology adoption by climate exposure adoption_by_drought = df.groupby('drought_events_5yr')['improved_seed_use'].value_counts(normalize=True) print(adoption_by_drought) # Credit access and technology adoption relationship credit_tech = pd.crosstab(df['credit_access'], df['conservation_agriculture'], normalize='index') print(credit_tech) ``` ## Validation The dataset has been validated against: - ✅ Literature benchmarks (adoption rates match published studies) - ✅ Logical consistency (e.g., climate info access higher with extension) - ✅ Conditional dependencies (e.g., irrigation higher in peri-urban) - ✅ Missing data patterns realistic See metadata files for detailed validation results. --- **Questions?** See full documentation in repository **Ready to use!** Load and start analyzing climate adaptation patterns immediately. **Part of a larger portfolio**: Combine with Datasets 1-3 for integrated farm-livestock-postharvest-climate analysis.
提供机构:
electricsheepafrica
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作