electricsheepafrica/africa-ethiopia-zonal-level-hiv-aids-estimates-for-year-2024
收藏Hugging Face2026-04-07 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/electricsheepafrica/africa-ethiopia-zonal-level-hiv-aids-estimates-for-year-2024
下载链接
链接失效反馈官方服务:
资源简介:
---
annotations_creators:
- no-annotation
language_creators:
- found
language:
- en
license: cc-by-4.0
multilinguality:
- monolingual
size_categories:
- n<1K
source_datasets:
- original
task_categories:
- tabular-classification
task_ids: []
tags:
- africa
- humanitarian
- hdx
- electric-sheep-africa
- disease
- health
- eth
pretty_name: "Ethiopia: Zonal-Level HIV/AIDS Estimates"
dataset_info:
splits:
- name: train
num_examples: 111
- name: test
num_examples: 27
---
# Ethiopia: Zonal-Level HIV/AIDS Estimates
**Publisher:** 3iS · **Source:** [HDX](https://data.humdata.org/dataset/ethiopia-zonal-level-hiv-aids-estimates-for-year-2024) · **License:** `cc-by` · **Updated:** 2025-08-26
---
## Abstract
The HIV Estimates Dataset for Ethiopia (2024) provides a comprehensive analysis of the HIV epidemic at the zonal level, offering critical insights into trends, intervention impacts, and projections. This dataset is a vital resource for policymakers, researchers, and public health professionals seeking to understand and address the HIV/AIDS situation in Ethiopia.
Developed through rigorous validation and consultation with key stakeholders—including the Ethiopian Ministry of Health, UNAIDS, USAID, WHO, and the CDC—this dataset serves as a crucial tool in the ongoing fight against HIV/AIDS in Ethiopia. It facilitates targeted interventions, policy formulation, and progress tracking toward epidemic control.
Each row in this dataset represents first-level administrative unit observations. Data was last updated on HDX on 2025-08-26. Geographic scope: **ETH**.
*Curated into ML-ready Parquet format by [Electric Sheep Africa](https://huggingface.co/electricsheepafrica).*
---
## Dataset Characteristics
| | |
|---|---|
| **Domain** | Public health |
| **Unit of observation** | First-level administrative unit observations |
| **Rows (total)** | 139 |
| **Columns** | 11 (1 numeric, 10 categorical, 0 datetime) |
| **Train split** | 111 rows |
| **Test split** | 27 rows |
| **Geographic scope** | ETH |
| **Publisher** | 3iS |
| **HDX last updated** | 2025-08-26 |
---
## Variables
**Geographic** — `region` (Oromia, Amhara, South Ethiopia), `zone` (Addis Ketema, West Arsi, Oromiya Zone).
**Demographic** — `art_coverage_15` (96.8% (94.8-98.2%), 87.8% (76.6-94.9%), 92.6% (88.4-95.8%)).
**Outcome / Measurement** — `number_of_residents_on_art_15` (100 (0-200), 100 (0-300), 600 (300-1,100)), `number_clients_receiving_art_15` (range 0.0–15700.0).
**Identifier / Metadata** — `incidence_15_49_per_1000` (0.0 (0.0-0.1), 0.1 (0.0-0.1), 0.1 (0.0-0.2)), `esa_source` (HDX), `esa_processed` (2026-04-07).
**Other** — `plhiv15` (100 (0-200), 100 (0-300), 400 (200-800)), `hiv_prevalence_15_49` (0.2% (0.1-0.3%), 0.3% (0.2-0.5%), 0.4% (0.2-0.7%)), `new_infection_15` (10 (10-20), 20 (10-40), 0 (0-10)).
---
## Quick Start
```python
from datasets import load_dataset
ds = load_dataset("electricsheepafrica/africa-ethiopia-zonal-level-hiv-aids-estimates-for-year-2024")
train = ds["train"].to_pandas()
test = ds["test"].to_pandas()
print(train.shape)
train.head()
```
---
## Schema
| Column | Type | Null % | Range / Sample Values |
|---|---|---|---|
| `region` | object | 0.0% | Oromia, Amhara, South Ethiopia |
| `zone` | object | 0.0% | Addis Ketema, West Arsi, Oromiya Zone |
| `plhiv15` | object | 0.0% | 100 (0-200), 100 (0-300), 400 (200-800) |
| `hiv_prevalence_15_49` | object | 0.0% | 0.2% (0.1-0.3%), 0.3% (0.2-0.5%), 0.4% (0.2-0.7%) |
| `incidence_15_49_per_1000` | object | 0.0% | 0.0 (0.0-0.1), 0.1 (0.0-0.1), 0.1 (0.0-0.2) |
| `new_infection_15` | object | 0.0% | 10 (10-20), 20 (10-40), 0 (0-10) |
| `art_coverage_15` | object | 0.0% | 96.8% (94.8-98.2%), 87.8% (76.6-94.9%), 92.6% (88.4-95.8%) |
| `number_of_residents_on_art_15` | object | 0.0% | 100 (0-200), 100 (0-300), 600 (300-1,100) |
| `number_clients_receiving_art_15` | int64 | 0.0% | 0.0 – 15700.0 (mean 3588.4892) |
| `esa_source` | object | 0.0% | HDX |
| `esa_processed` | object | 0.0% | 2026-04-07 |
---
## Numeric Summary
| Column | Min | Max | Mean | Median |
|---|---|---|---|---|
| `number_clients_receiving_art_15` | 0.0 | 15700.0 | 3588.4892 | 2700.0 |
---
## Curation
Raw data was downloaded from HDX via the CKAN API and converted to Parquet. Column names were lowercased and standardised to snake_case. Common missing-value markers (`N/A`, `null`, `none`, `-`, `unknown`, `no data`, `#N/A`) were unified to `NaN`. The dataset was split 80/20 into train and test partitions using a fixed random seed (42) and saved as Snappy-compressed Parquet.
---
## Limitations
- Data originates from 3iS and has not been independently validated by ESA.
- Automated cleaning cannot correct for misreported values, definitional inconsistencies, or sampling bias in the original collection.
- Refer to the [original HDX dataset page](https://data.humdata.org/dataset/ethiopia-zonal-level-hiv-aids-estimates-for-year-2024) for the publisher's own methodology notes and caveats.
---
## Citation
```bibtex
@dataset{hdx_africa_ethiopia_zonal_level_hiv_aids_estimates_for_year_2024,
title = {Ethiopia: Zonal-Level HIV/AIDS Estimates},
author = {3iS},
year = {2025},
url = {https://data.humdata.org/dataset/ethiopia-zonal-level-hiv-aids-estimates-for-year-2024},
note = {Repackaged for machine learning by Electric Sheep Africa (https://huggingface.co/electricsheepafrica)}
}
```
---
*[Electric Sheep Africa](https://huggingface.co/electricsheepafrica) — Africa's ML dataset infrastructure. Lagos, Nigeria.*
提供机构:
electricsheepafrica



