five

electricsheepafrica/africa-ethiopia-zonal-level-hiv-aids-estimates-for-year-2024

收藏
Hugging Face2026-04-07 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/electricsheepafrica/africa-ethiopia-zonal-level-hiv-aids-estimates-for-year-2024
下载链接
链接失效反馈
官方服务:
资源简介:
--- annotations_creators: - no-annotation language_creators: - found language: - en license: cc-by-4.0 multilinguality: - monolingual size_categories: - n<1K source_datasets: - original task_categories: - tabular-classification task_ids: [] tags: - africa - humanitarian - hdx - electric-sheep-africa - disease - health - eth pretty_name: "Ethiopia: Zonal-Level HIV/AIDS Estimates" dataset_info: splits: - name: train num_examples: 111 - name: test num_examples: 27 --- # Ethiopia: Zonal-Level HIV/AIDS Estimates **Publisher:** 3iS · **Source:** [HDX](https://data.humdata.org/dataset/ethiopia-zonal-level-hiv-aids-estimates-for-year-2024) · **License:** `cc-by` · **Updated:** 2025-08-26 --- ## Abstract The HIV Estimates Dataset for Ethiopia (2024) provides a comprehensive analysis of the HIV epidemic at the zonal level, offering critical insights into trends, intervention impacts, and projections. This dataset is a vital resource for policymakers, researchers, and public health professionals seeking to understand and address the HIV/AIDS situation in Ethiopia. Developed through rigorous validation and consultation with key stakeholders—including the Ethiopian Ministry of Health, UNAIDS, USAID, WHO, and the CDC—this dataset serves as a crucial tool in the ongoing fight against HIV/AIDS in Ethiopia. It facilitates targeted interventions, policy formulation, and progress tracking toward epidemic control. Each row in this dataset represents first-level administrative unit observations. Data was last updated on HDX on 2025-08-26. Geographic scope: **ETH**. *Curated into ML-ready Parquet format by [Electric Sheep Africa](https://huggingface.co/electricsheepafrica).* --- ## Dataset Characteristics | | | |---|---| | **Domain** | Public health | | **Unit of observation** | First-level administrative unit observations | | **Rows (total)** | 139 | | **Columns** | 11 (1 numeric, 10 categorical, 0 datetime) | | **Train split** | 111 rows | | **Test split** | 27 rows | | **Geographic scope** | ETH | | **Publisher** | 3iS | | **HDX last updated** | 2025-08-26 | --- ## Variables **Geographic** — `region` (Oromia, Amhara, South Ethiopia), `zone` (Addis Ketema, West Arsi, Oromiya Zone). **Demographic** — `art_coverage_15` (96.8% (94.8-98.2%), 87.8% (76.6-94.9%), 92.6% (88.4-95.8%)). **Outcome / Measurement** — `number_of_residents_on_art_15` (100 (0-200), 100 (0-300), 600 (300-1,100)), `number_clients_receiving_art_15` (range 0.0–15700.0). **Identifier / Metadata** — `incidence_15_49_per_1000` (0.0 (0.0-0.1), 0.1 (0.0-0.1), 0.1 (0.0-0.2)), `esa_source` (HDX), `esa_processed` (2026-04-07). **Other** — `plhiv15` (100 (0-200), 100 (0-300), 400 (200-800)), `hiv_prevalence_15_49` (0.2% (0.1-0.3%), 0.3% (0.2-0.5%), 0.4% (0.2-0.7%)), `new_infection_15` (10 (10-20), 20 (10-40), 0 (0-10)). --- ## Quick Start ```python from datasets import load_dataset ds = load_dataset("electricsheepafrica/africa-ethiopia-zonal-level-hiv-aids-estimates-for-year-2024") train = ds["train"].to_pandas() test = ds["test"].to_pandas() print(train.shape) train.head() ``` --- ## Schema | Column | Type | Null % | Range / Sample Values | |---|---|---|---| | `region` | object | 0.0% | Oromia, Amhara, South Ethiopia | | `zone` | object | 0.0% | Addis Ketema, West Arsi, Oromiya Zone | | `plhiv15` | object | 0.0% | 100 (0-200), 100 (0-300), 400 (200-800) | | `hiv_prevalence_15_49` | object | 0.0% | 0.2% (0.1-0.3%), 0.3% (0.2-0.5%), 0.4% (0.2-0.7%) | | `incidence_15_49_per_1000` | object | 0.0% | 0.0 (0.0-0.1), 0.1 (0.0-0.1), 0.1 (0.0-0.2) | | `new_infection_15` | object | 0.0% | 10 (10-20), 20 (10-40), 0 (0-10) | | `art_coverage_15` | object | 0.0% | 96.8% (94.8-98.2%), 87.8% (76.6-94.9%), 92.6% (88.4-95.8%) | | `number_of_residents_on_art_15` | object | 0.0% | 100 (0-200), 100 (0-300), 600 (300-1,100) | | `number_clients_receiving_art_15` | int64 | 0.0% | 0.0 – 15700.0 (mean 3588.4892) | | `esa_source` | object | 0.0% | HDX | | `esa_processed` | object | 0.0% | 2026-04-07 | --- ## Numeric Summary | Column | Min | Max | Mean | Median | |---|---|---|---|---| | `number_clients_receiving_art_15` | 0.0 | 15700.0 | 3588.4892 | 2700.0 | --- ## Curation Raw data was downloaded from HDX via the CKAN API and converted to Parquet. Column names were lowercased and standardised to snake_case. Common missing-value markers (`N/A`, `null`, `none`, `-`, `unknown`, `no data`, `#N/A`) were unified to `NaN`. The dataset was split 80/20 into train and test partitions using a fixed random seed (42) and saved as Snappy-compressed Parquet. --- ## Limitations - Data originates from 3iS and has not been independently validated by ESA. - Automated cleaning cannot correct for misreported values, definitional inconsistencies, or sampling bias in the original collection. - Refer to the [original HDX dataset page](https://data.humdata.org/dataset/ethiopia-zonal-level-hiv-aids-estimates-for-year-2024) for the publisher's own methodology notes and caveats. --- ## Citation ```bibtex @dataset{hdx_africa_ethiopia_zonal_level_hiv_aids_estimates_for_year_2024, title = {Ethiopia: Zonal-Level HIV/AIDS Estimates}, author = {3iS}, year = {2025}, url = {https://data.humdata.org/dataset/ethiopia-zonal-level-hiv-aids-estimates-for-year-2024}, note = {Repackaged for machine learning by Electric Sheep Africa (https://huggingface.co/electricsheepafrica)} } ``` --- *[Electric Sheep Africa](https://huggingface.co/electricsheepafrica) — Africa's ML dataset infrastructure. Lagos, Nigeria.*
提供机构:
electricsheepafrica
二维码
社区交流群
二维码
科研交流群
商业服务