electricsheepafrica/africa-structures-de-sante-guinee-vf
收藏Hugging Face2026-04-07 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/electricsheepafrica/africa-structures-de-sante-guinee-vf
下载链接
链接失效反馈官方服务:
资源简介:
---
annotations_creators:
- no-annotation
language_creators:
- found
language:
- en
license: other
multilinguality:
- monolingual
size_categories:
- 1K<n<10K
source_datasets:
- original
task_categories:
- tabular-classification
- other
task_ids: []
tags:
- africa
- humanitarian
- hdx
- electric-sheep-africa
- health
- health-facilities
- gin
pretty_name: "Structures de Santé - Guinée"
dataset_info:
splits:
- name: train
num_examples: 1949
- name: test
num_examples: 487
---
# Structures de Santé - Guinée
**Publisher:** American Red Cross (inactive) · **Source:** [HDX](https://data.humdata.org/dataset/structures_de_sante_guinee_vf) · **License:** `other-pd-nr` · **Updated:** 2025-02-06
---
## Abstract
Cette donnée contient la liste des structures de Santé de la Guinée. Cette donnée contient la liste de 2430 structures de santé dont 1773 avec les coordonnées géographiques (latitude et longitude). Le nombre de structures de santé sans coordonnées GPS est de 666.
Each row in this dataset represents geolocated point observations. Data was last updated on HDX on 2025-02-06. Geographic scope: **GIN**.
*Curated into ML-ready Parquet format by [Electric Sheep Africa](https://huggingface.co/electricsheepafrica).*
---
## Dataset Characteristics
| | |
|---|---|
| **Domain** | Public health |
| **Unit of observation** | Geolocated point observations |
| **Rows (total)** | 2,437 |
| **Columns** | 3 (0 numeric, 3 categorical, 0 datetime) |
| **Train split** | 1,949 rows |
| **Test split** | 487 rows |
| **Geographic scope** | GIN |
| **Publisher** | American Red Cross (inactive) |
| **HDX last updated** | 2025-02-06 |
---
## Variables
**Geographic** — `nom_code_prefecture_latitude_longiture` (CSR de Albadariah centre;;CSR524;KISSIDOUGOU;;9.55066709;-10.09971425, PS Dandakara;;;DABOLA;;;, CSR de konendou;;CSR893;DABOLA;;10.68601728;-10.85957025).
**Identifier / Metadata** — `esa_source` (HDX), `esa_processed` (2026-04-07).
---
## Quick Start
```python
from datasets import load_dataset
ds = load_dataset("electricsheepafrica/africa-structures-de-sante-guinee-vf")
train = ds["train"].to_pandas()
test = ds["test"].to_pandas()
print(train.shape)
train.head()
```
---
## Schema
| Column | Type | Null % | Range / Sample Values |
|---|---|---|---|
| `nom_code_prefecture_latitude_longiture` | object | 0.0% | CSR de Albadariah centre;;CSR524;KISSIDOUGOU;;9.55066709;-10.09971425, PS Dandakara;;;DABOLA;;;, CSR de konendou;;CSR893;DABOLA;;10.68601728;-10.85957025 |
| `esa_source` | object | 0.0% | HDX |
| `esa_processed` | object | 0.0% | 2026-04-07 |
---
## Numeric Summary
| Column | Min | Max | Mean | Median |
|---|---|---|---|---|
_No numeric columns._
---
## Curation
Raw data was downloaded from HDX via the CKAN API and converted to Parquet. Column names were lowercased and standardised to snake_case. Common missing-value markers (`N/A`, `null`, `none`, `-`, `unknown`, `no data`, `#N/A`) were unified to `NaN`. 2 exact duplicate rows were removed. The dataset was split 80/20 into train and test partitions using a fixed random seed (42) and saved as Snappy-compressed Parquet.
---
## Limitations
- Data originates from American Red Cross (inactive) and has not been independently validated by ESA.
- Automated cleaning cannot correct for misreported values, definitional inconsistencies, or sampling bias in the original collection.
- Refer to the [original HDX dataset page](https://data.humdata.org/dataset/structures_de_sante_guinee_vf) for the publisher's own methodology notes and caveats.
---
## Citation
```bibtex
@dataset{hdx_africa_structures_de_sante_guinee_vf,
title = {Structures de Santé - Guinée},
author = {American Red Cross (inactive)},
year = {2025},
url = {https://data.humdata.org/dataset/structures_de_sante_guinee_vf},
note = {Repackaged for machine learning by Electric Sheep Africa (https://huggingface.co/electricsheepafrica)}
}
```
---
*[Electric Sheep Africa](https://huggingface.co/electricsheepafrica) — Africa's ML dataset infrastructure. Lagos, Nigeria.*
---
annotations_creators: 注释创建者:无注释
language_creators: 语言创建者:现成采集
language: 语言:英语
license: 许可协议:其他
multilinguality: 多语言属性:单语言
size_categories: 规模范围:1K<n<10K
source_datasets: 源数据集:原创数据集
task_categories: 任务类别:表格分类、其他
task_ids: 任务子项:无
tags: 标签:非洲、人道主义、HDX(Humanitarian Data Exchange)、Electric Sheep Africa(Electric Sheep Africa)、卫生、医疗设施、GIN
pretty_name: "几内亚医疗设施(Structures de Santé - Guinée)"
dataset_info:
splits:
- name: train
num_examples: 1949
- name: test
num_examples: 487
---
# 几内亚医疗设施(Structures de Santé - Guinée)
**发布方**:美国红十字会(American Red Cross,已停止运营) · **来源**:[HDX(Humanitarian Data Exchange)](https://data.humdata.org/dataset/structures_de_sante_guinee_vf) · **许可协议**:`other-pd-nr` · **更新时间**:2025-02-06
---
## 摘要
本数据集包含几内亚的医疗设施名录,共计2430家医疗设施,其中1773家带有地理坐标(纬度与经度),剩余666家未提供GPS坐标。
本数据集的每条数据均代表一个带地理定位的点位观测值。本数据集最后一次在HDX上的更新时间为2025-02-06,地理覆盖范围:**GIN**。
*本数据集已由[Electric Sheep Africa](https://huggingface.co/electricsheepafrica)整理为适用于机器学习的Parquet(Parquet)格式。*
---
## 数据集特征
| | |
|---|---|
| **领域** | 公共卫生 |
| **观测单元** | 带地理定位的点位观测值 |
| **总行数** | 2,437 |
| **列数** | 3(0个数值型、3个分类型、0个日期时间型) |
| **训练集划分** | 1,949行 |
| **测试集划分** | 487行 |
| **地理覆盖范围** | GIN |
| **发布方** | 美国红十字会(已停止运营) |
| **HDX最后更新时间** | 2025-02-06 |
---
## 变量
**地理信息** — `nom_code_prefecture_latitude_longiture`,示例值:CSR de Albadariah centre;;CSR524;KISSIDOUGOU;;9.55066709;-10.09971425, PS Dandakara;;;DABOLA;;;, CSR de konendou;;CSR893;DABOLA;;10.68601728;-10.85957025。该字段包含设施名称、辖区代码、辖区名称、纬度与经度信息。
**标识符/元数据** — `esa_source`(取值为HDX),`esa_processed`(取值为2026-04-07)。
---
## 快速上手
python
from datasets import load_dataset
ds = load_dataset("electricsheepafrica/africa-structures-de-sante-guinee-vf")
train = ds["train"].to_pandas()
test = ds["test"].to_pandas()
print(train.shape)
train.head()
---
## 数据模式
| 列名 | 数据类型 | 空值占比 | 取值范围/示例值 |
|---|---|---|---|
| `nom_code_prefecture_latitude_longiture` | 对象型(object) | 0.0% | CSR de Albadariah centre;;CSR524;KISSIDOUGOU;;9.55066709;-10.09971425, PS Dandakara;;;DABOLA;;;, CSR de konendou;;CSR893;DABOLA;;10.68601728;-10.85957025 |
| `esa_source` | 对象型(object) | 0.0% | HDX |
| `esa_processed` | 对象型(object) | 0.0% | 2026-04-07 |
---
## 数值型字段统计
| 列名 | 最小值 | 最大值 | 均值 | 中位数 |
|---|---|---|---|---|
_无数值型列。_
---
## 数据整理流程
原始数据通过CKAN API(CKAN API)从HDX下载,并转换为Parquet格式。列名统一转换为小写并采用蛇形命名法(snake_case)进行标准化。常见的缺失值标记(`N/A`、`null`、`none`、`-`、`unknown`、`no data`、`#N/A`)被统一替换为`NaN`。删除了2条完全重复的行。本数据集使用固定随机种子(42)按80/20的比例划分为训练集与测试集,并保存为Snappy压缩的Parquet格式。
---
## 局限性说明
- 本数据集源自已停止运营的美国红十字会,未由Electric Sheep Africa进行独立验证。
- 自动化清洗流程无法修正原始数据收集中的错报值、定义不一致或抽样偏差问题。
- 如需查看发布方的方法说明与免责条款,请参阅[原始HDX数据集页面](https://data.humdata.org/dataset/structures_de_sante_guinee_vf)。
---
## 引用格式
bibtex
@dataset{hdx_africa_structures_de_sante_guinee_vf,
title = {Structures de Santé - Guinée},
author = {American Red Cross (inactive)},
year = {2025},
url = {https://data.humdata.org/dataset/structures_de_sante_guinee_vf},
note = {由Electric Sheep Africa重新打包为机器学习可用数据集(https://huggingface.co/electricsheepafrica)}
}
---
*[Electric Sheep Africa](https://huggingface.co/electricsheepafrica) — 非洲机器学习数据集基础设施,总部位于尼日利亚拉各斯。*
提供机构:
electricsheepafrica



