five

electricsheepafrica/africa-hrp-projects-caf

收藏
Hugging Face2026-04-06 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/electricsheepafrica/africa-hrp-projects-caf
下载链接
链接失效反馈
官方服务:
资源简介:
--- annotations_creators: - no-annotation language_creators: - found language: - en license: cc-by-4.0 multilinguality: - monolingual size_categories: - n<1K source_datasets: - original task_categories: - other task_ids: [] tags: - africa - humanitarian - hdx - electric-sheep-africa - humanitarian-response-plan-hrp - who-is-doing-what-and-where-3w-4w-5w - caf pretty_name: "Central African Republic: Response Plan projects" dataset_info: splits: - name: train num_examples: 200 - name: test num_examples: 50 --- # Central African Republic: Response Plan projects **Publisher:** OCHA Humanitarian Programme Cycle Tools (HPC Tools) · **Source:** [HDX](https://data.humdata.org/dataset/hrp-projects-caf) · **License:** `cc-by-igo` · **Updated:** 2026-04-01 --- ## Abstract Projects proposed, in progress, or completed as part of the annual Central African Republic Humanitarian Response Plans (HRPs) or other Humanitarian Programme Cycle plans. The original data is available on https://hpc.tools **Important:** some projects in Central African Republic might be missing, and others might not apply specifically to Central African Republic. See _Caveats_ under the _Additional information_ tab. Each row in this dataset represents time-series observations. Temporal coverage is indicated by the `startdate`, `enddate` column(s). Geographic scope: **CAF**. *Curated into ML-ready Parquet format by [Electric Sheep Africa](https://huggingface.co/electricsheepafrica).* --- ## Dataset Characteristics | | | |---|---| | **Domain** | Humanitarian and development data | | **Unit of observation** | Time-series observations | | **Rows (total)** | 251 | | **Columns** | 13 (1 numeric, 10 categorical, 2 datetime) | | **Train split** | 200 rows | | **Test split** | 50 rows | | **Geographic scope** | CAF | | **Publisher** | OCHA Humanitarian Programme Cycle Tools (HPC Tools) | | **HDX last updated** | 2026-04-01 | --- ## Variables **Geographic** — `locations` (ETH, EGY, LBY). **Temporal** — `startdate`, `enddate`. **Identifier / Metadata** — `name` (Ethiopia: ACT Alliance/Ethiopian Orthodox Church-Development and Inter-Church Aid Commission (Education response), Uganda: ACT Alliance / Lutheran World Federation (Protection - Gender-Based Violence response), Egypt: United Nations High Commissioner for Refugees (Protection - Gender-Based Violence response)), `versioncode` (RRSDN24-EDU-213166-1, RRSDN24-PRO-220173-1, RRSDN24-PRO-213244-1), `response_plan_code` (RRSDN24), `esa_source` (HDX), `esa_processed` (2026-04-06). **Other** — `currentrequestedfunds` (range 5000.0–32130390.0), `objective` (International Rescue Committee - Protection response for Sudan Regional Plan in Libya, Cooperazione E Sviluppo - CESVI - Protection response for Sudan Regional Plan in Libya, ACT Alliance / Lutheran World Federation - Protection response for Sudan Regional Plan in Uganda), `globalclusters` (Health, Early Recovery, Protection), `organizations` (United Nations High Commissioner for Refugees, United Nations Children's Fund, International Organization for Migration), `plans` (Sudan Emergency: Regional Refugee Response Plan 2024). --- ## Quick Start ```python from datasets import load_dataset ds = load_dataset("electricsheepafrica/africa-hrp-projects-caf") train = ds["train"].to_pandas() test = ds["test"].to_pandas() print(train.shape) train.head() ``` --- ## Schema | Column | Type | Null % | Range / Sample Values | |---|---|---|---| | `name` | object | 0.0% | Ethiopia: ACT Alliance/Ethiopian Orthodox Church-Development and Inter-Church Aid Commission (Education response), Uganda: ACT Alliance / Lutheran World Federation (Protection - Gender-Based Violence response), Egypt: United Nations High Commissioner for Refugees (Protection - Gender-Based Violence response) | | `versioncode` | object | 0.0% | RRSDN24-EDU-213166-1, RRSDN24-PRO-220173-1, RRSDN24-PRO-213244-1 | | `currentrequestedfunds` | int64 | 0.0% | 5000.0 – 32130390.0 (mean 2010167.0876) | | `objective` | object | 0.0% | International Rescue Committee - Protection response for Sudan Regional Plan in Libya, Cooperazione E Sviluppo - CESVI - Protection response for Sudan Regional Plan in Libya, ACT Alliance / Lutheran World Federation - Protection response for Sudan Regional Plan in Uganda | | `startdate` | datetime64[ns] | 0.0% | | | `enddate` | datetime64[ns] | 0.0% | | | `globalclusters` | object | 0.0% | Health, Early Recovery, Protection | | `locations` | object | 0.0% | ETH, EGY, LBY | | `organizations` | object | 0.0% | United Nations High Commissioner for Refugees, United Nations Children's Fund, International Organization for Migration | | `plans` | object | 0.0% | Sudan Emergency: Regional Refugee Response Plan 2024 | | `response_plan_code` | object | 0.0% | RRSDN24 | | `esa_source` | object | 0.0% | HDX | | `esa_processed` | object | 0.0% | 2026-04-06 | --- ## Numeric Summary | Column | Min | Max | Mean | Median | |---|---|---|---|---| | `currentrequestedfunds` | 5000.0 | 32130390.0 | 2010167.0876 | 740000.0 | --- ## Curation Raw data was downloaded from HDX via the CKAN API and converted to Parquet. Column names were lowercased and standardised to snake_case. Common missing-value markers (`N/A`, `null`, `none`, `-`, `unknown`, `no data`, `#N/A`) were unified to `NaN`. 1 column(s) with >80% missing values were removed: `partners`. 2 column(s) were cast from string to numeric or datetime based on parse-success rate (>85% threshold). The dataset was split 80/20 into train and test partitions using a fixed random seed (42) and saved as Snappy-compressed Parquet. --- ## Limitations - Data originates from OCHA Humanitarian Programme Cycle Tools (HPC Tools) and has not been independently validated by ESA. - Automated cleaning cannot correct for misreported values, definitional inconsistencies, or sampling bias in the original collection. - Refer to the [original HDX dataset page](https://data.humdata.org/dataset/hrp-projects-caf) for the publisher's own methodology notes and caveats. --- ## Citation ```bibtex @dataset{hdx_africa_hrp_projects_caf, title = {Central African Republic: Response Plan projects}, author = {OCHA Humanitarian Programme Cycle Tools (HPC Tools)}, year = {2026}, url = {https://data.humdata.org/dataset/hrp-projects-caf}, note = {Repackaged for machine learning by Electric Sheep Africa (https://huggingface.co/electricsheepafrica)} } ``` --- *[Electric Sheep Africa](https://huggingface.co/electricsheepafrica) — Africa's ML dataset infrastructure. Lagos, Nigeria.*

--- annotations_creators: - 无注释 language_creators: - 公开采集 language: - 英语 license: cc-by-4.0 multilinguality: - 单语言 size_categories: - 样本量小于1000 source_datasets: - 原创数据集 task_categories: - 其他 task_ids: [] tags: - 非洲 - 人道主义 - 人道主义数据交换平台(Humanitarian Data Exchange,HDX) - electric-sheep-africa - 人道主义响应计划(Humanitarian Response Plan,HRP) - who-is-doing-what-and-where-3w-4w-5w - caf pretty_name: "中非共和国:响应计划项目" dataset_info: splits: - name: train num_examples: 200 - name: test num_examples: 50 --- # 中非共和国:响应计划项目 **发布方:** 联合国人道主义事务协调厅人道主义计划周期工具(Office for the Coordination of Humanitarian Affairs Humanitarian Programme Cycle Tools,HPC Tools) · **数据源:** [HDX(Humanitarian Data Exchange)](https://data.humdata.org/dataset/hrp-projects-caf) · **授权协议:** `cc-by-igo` · **更新时间:** 2026-04-01 --- ## 摘要 纳入中非共和国年度人道主义响应计划(Humanitarian Response Plan,HRP)或其他人道主义计划周期计划的已提议、进行中或已完成的项目。原始数据可从 https://hpc.tools 获取。 **重要提示:** 中非共和国的部分项目可能存在缺失,或并非专门针对该国。请参阅「附加信息」标签下的「注意事项」。 本数据集的每一行均代表时序观测数据,时间覆盖范围由`startdate`(开始日期)、`enddate`(结束日期)列标注。地理覆盖范围:**CAF(中非共和国)**。 *本数据集已由[Electric Sheep Africa](https://huggingface.co/electricsheepafrica)整理为适用于机器学习的Parquet格式。* --- ## 数据集特征 | | | |---|---| | **领域** | 人道主义与发展数据 | | **观测单元** | 时序观测数据 | | **总行数** | 251 | | **列数** | 13(1个数值型、10个分类型、2个日期时间型) | | **训练集划分** | 200行 | | **测试集划分** | 50行 | | **地理覆盖范围** | CAF(中非共和国) | | **发布方** | 联合国人道主义事务协调厅人道主义计划周期工具(HPC Tools) | | **HDX最后更新时间** | 2026-04-01 | --- ## 变量 **地理类** — `locations`(国家代码示例:ETH、EGY、LBY)。 **时间类** — `startdate`(开始日期)、`enddate`(结束日期)。 **标识符/元数据类** — `name`(项目名称示例:埃塞俄比亚:ACT联盟/埃塞俄比亚正教会发展与教会间援助委员会(教育响应项目)、乌干达:ACT联盟/世界路德会联合会(保护——性别暴力响应项目)、埃及:联合国难民署(保护——性别暴力响应项目))、`versioncode`(版本代码示例:RRSDN24-EDU-213166-1、RRSDN24-PRO-220173-1、RRSDN24-PRO-213244-1)、`response_plan_code`(响应计划代码:RRSDN24)、`esa_source`(数据来源:HDX)、`esa_processed`(数据处理日期:2026-04-06)。 **其他类** — `currentrequestedfunds`(所需资金范围:5000.0–32130390.0)、`objective`(项目目标示例:国际救援委员会——利比亚苏丹区域计划保护响应项目、Cooperazione E Sviluppo - CESVI——利比亚苏丹区域计划保护响应项目、ACT联盟/世界路德会联合会——乌干达苏丹区域计划保护响应项目)、`globalclusters`(全球集群领域示例:卫生、早期恢复、保护)、`organizations`(参与组织示例:联合国难民署、联合国儿童基金会、国际移民组织)、`plans`(所属计划:苏丹紧急情况:2024年区域难民响应计划)。 --- ## 快速入门 python from datasets import load_dataset ds = load_dataset("electricsheepafrica/africa-hrp-projects-caf") train = ds["train"].to_pandas() test = ds["test"].to_pandas() print(train.shape) train.head() --- ## 数据结构 | 列名 | 数据类型 | 缺失率 | 取值范围/示例值 | |---|---|---| | `name` | 字符型 | 0.0% | 埃塞俄比亚:ACT联盟/埃塞俄比亚正教会发展与教会间援助委员会(教育响应项目)、乌干达:ACT联盟/世界路德会联合会(保护——性别暴力响应项目)、埃及:联合国难民署(保护——性别暴力响应项目) | | `versioncode` | 字符型 | 0.0% | RRSDN24-EDU-213166-1、RRSDN24-PRO-220173-1、RRSDN24-PRO-213244-1 | | `currentrequestedfunds` | 64位整型 | 0.0% | 5000.0 – 32130390.0(均值:2010167.0876) | | `objective` | 字符型 | 0.0% | 国际救援委员会——利比亚苏丹区域计划保护响应项目、Cooperazione E Sviluppo - CESVI——利比亚苏丹区域计划保护响应项目、ACT联盟/世界路德会联合会——乌干达苏丹区域计划保护响应项目 | | `startdate` | 日期时间型[ns] | 0.0% | 无 | | `enddate` | 日期时间型[ns] | 0.0% | 无 | | `globalclusters` | 字符型 | 0.0% | 卫生、早期恢复、保护 | | `locations` | 字符型 | 0.0% | ETH、EGY、LBY | | `organizations` | 字符型 | 0.0% | 联合国难民署、联合国儿童基金会、国际移民组织 | | `plans` | 字符型 | 0.0% | 苏丹紧急情况:2024年区域难民响应计划 | | `response_plan_code` | 字符型 | 0.0% | RRSDN24 | | `esa_source` | 字符型 | 0.0% | HDX | | `esa_processed` | 字符型 | 0.0% | 2026-04-06 | --- ## 数值型变量统计摘要 | 列名 | 最小值 | 最大值 | 均值 | 中位数 | |---|---|---|---|---| | `currentrequestedfunds` | 5000.0 | 32130390.0 | 2010167.0876 | 740000.0 | --- ## 数据整理流程 原始数据通过CKAN API从HDX下载,并转换为Parquet格式。列名统一转换为小写并标准化为蛇形命名法。将常见缺失值标记(`N/A`、`null`、`none`、`-`、`unknown`、`no data`、`#N/A`)统一替换为`NaN`。移除了1列缺失率超过80%的列:`partners`。根据解析成功率(阈值>85%),将2列从字符串类型转换为数值型或日期时间型。本数据集以固定随机种子(42)按80/20比例划分为训练集与测试集,并保存为Snappy压缩的Parquet格式。 --- ## 数据集局限性 - 数据源自联合国人道主义事务协调厅人道主义计划周期工具(HPC Tools),未由Electric Sheep Africa(ESA)进行独立验证。 - 自动化清洗无法修正原始数据收集中的错报值、定义不一致或抽样偏差问题。 - 请参阅[原始HDX数据集页面](https://data.humdata.org/dataset/hrp-projects-caf)查看发布方提供的方法论说明与注意事项。 --- ## 引用 bibtex @dataset{hdx_africa_hrp_projects_caf, title = {Central African Republic: Response Plan projects}, author = {OCHA Humanitarian Programme Cycle Tools (HPC Tools)}, year = {2026}, url = {https://data.humdata.org/dataset/hrp-projects-caf}, note = {Repackaged for machine learning by Electric Sheep Africa (https://huggingface.co/electricsheepafrica)} } --- *[Electric Sheep Africa](https://huggingface.co/electricsheepafrica) — 非洲机器学习数据集基础设施。尼日利亚拉各斯。*
提供机构:
electricsheepafrica
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作