five

electricsheepafrica/africa-mapped-distribution-of-donor-and-government-funded-projects-for-2013-2015-in-kenya

收藏
Hugging Face2026-04-17 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/electricsheepafrica/africa-mapped-distribution-of-donor-and-government-funded-projects-for-2013-2015-in-kenya
下载链接
链接失效反馈
官方服务:
资源简介:
--- annotations_creators: - no-annotation language_creators: - found language: - en license: other multilinguality: - monolingual size_categories: - 1K<n<10K source_datasets: - original task_categories: - tabular-regression - other task_ids: [] tags: - africa - humanitarian - hdx - electric-sheep-africa - economics - ken pretty_name: "Kenya - Distribution of Donor and Government funded Projects" dataset_info: splits: - name: train num_examples: 1752 - name: test num_examples: 438 --- # Kenya - Distribution of Donor and Government funded Projects **Publisher:** Kenya Open Data Initiative (inactive) · **Source:** [HDX](https://data.humdata.org/dataset/mapped-distribution-of-donor-and-government-funded-projects-for-2013-2015-in-kenya) · **License:** `other-pd-nr` · **Updated:** 2022-01-04 --- ## Abstract This dataset contains the summarized distribution of Donor and GOK funded Projects across the Country. Together with the location information on projects the dataset contains descriptive information about the project, the total financial estimates and the average estimated costs per year based on the duration of the project. Each row in this dataset represents time-series observations. Data was last updated on HDX on 2022-01-04. Geographic scope: **KEN**. *Curated into ML-ready Parquet format by [Electric Sheep Africa](https://huggingface.co/electricsheepafrica).* --- ## Dataset Characteristics | | | |---|---| | **Domain** | Humanitarian and development data | | **Unit of observation** | Time-series observations | | **Rows (total)** | 2,191 | | **Columns** | 24 (6 numeric, 18 categorical, 0 datetime) | | **Train split** | 1,752 rows | | **Test split** | 438 rows | | **Geographic scope** | KEN | | **Publisher** | Kenya Open Data Initiative (inactive) | | **HDX last updated** | 2022-01-04 | --- ## Variables **Geographic** — `y` (range -4.5938–4.6435), `x` (range 33.9963–41.6592), `ward` (WASO, TOWNSHIP, KWAVONZA/YATTA), `constituency` (SAMBURU EAST, MAKUENI, KIBWEZI WEST), `county` (MAKUENI, SAMBURU, NAIROBI) and 3 others. **Temporal** — `duration_months` (71 months, 95 months, 59 months). **Outcome / Measurement** — `total_project_cost_kes` (range -2139934592.0–2142000000.0). **Identifier / Metadata** — `objectid` (range 1.0–2191.0), `projectid` (2010/052061, 2011/053187, 2010/052263), `epgeoname` (Unspecified, Nation Wide - Kenya, Kenya), `project_title` (Cash Trasfer For Orphans & Vulnerable Children (CT-OVC), Kenya Agricutural Productivity & Agribusiness Project (KAPAP), Community Empowerement And Institutional Support Project (CEISP)), `esa_source` and 1 others. **Other** — `duration` (71 months 29 days, 95 months 8 days, 87 months 27 days), `project_description` (Transfering regular and predictable cash to identified poor households living with orphans and vulnerable children to reduce the level of poverty and vulnerability and promote social inclusion and participation of the vulnerable groups., To support the agricultural research systems and agricultural extension, farmer and other Stakeholder empowerment., The project comprises of two components; i) Capacity building component which aims at developing the capacity of communities and other local level stakeholders to make better, more focused and equitable use of resources made available through Devolved f), `project_objectives` (To reduce vulnerability and subsequent suffering among the porest kenyan households taking care of the OVC by providing regular and preditable cash transfer to poor households taking care of the orphans and vulnerable children in order to strengthen th, To increase the agricultural productivity and incomes of participating small-holder farmers in 20 districs in Kenya, The project objective is to help empower poor communities to access socio-economic services in order to reduce poverty and to improve the management of local socio-economic development.), `ng_programme`, `vision_2030_flagship_project_pr` and 3 others. --- ## Quick Start ```python from datasets import load_dataset ds = load_dataset("electricsheepafrica/africa-mapped-distribution-of-donor-and-government-funded-projects-for-2013-2015-in-kenya") train = ds["train"].to_pandas() test = ds["test"].to_pandas() print(train.shape) train.head() ``` --- ## Schema | Column | Type | Null % | Range / Sample Values | |---|---|---|---| | `objectid` | int64 | 0.0% | 1.0 – 2191.0 (mean 1096.0) | | `projectid` | object | 0.0% | 2010/052061, 2011/053187, 2010/052263 | | `epgeoname` | object | 0.0% | Unspecified, Nation Wide - Kenya, Kenya | | `y` | float64 | 53.1% | -4.5938 – 4.6435 (mean -0.984) | | `x` | float64 | 53.1% | 33.9963 – 41.6592 (mean 37.1089) | | `ward` | object | 38.7% | WASO, TOWNSHIP, KWAVONZA/YATTA | | `constituency` | object | 16.6% | SAMBURU EAST, MAKUENI, KIBWEZI WEST | | `county` | object | 16.7% | MAKUENI, SAMBURU, NAIROBI | | `project_cost_yearly_breakdown` | float64 | 29.9% | -2139934592.0 – 2142000000.0 (mean 247723019.0248) | | `total_project_cost_kes` | float64 | 24.9% | -2139934592.0 – 2142000000.0 (mean 311121475.7874) | | `duration` | object | 24.9% | 71 months 29 days, 95 months 8 days, 87 months 27 days | | `duration_months` | object | 24.9% | 71 months, 95 months, 59 months | | `project_title` | object | 24.9% | Cash Trasfer For Orphans & Vulnerable Children (CT-OVC), Kenya Agricutural Productivity & Agribusiness Project (KAPAP), Community Empowerement And Institutional Support Project (CEISP) | | `project_description` | object | 25.5% | Transfering regular and predictable cash to identified poor households living with orphans and vulnerable children to reduce the level of poverty and vulnerability and promote social inclusion and participation of the vulnerable groups., To support the agricultural research systems and agricultural extension, farmer and other Stakeholder empowerment., The project comprises of two components; i) Capacity building component which aims at developing the capacity of communities and other local level stakeholders to make better, more focused and equitable use of resources made available through Devolved f | | `project_objectives` | object | 25.4% | To reduce vulnerability and subsequent suffering among the porest kenyan households taking care of the OVC by providing regular and preditable cash transfer to poor households taking care of the orphans and vulnerable children in order to strengthen th, To increase the agricultural productivity and incomes of participating small-holder farmers in 20 districs in Kenya, The project objective is to help empower poor communities to access socio-economic services in order to reduce poverty and to improve the management of local socio-economic development. | | `ng_programme` | object | 24.9% | | | `vision_2030_flagship_ministry` | object | 25.0% | | | `vision_2030_flagship_project_pr` | object | 24.9% | | | `implementing_agency` | object | 24.9% | | | `implementation_status` | object | 24.9% | | | `mtef_sector` | object | 24.9% | | | `work_plan_progress` | int64 | 0.0% | 0.0 – 100.0 (mean 1.1584) | | `esa_source` | object | 0.0% | | | `esa_processed` | object | 0.0% | | --- ## Numeric Summary | Column | Min | Max | Mean | Median | |---|---|---|---|---| | `objectid` | 1.0 | 2191.0 | 1096.0 | 1096.0 | | `y` | -4.5938 | 4.6435 | -0.984 | -0.9599 | | `x` | 33.9963 | 41.6592 | 37.1089 | 37.2847 | | `project_cost_yearly_breakdown` | -2139934592.0 | 2142000000.0 | 247723019.0248 | 70564139.0 | | `total_project_cost_kes` | -2139934592.0 | 2142000000.0 | 311121475.7874 | 80000000.0 | | `work_plan_progress` | 0.0 | 100.0 | 1.1584 | 0.0 | --- ## Curation Raw data was downloaded from HDX via the CKAN API and converted to Parquet. Column names were lowercased and standardised to snake_case. Common missing-value markers (`N/A`, `null`, `none`, `-`, `unknown`, `no data`, `#N/A`) were unified to `NaN`. 5 column(s) with >80% missing values were removed: `approval_date`, `start_date_planned`, `start_date_actual`, `end_date_planned`, `end_date_actual`. The dataset was split 80/20 into train and test partitions using a fixed random seed (42) and saved as Snappy-compressed Parquet. --- ## Limitations - Data originates from Kenya Open Data Initiative (inactive) and has not been independently validated by ESA. - Automated cleaning cannot correct for misreported values, definitional inconsistencies, or sampling bias in the original collection. - The following columns have >20% missing values and should be treated with caution in modelling: `y`, `x`, `ward`, `project_cost_yearly_breakdown`, `total_project_cost_kes`, `duration`, `duration_months`, `project_title`.... - Refer to the [original HDX dataset page](https://data.humdata.org/dataset/mapped-distribution-of-donor-and-government-funded-projects-for-2013-2015-in-kenya) for the publisher's own methodology notes and caveats. --- ## Citation ```bibtex @dataset{hdx_africa_mapped_distribution_of_donor_and_government_funded_projects_for_2013_2015_in_kenya, title = {Kenya - Distribution of Donor and Government funded Projects}, author = {Kenya Open Data Initiative (inactive)}, year = {2022}, url = {https://data.humdata.org/dataset/mapped-distribution-of-donor-and-government-funded-projects-for-2013-2015-in-kenya}, note = {Repackaged for machine learning by Electric Sheep Africa (https://huggingface.co/electricsheepafrica)} } ``` --- *[Electric Sheep Africa](https://huggingface.co/electricsheepafrica) — Africa's ML dataset infrastructure. Lagos, Nigeria.*
提供机构:
electricsheepafrica
二维码
社区交流群
二维码
科研交流群
商业服务