five

electricsheepafrica/africa-wfp-food-prices-for-madagascar

收藏
Hugging Face2026-04-05 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/electricsheepafrica/africa-wfp-food-prices-for-madagascar
下载链接
链接失效反馈
官方服务:
资源简介:
--- annotations_creators: - no-annotation language_creators: - found language: - en license: cc-by-4.0 multilinguality: - monolingual size_categories: - 10K<n<100K source_datasets: - original task_categories: - tabular-regression - other task_ids: [] tags: - africa - humanitarian - hdx - electric-sheep-africa - economics - food-security - indicators - markets - mdg pretty_name: "Madagascar - Food Prices" dataset_info: splits: - name: train num_examples: 18491 - name: test num_examples: 4622 --- # Madagascar - Food Prices **Publisher:** WFP - World Food Programme · **Source:** [HDX](https://data.humdata.org/dataset/wfp-food-prices-for-madagascar) · **License:** `cc-by-igo` · **Updated:** 2026-04-05 --- ## Abstract This dataset contains Food Prices data for Madagascar, sourced from the World Food Programme Price Database. The World Food Programme Price Database covers foods such as maize, rice, beans, fish, and sugar for 98 countries and some 3000 markets. It is updated weekly but contains to a large extent monthly data. The data goes back as far as 1992 for a few countries, although many countries started reporting from 2003 or thereafter. Each row in this dataset represents subnational administrative unit observations. Temporal coverage is indicated by the `date` column(s). Geographic scope: **MDG**. *Curated into ML-ready Parquet format by [Electric Sheep Africa](https://huggingface.co/electricsheepafrica).* --- ## Dataset Characteristics | | | |---|---| | **Domain** | Food security and nutrition | | **Unit of observation** | Subnational administrative unit observations | | **Rows (total)** | 23,114 | | **Columns** | 18 (6 numeric, 11 categorical, 1 datetime) | | **Train split** | 18,491 rows | | **Test split** | 4,622 rows | | **Geographic scope** | MDG | | **Publisher** | WFP - World Food Programme | | **HDX last updated** | 2026-04-05 | --- ## Variables **Geographic** — `admin1` (Atsimo Andrefana, Androy, Anosy), `admin2` (Betioky Atsimo, Ampanihy Ouest, Ambovombe-Androy), `latitude` (range -25.54–-12.28), `longitude` (range 43.31–50.17), `category` (cereals and tubers, oil and fats, pulses and nuts) and 4 others. **Temporal** — `date`. **Outcome / Measurement** — `priceflag` (aggregate, actual), `price` (range 220.0–16000.0), `usdprice` (range 0.1–3.73). **Identifier / Metadata** — `market_id` (range 732.0–11286.0), `esa_source` (HDX), `esa_processed`. **Other** — `market` (Amboasary Sud, Tsihombe, Ampanihy), `unit` (KG, L). --- ## Quick Start ```python from datasets import load_dataset ds = load_dataset("electricsheepafrica/africa-wfp-food-prices-for-madagascar") train = ds["train"].to_pandas() test = ds["test"].to_pandas() print(train.shape) train.head() ``` --- ## Schema | Column | Type | Null % | Range / Sample Values | |---|---|---|---| | `date` | datetime64[ns] | 0.0% | | | `admin1` | object | 0.0% | Atsimo Andrefana, Androy, Anosy | | `admin2` | object | 0.0% | Betioky Atsimo, Ampanihy Ouest, Ambovombe-Androy | | `market` | object | 0.0% | Amboasary Sud, Tsihombe, Ampanihy | | `market_id` | int64 | 0.0% | 732.0 – 11286.0 (mean 5547.3975) | | `latitude` | float64 | 0.0% | -25.54 – -12.28 (mean -22.6118) | | `longitude` | float64 | 0.0% | 43.31 – 50.17 (mean 46.0967) | | `category` | object | 0.0% | cereals and tubers, oil and fats, pulses and nuts | | `commodity` | object | 0.0% | Rice (local), Rice (imported), Maize (crushed) | | `commodity_id` | int64 | 0.0% | 51.0 – 1332.0 (mean 345.2263) | | `unit` | object | 0.0% | KG, L | | `priceflag` | object | 0.0% | aggregate, actual | | `pricetype` | object | 0.0% | Retail, Producer | | `currency` | object | 0.0% | MGA | | `price` | float64 | 0.0% | 220.0 – 16000.0 (mean 3690.1987) | | `usdprice` | float64 | 0.0% | 0.1 – 3.73 (mean 0.9432) | | `esa_source` | object | 0.0% | HDX | | `esa_processed` | object | 0.0% | | --- ## Numeric Summary | Column | Min | Max | Mean | Median | |---|---|---|---|---| | `market_id` | 732.0 | 11286.0 | 5547.3975 | 6568.0 | | `latitude` | -25.54 | -12.28 | -22.6118 | -23.58 | | `longitude` | 43.31 | 50.17 | 46.0967 | 45.86 | | `commodity_id` | 51.0 | 1332.0 | 345.2263 | 283.0 | | `price` | 220.0 | 16000.0 | 3690.1987 | 2800.0 | | `usdprice` | 0.1 | 3.73 | 0.9432 | 0.72 | --- ## Curation Raw data was downloaded from HDX via the CKAN API and converted to Parquet. Column names were lowercased and standardised to snake_case. Common missing-value markers (`N/A`, `null`, `none`, `-`, `unknown`, `no data`, `#N/A`) were unified to `NaN`. 1 column(s) were cast from string to numeric or datetime based on parse-success rate (>85% threshold). The dataset was split 80/20 into train and test partitions using a fixed random seed (42) and saved as Snappy-compressed Parquet. --- ## Limitations - Data originates from WFP - World Food Programme and has not been independently validated by ESA. - Automated cleaning cannot correct for misreported values, definitional inconsistencies, or sampling bias in the original collection. - Refer to the [original HDX dataset page](https://data.humdata.org/dataset/wfp-food-prices-for-madagascar) for the publisher's own methodology notes and caveats. --- ## Citation ```bibtex @dataset{hdx_africa_wfp_food_prices_for_madagascar, title = {Madagascar - Food Prices}, author = {WFP - World Food Programme}, year = {2026}, url = {https://data.humdata.org/dataset/wfp-food-prices-for-madagascar}, note = {Repackaged for machine learning by Electric Sheep Africa (https://huggingface.co/electricsheepafrica)} } ``` --- *[Electric Sheep Africa](https://huggingface.co/electricsheepafrica) — Africa's ML dataset infrastructure. Lagos, Nigeria.*
提供机构:
electricsheepafrica
二维码
社区交流群
二维码
科研交流群
商业服务