five

EPI-Eval/opendengue

收藏
Hugging Face2026-04-26 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/EPI-Eval/opendengue
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: OpenDengue — national dengue case counts (V1.3) license: cc-by-4.0 size_categories: - 10K<n<100K tags: - cadence-irregular - geo-global - surveillance-arboviral - pathogen-dengue - tier-1 - availability-open schema_version: '0.1' source_id: opendengue source_url: https://opendengue.org/ manifest_section: §6.1 surveillance_category: arboviral pathogens: - dengue availability: open availability_notes: Static archive — V1.3 release on figshare. Updates are sporadic figshare uploads; this dataset pins V1.3 and must be re-ingested manually when a new release drops. access_type: github tier: 1 cadence: irregular geography_levels: - national geography_countries: - multiple gold_standard_for: [] vintaged_version_of: null succeeds: null derived_from: [] value_columns: - name: dengue_total unit: cases value_type: incident description: Reported dengue cases in the period. May include suspected, probable, and laboratory-confirmed cases — see `case_status` (schema-standard) and `case_definition` (source-original) for the certainty tier of each row. Includes all severity levels (dengue, dengue with/without warning signs, severe dengue, dengue haemorrhagic fever, dengue shock syndrome, dengue deaths) where reported. aggregation: sum notes: extra_columns: - column: case_definition description: 'Source-original case definition: `Suspected`, `Probable`, `Confirmed`, `Probable and confirmed`, `Suspected and confirmed`, `Total`. Preserved alongside the schema-standard `case_status` so original fidelity isn''t lost. The same (country, date) cell can have multiple rows when the source distinguishes definitions — filter to one for a single time series.' - column: temporal_resolution description: Per-row temporal granularity (`Week` or `Month`). Coarser source aggregates (`Year`, `Total`) are filtered out at ingest — the dashboard can sum kept rows on the fly when an annual roll-up is needed. - column: location_name description: Source's English country name (uppercase). `location_id` (ISO 3166-1 alpha-2) is canonical. interpretation_caveats: - column: case_status caveat: 'Mapping from the source''s `case_definition`: `Probable and confirmed` → `probable` (looser tier of the union, the minimum certainty represented); `Suspected and confirmed` → `suspect`; `Total` → `not-classified`. The original definition is preserved in `case_definition` for full fidelity.' - column: dengue_total caveat: Cross-country comparability is limited — case definitions, reporting completeness, and surveillance intensity vary substantially. OpenDengue normalizes structure but does not normalize underlying definitions. For modeling, prefer one country at a time or one case_status consistently. - column: case_definition caveat: '`Probable and confirmed` and `Suspected and confirmed` are *summed* categories from sources that combine tiers — they are not standalone classifications. Don''t sum them with `Confirmed`/`Probable`/`Suspected` rows from the same country and date.' - column: temporal_resolution caveat: A country may switch between Week and Month over its history. Pick one resolution per country when constructing a continuous series; don't naively concatenate rows of different temporal_resolution. general: 'OpenDengue (Clarke et al. 2024, https://doi.org/10.1038/s41597-024-03120-7) is a standardized archive of dengue case reports from public national surveillance systems. This dataset is the V1.3 **national** extract; subnational extracts (`Spatial_extract`, `Temporal_extract`) are larger and could be ingested as sibling datasets later (`opendengue-admin1`, `opendengue-admin2`). Coverage spans 1993-present (with a small 1924-1925 fragment), but per-country availability varies widely — Brazil has weekly data 2014+, others only monthly. Annual and Total aggregates from the source are filtered out so the dataset is series-shaped; sum kept rows in the dashboard when an annual roll-up is needed. Use the `case_status` / `case_definition` and `temporal_resolution` row-level columns to construct a coherent series. ' extra: release_tag: V1.3 figshare_doi: https://doi.org/10.6084/m9.figshare.24259573 github_repo: https://github.com/OpenDengue/master-repo source_csv: data/releases/V1.3/National_extract_V1_3.zip paper_doi: https://doi.org/10.1038/s41597-024-03120-7 case_definition_to_status_mapping: Confirmed: confirmed Probable: probable Suspected: suspect Probable and confirmed: probable Suspected and confirmed: suspect Total: not-classified case_definition_normalization: lowercase 'confirmed' rows in source were merged with 'Confirmed' to deduplicate filter values filtered_temporal_resolutions: - Year - Total computed: last_ingested: '2026-04-26T02:53:03Z' row_count: 26378 time_coverage: - start: '1924-01-26' end: '1925-01-17' - start: '1993-01-31' end: '2025-04-30' geography_unit_count: 127 observed_cadence_days: 7 missing_gaps: - start: '1924-01-26' end: '1924-03-01' weeks: 5 - start: '1924-03-01' end: '1924-04-12' weeks: 6 - start: '1924-04-12' end: '1924-07-19' weeks: 14 - start: '1924-07-19' end: '1924-08-16' weeks: 4 - start: '1924-09-06' end: '1924-09-27' weeks: 3 - start: '1924-09-27' end: '1925-01-10' weeks: 15 - start: '1993-01-31' end: '1993-02-28' weeks: 4 - start: '1993-02-28' end: '1993-03-31' weeks: 4 - start: '1993-03-31' end: '1993-04-30' weeks: 4 - start: '1993-04-30' end: '1993-05-31' weeks: 4 - start: '1993-05-31' end: '1993-06-30' weeks: 4 - start: '1993-06-30' end: '1993-07-31' weeks: 4 - start: '1993-07-31' end: '1993-08-31' weeks: 4 - start: '1993-08-31' end: '1993-09-30' weeks: 4 - start: '1993-09-30' end: '1993-10-31' weeks: 4 - start: '1993-10-31' end: '1993-11-30' weeks: 4 - start: '1993-11-30' end: '1993-12-31' weeks: 4 - start: '1993-12-31' end: '1994-01-31' weeks: 4 - start: '1994-01-31' end: '1994-02-28' weeks: 4 - start: '1994-02-28' end: '1994-03-31' weeks: 4 - start: '1994-03-31' end: '1994-04-30' weeks: 4 - start: '1994-04-30' end: '1994-05-31' weeks: 4 - start: '1994-05-31' end: '1994-06-30' weeks: 4 - start: '1994-06-30' end: '1994-07-31' weeks: 4 - start: '1994-07-31' end: '1994-08-31' weeks: 4 - start: '1994-08-31' end: '1994-09-30' weeks: 4 - start: '1994-09-30' end: '1994-10-31' weeks: 4 - start: '1994-10-31' end: '1994-11-30' weeks: 4 - start: '1994-11-30' end: '1994-12-31' weeks: 4 - start: '1994-12-31' end: '1995-01-31' weeks: 4 - start: '1995-01-31' end: '1995-02-28' weeks: 4 - start: '1995-02-28' end: '1995-03-31' weeks: 4 - start: '1995-03-31' end: '1995-04-30' weeks: 4 - start: '1995-04-30' end: '1995-05-31' weeks: 4 - start: '1995-05-31' end: '1995-06-30' weeks: 4 - start: '1995-06-30' end: '1995-07-31' weeks: 4 - start: '1995-07-31' end: '1995-08-31' weeks: 4 - start: '1995-08-31' end: '1995-09-30' weeks: 4 - start: '1995-09-30' end: '1995-10-31' weeks: 4 - start: '1995-10-31' end: '1995-11-30' weeks: 4 - start: '1995-11-30' end: '1995-12-31' weeks: 4 - start: '1995-12-31' end: '1996-01-31' weeks: 4 - start: '1996-01-31' end: '1996-02-29' weeks: 4 - start: '1996-02-29' end: '1996-03-31' weeks: 4 - start: '1996-03-31' end: '1996-04-30' weeks: 4 - start: '1996-04-30' end: '1996-05-31' weeks: 4 - start: '1996-05-31' end: '1996-06-30' weeks: 4 - start: '1996-06-30' end: '1996-07-31' weeks: 4 - start: '1996-07-31' end: '1996-08-31' weeks: 4 - start: '1996-08-31' end: '1996-09-30' weeks: 4 - start: '1996-09-30' end: '1996-10-31' weeks: 4 - start: '1996-10-31' end: '1996-11-30' weeks: 4 - start: '1996-11-30' end: '1996-12-31' weeks: 4 - start: '1996-12-31' end: '1997-01-31' weeks: 4 - start: '1997-01-31' end: '1997-02-28' weeks: 4 - start: '1997-02-28' end: '1997-03-31' weeks: 4 - start: '1997-03-31' end: '1997-04-30' weeks: 4 - start: '1997-04-30' end: '1997-05-31' weeks: 4 - start: '1997-05-31' end: '1997-06-30' weeks: 4 - start: '1997-06-30' end: '1997-07-31' weeks: 4 - start: '1997-07-31' end: '1997-08-31' weeks: 4 - start: '1997-08-31' end: '1997-09-30' weeks: 4 - start: '1997-09-30' end: '1997-10-31' weeks: 4 - start: '1997-10-31' end: '1997-11-30' weeks: 4 - start: '1997-11-30' end: '1997-12-31' weeks: 4 - start: '1997-12-31' end: '1998-01-17' weeks: 2 - start: '1998-01-31' end: '1998-02-21' weeks: 3 - start: '1999-01-02' end: '1999-01-31' weeks: 4 - start: '1999-01-31' end: '1999-02-28' weeks: 4 - start: '1999-02-28' end: '1999-03-31' weeks: 4 - start: '1999-03-31' end: '1999-04-30' weeks: 4 - start: '1999-04-30' end: '1999-05-31' weeks: 4 - start: '1999-05-31' end: '1999-06-30' weeks: 4 - start: '1999-06-30' end: '1999-07-31' weeks: 4 - start: '1999-07-31' end: '1999-08-31' weeks: 4 - start: '1999-08-31' end: '1999-09-30' weeks: 4 - start: '1999-09-30' end: '1999-10-31' weeks: 4 - start: '1999-10-31' end: '1999-11-30' weeks: 4 - start: '1999-11-30' end: '1999-12-31' weeks: 4 - start: '2024-12-31' end: '2025-01-31' weeks: 4 - start: '2025-01-31' end: '2025-02-28' weeks: 4 - start: '2025-02-28' end: '2025-03-31' weeks: 4 - start: '2025-03-31' end: '2025-04-30' weeks: 4 data_hash: b5be5b622fec7445 --- # OpenDengue — national dengue case counts (V1.3) OpenDengue (Clarke et al. 2024, https://doi.org/10.1038/s41597-024-03120-7) is a standardized archive of dengue case reports from public national surveillance systems. This dataset is the V1.3 **national** extract; subnational extracts (`Spatial_extract`, `Temporal_extract`) are larger and could be ingested as sibling datasets later (`opendengue-admin1`, `opendengue-admin2`). Coverage spans 1993-present (with a small 1924-1925 fragment), but per-country availability varies widely — Brazil has weekly data 2014+, others only monthly. Annual and Total aggregates from the source are filtered out so the dataset is series-shaped; sum kept rows in the dashboard when an annual roll-up is needed. Use the `case_status` / `case_definition` and `temporal_resolution` row-level columns to construct a coherent series. **Source:** <https://opendengue.org/> ## Coverage - **Time:** 1924-01-26 → 1925-01-17; 1993-01-31 → 2025-04-30 - **Cadence:** `irregular` (observed median spacing: 7 days) - **Geography levels:** `national` — 127 unique location IDs - **Countries:** multiple - **Pathogens:** `dengue` - **Surveillance category:** `arboviral` - **Rows:** 26,378 ### Reporting gaps within coverage - 1924-01-26 → 1924-03-01 (5 weeks) - 1924-03-01 → 1924-04-12 (6 weeks) - 1924-04-12 → 1924-07-19 (14 weeks) - 1924-07-19 → 1924-08-16 (4 weeks) - 1924-09-06 → 1924-09-27 (3 weeks) - 1924-09-27 → 1925-01-10 (15 weeks) - 1993-01-31 → 1993-02-28 (4 weeks) - 1993-02-28 → 1993-03-31 (4 weeks) - 1993-03-31 → 1993-04-30 (4 weeks) - 1993-04-30 → 1993-05-31 (4 weeks) - 1993-05-31 → 1993-06-30 (4 weeks) - 1993-06-30 → 1993-07-31 (4 weeks) - 1993-07-31 → 1993-08-31 (4 weeks) - 1993-08-31 → 1993-09-30 (4 weeks) - 1993-09-30 → 1993-10-31 (4 weeks) - 1993-10-31 → 1993-11-30 (4 weeks) - 1993-11-30 → 1993-12-31 (4 weeks) - 1993-12-31 → 1994-01-31 (4 weeks) - 1994-01-31 → 1994-02-28 (4 weeks) - 1994-02-28 → 1994-03-31 (4 weeks) - 1994-03-31 → 1994-04-30 (4 weeks) - 1994-04-30 → 1994-05-31 (4 weeks) - 1994-05-31 → 1994-06-30 (4 weeks) - 1994-06-30 → 1994-07-31 (4 weeks) - 1994-07-31 → 1994-08-31 (4 weeks) - 1994-08-31 → 1994-09-30 (4 weeks) - 1994-09-30 → 1994-10-31 (4 weeks) - 1994-10-31 → 1994-11-30 (4 weeks) - 1994-11-30 → 1994-12-31 (4 weeks) - 1994-12-31 → 1995-01-31 (4 weeks) - 1995-01-31 → 1995-02-28 (4 weeks) - 1995-02-28 → 1995-03-31 (4 weeks) - 1995-03-31 → 1995-04-30 (4 weeks) - 1995-04-30 → 1995-05-31 (4 weeks) - 1995-05-31 → 1995-06-30 (4 weeks) - 1995-06-30 → 1995-07-31 (4 weeks) - 1995-07-31 → 1995-08-31 (4 weeks) - 1995-08-31 → 1995-09-30 (4 weeks) - 1995-09-30 → 1995-10-31 (4 weeks) - 1995-10-31 → 1995-11-30 (4 weeks) - 1995-11-30 → 1995-12-31 (4 weeks) - 1995-12-31 → 1996-01-31 (4 weeks) - 1996-01-31 → 1996-02-29 (4 weeks) - 1996-02-29 → 1996-03-31 (4 weeks) - 1996-03-31 → 1996-04-30 (4 weeks) - 1996-04-30 → 1996-05-31 (4 weeks) - 1996-05-31 → 1996-06-30 (4 weeks) - 1996-06-30 → 1996-07-31 (4 weeks) - 1996-07-31 → 1996-08-31 (4 weeks) - 1996-08-31 → 1996-09-30 (4 weeks) - 1996-09-30 → 1996-10-31 (4 weeks) - 1996-10-31 → 1996-11-30 (4 weeks) - 1996-11-30 → 1996-12-31 (4 weeks) - 1996-12-31 → 1997-01-31 (4 weeks) - 1997-01-31 → 1997-02-28 (4 weeks) - 1997-02-28 → 1997-03-31 (4 weeks) - 1997-03-31 → 1997-04-30 (4 weeks) - 1997-04-30 → 1997-05-31 (4 weeks) - 1997-05-31 → 1997-06-30 (4 weeks) - 1997-06-30 → 1997-07-31 (4 weeks) - 1997-07-31 → 1997-08-31 (4 weeks) - 1997-08-31 → 1997-09-30 (4 weeks) - 1997-09-30 → 1997-10-31 (4 weeks) - 1997-10-31 → 1997-11-30 (4 weeks) - 1997-11-30 → 1997-12-31 (4 weeks) - 1997-12-31 → 1998-01-17 (2 weeks) - 1998-01-31 → 1998-02-21 (3 weeks) - 1999-01-02 → 1999-01-31 (4 weeks) - 1999-01-31 → 1999-02-28 (4 weeks) - 1999-02-28 → 1999-03-31 (4 weeks) - 1999-03-31 → 1999-04-30 (4 weeks) - 1999-04-30 → 1999-05-31 (4 weeks) - 1999-05-31 → 1999-06-30 (4 weeks) - 1999-06-30 → 1999-07-31 (4 weeks) - 1999-07-31 → 1999-08-31 (4 weeks) - 1999-08-31 → 1999-09-30 (4 weeks) - 1999-09-30 → 1999-10-31 (4 weeks) - 1999-10-31 → 1999-11-30 (4 weeks) - 1999-11-30 → 1999-12-31 (4 weeks) - 2024-12-31 → 2025-01-31 (4 weeks) - 2025-01-31 → 2025-02-28 (4 weeks) - 2025-02-28 → 2025-03-31 (4 weeks) - 2025-03-31 → 2025-04-30 (4 weeks) ## Columns | Column | Unit | value_type | Aggregation | Description | |--------|------|------------|-------------|-------------| | `dengue_total` | cases | `incident` | `sum` | Reported dengue cases in the period. May include suspected, probable, and laboratory-confirmed cases — see `case_status` (schema-standard) and `case_definition` (source-original) for the certainty tier of each row. Includes all severity levels (dengue, dengue with/without warning signs, severe dengue, dengue haemorrhagic fever, dengue shock syndrome, dengue deaths) where reported. | ### Additional data columns - **`case_definition`** — Source-original case definition: `Suspected`, `Probable`, `Confirmed`, `Probable and confirmed`, `Suspected and confirmed`, `Total`. Preserved alongside the schema-standard `case_status` so original fidelity isn't lost. The same (country, date) cell can have multiple rows when the source distinguishes definitions — filter to one for a single time series. - **`temporal_resolution`** — Per-row temporal granularity (`Week` or `Month`). Coarser source aggregates (`Year`, `Total`) are filtered out at ingest — the dashboard can sum kept rows on the fly when an annual roll-up is needed. - **`location_name`** — Source's English country name (uppercase). `location_id` (ISO 3166-1 alpha-2) is canonical. ## Interpretation caveats Things that may differ from how other sources define a similar measure. If you're combining this dataset with another, read these first. - **`case_status`** — Mapping from the source's `case_definition`: `Probable and confirmed` → `probable` (looser tier of the union, the minimum certainty represented); `Suspected and confirmed` → `suspect`; `Total` → `not-classified`. The original definition is preserved in `case_definition` for full fidelity. - **`dengue_total`** — Cross-country comparability is limited — case definitions, reporting completeness, and surveillance intensity vary substantially. OpenDengue normalizes structure but does not normalize underlying definitions. For modeling, prefer one country at a time or one case_status consistently. - **`case_definition`** — `Probable and confirmed` and `Suspected and confirmed` are *summed* categories from sources that combine tiers — they are not standalone classifications. Don't sum them with `Confirmed`/`Probable`/`Suspected` rows from the same country and date. - **`temporal_resolution`** — A country may switch between Week and Month over its history. Pick one resolution per country when constructing a continuous series; don't naively concatenate rows of different temporal_resolution. ## Access - **Availability:** `open` - **Access type:** `github` - **License:** cc-by-4.0 - **Tier:** 1 --- *Schema version `0.1` · Last ingested 2026-04-26T02:53:03Z · `source_id: opendengue` · Manifest section §6.1*
提供机构:
EPI-Eval
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作