VRKomari/30k_CA_synthetic_patients_FHIR_Bundles_JSON_NDJSON
收藏Hugging Face2026-03-09 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/VRKomari/30k_CA_synthetic_patients_FHIR_Bundles_JSON_NDJSON
下载链接
链接失效反馈官方服务:
资源简介:
---
---
license: cc-by-4.0
task_categories:
- text-classification
- question-answering
- tabular-classification
language:
- en
tags:
- healthcare
- fhir
- synthetic
- ehr
- clinical
- synthea
- medical
pretty_name: Synthea 30K Synthetic Patient Dataset (FHIR R4)
size_categories:
- 10M<n<100M
---
# Synthea 30K Synthetic Patient Dataset
Synthetic FHIR R4 patient data generated using [Synthea](https://github.com/synthetichealth/synthea), an open-source synthetic patient generator developed by MITRE Corporation.
## Contents
The dataset includes 30,000 + (deceased) patient records across the following FHIR R4 resource types, provided as NDJSON files and a bundled ZIP archive:
- AllergyIntolerance
- Condition
- DiagnosticReport
- Encounter
- ImagingStudy
- Immunization
- Medication
- MedicationAdministration
- MedicationRequest
- Observation
- Patient
Each NDJSON file contains one FHIR resource per line. The ZIP archive contains the full 30K patient bundles.
## Attribution
Generated using [Synthea](https://github.com/synthetichealth/synthea) by MITRE Corporation.
Dataset assembled and published by Raghu (raghuk).
## License
[Creative Commons Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/)
You are free to share and adapt this dataset for any purpose, provided appropriate credit is given.
## Citation
If you use this dataset, please cite:
> Walonoski J, et al. Synthea: An approach, method, and software mechanism for generating synthetic patients and the synthetic electronic health care record. J Am Med Inform Assoc. 2018;25(3):230-238. doi:10.1093/jamia/ocx079
license: cc-by-sa-4.0
---
提供机构:
VRKomari



