brhkim/education_data_portal_mirror
收藏Hugging Face2026-02-09 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/brhkim/education_data_portal_mirror
下载链接
链接失效反馈官方服务:
资源简介:
---
license: odc-by
---
# Education Data Portal — Parquet Mirror
A complete mirror of the [Urban Institute Education Data Portal](https://educationdata.urban.org) datasets version 0.24.0, collected on February 7, 2026, and converted from CSV to Apache Parquet format for efficient analytical use. Please note that the maintainers of this Huggingface Dataset have no affiliation with the Urban Institute or the Education Data Portal team.
Huge appreciation for all they do -- if you use this mirror, please make sure to follow all attribution guidelines below. Urban Institute moreover highly encourages users to reach out to educationdata@urban.org to notify them of any public projects or research using these data, including providing information on the title and link to the published work, which can elevate the visibility of your work and help them make the case for continued funding.
## Contents
This repository contains **487 files** (396 data files + 91 codebooks) across **14 data sources**, covering U.S. education data from 1979 to 2023.
| Source | Full Name | Original Provider | Data Files | Codebooks | Years | Levels |
| --- | --- | --- | --- | --- | --- | --- |
| `ccd` | Common Core of Data | US Department of Education, NCES | 79 | 5 | 1986–2023 | Schools, Districts |
| `crdc` | Civil Rights Data Collection | US Department of Education, OCR | 66 | 24 | 2011–2021 | Schools |
| `csafety` | Campus Safety and Security | US Department of Education, OPE | 1 | 1 | 2005–2021 | Colleges |
| `eada` | Equity in Athletics Disclosure Act | US Department of Education, OPE | 1 | 1 | 2002–2021 | Colleges |
| `edfacts` | EDFacts | US Department of Education | 42 | 4 | 2009–2018 | Schools, Districts |
| `fsa` | Federal Student Aid | US Department of Education, FSA | 5 | 5 | 1999–2021 | Colleges |
| `ipeds` | Integrated Postsecondary Education Data System | US Department of Education, NCES | 163 | 32 | 1979–2023 | Colleges |
| `meps` | Model Estimates of Poverty in Schools | Urban Institute | 1 | 1 | 2009–2022 | Schools |
| `nacubo` | NACUBO-Commonfund Study of Endowments | NACUBO / Commonfund Institute | 1 | 1 | 2012–2022 | Colleges |
| `nccs` | National Center for Charitable Statistics | Urban Institute / IRS | 1 | 1 | 1993–2016 | Colleges |
| `nhgis` | National Historical Geographic Information System | IPUMS / US Census Bureau | 8 | 8 | 1980–2023 | Schools, Colleges |
| `pseo` | Post-Secondary Employment Outcomes | US Census Bureau, LEHD | 21 | 1 | 2001–2021 | Colleges |
| `saipe` | Small Area Income and Poverty Estimates | US Census Bureau | 1 | 1 | 1995–2023 | Districts |
| `scorecard` | College Scorecard | US Department of Education | 6 | 6 | 1996–2020 | Colleges |
## Directory Structure
Files are organized in a flat `{source}/{filename}` layout mirroring the Urban Institute's CSV download structure:
```
{source}/{filename}.parquet # Data files (Parquet format)
{source}/codebook_{name}.xls # Codebooks (original Excel format)
```
Each file corresponds 1:1 with an entry in the Urban Institute's [download manifest](https://educationdata.urban.org/api/v1/api-downloads/). The Urban CSV download URL for any file is:
```
https://educationdata.urban.org/csv/{source}/{basename}.csv
```
where `{basename}` matches the filename in this repository (with `.parquet` replaced by `.csv`).
## Format
- **Data files** are stored in [Apache Parquet](https://parquet.apache.org/) format, converted from the original CSV downloads. Parquet provides columnar compression, type preservation, and significantly faster analytical queries compared to CSV.
- **Codebooks** are stored in their original `.xls` (Microsoft Excel) format as provided by the Urban Institute. Each codebook documents the variables, coded values, and definitions for its corresponding dataset.
## License
Aligning with the [Urban Institute Education Data Portal's license and Data Policy and Terms of Use](https://educationdata.urban.org/documentation/#terms), data in this repository is made available under the [**Open Data Commons Attribution License (ODC-By) v1.0**](https://opendatacommons.org/licenses/by/1-0/).
### You are free to
- **Share** — copy, distribute, and use the database
- **Create** — produce works from the database
- **Adapt** — modify, transform, and build upon the database
### As long as you
- **Attribute** — You must attribute any public use of the database, or works produced from the database, in the manner specified below. For any use or redistribution of the database, or works produced from it, you must make clear to others the license of the database and keep intact any notices on the original database.
### Required Attribution
**For formal research and publications:**
> [Dataset name(s)], Education Data Portal (Version 0.24.0), Urban Institute, accessed [Month DD, YYYY], https://educationdata.urban.org/documentation/, made available under the ODC Attribution License.
**For informal works (blogs, visualizations, applications):**
> [Dataset name(s)], via Education Data Portal, Urban Institute, under ODC Attribution License.
**Example (formal):**
> Common Core of Data; Integrated Postsecondary Education Data System, Education Data Portal, Urban Institute, accessed February 9, 2026, https://educationdata.urban.org/documentation/, made available under the ODC Attribution License.
### Disclaimer
The data is provided "as is," without warranty of any kind, express or implied. The Urban Institute disclaims all liability for claims arising from the use of this data. See the [Urban Institute Terms of Service](https://www.urban.org/terms-service) for full terms.
## Original Data Sources
The Education Data Portal aggregates, harmonizes, and standardizes data from multiple U.S. federal agencies and research organizations. The original data providers are:
- **US Department of Education, National Center for Education Statistics (NCES)** — CCD, IPEDS
- **US Department of Education, Office for Civil Rights (OCR)** — CRDC
- **US Department of Education, Office of Postsecondary Education (OPE)** — Campus Safety, EADA
- **US Department of Education, Federal Student Aid (FSA)** — FSA
- **US Department of Education** — EDFacts, College Scorecard
- **US Census Bureau** — SAIPE, PSEO (LEHD program)
- **IPUMS / US Census Bureau** — NHGIS
- **Urban Institute** — MEPS, NCCS
- **NACUBO / Commonfund Institute** — NACUBO Endowment Study
For detailed documentation on individual data sources, endpoints, variables, and methodological notes, see the [Education Data Portal documentation](https://educationdata.urban.org/documentation/).
## Contact
Questions or feedback about the Education Data Portal: [educationdata@urban.org](mailto:educationdata@urban.org)
提供机构:
brhkim



