five

Census Data Workflow using IPUMS NHGIS API: Data Request and Download

收藏
DataCite Commons2026-03-18 更新2026-05-03 收录
下载链接:
https://www.openicpsr.org/openicpsr/project/120305/version/V1/view?path=/openicpsr/120305/fcr:versions/V1/NHGIS_2_getdata_2020-07-29.ipynb&type=file
下载链接
链接失效反馈
官方服务:
资源简介:
This archive provides two Jupyter Notebooks to explore metadata and retrieve data using the IPUMS NHGIS API. American Community Survey (ACS) 5-year estimates 2005-2009 at the block group and county levels are requested and downloaded for Galveston County, Texas. Data of interest are race and ethnicity, and median household income. Block group and county shapefiles are also downloaded.<br><br>The Python code was developed in Google Colaboratory, or Google Colab for short, which is an Integrated Development Environment (IDE) of JupyterLab and streamlines package installation, code collaboration and management.The notebooks use Google Drive for file storage and include extensive markdown and comments. The notebooks can be adapted for use in other environments (i.e., Jupyter Notebook) as well as reading and writing files to a local or shared drive, or cloud drive (i.e., Google Drive).<br><br>The first notebook explores metadata in order to identify relevant datasets and tables and necessary parameters for subsequent data request and retrieval. The second notebook uses the parameters identified from the first notebook. A data request is constructed and the data extract is downloaded and files unzipped and made ready for analysis. The data that were downloaded are also stored separately with this archive.<br><br>The data referenced in this archive have research applications listed in the Related Publications section and in ongoing research at the Texas A&amp;M University Department of Landscape Architecture and Urban Planning (LAUP), and the Hazard Reduction and Recovery Center (HRRC).

本数据集存档包含两份Jupyter笔记本,用于通过IPUMS NHGIS API探索元数据并检索数据。针对得克萨斯州加尔维斯顿县,获取并下载了2005-2009年美国社区调查(American Community Survey, ACS)5年估算数据,统计单元涵盖街区组(block group)与县级尺度。本研究关注的数据包括种族与族裔信息以及家庭收入中位数,同时还下载了对应街区组与县级的shapefile格式矢量文件(shapefile)。 本Python代码基于Google Colaboratory(简称Google Colab)开发——该工具是基于JupyterLab的集成开发环境(Integrated Development Environment, IDE),可简化依赖包安装、代码协作与管理流程。笔记本采用Google Drive进行文件存储,并配备了详尽的Markdown文档与注释。此类笔记本可适配其他运行环境(如原生Jupyter Notebook),也支持读取、写入本地磁盘、共享磁盘或云盘(如Google Drive)中的文件。 第一份笔记本用于探索元数据,以确定后续数据请求与检索所需的相关数据集、数据表及必要参数。第二份笔记本则调用第一份笔记本确定的参数,构建数据请求任务,下载数据提取结果,解压文件并完成分析前的准备工作。本次下载的数据集也已单独存储于本存档中。 本存档中引用的数据的研究用途已在「相关出版物」板块列明,同时可应用于德克萨斯农工大学景观建筑与城市规划系(Landscape Architecture and Urban Planning, LAUP)以及减灾与恢复中心(Hazard Reduction and Recovery Center, HRRC)的在研项目。
提供机构:
ICPSR - Interuniversity Consortium for Political and Social Research
创建时间:
2026-03-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作