Census Data by Zip Code 2012-2016
收藏Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/72a9bf62-c0ab-4386-8f9f-63661eb263ca/John-Snow-Labs_Census-Data-by-Zip-Code-2012-2016
下载链接
链接失效反馈官方服务:
资源简介:
**Overview**
This data package has the purpose to offer data for demographic indicators, part of 5-years American Community Census, that could be needed in the analysis made along with health-related data or as stand-alone. The American Community Survey based on 5-years estimates is, according to U.S Census Bureau, the most reliable, because the samples used are the largest and the data collected cover all country areas, regardless of the population number.
**Description**
The U.S. Census Bureau, American Community Survey (ACS) is a nationwide survey designed to provide communities a fresh look at how they are changing. The ACS replaced the decennial census long form in 2010 and thereafter by collecting long form type information throughout the decade rather than only once every 10 years. The American Community Survey produces demographic, social, housing and economic estimates in the form of 1-year, 3-year and 5-year estimates based on population thresholds. The strength of the ACS is in estimating population and housing characteristics. It produces estimates for small areas, including census tracts and population subgroups.
The main indicators categories and subcategories that can be found in the content of this data package are as follows:
- General demographics: age, gender (sex) and race/ethnicity
- Language: language spoken at home and English ability
- Origins: nativity, place of birth and ancestry
- Socio-economic demographics: general social and economic characteristics, poverty status
Every dataset in this data package has over 100 demographic indicators, each accompanied by the margin of error of the estimated value of the indicator.
Demography has an important role in public health planning and assessment. At the same time demographics plays an important role in the field of health-related research and development. Socio-economic factors are health influencers that are assessed in order to understand better the root causes of health differences among population groups. Personal factors like age, gender and race/ethnicity are directly influencing the risk levels for various diseases. The English ability has an important in relation to health-related communication and health literacy. Place of birth is important because of the different exposures to health risk factors and of differences related to health services (for example immunizations).
**Benefits**
- Useful for public health specialists and specialized institutions to plan and assess the health services and projects. useful for researchers seeking to improve the methods, regulations, health policies and strategy. the raw version can be found on the author's website, but jsl is offering a clean, standardized and easier to use version, alone or in combination with other datasets found in jsl dataset library. the jsl version was enriched by adding for each zip code tabulation area the geographic coordinates and the land and water surfaces size. the fields descriptions along with the dataset description are again useful for the user to quickly understand the data and the dataset.
**License Information**
The use of John Snow Labs datasets is free for personal and research purposes. For commercial use please subscribe to the [Data Library](https://www.johnsnowlabs.com/marketplace/) on John Snow Labs website. The subscription will allow you to use all John Snow Labs datasets and data packages for commercial purposes.
**Included Datasets**
- [Age and Sex by Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/age-and-sex-by-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) data set identifies age and sex for the population by zip code tabulation areas within the United States. The data includes an estimate of the population by gender and age categories, total estimates as well as these category’s margin of error and margin of error ratio.
- [Ancestry by Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/ancestry-by-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) data set identifies ancestry by zip code tabulation areas within the United States. Ancestry refers to a person’s ethnic origin or descent, "roots," or heritage, or the place of birth of the person or the person’s parents or ancestors before their arrival in the United States.
- [Characteristics Language Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/characteristics-language-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) dataset provides characteristics of people by language spoken at home by zip code tabulation areas within the United States. The characteristics of the people include age, nativity and citizenship status, poverty status, educational attainment.
- [Characteristics by Nativity Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/characteristics-by-nativity-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) data set identifies selected characteristics of the total and native population by zip code tabulation area within the United States. The dataset identifies population by native and foreign-born, including age, sex, language spoken at home, ability to speak English, marital status, educational attainment, income, poverty and citizenship status by zip code tabulation area.
- [Characteristics by Poverty by Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/characteristics-by-poverty-by-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) dataset identifies selected characteristics of people at specified levels of poverty by zip code tabulation areas within the United States, from 2012 through 2016. The selected characteristics include sex, age, race, living arrangement, educational attainment, nativity and citizenship status, disability status, and work status.
- [Demographic Housing Estimates Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/demographic-housing-estimates-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) dataset identifies demographic and housing estimates by zip code tabulation areas within the United States, from 2012 through 2016. The dataset identifies sex and age, race and housing units by Zip Code Tabulation Area.
- [Detailed Race by Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/detailed-race-by-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) dataset identifies race in detail by zip code tabulation areas within the United States, from 2012 through 2016. The races included in this dataset are White, Black or African American, American Indian and Alaskan Native, Asian, Native Hawaiian and other Pacific Islander, and other. The survey also looks at races alone, and two or more races combined.
- [Language Spoken at Home by Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/language-spoken-at-home-by-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) data set identifies the language spoken at home by zip code tabulation area within the United States, from 2012 through 2016. The dataset identifies languages spoken and how well English is spoken by Zip Code Tabulation Area.
- [Place of Birth by Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/place-of-birth-by-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) dataset specifies the place of birth for the foreign-born population by zip code tabulation areas within the United States.
- [Poverty Status by Age and Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/poverty-status-by-age-and-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) dataset identifies poverty status by age and zip code tabulation areas within the United States, from 2012 through 2016.
- [Poverty Status by Sex Age and Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/poverty-status-by-sex-age-and-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) dataset identifies poverty status by sex, age and zip code tabulation areas within the United States, from 2012 through 2016.
- [Poverty Status by Zip Code Tabulation Area 2012-2016](https://www.johnsnowlabs.com/marketplace/poverty-status-by-zip-code-tabulation-area-2012-2016)
- This American Community Survey (ACS) dataset identifies poverty status by zip code tabulation areas within the United States, from 2012 through 2016. The poverty characteristics include race, sex, age, educational attainment, employment status, work experience, and poverty level.
**Data Engineering Overview**
**We deliver high-quality data**
- Each dataset goes through 3 levels of quality review
- 2 Manual reviews are done by domain experts
- Then, an automated set of 60+ validations enforces every datum matches metadata & defined constraints
- Data is normalized into one unified type system
- All dates, unites, codes, currencies look the same
- All null values are normalized to the same value
- All dataset and field names are SQL and Hive compliant
- Data and Metadata
- Data is available in both CSV and Apache Parquet format, optimized for high read performance on distributed Hadoop, Spark & MPP clusters
- Metadata is provided in the open Frictionless Data standard, and its every field is normalized & validated
- Data Updates
- Data updates support replace-on-update: outdated foreign keys are deprecated, not deleted
**Our data is curated and enriched by domain experts**
Each dataset is manually curated by our team of doctors, pharmacists, public health & medical billing experts:
- Field names, descriptions, and normalized values are chosen by people who actually understand their meaning
- Healthcare & life science experts add categories, search keywords, descriptions and more to each dataset
- Both manual and automated data enrichment supported for clinical codes, providers, drugs, and geo-locations
- The data is always kept up to date – even when the source requires manual effort to get updates
- Support for data subscribers is provided directly by the domain experts who curated the data sets
- Every data source’s license is manually verified to allow for royalty-free commercial use and redistribution.
**Need Help?**
If you have questions about our products, contact us at [info@johnsnowlabs.com](mailto:info@johnsnowlabs.com).
提供机构:
John Snow Labs



