US Cities Demographics
收藏Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/148971cf-317e-4287-951d-ecbd9b3ffade/John-Snow-Labs_US-Cities-Demographics
下载链接
链接失效反馈官方服务:
资源简介:
**Overview**
The purpose of this data package is to offer demographic data for U.S. cities. The data sources are multiple, the most important one being the U.S. Census Bureau, American Community Survey. In this case, the data was organized by the Big Cities Health Coalition (BCHC). Others are the New York City Department of City Planning and Department of Parks and Recreation, data being available through the NYC Open Data.
**Description**
The U.S. Census Bureau, American Community Survey (ACS) is a nationwide survey designed to provide communities a fresh look at how they are changing by producing demographic, social, housing and economic estimates in the form of 1-year, 3-year and 5-year estimates based on population thresholds. The strength of the ACS is in estimating population and housing characteristics. In this data package the following type of indicators can be found:
- Population structure (static): population by age, gender and race/ethnicity;
- Population dynamics: mortality and migration-related indicators;
- Socio-economic characteristics: income, employment, education, language spoken at home, living conditions and health insurance. The NYC Open Data ensures, through the free shared data, transparency and supports in this way the improvement of the quality of life for the citizens of New York City. The following data about the City of New York is available:
- Population size by districts;
- The results of the New York City streets trees mapping for the trees location and their characteristics (like scientific name and health status).
**Benefits**
- Useful for public health specialists and specialized institutions to plan and assess the health services and projects at the city level or for other categories of professionals. useful for researchers seeking to improve the methods, regulations, health policies and strategies at the city level or for other types of projects (like environmental projects). useful for researchers seeking to improve the methods, regulations, health policies and strategies at the city level or to maintain and improve individuals living surroundings. the raw version can be found on the author's website, but jsl is offering a clean, standardized and easier to use version, alone or in combination with other datasets found in jsl dataset library. the fields descriptions along with the dataset description are again useful for the user to quickly understand the data and the dataset.
**License Information**
The use of John Snow Labs datasets is free for personal and research purposes. For commercial use please subscribe to the [Data Library](https://www.johnsnowlabs.com/marketplace/) on John Snow Labs website. The subscription will allow you to use all John Snow Labs datasets and data packages for commercial purposes.
**Included Datasets**
- [Big Cities Demographic Indicators](https://www.johnsnowlabs.com/marketplace/big-cities-demographic-indicators)
- This dataset contains estimates for demographic indicators shared by the Big Cities Health Coalition members represented by the largest metropolitan health departments in United States. The estimated values of demographic indicators cover the 2010-2015 period and are described by location, sex and race/ethnicity.
- [NYC Population By Community Districts](https://www.johnsnowlabs.com/marketplace/nyc-population-by-community-districts)
- This dataset contains the New York City Population By Community Districts.The community boards of the New York City government are the appointed advisory groups of the community districts of the five boroughs. There are currently 59 community districts, including twelve in Manhattan, twelve in the Bronx, eighteen in Brooklyn, fourteen in Queens, and three in Staten Island.
- [NYC Street Tree Census 2015](https://www.johnsnowlabs.com/marketplace/nyc-street-tree-census-2015)
- This dataset shows the New York City (NYC) street tree census data for the year 2015 provided by the Department of Parks and Recreation (DPR).
**Data Engineering Overview**
**We deliver high-quality data**
- Each dataset goes through 3 levels of quality review
- 2 Manual reviews are done by domain experts
- Then, an automated set of 60+ validations enforces every datum matches metadata & defined constraints
- Data is normalized into one unified type system
- All dates, unites, codes, currencies look the same
- All null values are normalized to the same value
- All dataset and field names are SQL and Hive compliant
- Data and Metadata
- Data is available in both CSV and Apache Parquet format, optimized for high read performance on distributed Hadoop, Spark & MPP clusters
- Metadata is provided in the open Frictionless Data standard, and its every field is normalized & validated
- Data Updates
- Data updates support replace-on-update: outdated foreign keys are deprecated, not deleted
**Our data is curated and enriched by domain experts**
Each dataset is manually curated by our team of doctors, pharmacists, public health & medical billing experts:
- Field names, descriptions, and normalized values are chosen by people who actually understand their meaning
- Healthcare & life science experts add categories, search keywords, descriptions and more to each dataset
- Both manual and automated data enrichment supported for clinical codes, providers, drugs, and geo-locations
- The data is always kept up to date – even when the source requires manual effort to get updates
- Support for data subscribers is provided directly by the domain experts who curated the data sets
- Every data source’s license is manually verified to allow for royalty-free commercial use and redistribution.
**Need Help?**
If you have questions about our products, contact us at [info@johnsnowlabs.com](mailto:info@johnsnowlabs.com).
提供机构:
John Snow Labs



