U.S. Community Water Systems Service Boundaries, v2.4.0
收藏DataONE2022-09-28 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:c3ebe421c92b696b2b0c7d35f511d08fbd101667bf185a2fadc100255156753e
下载链接
链接失效反馈官方服务:
资源简介:
This is a layer of water service boundaries for 46,014 community water systems that deliver tap water to 307.7 million people in the US. This amounts to 97% of the population reportedly served by active community water systems and 91% of active community water systems. The layer is based on multiple data sources and a methodology developed by SimpleLab and collaborators called a Tiered, Explicit, Match, and Model approach–or TEMM, for short. The name of the approach reflects exactly how the nationwide data layer was developed. The TEMM is composed of three hierarchical tiers, arranged by data and model fidelity. First, we use explicit water service boundaries provided by states. These are spatial polygon data, typically provided at the state-level. We call systems with explicit boundaries Tier 1. In the absence of explicit water service boundary data, we use a matching algorithm to match water systems to the boundary of a town or city (Census Place TIGER polygons). When multiple water systems match to the same TIGER boundary, we employ a \"best match\" algorithm that assigns one water system to one TIGER place based on features like population served and other locational information about the water system. Finally, in the absence of an explicit water service boundary (Tier 1) or a TIGER place polygon match (Tier 2a), a statistical model trained on explicit water service boundary data (Tier 1) is used to estimate a reasonable radius at provided water system centroids, and model a spherical water system boundary (Tier 3). Water system centroids are taken from the ECHO database; however, where a system centroid is labeled as a county or state centroid, we take several steps to assign a better centroid (using sources like UCMR or TIGER). A summary of the systems and population assigned to different tiers is as follows:
Population coverage rates per Tier, for systems with population reported:
- Tier 1: 45.6% population covered (140,302,401 people)
- Tier 2: 39.98% population covered (123,028,626 people)
- Tier 3: 14.42% population covered (44,372,326 people)
Active community water systems coverage rates per Tier:
- Tier 1: 35.61% system covered (17600 systems)
- Tier 2: 22.49% system covered (11117 systems)
- Tier 3: 35% system covered (17297 systems)
- No Tier/Geometry: 6.9% system covered (3410 systems)
Several limitations to this data exist–and the layer should be used with these in mind. The case of assigning a Census Place TIGER polygon to the \"best match\" water system first introduced in v2.0.0 requires further validation. Tier 3 boundaries have modeled radii stemming from a lat/long centroid of a water system facility; but the underlying lat/long centroids for water system facilities are of variable quality. It is critical to evaluate the \"geometry quality\" column (included from the EPA ECHO data source) when looking at Tier 3 boundaries; fidelity is very low when geometry quality is a county or state centroid– but we did not exclude the data from the layer. Since v 2.0.0 we have improved the percentage of Tier 3 geometries with state centroids and county centroids from 50% of Tier 3 boundaries to 30% of Tier 3 boundaries. Missing water systems are typically those without a centroid, in a U.S. territory, or missing population and connection data. Finally, Tier 1 systems are assumed to be high fidelity, but rely on the accuracy of state data collection and maintenance.
创建时间:
2023-12-30



