BOD and TSS Numeric Violations in the US
收藏Databricks2025-06-05 收录
下载链接:
https://marketplace.databricks.com/details/caa55ea8-b491-4271-880c-e601af7a38a2/KETOS-Inc_BOD-and-TSS-Numeric-Violations-in-the-US
下载链接
链接失效反馈官方服务:
资源简介:
**Datasets**: Biological Oxygen Demand (BOD) Violations and Total Suspended Solids (TSS) NPDES Permit Violations in the US - Sample data.
## Overview
This README provides detailed information about two datasets: `public_bod_violations_sample` and `public_suspended_violations_sample`. These datasets contain information about violations related to Biochemical Oxygen Demand (BOD) and suspended solids, respectively. The datasets include various columns, with a focus on geolocation, industry name, type, violation description, parameter description, and exceedances.
## Datasets
### 1. `public_bod_violations_sample`
This dataset contains information about violations related to Biochemical Oxygen Demand (BOD). Below are the key columns of interest:
- **Geolocation:**
- `LOCATION_ADDRESS`: The address of the facility.
- `city`: The city where the facility is located.
- `county_code`: The county code of the facility's location.
- `state_code`: The state code of the facility's location.
- `zip`: The ZIP code of the facility's location.
- `latitude`: The latitude coordinate of the facility.
- `longitude`: The longitude coordinate of the facility.
- **Industry Information:**
- `facility_name`: The name of the facility.
- `naics_code`: The North American Industry Classification System (NAICS) code for the facility.
- `naics_desc`: The description of the NAICS code.
- **Violation Information:**
- `VIOLATION_DESC`: A description of the violation.
- `PARAMETER_DESC`: A description of the parameter related to the violation.
- `EXCEEDENCE_PCT`: The percentage by which the parameter exceeded the limit.
### 2. `public_suspended_violations_sample`
This dataset contains information about violations related to suspended solids. Below are the key columns of interest:
- **Geolocation:**
- `LOCATION_ADDRESS`: The address of the facility.
- `city`: The city where the facility is located.
- `county_code`: The county code of the facility's location.
- `state_code`: The state code of the facility's location.
- `zip`: The ZIP code of the facility's location.
- `latitude`: The latitude coordinate of the facility.
- `longitude`: The longitude coordinate of the facility.
- **Industry Information:**
- `facility_name`: The name of the facility.
- `naics_code`: The North American Industry Classification System (NAICS) code for the facility.
- `naics_desc`: The description of the NAICS code.
- **Violation Information:**
- `VIOLATION_DESC`: A description of the violation.
- `PARAMETER_DESC`: A description of the parameter related to the violation.
- `EXCEEDENCE_PCT`: The percentage by which the parameter exceeded the limit.
## Column Descriptions
### Common Columns
- `NPDES_ID`: The National Pollutant Discharge Elimination System (NPDES) ID of the facility.
- `VERSION_NMBR`: The version number of the record.
- `ACTIVITY_ID`: The activity ID associated with the violation.
- `NPDES_VIOLATION_ID`: The NPDES violation ID.
- `PERM_FEATURE_NMBR`: The permit feature number.
- `PERMIT_ACTIVITY_ID`: The permit activity ID.
- `LIMIT_SET_DESIGNATOR`: The limit set designator.
- `MONITORING_LOCATION_CODE`: The monitoring location code.
- `DMR_FORM_VALUE_ID`: The Discharge Monitoring Report (DMR) form value ID.
- `DMR_VALUE_NMBR`: The DMR value number.
- `DMR_VALUE_ID`: The DMR value ID.
- `DMR_PARAMETER_ID`: The DMR parameter ID.
- `NODI_CODE`: The No Data Indicator (NODI) code.
- `ADJUSTED_DMR_VALUE_NMBR`: The adjusted DMR value number.
- `LIMIT_VALUE_STANDARD_UNITS`: The limit value in standard units.
- `VIOLATION_TYPE_CODE`: The violation type code.
- `VIOLATION_TYPE_DESC`: The violation type description.
- `VIOLATION_CODE`: The violation code.
- `PARAMETER_CODE`: The parameter code.
- `STANDARD_UNIT_CODE`: The standard unit code.
- `STANDARD_UNIT_DESC`: The standard unit description.
- `MONITORING_PERIOD_END_DATE`: The end date of the monitoring period.
- `NMBR_OF_REPORT`: The number of reports.
- `VALUE_QUALIFIER_CODE`: The value qualifier code.
- `UNIT_CODE`: The unit code.
- `VALUE_RECEIVED_DATE`: The date the value was received.
- `DAYS_LATE`: The number of days late.
- `ADJUSTED_DMR_STANDARD_UNITS`: The adjusted DMR value in standard units.
- `LIMIT_ID`: The limit ID.
- `DMR_VALUE_STANDARD_UNITS`: The DMR value in standard units.
- `VALUE_TYPE_CODE`: The value type code.
- `RNC_DETECTION_CODE`: The RNC detection code.
- `RNC_DETECTION_DESC`: The RNC detection description.
- `RNC_DETECTION_DATE`: The RNC detection date.
- `RNC_RESOLUTION_CODE`: The RNC resolution code.
- `RNC_RESOLUTION_DESC`: The RNC resolution description.
- `RNC_RESOLUTION_DATE`: The RNC resolution date.
- `STATISTICAL_BASE_CODE`: The statistical base code.
- `STATISTICAL_BASE_MONTHLY_AVG`: The statistical base monthly average.
- `STATISTICAL_BASE_SHORT_DESC`: The statistical base short description.
- `registry_id`: The registry ID.
- `facility_type_code`: The facility type code.
- `IMPAIRED_WATERS`: Indicates if the waters are impaired.
## Usage
These datasets can be used for various analyses, including:
- Identifying facilities with frequent violations.
- Analyzing the geographic distribution of violations.
- Understanding the types of industries and their compliance with environmental regulations.
- Examining the exceedance percentages to identify severe violations.
## Access
These datasets are publicly accessible and can be queried using Databricks SQL.
## Contact
For any questions or further information, please contact support_prism@ketos.co
提供机构:
KETOS Inc



