five

The National Eutrophication Survey: lake characteristics and historical nutrient concentrations

收藏
DataCite Commons2020-09-18 更新2025-04-16 收录
下载链接:
https://knb.ecoinformatics.org/view/doi:10.5063/F10G3H3Z
下载链接
链接失效反馈
官方服务:
资源简介:
Historical ecological surveys are an important source of baseline information to provide context for contemporary research, yet many of these records are not preserved in a way to ensure their long-term usability. The National Eutrophication Survey database is currently only available as portable document files (PDF) scanned from printed records with no embedded character information. This limits its searchability and the ability of current and future scientists to systematically evaluate its contents. The data was collected by the United States Environmental Protection Agency between 1972 and 1975, as part of an effort to investigate eutrophication in freshwater lakes and reservoirs. Although several studies have hand-transcribed portions of the database in support of specific studies, there have been no systematic attempts to transcribe and preserve the database in its entirety. Here we use a combination of automated optical character recognition and manual quality assurance to make this data available for analysis. The performance of the optical character recognition procedure was highly variable depending on scan quality. For each of the four archival pdf files, we found an error rate of between 5.9 and 17%. The goal of our approach was to strike a balance efficiency and data quality by combining hand-entry of data with digital transcription technologies. The finished database contains information on the physical characteristics, hydrology, and water quality of about 800 lakes in the continental United States. This database could be combined with more recent data to generate meta analyses of water quality trends and spatial variation across the continental United States.
提供机构:
KNB Data Repository
创建时间:
2017-04-07
二维码
社区交流群
二维码
科研交流群
商业服务