five

Waterhackweek2019 Data Access and Time-series Statistics Cyberseminar

收藏
DataCite Commons2025-12-12 更新2026-04-25 收录
下载链接:
http://www.hydroshare.org/resource/9985b3cb38c94cee872b28f6dcdef739
下载链接
链接失效反馈
官方服务:
资源简介:
Data about water are found in many types of formats distributed by many different sources and depicting different spatial representations such as points, polygons and grids. How do we find and explore the data we need for our specific research or application? This seminar will present common challenges and strategies for finding and accessing relevant datasets, focusing on time series data from sites commonly represented as fixed geographical points. This type of data may come from automated monitoring stations such as river gauges and weather stations, from repeated in-person field observations and samples, or from model output and processed data products. We will present and explore useful data catalogs, including the CUAHSI HIS catalog accessible via HydroClient, CUAHSI HydroShare, the EarthCube Data Discovery Studio, Google Dataset search, and agency-specific catalogs. We will also discuss programmatic data access approaches and tools in Python, particularly the ulmo data access package, touching on the role of community standards for data formats and data access protocols. Once we have accessed datasets we are interested in, the next steps are typically exploratory, focusing on visualization and statistical summaries. This seminar will illustrate useful approaches and Python libraries used for processing and exploring time series data, with an emphasis on the distinctive needs posed by temporal data. Core Python packages used include Pandas, GeoPandas, Matplotlib and the geospatial visualization tools introduced at the last seminar. Approaches presented can be applied to other data types that can be summarized as single time series, such as averages over a watershed or data extracts from a single cell in a gridded dataset – the topic for the next seminar. Cyberseminar recording is available on Youtube at https://youtu.be/uQXuS1AB2M0

水资源相关数据存在多种格式,由众多不同来源发布,且涵盖多种空间表达形式,例如点、多边形与网格。我们该如何查找并探索适配特定研究或应用场景的所需数据呢?本次网络研讨会将介绍查找与获取相关数据集的常见挑战及应对策略,重点聚焦于以固定地理点为常见表达形式的站点时序数据。此类数据可来源于自动监测站点(如水位站、气象站)、重复开展的实地观测与采样工作,或是模型输出结果及经过处理的数据产品。本次研讨会将介绍并实操探索多款实用数据目录,包括可通过HydroClient访问的CUAHSI HIS目录(CUAHSI HIS Catalog)、CUAHSI HydroShare、EarthCube数据发现工作室(EarthCube Data Discovery Studio)、谷歌数据集搜索(Google Dataset Search)以及各机构专属数据目录。此外,我们还将探讨Python语言下的程序化数据获取方法与工具,重点介绍ulmo数据获取包,并简要提及数据格式与数据访问协议领域的社区标准所发挥的作用。在成功获取目标数据集后,后续步骤通常为探索性分析,重点围绕数据可视化与统计汇总展开。本次研讨会将演示用于处理与探索时序数据的实用方法及Python库,重点关注时序数据特有的分析需求。本次将用到的核心Python库包括Pandas、GeoPandas、Matplotlib,以及上一期研讨会介绍的地理空间可视化工具。本次介绍的分析方法可推广至可归纳为单一时序序列的其他数据类型,例如流域平均数据或网格数据集中单格数据提取结果——这也将是下一期研讨会的主题。本次网络研讨会的录播视频可在YouTube观看,链接为:https://youtu.be/uQXuS1AB2M0
提供机构:
Consortium of Universities for the Advancement of Hydrologic Science, Inc
创建时间:
2025-12-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作