five

Aggregated Data: Environmental Monitoring and Observations Effort Jan 2010-Mar 2023

收藏
Research Data Australia2025-12-20 收录
下载链接:
https://researchdata.edu.au/aggregated-data-environmental-mar-2023/3653794
下载链接
链接失效反馈
官方服务:
资源简介:
This collection contains aggregated metadata on environmental monitoring and observing activities from three Australian national research infrastructures (NRIs): biodiversity survey events from the Atlas of Living Australia (ALA), marine observations collected by the Integrated Marine Observing System (IMOS), and site-based monitoring and survey efforts by the Terrestrial Ecosystem Research Network (TERN). This dataset provides a summary breakdown of these efforts by survey topic, region, and time period from January 2010 to March 2023.Survey topics are mapped to an EcoAssets Earth Science Features vocabulary based on the Earth Science keywords from the Global Change Master Directory (GCMD) vocabulary, modified to use taxonomic concept URIs from the Australian National Species List (ANSL) in place of the GCMD Earth Science > Biological Classification vocabulary. ANSL categories map more readily to biodiversity survey categories, since GCMD depends on a top-level division between vertebrates and invertebrates rather than offering an animal category. The EcoAssets Earth Science Features vocabulary, including alternative keywords used in ALA, IMOS, or TERN datasets, is included in this collection.The primary asset is aggregated_env_monitoring.csv. This contains all faceted data records for the period and supported facets related to time, space, and features observed.Two derived assets (summary_monitoring_effort_terrestrial.csv, summary_monitoring_effort_marine.csv) further summarise the faceted data. Each is a pivot of the aggregated dataset.vocabulary_earth_science_features.csv contains the hierarchical terms used within this asset to categorise earth science features. treeview_earth_science_features.txt provides a simpler, more readable view. keyword_mapping.csv shows the mappings between these terms and the keywords used in source datasets. The data_sources.csv file includes information on the source datasets that contributed to this asset.Lineage: This dataset was created by the following pipeline:1. Metadata records were collected from the TERN linked data portal (https://linkeddata.tern.org.au/) for all TERN monitoring sites and survey activities. Feature terms follow the TERN Feature Type vocabulary, mapped to the EcoAssets Earth Science Features vocabulary. For features that have been measured continuously at the site, metadata records were created for each relevant year since commission of the site. For other sites and features, metadata records were generated only for years in which the site was visited. TERN metadata records are associated with site coordinates.2. Metadata records were harvested for datasets in the Australian Ocean Data Network (AODN, https://portal.aodn.org.au/) portal maintained by IMOS (iso19115-3.2018 format over OAI-PMH). Feature terms follow the GCMD keywords used in these metadata records. Metadata records were created for each year overlapping the data collection period for each dataset. Where the datasets were associated with a bounding box, records were created for each IMCRA region intersecting the bounding box.3. Metadata records were created for each biodiversity sample event published to the ALA and associated with a Darwin Core event ID and a named sampling protocol (see https://dwc.tdwg.org/terms/#event). Events were excluded if the set of sampled taxa included multiple kingdoms OR the sampling protocol was associated with 1 species. The remaining samples were mapped to feature terms based on the taxonomic scope of all species recorded for the associated protocol. Year and coordinates were taken from the associate sample event.4. Metadata records from all sources were combined and include the following values. The feature facet values are offered as a convenience for grouping records without using the hierarchical structure of the EcoAssets Earth Science Features vocabulary:• Source National Research Institute (NRI – one of ALA, IMOS, TERN)• Dataset name • Dataset URI • Original keyword from NRI (TERN feature type, IMOS GCMD keyword, ALA taxon)• Decimal latitude (where appropriate)• Decimal longitude (where appropriate)• Year• State or Territory• IBRA7 terrestrial region• IMCRA 4.0 mesoscale marine bioregion• Feature ID from EcoAssets Earth Science Features vocabulary• Feature name associated with feature ID• Feature facet 1 – high-level facet based on feature ID – a top-level GCMD Earth Science category (6 terms)• Feature facet 2 – intermediate-level facet based on feature ID – second-level GCMD/ANSL category (29 terms)• Feature facet 3 – lower-level facet with more fine-grained taxonomic structure based on feature ID – typically a third-level GCMD/ANSL category (36 terms)

本数据集集合涵盖了来自3个澳大利亚国家研究基础设施(National Research Infrastructures, NRIs)的环境监测与观测活动聚合元数据:分别为澳大利亚生物多样性图谱(Atlas of Living Australia, ALA)的生物多样性调查事件数据、综合海洋观测系统(Integrated Marine Observing System, IMOS)采集的海洋观测数据,以及陆地生态系统研究网络(Terrestrial Ecosystem Research Network, TERN)开展的站点级监测与调查工作数据。 本数据集按调查主题、区域与时间跨度(2010年1月至2023年3月)对上述工作进行了分类汇总。 调查主题被映射至基于全球变化主目录(Global Change Master Directory, GCMD)词汇表中地球科学关键词构建的EcoAssets地球科学特征词汇表,该词汇表已修改为采用澳大利亚国家物种名录(Australian National Species List, ANSL)的分类学概念统一资源标识符(URI),替代原GCMD地球科学>生物分类词汇表。由于GCMD仅以脊椎动物与无脊椎动物作为顶级分类划分,未设置动物大类,因此ANSL分类体系更适配生物多样性调查的分类需求。本数据集集合同时收录了EcoAssets地球科学特征词汇表,包含ALA、IMOS及TERN数据集所使用的替代关键词。 核心数据资产为aggregated_env_monitoring.csv,该文件包含研究时段内的所有分面数据记录,以及与时间、空间及观测特征相关的支持性分面字段。另有2个衍生数据资产:summary_monitoring_effort_terrestrial.csv与summary_monitoring_effort_marine.csv,用于对分面数据进行进一步汇总,二者均为聚合数据集的数据透视表。 vocabulary_earth_science_features.csv收录了本数据集中用于地球科学特征分类的层级化术语;treeview_earth_science_features.txt则提供了更简洁易读的术语视图;keyword_mapping.csv展示了这些术语与源数据集所用关键词之间的映射关系;data_sources.csv文件则记录了为本数据集提供数据的源数据集相关信息。 数据溯源:本数据集通过以下流程构建: 1. 从TERN关联数据门户(https://linkeddata.tern.org.au/)采集所有TERN监测站点与调查活动的元数据记录。特征术语遵循TERN特征类型词汇表,并映射至EcoAssets地球科学特征词汇表。对于站点上持续监测的特征,自站点启用起的每个相关年度均生成元数据记录;对于其他站点与特征,则仅在站点实际开展调查的年度生成元数据记录。TERN元数据记录均关联站点坐标信息。 2. 从IMOS维护的澳大利亚海洋数据网络(Australian Ocean Data Network, AODN, https://portal.aodn.org.au/)门户采集数据集元数据记录,采用基于OAI-PMH协议的iso19115-3.2018格式。特征术语遵循这些元数据中使用的GCMD关键词。针对每个数据集数据采集时段所覆盖的每个年度,均生成元数据记录;若数据集关联空间边界框,则为与该边界框相交的每个IMCRA 4.0中尺度海洋生物区生成元数据记录。 3. 为发布至ALA且关联达尔文核心(Darwin Core)事件ID与命名采样方案的所有生物多样性采样事件生成元数据记录(详见https://dwc.tdwg.org/terms/#event)。若采样类群涵盖多个界,或采样方案仅对应1个物种,则排除该采样事件。剩余采样事件将根据关联采样方案所记录的所有物种的分类学范围,映射至对应的特征术语。采样事件的年度与坐标信息直接取自原始记录。 4. 整合所有来源的元数据记录,最终数据集包含以下字段。为方便用户无需借助EcoAssets地球科学特征词汇表的层级结构即可对记录进行分组,本数据集提供了特征分面字段: • 来源国家研究基础设施(National Research Institute, NRI——可选值为ALA、IMOS、TERN之一) • 数据集名称 • 数据集统一资源标识符(Dataset URI) • 来源NRI的原始关键词(TERN特征类型、IMOS的GCMD关键词、ALA的类群名称) • 十进制纬度(如适用) • 十进制经度(如适用) • 调查年度 • 州或领地 • IBRA7陆地生物地理区 • IMCRA 4.0中尺度海洋生物区 • EcoAssets地球科学特征词汇表中的特征ID • 与特征ID关联的特征名称 • 特征分面1:基于特征ID生成的高级分面,对应GCMD地球科学顶级分类(共6个术语) • 特征分面2:基于特征ID生成的中级分面,对应GCMD/ANSL二级分类(共29个术语) • 特征分面3:基于特征ID生成的低级细粒度分类分面,通常对应GCMD/ANSL三级分类(共36个术语)
提供机构:
Commonwealth Scientific and Industrial Research Organisation
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作