five

Fils - APPLICATION OF OPEN WEB PATTERNS AND STRUCTURED DATA ON THE WEB TO GEOINFORMATICS

收藏
www.hydroshare.org2018-12-06 更新2025-01-16 收录
下载链接:
https://www.hydroshare.org/resource/8f81956f98ae458ab3373d7baa1776d6
下载链接
链接失效反馈
官方服务:
资源简介:
FILS, Douglas, Ocean Leadership, 1201 New York Ave, NW, 4th Floor, Washington, DC 20005, SHEPHERD, Adam, Woods Hole Oceangraphic Inst, 266 Woods Hole Road, Woods Hole, MA 02543-1050 and LINGERFELT, Eric, Earth Science Support Office, Boulder, CO 80304 The growth in the amount of geoscience data on the internet is paralleled by the need to address issues of data citation, access and reuse. Additionally, new research tools are driving a demand for machine accessible data as part of researcher workflows. In the commercial sector, elements of this have been addressed by the use of the Schema.org vocabulary encoded via JSON-LD and coupled with web publishing patterns. Adaptable publishing approaches are already in use by many data facilities as they work to address publishing and FAIR patterns. While these often lack the structured data elements these workflows could be leveraged to additionally implement schema.org style publishing patterns. This presentation will report on work that grew out of the EarthCube Council of Data Facilities known as, Project 418. Project 418 was a proof of concept funded by the EarthCube Science Support Office for exploring the approach of publishing JSON-LD with schema.org and extensions by a set of NSF data facilities. The goal was focused on using this approach to describe data set resources and evaluate the use of this structured metadata to address discovery. Additionally, we will discuss growing interest by Google and others in leveraging this approach to data set discovery. The work scoped 47,650 datasets from 10 NSF-funded data facilities. Across these datasets, the harvester found 54,665 data download URLs, and approximately 560K dataset variables and 35k unique identifiers (DOIs, IGSNs or ORCIDs). The various publishing workflows used by the involved data facilities will be presented along with the harvesting and interface developments. Details on how resources were indexed into text, spatial and graph systems and used for search interfaces will be presented along with future directions underway building on this foundation.

FILS, Douglas,Ocean Leadership,位于纽约大道西北1201号,华盛顿特区,DC 20005,SHEPHERD, Adam,伍兹霍尔海洋研究所,266 Woods Hole Road,Woods Hole,MA 02543-1050,以及LINGERFELT, Eric,地球科学支持办公室,博尔德,CO 80304。 互联网上地球科学数据的数量激增,与之相伴的是解决数据引用、访问和再利用问题的必要性。此外,新型研究工具的兴起推动了研究人员工作流程中机器可访问数据的需求。 在商业领域,此类问题已通过使用Schema.org词汇表,并通过JSON-LD编码以及结合网络发布模式得到解决。可适应性发布方法已被许多数据设施采用,以应对发布和FAIR模式。尽管这些方法通常缺乏这些工作流程可以利用的结构化数据元素,但这些工作流程可进一步实施schema.org风格的发布模式。 本次演示将报告由地球立方数据设施理事会(EarthCube Council of Data Facilities)发起,名为项目418的工作。项目418由地球立方科学支持办公室资助,旨在探索由一组NSF资助的数据设施通过发布JSON-LD和schema.org扩展来实现这一方法。该目标专注于使用这种方法来描述数据集资源,并评估这种结构化元数据在发现中的应用。此外,我们还将讨论谷歌和其他机构日益增长的利用这一方法进行数据集发现的兴趣。 本工作范围包括10个NSF资助的数据设施的47,650个数据集。在这些数据集中,收割器找到了54,665个数据下载URL,以及约560K个数据集变量和35k个唯一标识符(DOIs、IGSNs或ORCIDs)。 将展示涉及数据设施使用的各种发布工作流程,以及收割和界面开发。还将介绍资源如何被索引到文本、空间和图系统中,并用于搜索界面。此外,还将介绍基于这一基础正在进行的未来发展方向。
提供机构:
HydroShare
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作