Replication Data for: Synchronic Curation for Assessing Reuse and Integration Fitness of Multiple Data Collections
收藏DataCite Commons2026-03-27 更新2026-05-05 收录
下载链接:
https://dataverse.tdl.org/citation?persistentId=doi:10.18738/T8/OOTALX
下载链接
链接失效反馈官方服务:
资源简介:
The dataset in this publication demonstrates the implementation and capabilities of Synchronic Curation (SC) in ASTRIAGraph.
SC is a framework to curate multiple and large datasets for purposes of integration and reuse in research applications. Data driven applications often require data integrated from different large and continuously updated collections. These collections may present gaps and overlaps, or may conflict with or complement each other. Thus, a curation need is to continuously assess if data are fit for integration and reuse. The SC framework involves processing steps to map different collections to a unifying data model that represents research problems in a scientific area as well as the collections' provenance. Data points from the collections that are integrated to the system are mapped to the data model, and a unified data dictionary is maintained centrally and expanded as needed. The data model is implemented in a graph database where collections are continuously ingested and queried. SC includes a collection analysis and comparison module to track collections updates, and to identify gaps, changes, and irregularities within and across them. Users can query the database or access comparison results interactively through an interactive graph.
We present three files:
1) The Synchronic Curation data model's state in ASTRIAGraph up to the date of this publication. The data model includes labeled classes identified by domain scientists as comprising research problems in the space, their corresponding properties. Classes and properties are defined according to a unified data dictionary maintained by the ASTRIAGraph team. Some terms/labels and definitions are extracted from the Unified Astronomy Thesaurus. The names of the collections that are ingested to ASTRIAGraph are also included in the data model, as well as the relationships between their data points to the classes and properties that they had been mapped to.
2) Schema for comparing data fields of two versions of the collection of the United Nations Office for Outer Space Affairs (UNOOSA) Space Object Register.
3) Matrix with the final tally of the comparison of the two versions. The results can be accessed via web based interactive graphs whose URL are noted in the metadata.
Originally developed for ASTRIAGraph, SC can be applied to other areas of knowledge. It is specially useful for very large and frequently updated datasets. This dataset can be used to learn about the methodology used to process the data for SC and to replicate results.
提供机构:
Texas Data Repository
创建时间:
2022-06-27



