NIMS polymer database PoLyInfo (III): modularizing ShEx schemas for descriptors and properties in PoLyInfoRDF
收藏Figshare2025-09-11 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/NIMS_polymer_database_PoLyInfo_III_modularizing_ShEx_schemas_for_descriptors_and_properties_in_PoLyInfoRDF/30104840
下载链接
链接失效反馈官方服务:
资源简介:
PoLyInfo is a polymer database of the National Institute for Materials Science (NIMS) of Japan. In our previous work, to make the PoLyInfo data machine-readable and further machine-understandable, we built PoLyInfoRDF to store these data in the standard Resource Description Framework (RDF) format and then defined its schema in the Shape Expressions (ShEx) language. When designing the schema, it is important to modularize the schema such that the common components are reusable. This is the objective of this study and is essential for efficiently defining schemas of the descriptors and properties, which constitute the core of PoLyInfo, a large collection of experimentally measured polymer characteristics. As an example of modularization, descriptors of the source-based name and molecular formula both include a string value, hence their schemas may well share (‘inherit’) the schema for string values, which would be defined once and subsequently reused throughout the entire set of schemas. Actually we noticed a considerable amount of common portions among schemas of descriptors and properties, and clarified a ‘schema hierarchy’ to reflect the above ‘inheritance’ relationships, separately from the ontological ‘concept hierarchy’. We then investigated the extent to which the adapted strategy was able to successfully define the PoLyInfoRDF schema. Under this schema hierarchy, inheritance mechanisms in ShEx played a significant role in sharing common portions effectively in a well-organized manner. We expect future developments based on our approach to contribute to the standardization of scientific data representation in RDF by providing a library of reusable schemas. We have developed a new method for modularizing scientific data schemas and managing them hierarchically, and demonstrated it in PoLyInfo. This paves the way for data fusion in materials chemistry.
创建时间:
2025-09-11



