Extracted Schemas from the Life Sciences Linked Open Data Cloud
收藏Mendeley Data2024-06-27 更新2024-06-27 收录
下载链接:
https://figshare.com/articles/dataset/Extracted_Schemas_from_the_Life_Sciences_Linked_Open_Data_Cloud/12402425
下载链接
链接失效反馈官方服务:
资源简介:
This dataset is related to the manuscript "An empirical meta-analysis of the life sciences linked open data on the web" published at Nature Scientific Data. If you use the dataset, please cite the manuscript as follows:Kamdar, M.R., Musen, M.A. An empirical meta-analysis of the life sciences linked open data on the web. Sci Data 8, 24 (2021). https://doi.org/10.1038/s41597-021-00797-y We have extracted schemas from more than 80 publicly available biomedical linked data graphs in the Life Sciences Linked Open Data (LSLOD) cloud into an LSLOD schema graph and conduct an empirical meta-analysis to evaluate the extent of semantic heterogeneity across the LSLOD cloud. The dataset published here contains the following files:- The set of Linked Data Graphs from the LSLOD cloud from which schemas are extracted.- Refined Sets of extracted classes, object properties, data properties, and datatypes, shared across the Linked Data Graphs on LSLOD cloud. Where the schema element is reused from a Linked Open Vocabulary or an ontology, it is explicitly indicated.- The LSLOD Schema Graph, which contains all the above extracted schema elements interlinked with each other based on the underlying content. Sample instances and sample assertions are also provided along with broad level characteristics of the modeled content. The LSLOD Schema Graph is saved as a JSON Pickle File. To read the JSON object in this Pickle file use the Python command as follows:with open('LSLOD-Schema-Graph.json.pickle' , 'rb') as infile: x = pickle.load(infile, encoding='iso-8859-1') Check the Referenced Link for more details on this research, raw data files, and code references.
本数据集关联发表于《自然科学数据(Nature Scientific Data)》的研究论文《面向Web上生命科学关联开放数据的实证元分析》。若使用本数据集,请按以下格式引用该论文:
Kamdar, M.R.、Musen, M.A. 面向Web上生命科学关联开放数据的实证元分析. Sci Data 8, 24 (2021). https://doi.org/10.1038/s41597-021-00797-y
研究团队从生命科学关联开放数据(Life Sciences Linked Open Data,LSLOD)云平台的80余个公开生物医学关联数据图谱(Linked Data Graphs)中提取模式(schema),构建LSLOD模式图谱,并开展实证元分析(empirical meta-analysis)以评估LSLOD云平台内的语义异质性(semantic heterogeneity)程度。
本数据集包含如下文件:
- 用于提取模式的LSLOD云平台关联数据图谱集合;
- 经精炼的提取所得类(class)、对象属性(object property)、数据属性(data property)及数据类型(datatype)集合,这些元素可在LSLOD云平台的关联数据图谱间共享。若某模式元素复用自关联开放词汇表(Linked Open Vocabulary)或本体(ontology),则会明确标注其来源;
- LSLOD模式图谱,涵盖上述全部提取得到的模式元素,并依据底层内容实现元素间的相互关联。此外还附带提供示例实例(sample instances)与示例断言(sample assertions),以及所建模内容的宏观特征。LSLOD模式图谱以JSON Pickle文件(JSON Pickle File)格式存储。若需读取该Pickle文件中的JSON对象,请使用如下Python代码:
with open('LSLOD-Schema-Graph.json.pickle' , 'rb') as infile: x = pickle.load(infile, encoding='iso-8859-1')
如需了解本研究、原始数据文件及代码参考的更多细节,请查看参考链接。
创建时间:
2023-06-28



