five

QAW: A Quality Assurance Workflow for Ontologies based on Detecting Semantic Regularities - Dataset on SNOMED

收藏
Figshare2013-05-20 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/QAW_A_Quality_Assurance_Workflow_for_Ontologies_based_on_Detecting_Semantic_Regularities/701284
下载链接
链接失效反馈
官方服务:
资源简介:
This page contains supplementary material for the EKAW 2014 submission with title: "QAW: A Quality Assurance Workflow for Ontologies based on Detecting Semantic Regularities" Eleni Mikroyannidi, Manuel Quesada-Mart ́ınez, Dmitry Tsarkov, Jesualdo Tomas Fernandez Breis, Robert Stevens, Ignazio Palmisano. The detection of regularities was done with the Regularity Inspector for Ontologies framework. The project is open source and can be downloaded from the Links below. The fileset contains the data for the qualitative and quantititative analysis that were presented in the paper. In the qualitative analysis, six lexical patterns (keywords) that were processed. These are: "chronic","acute", "absent", "present", "right", "left". For these ones the reader can browse and download the following data: 1. XML Files with the generic name $keyword_syntactic_usage.xml which contains the detected syntactic regularities for the referencing asserted axioms of the entities that contain the corresponding keyword in their label. There should be 6 files in total (for each keyword). 2. XML Files with the generic name $keyword_semantic_usage.xml which contains the detected syntactic regularities for the referencing asserted axioms of the entities that contain the keyword in their label. 3. Text files with the generic name $keyword_syntactic_usage_readable.txt, which contains a more readable format with label rendering of the syntactic regularities. 3. Text files with the generic name $keyword_semantic_usage_readable.txt, which contains a more readable format with label rendering of the semantic regularities. In the quantitative analysis, 308 lexical patterns were processed, and corresponding syntactic and semantic regularities were detected. The dataset that is available for the reader contains the following: 1. LexAnal_Snomed_2013_NoSensitiveAnalysis_Cov_0.1_100.0.xml, which contains all lexical patterns that could be detected in the SNOMED-CT version January 2013. 2. Snomed_2013_LexAnal_Full_0.1-0.4Perc_.xml, which contains all lexical patterns with 0.1%-0.4% lexical pattern threshold. 3. syntactic_regularities_dataset.zip which contains 308 xml files with the syntactic regularities that were generated by RIO. 4. semantic_regularities_dataset.zip which contains 308 xml files with the semantic regularities that were generated by RIO. 5. quantitative_syntactic_regularity_analysis.csv, which contains the syntactic regularity stat analysis for the 308 processed cases. 6. quantitative_semantic_regularity_analysis.csv, which contains the semantic regularity stat analysis for the 308 processed cases.
创建时间:
2013-05-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作