PALADIN: Benchmarks, Experimental Settings, and Evaluation
收藏DataCite Commons2024-04-11 更新2024-07-13 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/4e81c650-6b82-4a0a-bf3c-ab81394cf8df
下载链接
链接失效反馈官方服务:
资源简介:
This collection includes all the data and scripts necessary to reproduce the results from the experimental study of [PALADIN](https://github.com/SDM-TIB/PALADIN).
__Data__
The data is generated using the [Synthetic Data Generator](https://github.com/SDM-TIB/Synthetic-Data-Generator) which generates process-based breast cancer treatment data following the distribution in a real population of breast cancer patients. The collection comprises a total of 18 data sets, nine for relational databases and nine for RDF-based knowledge graphs. For each data format, there are three different sizes of data sets:
+ _Small_ models 1,000 patients
+ _Medium-sized_ models 10,000 patients
+ _Large_ models 100,000 patients
There are three data sets of each size. They differ in the parameter used for the mutation probability of the data generator. The lower this value is, the closer the data is to following the treatment guideline for breast cancer patients with an amplified HER2 gene.
The data is available for download in
* Turtle format: `synth_data_ttl.zip`
* Preloaded for the use with Virtuoso 7.20.3237: `synth_data_virtuoso.zip`
* MySQL 8.1 dump: `synth_data_sql.zip`
__PALADIN Schemas__
The file `paladin_schemas.zip` contains the different PALADIN schemas used in the experimental study. There are mainly seven different schemas. One of them represents the treatment guideline for breast cancer patients with an amplified HER2 gene. The remaining six shemas are used in the study of the scalability. They divide the patients based on the ranges over their IDs. They comprise of 16, 32, 64, 128, 256, 512, and 1024 nodes, respectively.
__Experimental Environment__
In order to reproduce the results, download the file `experiments.zip`. Once unzipped, execute the file `run_experiments.sh`. Note that you need to have Docker installed. The script `run_experiments.sh` should be executed with sudo permissions in order to let the script automatically transfer the ownership of the files created with Docker to your user.
提供机构:
TIB
创建时间:
2023-11-15



