five

PALADIN: Benchmarks, Experimental Settings, and Evaluation

收藏
DataCite Commons2024-04-11 更新2024-07-13 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/4e81c650-6b82-4a0a-bf3c-ab81394cf8df
下载链接
链接失效反馈
官方服务:
资源简介:
This collection includes all the data and scripts necessary to reproduce the results from the experimental study of [PALADIN](https://github.com/SDM-TIB/PALADIN). __Data__ The data is generated using the [Synthetic Data Generator](https://github.com/SDM-TIB/Synthetic-Data-Generator) which generates process-based breast cancer treatment data following the distribution in a real population of breast cancer patients. The collection comprises a total of 18 data sets, nine for relational databases and nine for RDF-based knowledge graphs. For each data format, there are three different sizes of data sets: + _Small_ models 1,000 patients + _Medium-sized_ models 10,000 patients + _Large_ models 100,000 patients There are three data sets of each size. They differ in the parameter used for the mutation probability of the data generator. The lower this value is, the closer the data is to following the treatment guideline for breast cancer patients with an amplified HER2 gene. The data is available for download in * Turtle format: `synth_data_ttl.zip` * Preloaded for the use with Virtuoso 7.20.3237: `synth_data_virtuoso.zip` * MySQL 8.1 dump: `synth_data_sql.zip` __PALADIN Schemas__ The file `paladin_schemas.zip` contains the different PALADIN schemas used in the experimental study. There are mainly seven different schemas. One of them represents the treatment guideline for breast cancer patients with an amplified HER2 gene. The remaining six shemas are used in the study of the scalability. They divide the patients based on the ranges over their IDs. They comprise of 16, 32, 64, 128, 256, 512, and 1024 nodes, respectively. __Experimental Environment__ In order to reproduce the results, download the file `experiments.zip`. Once unzipped, execute the file `run_experiments.sh`. Note that you need to have Docker installed. The script `run_experiments.sh` should be executed with sudo permissions in order to let the script automatically transfer the ownership of the files created with Docker to your user.
提供机构:
TIB
创建时间:
2023-11-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作