PostgreSQL Dataset
收藏arXiv2025-09-30 收录
下载链接:
https://groups.cs.umass.edu/kdl/causal-eval-data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了11,252条查询的执行信息,这些查询对应了90,016种不同的协变量-治疗方法组合,在Postgres数据库上执行。研究重点在于探讨各种系统参数对查询运行时间的影响。数据集还包括了影响查询执行时间的处理变量,如内存级别(MemoryLevel)、索引级别(IndexLevel)和页面成本(PageCost)。规模上,数据集涵盖了11,252条查询和90,016个协变量-治疗方法组合。此外,该数据集的任务是用于机器学习中的异常检测和解释。
This dataset contains execution records for 11,252 queries, which correspond to 90,016 distinct covariate-treatment pairs executed on a Postgres database. The research focuses on investigating the impact of various system parameters on query runtime. The dataset also includes processing variables that affect query execution time, namely MemoryLevel, IndexLevel, and PageCost. In terms of scale, the dataset covers 11,252 queries and 90,016 covariate-treatment pairs. Additionally, this dataset is designed for anomaly detection and explanation tasks in machine learning.
提供机构:
University of Massachusetts



