five

Public BI benchmark - part 1

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6277286
下载链接
链接失效反馈
官方服务:
资源简介:
Originally published on: https://github.com/cwida/public_bi_benchmark  Originally compressed with bzip2. Compressed here with gzip. "User generated benchmark derived from the DBTest'18 paper [1] by Tableau. It contains real data and queries from 46 public workbooks in Tableau Public [2]. We downloaded 46 of the biggest workbooks and converted the data to .csv files and collected the SQL queries that appear in the Tableau log when the workbooks are visualized. We processed the .csv files and queries with the purpose of making them load and run on different database systems. Each directory is associated with a workbook and contains: samples: a sample of each .csv file (first 20 rows) tables: .sql files containing the schema of each .csv file queries: .sql files containing the queries data-urls.txt: links for downloading the full .csv.bz2 compressed files There are 46 workbooks containing 206 tables (.csv files) with the total size of 41 GB compressed and 386 GB uncompressed. Multiple .csv files may overlap but are not identical. This is because Tableau extracts the same workbook in multiple different ways for different queries."
创建时间:
2022-05-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作