Dataset - Review Paper Data Vault
收藏DataCite Commons2026-04-22 更新2026-05-04 收录
下载链接:
https://data.mendeley.com/datasets/73mzkf6rn4
下载链接
链接失效反馈官方服务:
资源简介:
This dataset supports a systematic review of automation techniques in modern data warehouse design, with emphasis on dimensional modeling, Data Vault, ETL/ELT automation, schema handling, and emerging AI-based approaches such as ML and LLMs. It includes an Excel file containing the scientific papers retrieved by the NLP toolkit, together with their metadata, screening outcomes, and classification attributes, and the JSON configuration file that defines the search keywords, synonym groups, mandatory properties, exclusion criteria, and time range used in the review process. The data was gathered through an automated PRISMA-aligned literature search across major digital libraries, followed by deduplication, preprocessing, filtering, and manual validation. It can be used to reproduce the review, examine publication and methodological trends, identify research gaps, and support further studies on automated data engineering, especially the current fragmentation of solutions and the limited end-to-end automation of Data Vault and related architectures.
提供机构:
Mendeley Data
创建时间:
2026-04-22



