Data Warehouse Accelerator
收藏Databricks2024-08-03 收录
下载链接:
https://marketplace.databricks.com/details/652d5721-9ede-4cf7-b042-52791155b161/Pingahla-Inc_Data-Warehouse-Accelerator
下载链接
链接失效反馈官方服务:
资源简介:
**Overview**
This Accelerator automates the creation of a datawarehouse in Databricks lakehouse and provides users with robust collaboration tools to efficiently develop a datawarehouse from various sources.
To run Pingahla's Data Warehouse Accelerator, execute the provided notebook. This accelerator allows you to build a data warehouse from different sources (such as SQLServer, AWS-S3, Oracle, Redshift and many others) with a single click. It will read data from the sources and create landing, staging, and dimension tables in Databricks. The landing load will be a delta load with truncate, the staging will be SCD1, and the dimension will be SCD2.
\
**Use cases**
- Data Warehouse Creation: Build a data warehouse from various sources with a single click.
- Tracking Batch Schedules: Utilize a control table to monitor the start and completion times of each processing layer (Landing, Staging, Dimension) for a given cycle.
- Monitoring via Email Notification: Error messages are recorded and sent via email to a designated group, providing all necessary information.
For more details, refer to the embedded notebook
\
**Additional Insights**
This data warehouse project is designed for integration with multiple platforms, such as Databricks and AWS. Users can configure the project through a JSON configuration file, allowing for flexibility and customization to meet their specific needs.
提供机构:
Pingahla Inc



