five

Data Warehouse Accelerator

收藏
Databricks2024-08-03 收录
下载链接:
https://marketplace.databricks.com/details/652d5721-9ede-4cf7-b042-52791155b161/Pingahla-Inc_Data-Warehouse-Accelerator
下载链接
链接失效反馈
官方服务:
资源简介:
**Overview** This Accelerator automates the creation of a datawarehouse in Databricks lakehouse and provides users with robust collaboration tools to efficiently develop a datawarehouse from various sources. To run Pingahla's Data Warehouse Accelerator, execute the provided notebook. This accelerator allows you to build a data warehouse from different sources (such as SQLServer, AWS-S3, Oracle, Redshift and many others) with a single click. It will read data from the sources and create landing, staging, and dimension tables in Databricks. The landing load will be a delta load with truncate, the staging will be SCD1, and the dimension will be SCD2. \ **Use cases** - Data Warehouse Creation: Build a data warehouse from various sources with a single click. - Tracking Batch Schedules: Utilize a control table to monitor the start and completion times of each processing layer (Landing, Staging, Dimension) for a given cycle. - Monitoring via Email Notification: Error messages are recorded and sent via email to a designated group, providing all necessary information. For more details, refer to the embedded notebook \ **Additional Insights** This data warehouse project is designed for integration with multiple platforms, such as Databricks and AWS. Users can configure the project through a JSON configuration file, allowing for flexibility and customization to meet their specific needs.
提供机构:
Pingahla Inc
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作