five

DQ-Data Quality APP

收藏
Snowflake2023-10-31 更新2024-05-01 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZSTZCR1UTT
下载链接
链接失效反馈
官方服务:
资源简介:
DQ - Data Quality APP Implementing Data Quality checks are essential part of designing Data Pipelines. Data Quality ensures dataset meets criteria for accuracy, completeness, validity, consistency, uniqueness, timeliness, and fitness for purpose, and it is critical to all data governance initiatives within an organization. This APP uses Python’s library for Data Quality Validation and Snowflake’s SnowPark API for data frames. Once APP gets onboarded into Snowflake account, Data Quality Checks can be implemented by calling stored procedures with relevant parameters. Below is the list of objects used in this framework. Installing APP on Snowflake account creates Tables, Sequences & Stored Procs required for this framework to function. This App provides you the capability to use inbuilt data quality checks and add your own rule as per requirement without much efforts. Data Quality app provides the following capabilities: - Add data quality rules - Data quality results with failed records in details ***** SEQUENCE ***** - GE_FUNCTION_RULE_MAP_SEQ - DQ_VIOLATIONS_SEQ - DQ_RULES_SEQ - PROCESS_METADATA_SEQ ***** TABLES ****** - DATASETS - Stores Table level information. - ATTRIBUTE - Stores Column Level information. - DQ_RULES - Stores Data Quality Rules Information. - GE_FUNCTION_RULE_MAP - Functions & Data Quality Rule Mapping information. - DQ_VIOLATIONS - Stores Failed Records in JSON Format - SENSITIVE_ATTRIBUTES - Stores Attributes to be identified as PI & PII information. - PROCESS_METADATA - Stores Process Metadata information ***** STORED PROCEDURE ****** - CREATE_METADATA(DB VARCHAR, SCHEMA VARCHAR, TBL_NAME VARCHAR); - CREATE_DQ_RULE(TABLE_NAME VARCHAR,COLUMN_NAME VARCHAR,RULE_ID VARCHAR,EXPRESSION VARCHAR); - CREATE PROCESS_METADATA(DB VARCHAR,SCHEMA VARCHAR,TBL_NAME VARCHAR); For full steps, refer sample worksheet.
提供机构:
Publicis Sapient
创建时间:
2023-10-03
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
DQ-Data Quality APP是一个基于Python和Snowflake SnowPark API的数据质量验证工具,提供内置检查规则和自定义规则功能,能记录详细的质量检查结果和失败记录。该应用在Snowflake账户中创建必要的表、序列和存储过程来支持数据质量框架的运行。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作