Deequ
Deequ is an open-source library developed by AWS for data validation and quality monitoring. It is designed to help data engineers and data scientists ensure the quality and integrity of their data pipelines. Deequ provides a set of tools and APIs that allow users to define and enforce data quality rules, detect anomalies, and monitor data quality over time.
The library supports various data sources, including Apache Spark, Apache Hive, and AWS Glue. It offers a
One of the key features of Deequ is its ability to handle large-scale data processing. It leverages
Deequ is licensed under the Apache License 2.0, which allows for free use, modification, and distribution. It