datapr
datapr is a Python library designed for data profiling and analysis. It aims to provide a comprehensive set of tools for understanding the characteristics of datasets, such as identifying data types, detecting missing values, calculating descriptive statistics, and uncovering potential data quality issues. The library can be used to generate reports that summarize key aspects of a dataset, enabling users to quickly gain insights into its structure and content.
The primary goal of datapr is to automate the often time-consuming process of initial data exploration. By
Key features of datapr include the ability to compute measures like mean, median, standard deviation, quantiles,