DataFrame-level#
Exploration Methods#
Generates a styled overview table with key statistics about the DataFrame. |
|
Generates a comprehensive report for rows containing specified anomalies across ALL columns. |
|
Unified method to detect different types of anomalies by column. |
|
|
Advanced correlation matrix showing only columns with anomalies |
|
Unified method to analyze co-occurring anomalies in column combinations. |
Unified method to analyze rows containing specified anomalies across ALL columns by categories. |
|
Detect and analyze rows with simultaneous anomalies of specified type in multiple columns. |
|
|
Plot anomalies across ALL columns over time using resampling. |
Preprocessing Methods#
|
Perform missing value imputation on specified numerical columns. |
|
Restores a full index for a DataFrame by filling in missing dates and categories. |
Analysis Methods#
|
Enhanced cohort analysis with multiple metrics and professional features. |
|
Builds an advanced correlation matrix for numeric columns in the dataframe. |
|
Perform comprehensive RFM (Recency, Frequency, Monetary) analysis with visualization. |
|
Creates a combined visualization showing segment distribution (left) and metric profiles (right). |
Creates faceted horizontal bar charts for a metric across multiple dimensions. |
|
|
Perform comprehensive sentiment analysis on text data with multiple visualization options. |
|
Generate bar plots showing the most/least frequent words in a text column. |
Visualization Methods#
|
Creates a bar chart using the Plotly Express library. |
|
Creates a line chart using the Plotly Express library. |
|
Creates an area chart using the Plotly Express library. |
|
Creates a box plot using Plotly Express. |
|
Creates an enhanced heatmap visualization with support for data aggregation, filtering, and advanced customization. |
|
Create an advanced pairplot of numerical variables with multiple display options. |
|
Creates a combined pie and bar chart visualization. |
|
Creates a histogram chart using the Plotly Express library. |
|
Compare two categorical variables in a DataFrame and visualize the results. |
|
Generate an interactive word cloud visualization using Plotly. |
|
Create a parallel categories plot with customizable colors and category filtering. |
|
Plot period-over-period changes for a given metric using pd.Grouper with enhanced customization. |
Statistics Methods#
|
Perform normality tests for each group and display QQ-plots. |
|
Perform Levene's test for equality of variances. |
|
Perform comprehensive independent samples t-test analysis with step-by-step reporting. |
|
Perform Mann-Whitney U test (non-parametric alternative to t-test) with comprehensive reporting. |
|
Perform ANOVA analysis with automatic variance checking and appropriate post-hoc tests. |
|
Perform Kruskal-Wallis H-test (non-parametric alternative to one-way ANOVA). |
|
Categorical variables association report using pingouin.chi2_independence |
|
Perform bootstrap analysis on DataFrame columns with parallel and non-parallel modes. |
|
Fit Ordinary Least Squares (OLS) regression. |
|
Fit Robust Linear Model (RLM) using M-estimators. |
|
Fit Generalized Linear Model (GLM). |
|
Fit Quantile Regression (median by default). |
|
Fit an ordinal regression model (Proportional Odds Model). |
|
Fit Linear Mixed Effects Model. |
Advanced feature importance analysis using multiple methods and models. |