Install
pip install analysistoolbox
Requires Python 3.9 or later.
Configure plot style
Add this to the top of any analysis notebook to get clean, consistent charts:
import seaborn as sns
sns.set(style="white", font="Arial", context="paper")
Your first analysis
from analysistoolbox.data_processing import CreateDataOverview
import pandas as pd
df = pd.read_csv('your_data.csv')
CreateDataOverview(dataframe=df, plot_missingness=True)
CreateDataOverview returns a summary with data types, missing value counts, and distribution plots for every column — one function call instead of twenty.
Dependencies
Analysis Tool Box installs these automatically:
numpy,pandas,scipy,scikit-learn— core data stackmatplotlib,seaborn— visualizationstatsmodels,xgboost— modelingpymetalog— Metalog distribution fittingfuzzywuzzy,Levenshtein— entity matchingsympy— symbolic math (Calculus module)geopandas,folium— geospatial (Geospatial module)tableone— clinical summary tables
Next steps
Browse the module reference or jump straight to the Simulations and Probability modules for the intelligence analysis use cases.