I’ve been doing EDA for years and got tired of repeating the same code over and over.
So I built myself a Jupyter notebook that:
Automatically loads and summarizes any CSV Highlights missing values and duplicates Shows histograms, count plots, and correlation heatmaps Has an interactive scatter matrix using Plotly
Here’s a quick screenshot: (attach image)
I’m sharing it here because a lot of people ask for EDA templates.
If anyone wants the full version (notebook + sample dataset), I’ve uploaded it to Gumroad. Happy to DM the link. No spam — just trying to share something helpful I built.
submitted by /u/Akakro-1234 to r/learnmachinelearning
[link] [comments]
Laisser un commentaire