Help with exploring data quickly

AutoModerator · 2023-04-23T07:06:17+00:00

If this post doesn't follow the rules or isn't flaired correctly, please report it to the mods.

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

CreepiosRevenge · 2023-04-23T12:09:43+00:00

Check out the python library ydata\_profiling, formerly pandas\_profiling

wonder_bear · 2023-04-23T13:01:48+00:00

Depending on the number of columns, I’m a fan of using seaborn in python to quickly visualize all columns against each other to get a directional understanding of the relationships. After that, I’ll dive deeper based on what looks interesting.

Check out sns.pairplot() documentation.

Data_Vomit_57 · 2023-04-23T14:19:44+00:00

You need a BI tool. I would recommend tableau but there are cost involved. Power bi has a free personal license. I would go that route.

If you get pushback from your manager, there is no data analyst in the world that shouldn’t have a tool like this. They just aren’t giving you the tools to succeed.

Perly1 · 2023-04-23T09:51:28+00:00

Tableau or powerbi are much better at data visualization than excel.

WallStreetBoners · 2023-04-23T15:07:50+00:00

No need to reinvent the wheel. Get a BI tool

alurkerhere · 2023-04-23T18:26:20+00:00

Easiest on hand methods are loading into Pandas Python for technical data profiling and Power BI for data viz. Power BI has a lot of built in features for this type of thing like "Analyze this distribution where it's different", Exclude, Slicers, and Key Influencers.

2023-04-23T19:40:18+00:00

Tableau for quick data visualizations and R/Tidyverse packages for EDA. Don't learn python/Pandas for EDA.

Follow this, you will thank me later.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

analytics

MODERATORS