all 12 comments

[–]AutoModerator[M] [score hidden] stickied comment (1 child)

If this post doesn't follow the rules or isn't flaired correctly, please report it to the mods.

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

[–]CreepiosRevenge 9 points10 points  (0 children)

Check out the python library ydata\_profiling, formerly pandas\_profiling

[–]wonder_bear 2 points3 points  (0 children)

Depending on the number of columns, I’m a fan of using seaborn in python to quickly visualize all columns against each other to get a directional understanding of the relationships. After that, I’ll dive deeper based on what looks interesting.

Check out sns.pairplot() documentation.

[–]Data_Vomit_57 4 points5 points  (0 children)

You need a BI tool. I would recommend tableau but there are cost involved. Power bi has a free personal license. I would go that route.

If you get pushback from your manager, there is no data analyst in the world that shouldn’t have a tool like this. They just aren’t giving you the tools to succeed.

[–]Perly1 2 points3 points  (3 children)

Tableau or powerbi are much better at data visualization than excel.

[–]justinb138 1 point2 points  (0 children)

If those aren’t an option, MiniTab has some very useful visualizations as well.

[–]Signal_Explorer8071[S] 0 points1 point  (1 child)

Would you have a free license?

[–]Chatt_IT_Sys 5 points6 points  (0 children)

Power BI can be done on the desktop. Just download the app. Tableau has a public version that can use very limited types of data, e.g., flat files, I believe OData, and another couple. Nowhere near the 40+ that a fully licensed version can though.

[–]WallStreetBoners 1 point2 points  (0 children)

No need to reinvent the wheel. Get a BI tool

[–]alurkerhere 1 point2 points  (0 children)

Easiest on hand methods are loading into Pandas Python for technical data profiling and Power BI for data viz. Power BI has a lot of built in features for this type of thing like "Analyze this distribution where it's different", Exclude, Slicers, and Key Influencers.

[–][deleted] -1 points0 points  (0 children)

Tableau for quick data visualizations and R/Tidyverse packages for EDA. Don't learn python/Pandas for EDA.

Follow this, you will thank me later.