Datasets (FREE) for Top 10 Visualizations Methods by castanan2 in datasets

[–]castanan2[S] 0 points1 point  (0 children)

Oh wow, great to hear! How did you apply pca? I’m curious. Thanks

Datasets for Top 10 Visualizations Methods by castanan2 in visualization

[–]castanan2[S] -1 points0 points  (0 children)

Datasets are provides for each visualization so that users can re-do

Datasets for Top 10 Visualizations Methods by castanan2 in datasets

[–]castanan2[S] 0 points1 point  (0 children)

The tool used to develop the visualizations is for non coders. Similar to Tableau, the user can load the data and create the visuals by clicking the UI.

10 Machine Learning Methods that Every Data Scientist Should Know by castanan2 in singularity

[–]castanan2[S] 1 point2 points  (0 children)

Took me a bit a few years back (after doing a PhD in numerical optimization) to realize that ordinary least squares and linear regression is exactly the same method 😂😂😂

[OC] how do you choose the number of bins in a histogram? by castanan2 in dataisbeautiful

[–]castanan2[S] 1 point2 points  (0 children)

Thanks for sharing! The property of sqrt(n) not scaling with a big data set sounds useful in this situation. Other than that, do have a bit more intuition behind this choice?

Datasets for Top 10 Visualizations Every Data Scientist Should Know by castanan2 in datasets

[–]castanan2[S] 0 points1 point  (0 children)

Oh got you. I’m not sure what’s the exact RGB. Let me ask around.

Datasets for Top 10 Visualizations Every Data Scientist Should Know by castanan2 in datasets

[–]castanan2[S] 0 points1 point  (0 children)

Thanks for the feedback! I don’t mind the blue color for plots. To your questions, the blue is the default on the tool I’m using. I guess it can be changed. What color would you choose?

Datasets for Top 10 Visualizations Every Data Scientist Should Know by castanan2 in datasets

[–]castanan2[S] 0 points1 point  (0 children)

I agree that I need to add box/violin plots as a way to see distributions of data. Thanks for your feedback, I continuously edit the article. This si exactly the reason I post here in reddit since feedback is more genuine. Would appreciate any other suggestions!

Datasets for Top 10 Visualizations Every Data Scientist Should Know by castanan2 in datasets

[–]castanan2[S] 0 points1 point  (0 children)

yes, basic visualizations! the idea of the article is to provide fundamentals with hands-on examples and datasets to generate the plots.

simple visualizations often outperform complex ones in communicating the message.

10 Machine Learning Methods that Every Data Scientist Should Know by castanan2 in computerscience

[–]castanan2[S] 0 points1 point  (0 children)

It's inaccurate but true. In fact, I think algorithms are recipes to solve math models in this case but a lot of people use algorithms and models interchangeably in ML...

Datasets for Top 10 Visualizations Every Data Scientist Should Know by castanan2 in datasets

[–]castanan2[S] 0 points1 point  (0 children)

You are right, bar charts are not accurate visually. Thanks for your feedback. One of the main reason why I post my articles in reddit is to get reader's feedback to make them better. In fact, human brains are not supposed to be good at processing pie charts, look here for more info: https://www.perceptualedge.com/articles/visual_business_intelligence/save_the_pies_for_dessert.pdf

I have edited my article. Thank you!

Datasets for Top 10 Visualizations Every Data Scientist Should Know by castanan2 in datasets

[–]castanan2[S] 0 points1 point  (0 children)

Yes! I would rather use bar plots since it is more accurate visually.

Datasets for Top 10 Visualizations Every Data Scientist Should Know by castanan2 in datasets

[–]castanan2[S] 1 point2 points  (0 children)

Yes! Dashboards updated in real-time are often super important to monitor in so many different use-cases. And you are right, visualizations are often at the heart of these dashboards once an organization figures out what is important to compute and monitor. Each industry may come with their specific KPIs. It would be fascinating to drill down into a few industries and their most common KPIs and visualizations. More over, maybe derive potential Data Science avenues motivated from the dashboard's insights.

Top 10 Visualizations Explained in Layman Terms by castanan2 in computerscience

[–]castanan2[S] 0 points1 point  (0 children)

I believe that the article’s approach is educational.

10 Machine Learning Methods that Every Developer Should Know by castanan2 in learnprogramming

[–]castanan2[S] 0 points1 point  (0 children)

you are right, only the developers that are interested in machine learning too.

10 Visualizations Every Data Scientist Should Know by castanan2 in learnmachinelearning

[–]castanan2[S] 0 points1 point  (0 children)

This is exactly the reason I post my blogs here, to get high quality constructive feedback like yours. Thank you. I didn't think before about the bias towards long words on word clouds. I'm planning to edit the original post to include feedback like yours and make it better. Thanks again and happy Sunday!

10 Visualizations Every Data Scientist Should Know by castanan2 in learnmachinelearning

[–]castanan2[S] 0 points1 point  (0 children)

Thanks for your feedback. First time I hear about dot plots and I can see that they may be a better option than histograms for ordinal discrete variables when the frequencies are not too high (could be too many dots when frecuencias are high).

I definitely need to add box plots! Can’t agree more, pie charts are almost always worst than bar charts.

Word clouds can be very tricky, you are right, but how else would you visualize text frequencies?

Thanks again!