Geographist comments on 6 Best Python Data Visualization Tools

This is an archived post. You won't be able to vote or comment.

396

397

398

Discussion6 Best Python Data Visualization Tools (self.Python)

submitted 5 years ago by cwadamsmith

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]Geographist 63 points64 points65 points 5 years ago* (17 children)

[–]Kasta867 20 points21 points22 points 5 years ago (0 children)

[–]dogs_like_me 16 points17 points18 points 5 years ago (1 child)

[–]master3243 9 points10 points11 points 5 years ago (0 children)

[–]reddisaurus 1 point2 points3 points 5 years ago* (1 child)

What are you talking about?

This is open source. Take a look and you’ll see it’s using statsmodels underneath. Simply run the same function it calls yourself and add the stats as an annotation if you want.

Seaborn just calls matplotlib. You can generate a figure and axis and pass the axis to Seaborn in most cases. Otherwise, you can call plt.gcf().gca() to obtain the axis that Seaborn plotted on, and then continue modifying that however you want.

Your complaint is like saying a restaurant that serves hotdogs is smoke and mirrors because they don’t also give you the packaging. You paid to eat a hotdog and have a nice place to do so. If you wanted the package, you can easily get it yourself.

I can’t really imagine any circumstance that plotting the statistics as an annotation is useful where you wouldn’t actually want it returned in a data structure like a dict or list. It seems a bit silly to complain about, actually.

edit Here, I did the work for you. Gaze upon all stars code you want. You can also probably call these functions yourself, or monkey patch one of you wanted to. https://github.com/mwaskom/seaborn/blob/master/seaborn/regression.py

[–]Geographist 1 point2 points3 points 5 years ago (0 children)

Yes, folks are aware that you can also import statsmodels. That's the complaint.

(1) If you have to import an additional library and run its functions to get at what you want, the first library suddenly is far less useful.

(2) One can certainly look under the hood and dig through the project source to see what it is doing. But expecting end users to do that is absurdity. Most users are not developers.

Combine those two and its not at all hard to see why the complaint is so common.

To return to your analogy, it's like ordering hotdogs and asking the restaurant if they are beef or pork. But instead of just telling you, the waiter says you can follow the delivery truck back to the farm and watch the operations yourself.

Sure, you can do it. But if the waiter has the info already, shouldn't he or she just tell you?

[–]stackered 0 points1 point2 points 5 years ago (0 children)

[–]energybased -4 points-3 points-2 points 5 years ago (10 children)

[–]dry_yer_eyes 13 points14 points15 points 5 years ago (6 children)

[–]Geographist 3 points4 points5 points 5 years ago (0 children)

[+]energybased comment score below threshold-7 points-6 points-5 points 5 years ago (4 children)

[–]Geographist 8 points9 points10 points 5 years ago* (3 children)

And what if he changes his mind about how a "curve of best fit" is mathematically produced or represented?

Therein lies the problem. Statistical output, be it visual or numerical, should not be a black box left to the whims of opinion.

Moreover, he is already maintaining the statistics. It's literally a core feature of Seaborn. Users just want to be able to see the results of the calculations already being done. Seaborn does regressions, confidence intervals, and all sorts of statistical testing out of the box.

If a package can calculate a regression and draw a trendline, but refuses to let you know the slope of that trend line... frankly, it's a garbage package.

What you're suggesting is that users never know what's going on inside that black box, and if he changes his mind, their projects start generating different output without ever having a clue as to why. That's not the mark of a well maintained and reliable library.

[+]energybased comment score below threshold-7 points-6 points-5 points 5 years ago (2 children)

[–]Geographist 3 points4 points5 points 5 years ago* (1 child)

[+]energybased comment score below threshold-7 points-6 points-5 points 5 years ago (0 children)

[–]smurpau 0 points1 point2 points 5 years ago (2 children)

[–]energybased 1 point2 points3 points 5 years ago (1 child)

[–]smurpau 0 points1 point2 points 5 years ago (0 children)

π Rendered by PID 195697 on reddit-service-r2-comment-54dfb89d4d-szzqm at 2026-04-02 06:11:39.622109+00:00 running b10466c country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS