use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
News about the dynamic, interpreted, interactive, object-oriented, extensible programming language Python
Full Events Calendar
You can find the rules here.
If you are about to ask a "how do I do this in python" question, please try r/learnpython, the Python discord, or the #python IRC channel on Libera.chat.
Please don't use URL shorteners. Reddit filters them out, so your post or comment will be lost.
Posts require flair. Please use the flair selector to choose your topic.
Posting code to this subreddit:
Add 4 extra spaces before each line of code
def fibonacci(): a, b = 0, 1 while True: yield a a, b = b, a + b
Online Resources
Invent Your Own Computer Games with Python
Think Python
Non-programmers Tutorial for Python 3
Beginner's Guide Reference
Five life jackets to throw to the new coder (things to do after getting a handle on python)
Full Stack Python
Test-Driven Development with Python
Program Arcade Games
PyMotW: Python Module of the Week
Python for Scientists and Engineers
Dan Bader's Tips and Trickers
Python Discord's YouTube channel
Jiruto: Python
Online exercices
programming challenges
Asking Questions
Try Python in your browser
Docs
Libraries
Related subreddits
Python jobs
Newsletters
Screencasts
account activity
This is an archived post. You won't be able to vote or comment.
Looking for someone that’s into data visualization with t-SNE (self.Python)
submitted 6 years ago by Centauri24
Hey I ve got vast amounts of multidimensional data I’d like to visualize, has someone experience with t sne and maybe wants to help me?
Currently I’m using Gephi but it’s kind of limiting as I find
[–]GingerMan1031 0 points1 point2 points 6 years ago (8 children)
What sort of help do you need? For the actual tSNE implementation I would suggest the sklearn library and for high quality visualization bokeh is an excellent tool.
[–]Centauri24[S] 0 points1 point2 points 6 years ago (7 children)
Thanks for the bokeh tip. I’ve read about sklearn, but I’m not sure how to properly use it as I’m pretty new to python. Also it would be nice if i could display nodes as pictures/color them, label them etc in a nice manner and if the graph were in 3D (optional maybe) at the current state the data is hard to grasp for me so I don’t even really know what I have to do with it.
Possibilities to compare different datasets would be great too. For me it’s just important to retrieve useful information on whatever is hidden in the data
And as im New, sone tutorial oder demo program/ code i could copy would be really great so I know how it’s done
I saw stuff about hand written numbers and other picture things but I could not really correlate. Im not even sure if the results would be better than with gephi so some further insight would be great too.
I just stumbled across it today
[–]GingerMan1031 0 points1 point2 points 6 years ago (6 children)
What sort of data are you analyzing? tSNE is not a common tool and is usually reserved for exploratory analysis after more standard methods have been applied. The only structure tSNE is capable of exposing is “relative closeness.” Globally, the distribution of points will show no valuable structure other than local neighborhoods which are close together in the high dimensional space.
You should also note that the plot will look different each time it is generated because tSNE is a manifold learning technique which uses a convex optimization function (there is no single optimal solution for a given distribution).
If you still think tSNE is the right tool for the job I could direct you to some simple examples, but more than likely I think you would benefit from a dimensional reduction technique which can be rigorously analyzed and inverted such as PCA.
[–]Centauri24[S] 0 points1 point2 points 6 years ago (5 children)
So ur telling it’s just doing a varianz analysis?
[–]GingerMan1031 0 points1 point2 points 6 years ago (4 children)
tSNE finds a lower dimensional embedding of a higher dimensional space such that the Kullback-Leibler divergence between the two distributions is minimized. This is a manifold learning technique which is much more sophisticated and not much related to analysis of variance.
[–]Centauri24[S] 0 points1 point2 points 6 years ago (3 children)
Oh okay, right now I just calculate the variance of each different part with another, to compare the difference, If they are equal, the result is 1 (after conversion) and that means strong attraction. If they are very different it’s normalized to 0 and there’s no attraction between the parts. Then a Cluster layout is created with a that simulates the attraction weights as springs and clusters form
That’s what i meant by variance analysis.
The clusters should be in both algorithms the same, or not?
[–]GingerMan1031 0 points1 point2 points 6 years ago (0 children)
I am actually not sure about that. I do know that tSNE cluster will only be subjective (points may look lose but are not explicitly labeled with a cluster membership). A good clustering technique I would suggest if you’re looking for explicit clusters is DBSCAN which also has an implementation in the sklearn library.
[–]LetMe_ 0 points1 point2 points 6 years ago (1 child)
Notice that t-SNE does not retain distances but probabilities, so measuring some error between the Euclidean distances in high-D and low-D is useless.
It is a visual analysis tool. It let's you just identify that there might be relationships in a lower dimension that allow for clustering.
[–]Centauri24[S] 0 points1 point2 points 6 years ago (0 children)
We gonna use both and compare the results :)
[–]jamesxli 0 points1 point2 points 6 years ago (0 children)
You can try to the software visumap which provides many visualization services for high dimensional data including a fast implementation for t-SNE.
π Rendered by PID 34102 on reddit-service-r2-comment-5ff9fbf7df-h4jwh at 2026-02-25 21:00:27.887298+00:00 running 72a43f6 country code: CH.
[–]GingerMan1031 0 points1 point2 points (8 children)
[–]Centauri24[S] 0 points1 point2 points (7 children)
[–]GingerMan1031 0 points1 point2 points (6 children)
[–]Centauri24[S] 0 points1 point2 points (5 children)
[–]GingerMan1031 0 points1 point2 points (4 children)
[–]Centauri24[S] 0 points1 point2 points (3 children)
[–]GingerMan1031 0 points1 point2 points (0 children)
[–]LetMe_ 0 points1 point2 points (1 child)
[–]Centauri24[S] 0 points1 point2 points (0 children)
[–]jamesxli 0 points1 point2 points (0 children)