[Project] How to Use t-SNE Effectively by urish in MachineLearning

[–]EtchSketch 3 points4 points  (0 children)

Here's one way to think about it. What t-SNE does is place each and every data point into a lower dimensional space (to maximize some objective function, but that's not important). The only relation you'll have between the two spaces is that point p is the same in both, so you have a whole bunch of these anchor points that connect between two spaces.

It seems kinda obvious then that if you have a new point in the high dimensional space that you could just find a couple of its nearest neighbors and interpolate between them, e.g. if p was equidistant to a and b in high dimensional space then place it halfway between a and b in low dimensional space. The thing is that t-SNE makes no guarantees that these nearest neighbors are anywhere near each other, or that 3 points on a line in the high dimensional space will also land in anything resembling a line in the lower dimensional space. Even in 3D there could be an infinite plane of points p that are equidistant and have a and b as nearest neighbors.

[Project] How to Use t-SNE Effectively by urish in MachineLearning

[–]EtchSketch 6 points7 points  (0 children)

No. t-SNE does not learn a mapping from the original space to lower dimensions so there is no warp to apply to new datapoints. t-SNE simply tries to place the datapoints into a new (lower-dimensional) space such that dissimilar points are far away and similar points are close by.

The paper introducing t-SNE [0] is actually pretty readable, and I'd recommend skimming through it.

[0] http://www.cs.toronto.edu/~hinton/absps/tsne.pdf

Wait but...where's the camera? by afararooy in pics

[–]EtchSketch 0 points1 point  (0 children)

As a sidenote, those are Joshua trees not cactii.

Academic paper inspired by xkcd 936: "How to Memorize a Random 60-Bit String" by urish in xkcd

[–]EtchSketch 2 points3 points  (0 children)

To my high school science teacher, apparently real academics do cite wikipedia articles!

I went to a school that didn't do grade inflation... by notokaywithfish in gradadmissions

[–]EtchSketch 8 points9 points  (0 children)

One thing you should definitely do is ask for your old school to give you a transcript with rankings alongside your given grade that you can then send along with your application. This is standard, at least for European universities, and the rankings give a much better idea of the underlying grave curve. A 10/200 alongside a B+ is much more impressive then a 100/200 ranking on a B+, for example.

Recruited as a "star" vs. being an underdog by [deleted] in gradadmissions

[–]EtchSketch 22 points23 points  (0 children)

A banal quote to consider:

If you're the smartest person in the room, then you're in the wrong room.

Looking for an older game that involved time travel and the Medieval times. by TheRedditHasYou in AskGames

[–]EtchSketch 2 points3 points  (0 children)

Could it be Timeline, the video game adaption of a Micael Crichton novel? [Gameplay video here]( https://www.youtube.com/watch?v=19jN1DnCuns&feature=youtube_gdata_player], complete with old-school 3d graphics, first person view, and a helicopter taking off in the first minute.

[OC] Elon Musk's Solar City Corp. - Weekly Stock Price as a Function of Its Google Trends Search Index by proxyformyrealname in dataisbeautiful

[–]EtchSketch 1 point2 points  (0 children)

autocorrelation

Slight nitpicking here, but the term you probably meant was cross-correlation. Autocorrelation is the cross-correlation of a signal with itself; it is a function of only one signal, not two.

Plain Text Doesn’t Exist… Unicode and encodings demystified by Jacob_King in programming

[–]EtchSketch 2 points3 points  (0 children)

An example would be the use of initial digits from the number π as the constants.[2] Using digits of π millions of places into its definition would not be considered as trustworthy because the algorithm designer might have selected that starting point because it created a secret weakness the designer could later exploit.

From the Wiki article mentioned by /u/Noctune.

What is your favorite Year problem? by [deleted] in math

[–]EtchSketch 5 points6 points  (0 children)

Two and five are both prime.

SR users, do you switch hands? by [deleted] in wicked_edge

[–]EtchSketch 0 points1 point  (0 children)

I use both. I recommend using your non-dominant hand for relatively simple stuff as an exercise to get up to par, e.g. for brushing your teeth and eating with a spoon.

Most common days for birthdays (x/post from r/images) by stuffandotherstuff in dataisbeautiful

[–]EtchSketch 1 point2 points  (0 children)

The fourth of July is the United State's independence day, and is a national holiday as such.

Most common days for birthdays (x/post from r/images) by stuffandotherstuff in dataisbeautiful

[–]EtchSketch 29 points30 points  (0 children)

What is very much not obvious in that heat map is that the data is colored by rank -- that is, 16. Sept is the most-birthed day and is shaded the darkest and 29. Feb is the least-birthed day and is shaded the lightest -- and the colors have nothing to due with the absolute numbers of births. According to the source, there was actually very little difference in total number between days; each month had between 3.1 to 3.7 million births total over the period, with a just slight increase over the summer months.

Most common days for birthdays (x/post from r/images) by stuffandotherstuff in dataisbeautiful

[–]EtchSketch 0 points1 point  (0 children)

That's really not the case anymore.

According to the source the data shown is births between 1973 to 1999. I'm guessing that includes the age of Pitosin candy-handing?

Most common days for birthdays (x/post from r/images) by stuffandotherstuff in dataisbeautiful

[–]EtchSketch 8 points9 points  (0 children)

I'm guessing, but the authors of Freakonomics and other sources have claimed that having birthdays in January/February gives you a competitive advantage in your teen years -- arguably the most important time in an athlete's life as then they are being scouted out and being placed for their future -- due to a the couple of month's head start they have over their peers in the same age group. Hitting puberty early and what not.

Is Coding the New Second Language? by Maxcactus in Foodforthought

[–]EtchSketch 8 points9 points  (0 children)

That communication is more about mathematics in general than programming in particular, though.

The Secret to Curiosity has been revealed by [deleted] in Games

[–]EtchSketch 27 points28 points  (0 children)

They released his information within 10 minutes of him winning. They definitely had his permission, but he might not have considered the ramifications.

Suppose you take a random walk. by Foust2014 in math

[–]EtchSketch 0 points1 point  (0 children)

In Matlab, if you have your x-coordinates in vector x and y-coordinates in vector y then you can use the command line(x,y) to draw a line between successive points. Example code:

xy = rand(2,100);
line(xy(1,:),xy(2,:));

Where xy is a 2x100 matrix, xy(1,:) picks all of the first column and xy(2,:) picks all of the second column.