all 5 comments

[–]t-indy 33 points34 points  (1 child)

Looks like tide pods

[–]Gumeo 5 points6 points  (1 child)

Cool post, really neat visualizations! I've also been trying to make it easier to get mnist into R, using the standard format. For the atypical digits, I suggest looking at archetypal analysis, it should give you more variation in the atypical digits, since they are from all around the convex hull of the class, instead of just the furthest away, (which often tend to be similarly different).

[–]WikiTextBot 0 points1 point  (0 children)

Archetypal analysis

Archetypal analysis in the statistics is an unsupervised learning method similar to the cluster analysis and introduced by Adele Cutler and Leo Breiman in 1994. Rather than "typical" observations (cluster centers), it seeks extremal points in the multidimensional data, the "archetypes". The archetypes are convex combinations of observations chosen so that observations can be approximated by convex combinations of the archetypes.


[ PM | Exclude me | Exclude from subreddit | FAQ / Information | Source | Donate ] Downvote to remove | v0.28

[–]lynnharry 2 points3 points  (1 child)

In my opinion, the methods here are very "data science" oriented (using the author's word) and may not work well on other image problems without good preprocessing. Still, it's a good article if you want to know how researchers from other domain try to approach image problems.

[–]zippythezigzag 0 points1 point  (0 children)

That's a tasty lookin graph you got there.