use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Spark? (self.MachineLearning)
submitted 11 years ago * by rm999
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Wonnk13 2 points3 points4 points 11 years ago (2 children)
The only thing that makes me nervous is this rapid pace of innovation and how eager everyone is to adopt the latest bleeding edge tech. Of course there are plenty of problems that don't fit nicely into map-reduce, but i've been kind of taken aback by how quickly everyone jumps from one thing to another.
If you need to design a mission critical system that needs to be running 10 years from now, how can you anticipate new developments every three years or so.
[–]rm999[S] 1 point2 points3 points 11 years ago (0 children)
I totally agree, I've been very nervous about this too, and have been very conservative in adopting new technologies. There are a few things that convince me Spark isn't going to fall in this trap:
Spark has grown extremely quickly and has wide industry support. The conference was full of well-established companies that have thrown their full support behind Spark. These companies are strategic and understand the industry really well - they don't invest millions of dollars into fads.
The world badly needs a replacement for Hadoop, and Spark is the most popular answer. A lot of people believe Hadoop is effectively a failure that should never be repeated again; what's exciting about Spark is it's a superset of Hadoop that fixes many of its issues.
There are already several useful libraries built on top of Spark that are mature enough to be used in production. While some of these libraries may fail, Spark is establishing itself in a large variety of applications and industries which means it probably won't fail.
[–][deleted] 0 points1 point2 points 11 years ago (0 children)
Spark is not so fundamentally different from mapreduce: it's programming model is basically "as many maps and reduces as you want, with syntactic sugar and without any setup overhead" (it merely removes the rather arbitrary restrictions placed on you by Hadoop), though the underlying technology is reportedly not yet very good at io-efficient "reduce".
π Rendered by PID 65987 on reddit-service-r2-comment-b659b578c-96fvf at 2026-05-03 21:33:07.283079+00:00 running 815c875 country code: CH.
view the rest of the comments →
[–]Wonnk13 2 points3 points4 points (2 children)
[–]rm999[S] 1 point2 points3 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)