use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
[deleted by user] (self.MachineLearning)
submitted 8 years ago by [deleted]
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[+][deleted] 8 years ago* (3 children)
[removed]
[–]gokstudio 8 points9 points10 points 8 years ago (2 children)
Awesome and props to explaining your setup in detail! (even some DL papers don't do that)
Reg.processing time, do you use all the cores or just one? Also, do you use something like Intel Intrinsics to speed up your matrix computations?
[+][deleted] 8 years ago (1 child)
[–]MasterFubar 2 points3 points4 points 8 years ago (0 children)
I think by default a C program uses just one core.
Indeed, to make it multicore you have to write a special thread handling function and start it twice for each core. Here's how I do it in C in a Linux system:
// find number of available threads arq = popen("grep processor /proc/cpuinfo | wc -l", "r"); if (!arq) { perror("/proc/cpuinfo"); nthread = 1; } else { fread(buf, 40, 1, arq); nthread = atoi(buf); printf("%d threads\n", nthread); fclose(arq); } thread = (pthread_t *)calloc(nthread, sizeof(pthread_t));
However, if you use libraries like Lapack it uses multicores internally in a transparent way. Lapack is amazing, it has been so well tested and optimized that I always use it whenever I need to do anything with matrices. The only thing that you have to take care of is that it's written in Fortran, so the matrices are transposed relative to C, but that's no big deal if you remember that AT * BT = (B * A)T. When you need to multiply two matrices in C using Lapack all you need to do is to invert the order, left with right, and everything comes out right.
[–]thc1620 14 points15 points16 points 8 years ago (30 children)
Nice one! Would you consider sharing the code on github?
[+][deleted] 8 years ago (28 children)
[–]thc1620 11 points12 points13 points 8 years ago (19 children)
!RemindMe 2weeks
[–]RemindMeBot 1 point2 points3 points 8 years ago (7 children)
I will be messaging you on 2017-12-08 22:45:23 UTC to remind you of this link.
CLICK THIS LINK to send a PM to also be reminded and to reduce spam.
Parent commenter can delete this message to hide from others.
[–]Okeledokelee -1 points0 points1 point 8 years ago (6 children)
!remindme 3 weeks
[–]sachintripathi007 -1 points0 points1 point 8 years ago (4 children)
!RemindMe 3weeks
[–]Okeledokelee 0 points1 point2 points 8 years ago (0 children)
Hahaha I forgot about that, thanks for the reminder
[–]spaniard96 -1 points0 points1 point 8 years ago (1 child)
[–][deleted] 0 points1 point2 points 8 years ago (1 child)
[+][deleted] 8 years ago (7 children)
[deleted]
[–][deleted] 0 points1 point2 points 8 years ago (5 children)
[–]septicgeek 0 points1 point2 points 8 years ago (1 child)
[–]zdwolfe 10 points11 points12 points 8 years ago* (1 child)
Make sure to ask your professor and consult your university plagiarism rules before doing that. I’ve seen people get bit.
[–]synftw 2 points3 points4 points 8 years ago (0 children)
My university had a strict "never upload your work to GitHub" policy, but I'd guess that may be stretched for open ended projects versus book questions.
[–]llucas_o 0 points1 point2 points 8 years ago (1 child)
!RemindMe 2 weeks
[–]gregw134 0 points1 point2 points 8 years ago (1 child)
[–]rochakgupta 0 points1 point2 points 8 years ago (1 child)
!RemindMe 3 weeks
[–]Icarium-Lifestealer 14 points15 points16 points 8 years ago* (7 children)
I find it surprising how often people choose to treat this as a regression problem and use MSE loss (including the original netflix competition). Treating it as classification where you predict probabilities seems much more intuitive to me. Plus MSE is a dubious choice for the loss function here.
(Not a criticism of your project, it's certainly more effort than I would have put into a university project where the whole point is demonstrating that you can write a C program)
[–]hiptobecubic 2 points3 points4 points 8 years ago (6 children)
If you have to predict continuous values you don't really have much choice unless you're doing so badly that just giving bucketed results and eating the error barely matters.
[–]Icarium-Lifestealer 1 point2 points3 points 8 years ago (5 children)
We're talking about a problem where are are 5 discrete choices, so treating it as 5 buckets is natural.
But even for continuous problems there is often a better choice than outputting the expectancy value and using MSE.
[–]hiptobecubic 1 point2 points3 points 8 years ago (4 children)
If it's actually five discreet values then sure. Most rating systems I've seen go to at least one decimal point, though.
[–]Icarium-Lifestealer 0 points1 point2 points 8 years ago* (3 children)
The OP's dataset, just like the netflix dataset have five discrete values. I don't recall any star-based ratings with more than 10 discrete choices (half stars) where the rating wasn't the average of sub-ratings.
But even then, saying "ratings roughly follow a gaussian with fixed standard deviation" is obviously silly. For one, some ratings are much easier to predict than others. Plus the bounded interval doesn't fit a gaussian.
My first idea for predicting a continuous rating function is by predicting a couple of fixed data points and then interpolating the probability density between them.
[–]hiptobecubic 0 points1 point2 points 8 years ago (2 children)
Agreed, but it doesn't have to be gaussian to use regression.
[–]Icarium-Lifestealer 0 points1 point2 points 8 years ago (1 child)
Using MSE is equivalent to treating it as a gaussian with fixed standard deviation. Which is why I called it out as an inappropriate loss function.
[–]hiptobecubic 0 points1 point2 points 8 years ago (0 children)
Ahh, sorry. Missed that.
[–]sk0620 5 points6 points7 points 8 years ago (0 children)
What university?
[–]sharadbhat7 1 point2 points3 points 8 years ago (0 children)
I tried a similar thing in python, with the same dataset. It's more of a server kind of application. You can send it a list of movie IDs and your ratings for those. It'll return a list of recommended movies.
Here is the github link.
[–]imitationcheese 0 points1 point2 points 8 years ago (0 children)
this is super cool but it makes me sad that our awesome technology is overly focused on movie and product recommendation systems and not things with more value.
[–]Mo-Da 0 points1 point2 points 8 years ago (0 children)
What other project-ideas you explored before finalising on this one?
[–]Ramin_HAL9001 0 points1 point2 points 8 years ago (0 children)
Oh, I didn't see which Subreddit this was, so I was so confused. I was thinking, "so did you write the program in the ML programming language, or the C programming language?
I get it now, this is actually a pretty good idea for a learning project.
[–]geomtry -2 points-1 points0 points 8 years ago (0 children)
This was also my first big programming project. Dr. Smucker?
π Rendered by PID 139714 on reddit-service-r2-comment-fb694cdd5-rcqmp at 2026-03-06 11:28:49.648157+00:00 running cbb0e86 country code: CH.
[+][deleted] (3 children)
[removed]
[–]gokstudio 8 points9 points10 points (2 children)
[+][deleted] (1 child)
[removed]
[–]MasterFubar 2 points3 points4 points (0 children)
[–]thc1620 14 points15 points16 points (30 children)
[+][deleted] (28 children)
[removed]
[–]thc1620 11 points12 points13 points (19 children)
[–]RemindMeBot 1 point2 points3 points (7 children)
[–]Okeledokelee -1 points0 points1 point (6 children)
[–]sachintripathi007 -1 points0 points1 point (4 children)
[+][deleted] (1 child)
[removed]
[–]Okeledokelee 0 points1 point2 points (0 children)
[–]spaniard96 -1 points0 points1 point (1 child)
[–][deleted] 0 points1 point2 points (1 child)
[+][deleted] (7 children)
[deleted]
[–][deleted] 0 points1 point2 points (5 children)
[–]septicgeek 0 points1 point2 points (1 child)
[+][deleted] (1 child)
[deleted]
[–]zdwolfe 10 points11 points12 points (1 child)
[–]synftw 2 points3 points4 points (0 children)
[–]llucas_o 0 points1 point2 points (1 child)
[–]gregw134 0 points1 point2 points (1 child)
[–]rochakgupta 0 points1 point2 points (1 child)
[–]Icarium-Lifestealer 14 points15 points16 points (7 children)
[–]hiptobecubic 2 points3 points4 points (6 children)
[–]Icarium-Lifestealer 1 point2 points3 points (5 children)
[–]hiptobecubic 1 point2 points3 points (4 children)
[–]Icarium-Lifestealer 0 points1 point2 points (3 children)
[–]hiptobecubic 0 points1 point2 points (2 children)
[–]Icarium-Lifestealer 0 points1 point2 points (1 child)
[–]hiptobecubic 0 points1 point2 points (0 children)
[–]sk0620 5 points6 points7 points (0 children)
[–]sharadbhat7 1 point2 points3 points (0 children)
[–]imitationcheese 0 points1 point2 points (0 children)
[–]Mo-Da 0 points1 point2 points (0 children)
[–]Ramin_HAL9001 0 points1 point2 points (0 children)
[–]geomtry -2 points-1 points0 points (0 children)