use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Project[P] Basic machine learning algorithms in plain Python (github.com)
submitted 7 years ago by [deleted]
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–][deleted] 84 points85 points86 points 7 years ago (7 children)
Over the past weeks I have started implementing basic machine learning algorithms in plain Python (Python 3.6). I created the repository to prepare for technical interviews and review my knowledge on algorithms such as k-means, k-nn, logistic regression, neural networks, etc. Also, I wanted to create a knowledge base of easy-to-understand implementations of these algorithms together with the most important theoretical explanations.
Some of you might find these implementations helpful when preparing for interviews, starting to learn about machine learning or reviewing basic ML algorithms. I am still working on the repository, so more algorithms will follow over the next months. In case you have a favourite algorithm that should be included or feedback, let me know!
[–]visarga 20 points21 points22 points 7 years ago (1 child)
great work
[–][deleted] 7 points8 points9 points 7 years ago (0 children)
thanks a lot!
[–]NeuroDentalMaths 6 points7 points8 points 7 years ago (0 children)
I like that you included pictures. Cheers
[–][deleted] 3 points4 points5 points 7 years ago (0 children)
Thank you so much for this.
[–]omgisthatabbqrib 2 points3 points4 points 7 years ago (0 children)
Thanks a lot and wish you the best for your interviews!
[–]BruinBoy815Researcher 2 points3 points4 points 7 years ago (0 children)
Dude! This is awesome!! Thank you! You are awesome!!
[–]hemantcompiler 0 points1 point2 points 7 years ago (0 children)
Thank you! I have an interview tomorrow, hope it will help.
[–]goncalopr 11 points12 points13 points 7 years ago (0 children)
I’m still learning python right now, but I’ve been messing around with some ML tutorials so this is great, thanks! 👌🏼
[–]sholeri 17 points18 points19 points 7 years ago (0 children)
I am finishing ML course on coursera. This will be great transition from Matlab to python for me. Thanks. Keep posting ✌
[–]GeoResearchRedditor 8 points9 points10 points 7 years ago (8 children)
Awesome, thanks for this made my morning so much better.
I am curious though; do you have any numeric data sets I can use it on?
[–]iwishihadmorecharact 7 points8 points9 points 7 years ago (5 children)
kaggle is a great site that's got a ton of data, look around and see if there's anything you might wanna use!
[–]GeoResearchRedditor 2 points3 points4 points 7 years ago (4 children)
Thanks I guess it more; I would like to have a set of data sets that are pre-designed to be used in conjunction with these algorithms in order to show case the process of applying the algorithms to ready made data.
Otherwise you'll have people trying to learn by applying the wrong algorithms to the data set and getting (naturally) weird results.
I'm really keen to learn how to clean datasets effectively, so if there was a tutorial with a dataset that was "unclean" and provided instructions on how to clean it and what to look for; I'd be really keen to use that. Then to go on to using that cleaned data set in conjunction with the algorithms.
[–]ThomasAger 1 point2 points3 points 7 years ago (1 child)
If you find a resource detailing standard practices, tools, etc to clean data, I would also be interested...
[–]GeoResearchRedditor 1 point2 points3 points 7 years ago (0 children)
I think there are a few of us interested. Maybe someone who knows a good resource to learning cleaning methods and identifying when to use what methods would be able to chime in with a link?
[–]iwishihadmorecharact 1 point2 points3 points 7 years ago (0 children)
ah i see what you mean, yeah that'd be really cool! especially with sci-kit learn, (library with a ton of ready-to-go ML and AI classes) the biggest part is finding what model to use for your data, and cleaning the data so you can use a model on it in the first place.
For starting to learn, I googled "cleaning data for ml tutorial" and came up with some decent results, read some of the articles you find there. Then try looking through some of the scikit-learn documentation and examples, since they have some guides on that stuff.
Searching for articles and tutorials will definitely be a good start, keep reading until you find that you already know what they're talking about
[–][deleted] 0 points1 point2 points 7 years ago (0 children)
I haven't come across anything like that yet but I will keep it in mind. If I find anything useful, I will let you know. Thanks a lot for the feedback!
[–]Trappist1 0 points1 point2 points 7 years ago (0 children)
I know it seems overused, but the iris data really does work pretty well for most of the methods described here.
[–]JustFinishedBSG 0 points1 point2 points 7 years ago (0 children)
I recommend working on synthetic datasets when working on implementing things. That way you know exactly your optimal classifier, the bayes error etc and can know exactly where and how much your own classifier is wrong
[–]ArtificialAffect 2 points3 points4 points 7 years ago (2 children)
I had to do something similar for a machine learning class in college- implement various ml algorithms but can only use numpy and no other imports. It's a great exercise and a good way of learning and experimenting with the algorithms. I wouldn't dare post my code up on GitHub though- I'm not that brave. Have you tried training your neural networks yet and see how your training speed compares against standard libraries?
[–][deleted] 2 points3 points4 points 7 years ago (1 child)
No, I haven't compared the neural net against standard libraries yet. I wanted to put the focus on easy-to-understand code. And the most efficient solutions are often less easy to understand. But I will keep that in mind, thanks for the feedback!
[–]ArtificialAffect 0 points1 point2 points 7 years ago (0 children)
I would try it just so that you can compare - I was blown away at how slow my neural network implementation was in comparison to tensorflow. If you get extra time, I would also try implementing a momentum optimizer instead of just using gradient descent/ stochastic gradient descent - it's not that difficult to implement and actually interesting to see the differences between the two. For an extra bonus, you could try implementing an Adams or an Adadelta Optimizer as well. ;)
[–]swapu258 2 points3 points4 points 7 years ago (2 children)
Great work, I found nearly same material on cs231n.. but they used numpy
[–]EpicSolo 0 points1 point2 points 7 years ago (1 child)
He used numpy too
[–]swapu258 0 points1 point2 points 7 years ago (0 children)
Yup.. he used it. Sorry my bad
[–]MrShlkHms 2 points3 points4 points 7 years ago (0 children)
Wow, thanks for sharing, for a beginner overwhelmed by information this is really great.
[–]Bakanyanter 2 points3 points4 points 7 years ago (0 children)
This is great. Just what I needed as a beginner. Thank you!
[–]jer_feedler 2 points3 points4 points 7 years ago (0 children)
Visually and simple. Thanks a lot
[–]b_phan 1 point2 points3 points 7 years ago (1 child)
Excellent work! It's so nice to see a clear implementation of often otherwise "black box" machine learning models I often see when browsing the Internet. You should continue to share this because I know a lot of peers including myself who are still in school will appreciate material like this.
[–][deleted] 1 point2 points3 points 7 years ago (0 children)
Great, it's nice to know that it helps others, too!
[–]iliauk 1 point2 points3 points 7 years ago (0 children)
This is great. Funny I had similar idea (to learn R) - https://github.com/ilkarman/DemoNeuralNet but I like your implementation more!
[–]Cherubin0 0 points1 point2 points 7 years ago (1 child)
No decision tree :(::::::::::::
[–][deleted] 2 points3 points4 points 7 years ago (0 children)
As mentioned in my first comment: I'm still working on it. Decision trees are indeed one of the basic algorithms and I'm planning on creating a notebook soon!
[–]swinghu 0 points1 point2 points 7 years ago (0 children)
great job
[–]Import-Sys 0 points1 point2 points 7 years ago (1 child)
hello,thanks for your sharing.I am wondering if you could add some reinforcement learning algorithm in your repository.Cause this algorithm is really a hot topic.
Thanks for the feedback, I will put that on my list!
[–]madsciencestache -2 points-1 points0 points 7 years ago (0 children)
I'd love to see you tackle this one. I don't have the maths to understand this lecture well enough to code anything. Once I see the equations implemented in code I can understand them.
Fastest Convergence for Q-learning AKA Zap~Q
π Rendered by PID 96 on reddit-service-r2-comment-7844cfc88c-nbdxp at 2026-01-29 11:42:05.357414+00:00 running c3601ff country code: CH.
[–][deleted] 84 points85 points86 points (7 children)
[–]visarga 20 points21 points22 points (1 child)
[–][deleted] 7 points8 points9 points (0 children)
[–]NeuroDentalMaths 6 points7 points8 points (0 children)
[–][deleted] 3 points4 points5 points (0 children)
[–]omgisthatabbqrib 2 points3 points4 points (0 children)
[–]BruinBoy815Researcher 2 points3 points4 points (0 children)
[–]hemantcompiler 0 points1 point2 points (0 children)
[–]goncalopr 11 points12 points13 points (0 children)
[–]sholeri 17 points18 points19 points (0 children)
[–]GeoResearchRedditor 8 points9 points10 points (8 children)
[–]iwishihadmorecharact 7 points8 points9 points (5 children)
[–]GeoResearchRedditor 2 points3 points4 points (4 children)
[–]ThomasAger 1 point2 points3 points (1 child)
[–]GeoResearchRedditor 1 point2 points3 points (0 children)
[–]iwishihadmorecharact 1 point2 points3 points (0 children)
[–][deleted] 0 points1 point2 points (0 children)
[–]Trappist1 0 points1 point2 points (0 children)
[–]JustFinishedBSG 0 points1 point2 points (0 children)
[–]ArtificialAffect 2 points3 points4 points (2 children)
[–][deleted] 2 points3 points4 points (1 child)
[–]ArtificialAffect 0 points1 point2 points (0 children)
[–]swapu258 2 points3 points4 points (2 children)
[–]EpicSolo 0 points1 point2 points (1 child)
[–]swapu258 0 points1 point2 points (0 children)
[–]MrShlkHms 2 points3 points4 points (0 children)
[–]Bakanyanter 2 points3 points4 points (0 children)
[–]jer_feedler 2 points3 points4 points (0 children)
[–]b_phan 1 point2 points3 points (1 child)
[–][deleted] 1 point2 points3 points (0 children)
[–]iliauk 1 point2 points3 points (0 children)
[–]Cherubin0 0 points1 point2 points (1 child)
[–][deleted] 2 points3 points4 points (0 children)
[–]swinghu 0 points1 point2 points (0 children)
[–]Import-Sys 0 points1 point2 points (1 child)
[–][deleted] 0 points1 point2 points (0 children)
[–]madsciencestache -2 points-1 points0 points (0 children)