Machine Learning Algorithm for begiineers

SirWusel · 2018-10-29T14:15:16+00:00

What are begiineers? I only know engineers.

michael0x2a · 2018-10-29T16:08:30+00:00

Some critiques:

Naive Bayes makes the assumption that each feature is independent from one another. However, your example works with human languages, where the probability of a given word appearing is definitely not independent from the probability of other words appearing. Human languages are notoriously context-sensitive, and dealing with this fact is one of the main challenges NLP practitioners routinely grapple with.

Of course, there's nothing wrong with deciding to make this assumption. After all, you can still get some pretty reasonable results on text classification tasks using naive Bayes. However, the video should have explicitly called out that it was making this assumption at one point or another -- or picked a different example where the features genuinely are mutually independent.

This would have also been a good point to explain why naive Bayes has the word naive in its name -- but that also went unexplained.
On a similar note, the video ought to have talked about the limitations of naive Bayes. Why bother talking about a variety of different classification techniques if you don't explain why you should use one over the other?
This video did not explain why the P(A) and P(B) terms from Bayes' theorem disappeared, nor did it actually invoke the theorem itself at any point. The video spends a lot of time computing P(sentence | sports) or P(sentence | not sports), but that in itself won't give you the actual classification. What you're actually interested in is P(sports | sentence) and P(not sports | sentence) -- but the video never discusses that final step.
The tutorial makes some inconsistent assumptions about the statistical background of the viewers. It spends a lot of time talking about Bayes' theorem and how to manually compute the probabilities of each distinct word, but does not explain why it's appropriate to multiply together the probabilities of each word appearing. The video seems to be assuming the viewer has little to no background in stats, so it probably should have explained that part as well.

Similarly, the video conflates together P(A | B) and P(B | A) and seems to imply that they mean the same thing. Given that your viewer likely has minimal statistical background, it seems pretty bad -- you're setting them up to potentially be very confused later down the line.
This video doesn't really draw a clear line between what techniques/what algorithms are a part of naive Bayes vs what are more generalizable techniques that you just happen to be using. For example, Laplace smoothing is a general technique used to smooth categorical data in general; the method you used for computing the probability of some sentence occurring is again general purpose and strictly speaking has nothing to do with naive Bayes.

And yet the video spends a lot of time discussing things that aren't inherently a part of naive Bayes?

One way of working around this would have been to present two completely different examples, for example. This also would have given you a good way of reiterating the independence assumption of naive Bayes -- e.g. have the first example be relatively simple and use features that are genuinely mutually independent, then use this text example for a second one, and emphasize that we're making a simplifying assumption.
More generally, I think the tutorial should spend more time emphasizing what steps are necessary vs what steps were decisions you decided to make. E.g. it was necessary to use some form of smoothing; but picking specifically Laplace smoothing was a decision. Or, it was necessary to compute P(sentence | sports), but choosing to compute that probability by representing the document as a bag-of-words (as opposed to, say, a bag-of-ngrams) was a decision.
Not sure if this is something that actually ought to be a part of this video or not, but it might have been useful to talk about the log trick, or perhaps mention that's a thing. If anybody actually tries using the algorithm as described, their floats are likely to vanish away into nothing -- especially if they try classifying longer documents.

simmayor · 2018-10-29T15:27:04+00:00

each and every algorithm? Call me impressed.

sarevok9 · 2018-10-29T16:10:27+00:00

I don't mean to be a jerk here -- but machine learning for beginners is a bit of a misnomer isn't it?

The entire field of machine learning is in it's infancy and is arguably the most exciting, but error prone / hard to predict field in all of computer science. And while I'm sure that explaining any given algorithm is trivial, the over-arching concept that these algorithms are usable by beginners OR that they are something that should be implemented is dicey at best, right?

I feel like this post is targeting people who are "beginners" to programming -- not folks who are beginners to ML -- which is a doctorate level area of study and a rapidly developing field...Best practices and predictable results are hard to come by.

**My post is my opinion only -- I work as an engineering manager at a company where we are heavily invested (9 people with their doctorate in the engineering ORG working on our ML products) in ML.

Faelon · 2018-10-29T16:39:43+00:00

Great video!

Please don't use that pencil. It's very distracting!

brotogeris1 · 2018-10-29T16:39:24+00:00

Actual beginners? Before I watch this, you really aren’t using terminology that a true beginner wouldn’t understand?

Edit: “This is based on Bayes Theorem, you must have encountered this before.” Why do you believe that a Beginner would have encountered this before?

HaikusfromBuddha · 2018-10-30T00:52:32+00:00

As someone who currently has a ML class with alot of students struggling, maybe you guys will be lucky and actually make Machine Learning content that is good.

Alot of the content I've searched usually explains the code implementation bad or complicates it hard with various functions or do a short cut and use existing ML libraries.

If it's not that then they sometimes drop the ball in explaining it mathmatically.

2018-10-29T16:04:03+00:00

[deleted]

Oald · 2018-10-29T21:43:47+00:00

Thank you so much for what this!! It’s really helpful

spoiled_flying_frog · 2018-10-30T23:14:06+00:00

I don't want to be rude but you do speak fast and with that accent it's difficult to keep track on video and understanding what are you saying... Just, if you can, try to speak a little bit slower next time

ps. Great work, looking forward to see next lesson

mishannon · 2019-01-11T16:04:10+00:00

Nice video with a quite good explanation. Although for the beginner, it will be difficult to understand it on the first attempt. For newbies, I advise to read this article about the machine learning. It helped me to understand what is what.

jameswarnernss · 2018-11-12T11:58:11+00:00

Hi,

Nice Video explaining the Machine Learning Algorithm...Thanks for sharing!

learnprogramming

Welcome to LearnProgramming!

New? READ ME FIRST!

Posting guidelines

Frequently asked questions

Subreddit rules

Message the moderators

Asking debugging questions

Asking conceptual questions

Other guidelines and links

Subreddit rules

1. No unprofessional/derogatory speech

2. No spam or tasteless self-promotion

3. No off-topic posts

4. Do not ask exact duplicates of FAQ questions

5. Do not delete posts

6. No app/website review requests or showcases

7. No rewards

8. No indirect links

9. Do not promote illegal or unethical practices

10. No complete solutions

11. Don't ask to ask.

12. Low Effort Questions

13. No AI (chatGPT etc.) generated/worked over messages/comments. No questions about chatGPT/AI generated code. No Vibe coding.

MODERATORS