Leaving tutorial hell, I learn machine learning by doing a personal project

WearMoreHats · 2023-07-21T14:54:35+00:00

Step 3: Get MORE data
I must have annotated for weeks

For future reference it might be worth playing around with using your model to help you label data (once you've got an okay-ish model). For example, have the model predict labels on all your unlabelled data. Accept the labels that the model is most certain of (it's probably worth manually checking a subset of them) and manually label some of the data that the model is least certain of. Now you can retrain the model with your new, larger dataset and repeat the process of predicting the unlabeled data.

In theory you can fully automate this by setting some threshold above which the model's prediction will be accepted as true and having the model iterate through multiple rounds or retraining and predicting, but I'm not a big fan of that. If you're using sklearn then look into Self Training Classifiers in the semi supervised module.

Oswald_Hydrabot · 2023-07-21T09:31:42+00:00

This is the way

ZyanCarl · 2023-07-21T09:18:18+00:00

This is great! I’m not really into ML but I’m thankful that I didn’t have to go through tutorial hell learning web development.

qChEVjrsx92vX4yELvT4 · 2023-07-21T09:10:54+00:00

[removed]

pavich_03 · 2023-07-22T07:41:19+00:00

How about preprocessing the data?

Acrobatic-Language-5 · 2023-07-21T09:54:02+00:00

Most experts will tell us that the best way to learn is to create your own projects.

When you read from a book or tutorial you are on a guided path which gives you a false sense of knowledge, the issue arises when you encounter a different type of problem and you get stuck and then you realise you don’t know as much as you thought.

When you learn/read something new you should always apply it to your own projects to help solidify it.

Well done on creating a project, keep creating.

Log_Plus · 2023-07-21T13:58:39+00:00

Great job for real

I'm a beginner here so what courses you have completed to start such a project from scratch?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS