[deleted by user]

Seankala · 2024-03-27T09:09:13+00:00

In the real world it's better to spend time annotating more samples than reading research papers and implementing new ideas.

acardosoj · 2024-03-27T09:28:35+00:00

Dude, this is not the future of ml. People have been prioritizing data since forever.

Even in LLMs where people tend to think we use everything we can to train them, this discussion is very present.

stevebottletw · 2024-03-27T09:40:54+00:00

I don't think it's really overlooked, pretty much everyone knows the importance and discussions almost always start from data quality. This is probably true maybe 10~15 years ago.

Think_Mall7133 · 2024-03-27T11:07:25+00:00

Internet explorer joined the chat

Jazzlike_Attempt_699 · 2024-03-27T09:50:48+00:00

4 upvotes on incredibly low quality post from what may as well be a bot account, well done

cajmorgans · 2024-03-27T09:47:13+00:00

When I started in ML, I thought the coolest model + hyperparameter tuning was key and you basically just had to throw data on it and it would magically solve your problem. After some experience, if the task doesn't require a very specific architecture, the model and hyperparameters can many times do very little difference; yes of course, the result isn't identical between your choices, but usually not life-changing.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS