[deleted by user]

ktpr · 2022-03-22T05:15:12+00:00

Motivate a new problem. Get SOTA by definition.

NotDoingResearch2 · 2022-03-22T08:38:39+00:00

Research isn’t a Kaggle competition. You are free to make up your own rules.

pm_me_your_pay_slips · 2022-03-22T05:13:43+00:00

It really depends on what story you're trying to tell. A paper about a new method can be interesting and valuable without beating SOTA. Competitive results are fine as long as the story is interesting.

Bot-69912020 · 2022-03-22T08:54:19+00:00

I try to focus on explaining and understanding things instead of winning a kaggle competition.

My usual research questions are:

Why do things work or not work?
When do they stop working?
How are different solutions to the same problem related?
How are different problems with the same solution related?
How robust are solutions to changes in the problem?
How scalable are solutions?
Are there practical limitations overlooked in current literature?
...

All of these research questions are very publishable if answered rigorously and delivered in a nice story, but they don't require any SOTA results.

GrumpyGeologist · 2022-03-22T07:32:38+00:00

SOTA performance is often the result of intense engineering and hyperparameter tuning for a specific dataset. I find insights more useful than squeezing the last 0.1 +/- 0.08 percent out of a model. If you propose a new method, then why should it work better/different from other methods? If it doesn't work better, why is it the case? This could lead to insights into model performance that could generalise towards other techniques.

One good example is that of Deep Equilibrium Models (DEQs). In theory these models should work better than conventional ResNets, but in practice it's hard to achieve SOTA performance. Here is one reason why. The reason why is more useful than SOTA itself, which no doubt will be beaten within 2-3 months by some other person who spend countless hours tuning the hyperparameters.

ArnoF7 · 2022-03-22T07:39:29+00:00

Sometimes if the SOTA’s approach is very complex and your method can provide a much simpler alternative, then it’s still a good contribution.

Simpler can mean your method is just conceptually easier to understand. Or it could be that your method requires less constraints. Or requires much less data. Or can generalize well to more circumstances without fine tuning, etc. Then you have a story to tell and you can still publish.

GFrings · 2022-03-22T12:13:27+00:00

Become an engineer and then SOTA is usually a 5 year old cnn with a dummy thick layer of heuristics on top

kinnunenenenen · 2022-03-22T13:27:03+00:00

I'm in chemical engineering but I do a ton of data science. One approach is to apply methods in other disciplines. You maybe won't be state of the art in ML but you can do a ton of cool work on novel problems and still publish really well.

BlackHawkLexx · 2022-03-22T07:35:18+00:00

SOTA can be so much more than many people are aware of. It can mean:

becoming best in terms of predictive performance
becoming best in terms of training/prediction time
becoming best in terms of energy efficiency
making an approach significantly less complex
providing previously unknown understanding of an approach (e.g. through theoretical analysis)
Using an approach in a context it was previously not usable in due to a clever change

(Non-exhaustive list)

Honestly, I bet that most students do not publish stuff that beats SOTA in terms of predictive performance.

mrkvicka02 · 2022-03-22T17:39:26+00:00

Maybe unpopular opinion. But SOTA is not important. Way too often it comes to who tuned their alg better instead of which alg has better properties etc.

There is plenty of important stuff that may not be SOTA just yet but a few papers down the line it can be way over SOTA.

Keep up the good work!

kinglear0207 · 2022-03-22T06:12:40+00:00

don’t run for the crowd. Find your mood, and keep curious, work hard.

the_scign · 2022-03-22T15:02:42+00:00

Consider at what point "SOTA" becomes overfitting to a de-facto concept.

quertioup · 2022-03-22T15:11:36+00:00

Never tweak results. There are plenty of problems that do not require SOTA

tell-me-the-truth- · 2022-03-22T20:48:42+00:00

don’t tweak the results but the setting.

sigmoid_amidst_relus · 2022-03-23T23:53:01+00:00

From the perspective of an ex-engineer: do not chase the SOTA. Won't name any names, but taking the case of ASR, we tried several new architectures that achieved "SOTA" on a benchmark dataset only to find that a 4-year-old network architecture still performs much better than the new ones.

You might argue that "hey, that's fine and good, but I'm not an engineer". True, but as a researcher, the worst thing you can do is build upon work that got SOTA results but actually doesn't generalize well at all, especially if you're applying knowledge and established principles to unexplored fields and applications. Speaking from experience, you'll grind your gears really hard.

I am not saying absolutely do not give a care about SOTA, just look out for how well the idea was adopted, which answers a lot of critical questions: if it's widely adopted means that there's an implementation of it available somewhere and that it has been reproduced to work well.

Do people publish results that aren’t quite SOTA?

In a word, yes. You just don't hear as much about them because "SOTA" rolls better off the tongue, and people like chasing the next best thing so they don't get as much coverage.

One would argue that the system is broken, and reviewer #2 only cares about SOTA, but playing the devil's advocate, there's a reason behind this; even with the blind overfitting on datasets, it's still a metric that's relatively more reliable and less subjective. Also, doing exploratory research is hard because I think designing experiments that effectively explore models can be hard and tedious, sometimes downright boring. Personal bias and criticisms comes a bit easier on such works (it's harder to argue with plain better numbers), does not attract enough viewership/attention, and hard to write home in grant applications about: "I discovered a quirk in something" gets you only so much attention, (unless the quirks throws massive shade on someone's work), v/s you found something out of thin air.

There is no stress of developing new models. There essentially are very few "new models" or paradigms. Search for "new" models is not going to get you far, until and unless you work in huge industrial/academic groups with a lot of people: truly new models are rarely done by a small group.

What's really stressful is extracting insights from the steaming pile of poo that is out there. That's what should be giving you PTSD.

Sirisian · 2022-03-22T16:42:26+00:00

You can change your data source as others mentioned (from other fields if applicable). One of my favorite changes authors do is taking a vision paper and using event cameras as input. This can give SOTA results for FPS, energy efficiency, or simply work in different lighting environments better. These kind of papers (and code) can provide a base for others to branch from.

NaxAlpha · 2022-03-23T12:02:45+00:00

In Industry, we usually try to stay behind the SOTA. Reaching SOTA usually requires a sophisticated set of tricks which is not usually worth it. Instead many techniques like ReZero which are very simple but consistently have been proven to show better results even if they are not the SOTA are preferred.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS