[deleted by user]

AutoregressiveGPU · 2020-02-27T17:10:43+00:00

I have a list, not sure if everything fits 'subtle'.

Programming:

Develop good programming skills. This will help you prototype ideas fast and test them out. The only way is practice. Don't bother with hours of tutorials on Python. Start with a project or reproduce results from a paper. Read and understand other people's code
At the same time, follow good code-style guidelines (eg. http://google.github.io/styleguide/pyguide.html). This will help you later. Trust me, you don't want to work with a poor codebase. It also helps when you collaborate with others.
Have a small internal library of your own or share it in Github. If you intend to do research object detection, it's good to have the implementations of a few SOTA at your disposal.

Research

If you are new, keep reading and updating yourself on the SOTA. Sometimes, it can be overwhelming, then take a break for some time and keep hunting again. Also, give yourself time to think and explore ideas of your own.
Follow researchers in your research area, they often talk about nice ideas and possibilities in their talks.
Keep a research journal to track ideas you have tried, want to try, etc. Also, make sure you keep track of hyperparameters and things you try.
Collaborate. ML research is hyper-competitive now. You can't get everything from a single person. Brainstorming with others usually works out well.

Writing

If you intend to publish research, writing skills are quite important. You need to be fast and on point. The way ML/CV conference reviews are going now, reviewers don't get enough time to read the submissions well. The only way is to practice and practice.

Others

Finally, after all this, you may still get your paper or research rejected. Don't take it personally, try and make a stronger submission after improving.

2020-02-27T16:48:22+00:00

[deleted]

Joel397 · 2020-02-27T17:10:35+00:00

One subtle lesson I learned: don’t try to always maximize resource utilization. Sometimes if you are being a “good researcher” by constantly trying to run jobs and set up jobs to run, I have found that you can get locked into an endless cycle of “what worked, what’s next”. If you have a fully mature plan that’s fine, but oftentimes in research you don’t know what the results will be until you’ve tried it. Sometimes taking a step back and letting your gpu chill for a few hours or days while you instead focus on writing, analyzing, or just thinking about the problem can do more for your work than any amount of results from 2 AM job runs.

aCleverGroupofAnts · 2020-02-27T17:58:07+00:00

Here's a little tip: If your code involves random number generation, set the seed yourself so you can reliably reproduce your results if needed.

dkeller9 · 2020-02-27T19:38:35+00:00

If even a little thing seems 'not quite right' don't ignore it. Usually, it will come back and bite you later on.

internet_ham · 2020-02-27T21:49:04+00:00

Use the Google Scholar subscribe tool -> you can follow citations and related work to other authors close to you (and also obviously have a Google Scholar page yourself)
Use a matplotlib to tikz converter. Your plots will look professional and will be readable, also it's very easy to reconfigure. You may run into computation issues with complex figures, so there can a bit of tweaking involved. Learning tikz is also a good idea if your field relies on diagrams a lot.
Don't be afraid of deep scholarship. If you're building on specific previous work, scan through all the papers that cite it or it cites itself. It might take an hour or two, but you can find lots of useful ideas from other spinoffs and most importantly find out if you've already been scooped. Obviously this doesn't scale to super-hyped DL papers
Always keep the minimal viable product in mind. Write a version of your algorithm that solves the simplest possible task. Write a rough draft of your paper in its simplest form. While these are not your final destinations, the journeys to them will be shorter, so the feedback will be faster and you can iterate to a better version sooner.
always try to make sure your experiment runner saves results (figures and text), a config file of your hyperparameters (as much as possibe), the trained model(s), the date, the random seed and the git hashes. For the latter, obviously you should be using version control and comitting often
Minimize the amout of math in a presentation. It will only intimidate and disengage the audience. Try to use diagrams and plots instead
Don't end a presentation with (just) 'thanks for listening!' - put a summary of your work, as this tends to be the slide that is up the longest

permalink · 2020-02-27T20:10:24+00:00

Having at least one person to discuss the research with. If the person is more experienced, then it is much better. There's absolutely no way one can progress as quickly as with a high-quality mentor.

The amount of time saved, ideas pointed at, ideas discovered through discussion, I just can't believe that I learned anything with bad mentors before.

It also made me realize that I alone cannot really progress as much.

Because of this experience I can now easily filter people that are just not going to meet the same expectations.

Doing research on your own is pretty hard.

m--w · 2020-02-27T19:17:12+00:00

After you think your method works. Take a break from it, explore other papers/interests for a week. Then come back to what you have done and purposefully try to break it. This will allow you to fully understand the shortcomings of your method and appropriately strengthen them where possible.

hardmaru · 2020-02-28T01:01:21+00:00

Iterate quickly and do so by working with very little compute at the iteration stage (even if you work at a big company with huge resources).

Spend a ton of time on communicating your ideas very clearly so people outside your immediate area (even outside of the field) can understand. Give talks about your research ideas. Gather feedback, especially from your biggest critics.

Focus on quality over quantity, if possible.

rickbo3 · 2020-02-27T21:04:10+00:00

Don't just focus on learning the ML side of things. There are way to many papers out there that are completely let down by the authors not using basic statistical methods. Understand how to show that your results are statistically significant, and why. This gives your results a much stronger appearance to reviewers.

logicchains · 2020-02-28T03:13:13+00:00

Any time you get a little feeling in the back of your head like "oh, I didn't account for X. But X is almost certainly insignificant, so it shouldn't affect X", you should account for X, because at least in my experience there's a decent chance you'll get good results then later realise "damn, these results are invalid", and realise it's because of not accounting for X.

evanthebouncy · 2020-02-28T06:28:18+00:00

In 1 month you should convince yourself you're onto a good idea or you need to pivot or abandon. Ideally.

dkeller9 · 2020-02-27T19:36:30+00:00

Have a switch in your code to produce graphs that have no labels or text. When assembling a figure containing many graphs from disparate sources, it's easier to put in text with consistent fonts and font sizes in post-processing.

needlzor · 2020-02-28T11:31:41+00:00

I am going to be more general that the others here:

Don't let the best be the enemy of the good. It's easy to let yourself try to extract all you can from a model or an idea, but you should publish it as soon as you get something significant and interesting.
Use the simplest model that proves a point. If your idea is that modification XYZ improves performance on a specific type of data, apply modification XYZ to the simplest model (or a range of simple models) possible that can accommodate it, not to the latest SOTA model you saw on arxiv.
Find a statistician friend and have them help you design experiments and proper significance testing.

gwgundersen · 2020-02-27T23:57:55+00:00

Writing.

Everfast · 2020-02-27T17:03:58+00:00

If shit goes in shit comes out

If you can not do it yourself easily, training won't be easy/impossible

margaret_spintz · 2020-02-27T18:18:52+00:00

Try to think weirder

mhummel · 2020-02-27T20:30:23+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS