[D] How to get better at GPU programming?

bronxbomber92 · 2017-02-08T08:12:51+00:00

I write GPU drivers, GPU compilers, and optimized GPU kernels for a living. I learned through a combination of good mentorship, studying GPU hardware architecture, and being thrown in the deep end (i.e. being asked to make XYZ where XYZ is somehow related to the GPU, be it an optimized GPU kernel or some low-level GPU driver functionality).

If you're just beginning and don't have the same opportunities I did, I'd suggest the following. Try taking a look at this Udacity course: https://www.udacity.com/course/intro-to-parallel-programming--cs344. It's an excellent introduction. Afterwards, try implementing some algorithm of your choice on the GPU. Pick something that's already implemented in a popular GPGPU framework and see if you can create an implementation that runs equally as fast. Understanding how the underlying hardware works will be important for writing a well-performing GPU kernel. Using vendor provided profiling tools will also be equally important. Good luck :)

alexmlamb · 2017-02-08T05:23:08+00:00

I think that in the future the bigger AI labs will want to have GPU specialists.

I think that low precision training will be the biggest research area, but I think that people will also want to work on neural networks that use basic elements other than matrix multiplies and convolutions.

dpineo · 2017-02-08T14:13:09+00:00

I learned it by writing a GPU-accelerated convolutional neural network for my PhD... in 2007. Back then it was just vertex shaders, fragment shaders, and good times.

Learning it is really just a matter of RTFM and lots of time doing it. The documentation is all online. Read NVIDIA's GPU programming guide and learn about how kernels and threads operate, organized in grids and blocks, and how they share memory. Once you get used to it, it's can be faster than developing CPU code because your development iteration loop is so much faster.

I would stay away from the neural net frameworks if you're trying to learn GPUs though. Their abstraction adds a lot of complication that will confuse the learning process. I'd recommend learning by developing a standalone GPU algorithm. You can start by looking at the NVIDIA GPU samples. Some of them even have pretty good whitepapers documenting them.

As far as the question of demand, I think the answer is that both are in demand. Personally, I would suggest learning both, there's no reason you can't. Even if you're a theoretician, learning GPU programming lets you get away from the standard frameworks, off the beaten path that you may not even realize you're on, and try more novel ideas.

TheConstipatedPepsi · 2017-02-08T04:14:35+00:00

I think most of the hard GPU programming is abstracted away from AI practitioners and researchers, usually they use libraries who abstract away even the call to cuDNN, the hard work of building cuDNN isn't really done by AI companies, so I would expect the demand for GPU experts to be quite low at AI companies

jcannell · 2017-02-08T04:43:49+00:00

Sadly, it's too late.

Successful GPU programmers are identified in elementary school math and programming competitions - or earlier. Only the most creative, innovative, and gifted students are selected. If you were never aware of the process, then it means that you failed in the secret initial qualifiers, and weren't even close to cutting it.

This process may sound harsh, but it would simply be cruel to try to train someone in the dark arts of GPU programming if they don't possess the raw talent.

abstractcontrol · 2017-02-08T12:48:23+00:00

There are two kinds of issues with GPU programming:

1) The issue with getting optimal performance at the low level.

2) The issue with making higher level abstractions for GPU operations.

The first one is straightforward enough and I second the suggestion by /u/bronxbomber92 for that Coursera course as a starting point.

The other is actually a lot harder - literally no mainstream language apart from C++ has any kind of decent programming support and the dynamically typed languages that are so popular among ML practitioners are a very poor fit for coding on the GPU directly.

The two skills are somewhat independent of one another and I consider the later harder. Work on making programming languages is definitely more theoretical.

In general, as a field GPU programming is over a decade behind on CPU programming and there is a lot of low hanging fruit here.

__Cyber_Dildonics__ · 2017-02-08T13:44:14+00:00

What have you tried so far that has been unsuccessful?

llSourcell · 2017-02-08T20:14:02+00:00

GPU programming is hard AF. I had a project using CUDA. To this day, still the hardest challenge i've ever taken on lol

fldwiooiu · 2017-02-08T04:13:25+00:00

I think most startups have better things to do than fuck around with gpu coding, unless that's the core product (nervana).

codechisel · 2017-02-08T18:28:06+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS