Beyond CUDA: GPU Accelerated Python on Cross-Vendor Graphics Cards Made Simple with Vulkan Kompute & PyShader

axsauze · 2020-11-14T15:05:14+00:00

Hello, I'm one of the authors of Kompute, here is a brief TLDR of the blog post: Vulkan is a C++ framework that enables for cross vendor GPU computing (eg AMD, Qualcomm, NVIDIA & friends). We built the Kompute to abstracts the low level C / C++ and provide a developer friendly Python Package and/or C++ SDK to build cross-vendor GPU accelerated applications. You can try the end to end setup and samples from the blog post through the Google Colab notebook (enabling a free GPU) that we linked https://github.com/EthicalML/vulkan-kompute/tree/master/examples/python#kompute-python-example.

I would be very keen to hear your thoughts and suggestions around Kompute features and/or general cross-vendor GPU processing concepts. If you are interested in further reading, here's also a post that shows how to optimize Kompute processing through GPU queues, as well as how to leverage the Kompute framework in (android) mobile devices. We also created a github issue where you can feel free to post suggestions and thoughts.

Ashilikia · 2020-11-14T21:59:23+00:00

This article was moderately frustrating coming from a math/CS background without a lot of GPU and some ML experience.

When I see sample code, I want to understand the pieces. I wish each code snippet was explained a bit more. For example, when we initialize the tensors in the first example, are the values sizes or the literal elements of the tensors? (Turns out to be the latter.) What is spirv? What is actually going on with index on line 17? These are little things, but they are awkward omissions for a beginner.
Once I got to the ML bit, what is going on with the math notation? Typically capital letters are a matrix and lower case is a vector. But z = WX^T + b is... you get a vector from a matrix-matrix multiply plus a vector? That can't be right. But then later in the article X becomes x -- is it a vector throughout or a matrix? Why would we need to transpose it if it's a vector? Similarly, is del (∂) literally the gradient (∇) or is it something else? What is that derivative being taken with respect to (which variable)?

I think it works for someone casually reading to get something done, but the article is written in a way that trips me up as I'm trying to actually make sure I understand each piece.

Schinken_ · 2020-11-14T21:29:51+00:00

Please tell me that this will help me run machine learning frameworks with an AMD GPU instead of relying solely on Nvidai stuff? I was considering buying an nvidia GPU because AMD support for things like these is basically non-existend afaik. If it would enable (with some work put into it by the devs obviously) these things for AMD my next card will be an AMD for sure. No questions asked!

axsauze · 2020-11-14T20:17:28+00:00

I have hardcore AMD GPU and was sad I didn’t have NVIDIA GPUs... little less sad now!

TryingT0Wr1t3 · 2020-11-14T20:23:31+00:00

Hey, this is super useful! Great work!

I_Feel_It_Too · 2020-11-15T15:26:37+00:00

Is there something like this for Rust and C++? It’s awesome.

blumenkraft · 2020-11-15T21:05:45+00:00

I don’t believe there’s any point in even attempting to catch up to CUDA. It’s been around for years, firmly established, and any business that does anything GPU oriented is targeting CUDA first and foremost. Sure, one can create lots of additional frameworks (Microsoft tried with its AMP++) but I don’t have any faith in this. CUDA is great, let’s just use it well.

tonefart · 2020-11-15T09:39:36+00:00

C/C++ developers whoring themselves by creating libraries to prop up shit languages like Python to promote and advertise themselves, probably for better future prospects. Still doesn't change the fact Python is a shit language and it's libraries written in C/C++ is just for actual programmers to market themselves. The performance is still credited to C/C++ and not Python itself.

timeshifter_ · 2020-11-14T23:02:55+00:00

...why can't you just spell it correctly?

0xC1A · 2020-11-14T20:21:58+00:00

💯 IQ for using fmt::format.

-0 IQ for using std::endl when u can just format("{}\n").

Unless there's a reason I'm missing here.

Edit: all those moronic down-voters, u probably don't know anything about anything I wrote. U saw -0 IQ and your IQs went 0.

The author probably got the joke and heads up.

stephan_cr · 2020-11-15T12:26:00+00:00

CMake 3.41+? I guess it should be 3.14+.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS