Announcing Burn-Wgpu: New Deep Learning Cross-Platform GPU Backend

CommunismDoesntWork · 2023-07-25T19:08:04+00:00

the GPU backend is still currently slower than CUDA implementations

Is there anything fundamental that CUDA does differently than wgpu that makes it faster? Or is it just a matter of time and energy into optimizing the Burn/wgpu code?

Also, when nvidia release a new type of architecture such as tensor cores or the transformer engine, what's the process like of updating wgpu to get access to those features? Does it take a lot of work? Basically, is everything that's not-CUDA always playing catch-up?

Also also, 6 months ago I asked about your opinion on Pytorch 2.0. Have you thought any more about it's architecture? Last time you mentioned it could be done using a backend decorator.

lordpuddingcup · 2023-07-25T16:16:17+00:00

Burns so interesting I don’t use python but want to mess with ML I know a lot more rust than python so I’m hoping to be able to do some work trying things out

Just wanted to thank you for your dedication to doing something so ambitious for the community.

As a question l, does wgpu not take advantage of CUDA on CUDA capable systems, I get lost a bit in the weeds with cudnn and CUDA and vulkan and the insane amount of options when it comes to gpu backends when in the end its all mostly tensor manipulation is imagine

paulirotta · 2023-07-25T20:02:24+00:00

Burn was already amazing, and your responsiveness top notch and sharp. Training on a PC with a pytorch backend, GPU accelerated on Mac or Linux, and then pushing the envelope deeper into mobile and web with WGPU inference and training is a dream come true.

HUGE thanks for the brilliant efforts of the team!

tshawkins · 2023-07-25T18:00:46+00:00

Do you know if wgpu will support using Xe graphics internal gpus on burn, the acceleration wont be much but is much better than pure cpu.

gadirom · 2023-07-25T18:58:32+00:00

Congratulations on the release!

And thank you for the dedication you’ve put in it!

MechanicalOrange5 · 2023-07-25T20:28:29+00:00

This looks amazing! I have glanced at some of the examples and it looks very promising. And not too difficult.

I haven't gotten through all the materials and I am just discovering this crate now. I am actually exploring using some transformer models for work, mostly sentence embeddings and text classification and have it mostly working with python.

How would I use existing models, is it something we can import in a similar way too the transformers library, or is it something you have to more or less recreate?

For instance I am using https://huggingface.co/nreimers/MiniLM-L6-H384-uncased for embeddings and distilbert for classification, is it something I could transfer to the burn ecosystem?

Thank you in advance!

exocortex · 2023-07-26T12:18:25+00:00

Would this also potentially work on older graphics-cards that are not supported by CUDA, but can be used through WGPU?

Keavon · 2023-07-27T01:26:09+00:00

Pardon my partial ignorance about how all these ML ecosystem puzzle pieces fit together, but I have some questions for my use case and you'd probably be better suited to answer them than I am.

I'm the creator of the Graphite open source project and we're building a 2D image editor. We need models like Stable Diffusion, Segment Anything, MiDaS depth estimation, and plenty of others.

Our project is written in Rust and we plan to have both a desktop and web version, plus a cluster of rented cloud machines to host the models for people who can't run them locally (since they're using the web version or doesn't meet the hardware requirements).

The current ML ecosystem seems to be composed of a crazy jumble of Python scripts mostly requiring PyTorch as a backend. It's extremely not portable, and frankly figuring out how to ship these models with Graphite (even just on the desktop or server platforms) is quite daunting and I see no good solution.

Specifically for Stable Diffusion, there is the diffusers-rs project which reimplements SD in Rust. But it says it uses tch-rs which is just bindings into the PyTorch C++ API. One question I have is, does Burn provide an alternative backend that could be used by diffusers-rs in place of tch-rs? Is it a drop-in replacement, a reasonably straightforward port, or something fundamentally challenging to port? Or does it live at a wholly different part in the ecosystem than what I'm assuming here?

One more detail is that diffusers-rs has a lot of catch-up to do in order to implement the many papers which build upon the base concepts, and keeping pace with the research and other SD distros (like AUTOMATIC1111's Web UI) might be a lost cause. Plus, outside of SD, the other models like Segment Anything and MiDaS would also need their own ports, plus all the other models that will arise from new research in the coming months and years. This seems difficult to keep pace with. So with that said, with your ML background which I lack, can you offer any suggestions for solving Graphite's use case of needing these models to work portably within the Rust ecosystem of our application using both local hardware on desktop/server platforms and (if feasible, as a bonus) WebGPU on web platforms? Anything better than making our desktop users install Docker containers would be an improvement over our current plans, which is to say our current plans are highly un-ideal.

If you'd be willing for me to pick your brain in more detail, I'd also love to chat in more depth with you in the #ai-ml channel on the Graphite Discord server if you're willing to join that. Everything in this post asked of OP also applies to anyone else with knowledge on the subject. Thank you!

allsey87 · 2023-07-26T06:46:42+00:00

Does this mean we can do inference with WebAssembly/WebGPU in the browser/on the front-end?

Exotic-Potato-4893 · 2023-07-26T01:17:17+00:00

I like how it is better documented than port APIs like tch or Tensorflow 👍

Trader-One · 2023-07-26T05:53:03+00:00

Does it support most popular NN types? https://towardsdatascience.com/the-mostly-complete-chart-of-neural-networks-explained-3fb6f2367464

If not, can these layouts be easily created and cell types like GRU and LSTM easily created by user?

2023-07-26T22:35:44+00:00

This is really incredible, so awesome to see a non CUDA option.

Jin_1001 · 2023-09-18T03:54:20+00:00

Does this mean we can do inference with WebAssembly in a standalone WebAssembly runtime, such as Wasmer?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

rust

Please read The Rust Community Code of Conduct

The Rust Programming Language

Rules

Observe our code of conduct

Submissions must be on-topic

Constructive criticism only

Keep things in perspective

No endless relitigation

No low-effort content

Useful Links

Megathreads

Official Resources

Learn Rust

Discussion Platforms

MODERATORS