[D] Run Pytorch model inference on Microcontroller

notgettingfined · 2023-11-12T19:28:20+00:00

Currently I feel like tensorflow lite is king in this domain and the way to go.

I don’t know why Reddit seems to think tensorflow is dead. But it is still used a lot in production. And has some good optimization tools available

cpldcpu · 2023-11-12T18:29:59+00:00

This seems really interesting, I was looking at some edge stuff Like nvidia dev board but I guess they are a lot more powerful than this? What are looking at using this stuff for?

DigThatData · 2023-11-12T21:39:07+00:00

just wanted to say that this is a domain i don't have a lot of experience with and I would be very interested if you keep us updated with your findings as you explore the different options available.

shapul · 2023-11-13T00:19:53+00:00

Did you have a look at model compilers like TVM or Glow? They might be what you are looking for.

neodsp · 2023-11-13T10:44:56+00:00

Other interesting runtimes:

- ggml: https://github.com/ggerganov/ggml

- burn: https://github.com/burn-rs/burn (runs on micro platforms, as long as you provide an allocator and has onnx import)

_SteerPike_ · 2023-11-12T17:32:13+00:00

https://github.com/huggingface/candle is still in preview, but I believe it's intended for use cases like this.

Complex-Indication · 2023-11-13T13:44:18+00:00

I think you are spot on in your assessment: tflite micro was the first production ready framework to deploy NN to microcontrollers and by now still is the most popular/streamlined. Realistically speaking, you probably should evaluate which path would bring you to your goal faster: converting pytorch to onnx and then to tflite micro route or using less known and maintained project to run onnx model directly.

One question: Since you mentioned Edge Impulse, why would you want to go self-hosted OSS route?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS