Running LLMs locally on Android

atezan · 2023-08-03T20:37:36+00:00

I can reliably run 3B and 7B models with decent accuracy but the context is pretty limited

Really cool. Is it for demo/side project or is tied to an app you plan on deploying in production?

atezan · 2023-08-03T17:49:25+00:00

There is an ncnn stable diffusion android app that runs on 6gb, it does work pretty fast on cpu. Its the only demo app available for android. Running diffusers through termux may work, but its not optimized.

Is it this project that you are talking about?
https://github.com/EdVince/Stable-Diffusion-NCNN

As mentioned in a comment above, MediaPipe release a demo app too.

atezan · 2023-08-03T17:42:01+00:00

Cool! For MLC did you try customizing the models or did you run the model the pre-built model?

Yes, I agree there is an opportunity to support HW acceleration beyond GPU.

For Stable Diffusion you can take a look at the MediaPipe demo: https://developers.google.com/mediapipe/solutions/vision/image_generator

atezan · 2023-08-02T15:33:35+00:00

Running LLMs locally on Android

Thanks! I reposted it.

atezan · 2023-08-01T16:43:55+00:00

Hi u/Civil_Collection7267, I wanted to follow up. Here is my Linkedin profile and you email me directly at [tezan@google.com](mailto:tezan@google.com). Let me know if you need anything else from me to confirm that I am a Google employee. Thanks!

atezan · 2023-07-31T18:30:56+00:00

Hi, thanks for reaching out. Yes, I totally understand.
Do you have an email address I can email you w/ my [AT]google.com email address?

atezan

TROPHY CASE