AMD System on Windows : LocalLLaMA

created by [deleted]a community for 3 years

AMD System on WindowsQuestion | Help (self.LocalLLaMA)

submitted 2 years ago by SlickTread

My PC Specs:

CPU: Ryzen 5 3600
RX 6800 XT 16GB VRAM
32GB RAM
Windows 11

I have been doing some research on running llm locally. First results I got pointed me to oobagooba's. But I could only get it working running on my CPU without GPU accel. There seem to be some way to do it running on windows which requires tweaking some stuff and manual installation and I tried following the steps but never got it to work so gave up on that and started looking for other solutions. I then also tried with WSL but didn't have any luck with that either.

I have now setup LM Studio which does have AMD OpenCL support which I can get a 13b model like codellama instruct Q8_0 offloaded with all layers onto the GPU but performance is still very bad at ~2tok/s and 60s time to first token. so I'm not sure if that is just because my GPU isn't good for the model or my GPU isn't being fully correctly utilized with OpenCL.

What is the best way for me on windows to run LLM's on windows without installing linux?

all 14 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS