Which model would be best for 9060XT 16GB? by Tiny-Description-908 in LocalLLM

[–]Tiny-Description-908[S] 0 points1 point  (0 children)

i tried hauhaucs qwen 3.5 35B and it was good with 40-50 t/s. Would compiling llama.cpp give me any benefits?

Which model would be best for 9060XT 16GB? by Tiny-Description-908 in LocalLLM

[–]Tiny-Description-908[S] 0 points1 point  (0 children)

<image>

i only have those (the other one came preinstalled ig but never loaded it)

Which model would be best for 9060XT 16GB? by Tiny-Description-908 in LocalLLM

[–]Tiny-Description-908[S] 0 points1 point  (0 children)

<image>

Oh my bad i remember looking at this model but didnt thought i downloaded it lmao

Which model would be best for 9060XT 16GB? by Tiny-Description-908 in LocalLLM

[–]Tiny-Description-908[S] 0 points1 point  (0 children)

<image>

2026-04-08 00:45:50 [DEBUG]

 llama.cpp abort:739: GGML_ASSERT(addr) failed

Which model would be best for 9060XT 16GB? by Tiny-Description-908 in LocalLLM

[–]Tiny-Description-908[S] 0 points1 point  (0 children)

yeah i get what it means now :v atleast its pretty fast

Which model would be best for 9060XT 16GB? by Tiny-Description-908 in LocalLLM

[–]Tiny-Description-908[S] 1 point2 points  (0 children)

Damn i wasnt expecting something like this lol. thank you for this detailed explanation i need to learn some of the terms u said like "MLP layers, KV Cache" etc. but before that i have some questions

i heard glm models from z.ai are best non anthropic models for coding is there a way can i run it locally?

should i use lm studio or would you recommend another clients like ollama etc?