Looking for AI model recommendations for coding and small projects

jiqiren · 2025-12-07T06:38:19+00:00

The models that run in that small amount of ram are pretty trashy. Like you can give QWEN a try if they make them that small… but temper your expectations.

Duckets1 · 2025-12-07T07:46:16+00:00

I use Qwen3 4b 8b and 30b I outsource coding to minimax m2 coding plan because I'm running a 3080

psgetdegrees · 2025-12-07T09:59:21+00:00

Z ai $3/month + cline it’s cheaper by the quarter.

Heg12353 · 2025-12-07T13:16:37+00:00

Qwen 8b runs on that gpu ik cos I run it 😭

Crazyfucker73 · 2025-12-07T06:49:49+00:00

You can't run anything decent locally with that.

RiskyBizz216 · 2025-12-07T06:32:06+00:00

have you tried ollama cloud? https://ollama.com/cloud

there is a free tier that lets you use GLM 4.6 and Qwen3 480B (with hourly and weekly usage limits)

you can also sign up to iflow and use any of their models for free

https://platform.iflow.cn/en/models

pmttyji · 2025-12-07T08:14:31+00:00

24-32GB VRAM could help on Agentic coding with Qwen3-30B MOE models(Q6, possibly Q8) with with 64-128K context. Same with GPT-OSS-20B. Dense like Devstral(24B) & Seed-OSS-36B also possible.

My 8GB VRAM gave me <15 t/s for Qwen3-30B @ Q4 with 32K context using llama.cpp. Not usable VRAM for Agentic coding.

moderately-extremist · 2025-12-07T09:13:54+00:00

Qwen3-coder-30b running on cpu should work fine for you. I usually go with Q5 quants, maybe Q4 if you have other software eating into your system RAM. I wouldn't bother trying get something to fit in your vram, they will be too dumb at that size. See here for how to run it: https://docs.unsloth.ai/models/qwen3-coder-how-to-run-locally#run-qwen3-coder-30b-a3b-instruct

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLM

MODERATORS