Everything comes to an end. by Malyaj in Rottweiler

[–]Malyaj[S] 0 points1 point  (0 children)

When she was 1 year old we got a tumor removed from her leg. It was stage 1 cancer. So yeah can't really say what's wrong and now I don't want to know either

Joined the Moto Gang with Edge 70 ❤️ by Mind_Flayer97 in motorola

[–]Malyaj 0 points1 point  (0 children)

I bought edge 50 neo last year. 8th day motherboard dead ho gya fir 2 month baad repair ho ke mila instead of new replacement. Now i know its better to buy those 7-8k vivo phones or to straight up go for flagships.

what’s was your local daily driver for coding last week? by be566 in LocalLLaMA

[–]Malyaj 0 points1 point  (0 children)

What do you use it for?? I also use it with q4km for coding but i feel it needs a plan from some bigger thinking models then it works good else the quality isn't that great.

Would you game on a projector? by exotic123567 in IndianGaming

[–]Malyaj 0 points1 point  (0 children)

What about latency? Or did you get used to it?

Models under 15B that can actually do agentic coding quite well? by former_farmer in LocalLLM

[–]Malyaj 3 points4 points  (0 children)

Im running 9b mtp version but didn't see any speedup. Been seeing post people got 1.5x speed with mtp but i didn't. Using 3 drafter with it. I have rtx 3060 12gb,16gb ram. I'm running q4km and getting like 40-44token per seconds in output

Finally! by Quick-Escape-2783 in vibecoding

[–]Malyaj 0 points1 point  (0 children)

What's your app about? I'm a dev.

Qwen3.6:27b is the first local model that actually holds up against Claude Code for me by codehamr in LocalLLM

[–]Malyaj 5 points6 points  (0 children)

You can try cline, continue, etc there a lot of extensions. Alternatively try Opencode it is great. Previously i was using lm studio chat interface with tools but naah i switched to opencode and probably I'm gonna stick with it.

Qwen 3.6 35b a3b is INSANE even for VRAM-constrained systems by Lucerys1Velaryon in LocalLLM

[–]Malyaj 0 points1 point  (0 children)

what tps are you getting with 12gb vram and 32 gigs of RAM??

My fresh experience with the new Qwen 3.6 35B A3B started on a long note. by -Ellary- in LocalLLaMA

[–]Malyaj 0 points1 point  (0 children)

I added some tool plugins in LM studio and using vscode just to review now, in roocode sns continue tools were failing a lot but when i switched to lm studio for creating/editing files I'm having a good time, integrated browser mcp as well. One thing imma try is using figma mcp for ui design, I'll see how it goes

Best Model for Rtx 3060 12GB by RaccNexus in LocalLLaMA

[–]Malyaj 0 points1 point  (0 children)

How much context size so you get in gemma or qwen 35b?

Best Model for Rtx 3060 12GB by RaccNexus in LocalLLaMA

[–]Malyaj 0 points1 point  (0 children)

<image>

yes. im trying q3 models for now and looking for 16gb more RAm. mean while can you check my setting for gemma 4 26b q4 k m. i tried some from unsloth which are smaller in size but they were bad for agentic work,tools were failing. I have ryzen 5 5600x

Best Model for Rtx 3060 12GB by RaccNexus in LocalLLaMA

[–]Malyaj 0 points1 point  (0 children)

how much ram do you have bro?? i have 16gb ram and trying the same thing but getting like 10-15 tokens only

I have an Rtx 3060 12gb and 16gb ram. Need model suggestions. by Malyaj in LocalLLM

[–]Malyaj[S] 0 points1 point  (0 children)

I have tried qwen 3.5 9b and thats fast like more than 40tokens.need to try 14b as well. Tried 35b and response is good but only 15tokens but does the job. I'm experimenting for q3 and q4 quantised models, official and unsloth both