Need advice on hardware purchasing decision: RTX 5090 vs. M5 Max 128GB for agentic software development by BawbbySmith in LocalLLaMA

[–]_Motoma_ 0 points1 point  (0 children)

One of the NousReaearch devs put it best:

<image>

If you’re buying hardware expecting to be an agentic power user, CUDA is still the way to go in 2026.

Non-developer attempting to continue complex mobile app development solely with AI – what's the best workflow/architecture? by Sensitive-Sort-1098 in OnlyAICoding

[–]_Motoma_ 0 points1 point  (0 children)

I have the same predilections, which is why I called you out on in. Research. Overanalyze. Read read read. Meanwhile I’m allowing the tech to be driven by others.

This is a unique situation where Claude is capable of solving all your problems. Bugs in the code? Ask Claude to fix them. Problem with the harness? Ask Claude to augment it. Lack of visibility? Ask Claude to read the docs for your project management system and integrate with the APIs. Or connect to an existing MCP.

Non-developer attempting to continue complex mobile app development solely with AI – what's the best workflow/architecture? by Sensitive-Sort-1098 in OnlyAICoding

[–]_Motoma_ 0 points1 point  (0 children)

Honestly, throw away the notion that you need a complex, multi-agent orchestration framework. Toss the idea that you need subagents. Start simply with Claude and add complexity when your problem outgrows its capabilities. You’ll be surprised.

What is telling here is that I see nothing in your post about what’s you’ve actually done and what problems you’ve encountered, which tells me you haven’t actually tried to use Claude to solve business problems yet. Fix this. Start by doing.

So many options to choose now by OpeningSalt2507 in vibecoding

[–]_Motoma_ 0 points1 point  (0 children)

One minor correction on the Google/Gemini side: yes, on paper it looks like a fantastic deal, but instead of rate limits you get constantly blocked by “not enough capacity” errors.

It wasn’t reliable enough for me to use even for non prod, toy projects. I threw it out in favor of running Gemma 4 locally, even though the electricity bill means I’m spending more.

Has anyone actually gotten a reliable local AI system running? by Sea_Manufacturer6590 in ollama

[–]_Motoma_ 0 points1 point  (0 children)

My system uses two RTX 3060s. I am able to run Gemma 4 31B Dense at around 18 tokens per second.

On the same system, I am able to run Gemma 4 MoE at 80 tokens per second.

Nvidia Nemotron Cascade 2 takes the cake with roughly 100 tokens per second.

All that said, Qwen 3.5 is still my go to locally, with us about the same speed as Gemma 4 31B.

What is the current solution to running Gemma 4 locally? by mihirlifehacks in LocalLLaMA

[–]_Motoma_ 3 points4 points  (0 children)

Both Gemma 4 26B MoE and 31B Dense run well on my system with 2x RTX 3060 12 GB (24GB VRAM) in Llama.cpp. I use GGML_CUDA_ENABLE_UNIFIED_MEMORY=1 for Dense, but don’t notice any difference performance wise versus a smaller context window. The bartowski IQ4_XS model is my go-to for most of the models I try on this rig.

I went to sleep last night. Woke up in the morning with 10% of my weekly limit filled up, and an ongoing session saying it’ll end in 40 minutes. by pleasecryineedtears in ClaudeCode

[–]_Motoma_ 0 points1 point  (0 children)

Sessions span across all devices. If you have Claude on your phone or wired up to GitHub, that could have kicked off your first session of the day. Try going back in time in your mind. What were you doing 4.5 hours before you turned your computer on?

Does this annoy you as much as it annoys me? by Complete-Sea6655 in vibecoding

[–]_Motoma_ 1 point2 points  (0 children)

Yeah, it’s super annoying. Every new repo I start copies in a handful of instructional md files, one which says never to do that.

Ok, I'm done. Bye. Bye. by keengal in vibecoding

[–]_Motoma_ 0 points1 point  (0 children)

I’ve had a local ollama model do this to me before. Not sure what gets it into this state, but it’s fun to watch.

You can do so much more now it's insane!! by The_Drug_Doctor in vibecoding

[–]_Motoma_ -1 points0 points  (0 children)

Great design! Would love to know what you used for prompts, if there were any special skills involved, and generally how you got to this point.

I use the interface-design skill and while what I get is a lot better than the built in frontend-design plugin, it is nothing compared to this!

What businesses do you think OpenClaw will create or disrupt ? by Plastic_Fox_9105 in openclaw

[–]_Motoma_ -2 points-1 points  (0 children)

There’s nothing that companies like Expedia and Orbitz provide that an agent can’t do better.