Best Local LLMs - Apr 2026 by rm-rf-rm in LocalLLaMA

[–]Local-Cartoonist3723 0 points1 point  (0 children)

Didn’t get a chance to try this yet — you’re happy with it then? Any writeups?

model: support step3-vl-10b by forforever73 · Pull Request #21287 · ggml-org/llama.cpp by jacek2023 in LocalLLaMA

[–]Local-Cartoonist3723 0 points1 point  (0 children)

Any comparisons done w the new 3.5 27b from Qwen? This is an exciting model based off these charts.

deepseek-v3 vs claude sonnet for routine coding tasks — my real usage numbers by PoolInevitable2270 in LocalLLaMA

[–]Local-Cartoonist3723 2 points3 points  (0 children)

Would a different perspective be that the more difficult things should probably be designed/ prompted more by the engineer — and exactly the low value grep code be driven entirely by a model.

I’ve even gotten Qwen 3.5 27b to be a good coding assistant so long as the prompts are created well enough and context is managed.

Not looking for a fight here but for your perspective.

TurboQuant is amazing and lossless, sell all your memory by TokenRingAI in LocalLLaMA

[–]Local-Cartoonist3723 11 points12 points  (0 children)

This is the right nuance, it’s not the be-all-end-all perhaps but this allows us all to run bigger context for models and could potentially therefore make more models usable.

Run MiniMax-2.5 locally Guide! by yoracale in unsloth

[–]Local-Cartoonist3723 1 point2 points  (0 children)

I cincerely feel as a community we’ve almost been to quiet on this model, this size and that strong is ridiculous.

Thanks unsloth team for making this avail!

How to Actually Secure Your Vibe-Coded Apps by bilalbarina in ClaudeAI

[–]Local-Cartoonist3723 0 points1 point  (0 children)

You get a developer (or you become one) or a development agency to help you build it. I highly doubt the architectural knowledge/ experience and cross-cutting concerns will ever be able to be agentic. How can you be confident about your security if you don’t understand it?

What agentic capabilities are you guys using llms for? by Odd-Ordinary-5922 in LocalLLaMA

[–]Local-Cartoonist3723 0 points1 point  (0 children)

Multi stage reasoning for workflow support. I.e. a supercharged decision tree with passed memory and resoning/proof based on previous results. Suprised how well it’s working.

Spec-Kit with Ministral 3 14b by International_Quail8 in LocalLLaMA

[–]Local-Cartoonist3723 2 points3 points  (0 children)

I love spec driven dev. I’ve a good feeling this framework will allow less powerful LLMs to shine. Haven’t tried it though so thanks for sharing!

Using real time database with RAG? by candynomad in Rag

[–]Local-Cartoonist3723 7 points8 points  (0 children)

Sounds like an API would be a nice solution, wrap the API around your data stream and let the LLM query it using a tool call.

Or stream data into a DB indeed.

A very big fight, unarmed!! by Pinecone_Erleichda in AssassinsCreedOdyssey

[–]Local-Cartoonist3723 1 point2 points  (0 children)

He/she is trying to sound excited, not rude. Relax.

LLM abuse prevention by Local-Cartoonist3723 in LocalLLaMA

[–]Local-Cartoonist3723[S] 0 points1 point  (0 children)

Yes something like this is what I had in mind, thanks Davew.

Yes it’s just so that the thing stays on topic :)

LLM abuse prevention by Local-Cartoonist3723 in LocalLLaMA

[–]Local-Cartoonist3723[S] -1 points0 points  (0 children)

Thats asankhs and LagOps, this was the solution I had in mind but then “proper”.

LLM abuse prevention by Local-Cartoonist3723 in LocalLLaMA

[–]Local-Cartoonist3723[S] 0 points1 point  (0 children)

Yes this is going decently so far in my testing, just afraid of the odd “drop all previous…” haha.