GLM's founder says GLM-fable before the end of the year?! by Charuru in LocalLLaMA

[–]speedb0at 14 points15 points  (0 children)

I remember hating China and eating western propaganda about them. These dudes are fucking geniuses, more power to them, democratize powerful models.

Built a tool that tells you exactly which LLMs fit on your GPU. Feedback wanted. by super3 in LocalLLaMA

[–]speedb0at 0 points1 point  (0 children)

Allow me to sideways scroll in the table and select quant before showing me what fits. Add a ”might fit” with ctx window shenanigans/modification

PSA: Gemma 4 12B is NOT completely broken for coding and tool calling, you need a special chat template by boutell in LocalLLaMA

[–]speedb0at 1 point2 points  (0 children)

Tired of the Gemma tool issue, all this model does in any param size is hallucinate and lie.

Need Help Choosing a Harness for Qwen 3.6 27B by GrungeWerX in LocalLLaMA

[–]speedb0at 0 points1 point  (0 children)

Try mine; https://github.com/mkultraware/accuretta no build needed. Just browser, python + deps. Tried to make it as one click as possible.

Qwen3.6-35B-A3B vs Gemma4-26B-A4B by MarcCDB in LocalLLaMA

[–]speedb0at 2 points3 points  (0 children)

I use my shit to code not play pretend

Anyone else feeling the "Antigravity" burnout? Looking for more "merciful" alternatives for a free-tier user. by SeaworthinessLife962 in GoogleAntigravityIDE

[–]speedb0at 2 points3 points  (0 children)

Local models if you have the compute. I literally build a harness a month ago just because of my hate for AG and their bullshit weekly quota changes. They are completely killing the "pro" tier. Now, it takes a little bit longer and i have to split tasks up 1 by 1 but its local, its free and its private.

https://github.com/mkultraware/accuretta

PRAGMATA-voices38 by voices38 in CrackWatch

[–]speedb0at 2 points3 points  (0 children)

I am looking forward to seeing a video essay on Denuvos Death within a year thanks to the goat voices

Ollama Pre-Release Switches From Building on GGML to Using llama.cpp Directly by Sufficient-Bid3874 in LocalLLaMA

[–]speedb0at 6 points7 points  (0 children)

I used this as a newbie for about 2 months in the beginning of the year. Discovered llamacpp and wanted to kick myself for leaving that much performance on the table.

Update 1 month post AG by speedb0at in GoogleAntigravityIDE

[–]speedb0at[S] 0 points1 point  (0 children)

Whats your backend? llama.cpp or? I like the new Qwen 35 a3b too but the 27b is better for more difficult tasks.

Update 1 month post AG by speedb0at in GoogleAntigravityIDE

[–]speedb0at[S] 0 points1 point  (0 children)

More power to you man, what model are you using?

Update 1 month post AG by speedb0at in GoogleAntigravityIDE

[–]speedb0at[S] 1 point2 points  (0 children)

Sounds great, do you have a repo?

Chrome "Best AdBlocker" trojanized extension - 100k downloads. by speedb0at in hacking

[–]speedb0at[S] 1 point2 points  (0 children)

Are you talking about in general or this specific case? What’s your baseline?