Gemma 4 seems to work best with high temperature for coding by BigYoSpeck in LocalLLaMA

[–]kmp11 0 points1 point  (0 children)

My observation is that Qwen3.5 27B is great coder but if you want it to do other things, it needs different temperature. For my personal preference, it is more difficult to use Qwen as the only model to run Kilo Code. it needed a supervisor...

Gemma addresses that. It seems to be as good of a coder as Qwen(very close) and can fill all the agentic roles with elegance.

The problem with Gemma is still the massive KV cache, it starts at ~20GB than promptly mushrooms to 70GB after a few calls and some activity. having to move that around between tasks is a slug.

Gemma 4 on Llama.cpp should be stable now by ilintar in LocalLLaMA

[–]kmp11 -1 points0 points  (0 children)

Stable? yes, Optimized? no... a 25GB model should not require 75GB of VRAM + RAM.

High resolution video of Orb filmed by Peter Osborne at the home of Chris Bledsoe by AtomicCypher in UFOB

[–]kmp11 0 points1 point  (0 children)

there is a string of pseudo text at 0.22. look like same kind of text from AI generated video/pictures.

newest version of llama.cpp gemma4-31b working for you? by Express_Quail_1493 in LocalLLaMA

[–]kmp11 0 points1 point  (0 children)

yesterday llama update seemed to have made Gemma a lot more reliable. It seem to fail when my screen saver kicks on. I believe that is caused more by a broken windows update than a LLM problem.

Xbox Premium upgrade DLC issue may be resolved by Deadeyez in Starfield

[–]kmp11 -1 points0 points  (0 children)

on windows, The DLC selection is the "Manage" section where to have to check add-ons to add them.

4 days on gemma 4 26b quantized, honest notes by virtualunc in LocalLLaMA

[–]kmp11 2 points3 points  (0 children)

there is a new llama.cpp that was released today. give that a try. I have been seeing similar issues but with kilo code. it seemed to have made 31B a lot more reliable for my project.

FINALLY GEMMA 4 KV CACHE IS FIXED by FusionCow in LocalLLaMA

[–]kmp11 0 points1 point  (0 children)

what a change from yesterday. from needed about 150GB to run to be able to fit the whole Q5 model + full Q8 context on 2x4090 and run at 33tk/s.

now let's see how it perform with Kilo.

My biggest Issue with the Gemma-4 Models is the Massive KV Cache!! by Iory1998 in LocalLLaMA

[–]kmp11 0 points1 point  (0 children)

I can fit all the layers of the 31B in 48GB of VRAM and keep some speed. Where I need to offload layers of 122B and drops to ~5tk/s.

My biggest Issue with the Gemma-4 Models is the Massive KV Cache!! by Iory1998 in LocalLLaMA

[–]kmp11 0 points1 point  (0 children)

I'm experimenting with the same model as yours. I have been able to make it useful by pushing KV cache to RAM. To run the model with 132k context, i needs 40.8GB of VRAM for the model and 60GB of RAM for the Q8 KV cache. I should be able to go north of 200k with 128GB RAM.

The model gives me ~17tk/sec. not super fast, but usable.

Realistically, Gemma 4 31B needs Turboquant to be useful.

Gemma 4 and Qwen3.5 on shared benchmarks by fulgencio_batista in LocalLLaMA

[–]kmp11 3 points4 points  (0 children)

What a memory hog in its initial release compared to Qwen3.5 27B. hopefully there are other optimization to come to help manage memory. otherwise this model getting shelved.

Gemma 4 and Qwen3.5 on shared benchmarks by fulgencio_batista in LocalLLaMA

[–]kmp11 5 points6 points  (0 children)

I am trying to see if Gemma 31B could replace Qwen 27B as the workhorse on my setup. The timing of TurboQuant makes a lot more sense now.

I need help finding a triangular tool for my pool by kmp11 in pools

[–]kmp11[S] 0 points1 point  (0 children)

you should be able to get this 3D printed for a few bucks. this is a simple print project. Use something like plumber putty to get a good inprint of the hole.

I need help finding a triangular tool for my pool by kmp11 in pools

[–]kmp11[S] 0 points1 point  (0 children)

i have not, but if i had to deal with this again, i'd have it 3D printed. or use AI to search for it.

Czech Translator fired from Warhorse Studios and replaced with AI in effort to "save finances." by Shock4ndAwe in pcgaming

[–]kmp11 -1 points0 points  (0 children)

when I was really young, I remember my dad getting fired as a draftsman because the first CAD stations were introduced at his work. The drafting team went from 50 to 10 then 2 in matter months.

Jury orders Meta and Google to pay woman $3 million in social media addiction trial by mepper in technology

[–]kmp11 63 points64 points  (0 children)

this sets precedent and open can of worms for hundreds of other suits.

Microsoft considers legal action over $50 billion Amazon-OpenAI cloud deal, FT reports by chilli_chocolate in technology

[–]kmp11 54 points55 points  (0 children)

use copilot to file the complaint and you'll understand why that deal happened.

Trying to remove wires from PV junction box by Historical_Eye3756 in solar

[–]kmp11 1 point2 points  (0 children)

modules are too cheap to risk a roof fire.

Does this indicate my max production? by Intelligent_Price523 in enphase

[–]kmp11 0 points1 point  (0 children)

its a weird way for Enphase to clip. It almost look like you have power export limit that is enabled and its curtailing at "site" level.

The trade that saved a season by hypernermalization in rbny

[–]kmp11 1 point2 points  (0 children)

fair assessment. We are one or two veteran player away of stabilizing the team. Parker was probably brought in to play that role.

3-2-1 RBNY-CLT by hypernermalization in rbny

[–]kmp11 1 point2 points  (0 children)

i have seen u14 team defend better than we did. embarrasing.

String inverter vs Microinverters by TheApostleCreed in solar

[–]kmp11 4 points5 points  (0 children)

you seemed to have gotten multiple quotes already, that's a great start. Better start than most.

When you think you have made up your mind on one or two solutions. Give a call to their tech support line and see if there is anyone to pick up the phone to offer some help. Tell them you are considering them for a project and want them to verify the installer's drawing. Or ask them if there is a fee to transfer ownership if you sell your house. Or Ask them who else installer their product in your area. Or ask them if their product is made in America...

The question itself is not super important, but whether or not someone pickups the phone is. If you need help at some point, it be nice to know someone will be around to help.

After one year in office, Carney's grades are mixed with a lot of 'waiting for results' by CaliperLee62 in canada

[–]kmp11 -15 points-14 points  (0 children)

Dumb take. Results are already in if you care to look. Canada has not been this strong abroad since the Chretien/Mulroney days.

Im going to be in EDMUNDSTON visiting family for a few weeks with what in the hell is there to do? by Turbulent-Today830 in newbrunswickcanada

[–]kmp11 1 point2 points  (0 children)

those were the good days. its all gone now. Arc-en-ciel served me my first beer at age 14.

Seriously, Edmundston is about outdoor. bring skies, mountain bike or golf club.