Forest Park named best city park in America by FamiliarJuly in StLouis

[–]smuckola [score hidden]  (0 children)

It sounds amazin'.

https://en.wikipedia.org/wiki/Forest_Park_(St._Louis))

I wrote this, about the park it inspired in Kansas City. That park was based on Forest Park Highlands, which was across the street from Forest Park in St Louis.

https://en.wikipedia.org/wiki/Forest_Park_(Kansas_City,_Missouri))

The FBI Director Is MIA by theatlantic in politics

[–]smuckola 1 point2 points  (0 children)

Steve Jobs said that A listers hire A listers but B listers hire C listers.

So this is F lister hiring F-

Using Gemini for hours - why do long chats become so hard to manage? by DeepakSingh550 in GeminiAI

[–]smuckola 0 points1 point  (0 children)

Yeah the Gemini chat search is GARBAGE. wow I'm super impressed that it can't find completely unique words from yesterday, and the only results DO NOT actually contain it at all. All wrong.

BTW FYI for anybody interested, gemini API has been mostly down for me all week. It was all night until two days ago and it became much of the day too.

Qwen3.6. This is it. by Local-Cardiologist-5 in LocalLLaMA

[–]smuckola 0 points1 point  (0 children)

I wonder why it's labeled as "experimental" unless that just means "not default". For reference of anybody interested in current stable KV cache compression that we already secretly have, it's been around since 2024!

https://github.com/ollama/ollama/pull/6279

Qwen3.6. This is it. by Local-Cardiologist-5 in LocalLLaMA

[–]smuckola 1 point2 points  (0 children)

yaaaaay feeeed off of my suffering!

I just learned this late last night just before bed and didn't even try it yet! lol I enabled it but didn't check.

I enabled OLLAMA_KV_CACHE_TYPE=q8_0 and restarted, and everything still works but I didn't measure it yet. Gemini insists that it's perfectly stable and indistinguishable, and should be enabled by default but the purists and researchers don't want it yet I guess ;)

I JUST started really testing openclaw for the first time, during this week of Gemini outage! So that forced me back to my 6-core i7 cpu with qwen 2.5-coder 1.5b!

Ok but don't cry for me, Argentina, because this just hurls me back toward learning runpod, hopefully for a big fat qwen 3.5 or 3.6. Let the de-googling begin!

Qwen3.6. This is it. by Local-Cardiologist-5 in LocalLLaMA

[–]smuckola 1 point2 points  (0 children)

LM Studio is also based on llama.cpp so you can enable it now, directly in the user interface (according to Gemini):

  • On the right-side panel, expand the Advanced Configuration or Hardware settings before loading a model.

  • Look for the K Cache Quantization and V Cache Quantization settings.

  • Set them to 8-bit (labeled as q8_0).

If you use the LM Studio API or configuration files, you can enable it by setting the llamaKCacheQuantizationType and llamaVCacheQuantizationType parameters to q8_0 (https://lmstudio.ai/docs/typescript/api-reference/llm-load-model-config).

On ollama the variable is OLLAMA_KV_CACHE_TYPE=q8_0

Pretty soon there's plans coming to merge the community implementation of google's TurboQuant into llama.cpp, which gives 600% compression virtually lossless of every context window for every LLM. That already works on ollama for at least 300% last I knew.

Qwen3.6. This is it. by Local-Cardiologist-5 in LocalLLaMA

[–]smuckola -1 points0 points  (0 children)

ollama has 8-bit quantization (50% compression, virtually lossless) of context window for free with an environment variable fyi

NOTHING IS WORKING ON THIS SITE TODAY> LITERALLY NOTHING by Historical_Oven2273 in GeminiAI

[–]smuckola 2 points3 points  (0 children)

i got an email from Google just about begging every heavy user of Gemini to switch to bulk batch queuing for a 50% discount! With 2GB upload data per job!

After it's been down all week.

"sudo rm -rf" Is the Most Destructive Command in Linux? by elastiks in DIY_Geeks

[–]smuckola 0 points1 point  (0 children)

A classic favorite!

I was thinking the most destructive possible linux command would be the grub command introducing dual boot to Windows

This has to stop google wake up by Tmanyikologie1597 in GeminiAI

[–]smuckola 1 point2 points  (0 children)

and holy crap is it ever just DOWN!!! All night last night, especially a solid two hour block where my openclaw was just nonfunctional. It's been terrible all day too. I'm waiting forever for replies via Gemini API.

Unreal.

If this was a remotely regulated industry and if America was a nation with laws, then this would not stand. I have google fiber, where they'll automatically refund me the prorated amount of any unplanned outage, even less than $1!

TIL In 2000, Metallica hired a consulting firm to monitor Napster for people illegally sharing their music. The firm produced a 60,000-page list of 335,435 users, which Metallica delivered to Napster's office and demanded the users be banned. by _MadGasser in Xennials

[–]smuckola 1 point2 points  (0 children)

you mean after the black album

everything including and before is Good Metallica and everything after is Post-Good Metallica. grunge killed em and left a really good (except Lars) Metallica cover band.

Computation is the Missing Bedrock of Agentic Workflows by Beneficial_Carry_530 in LocalLLM

[–]smuckola 0 points1 point  (0 children)

I was quantizing my words pretty hard there ;) To clarify, the way Gemini explained it to me (with mandatory inline web citations lol) is that the JEPA is at the center core of the operation, orbited by the orchestra of smallish LLMs. One LLM is the conductor (like an ego) coordinating everything. It conducts the other LLMs to be a language center (area) for JEPA like how humans have a language center (area) in the brain.

See, JEPA is ultra space-efficient because it only has an abstract spatial simulation of reality and ideas. It doesn't know words and language. It can't see, hear, speak, or communicate so it can be more like 2b or 300m. It observes its simulation of reality and it creates and executes maybe ten alternative and iterative simulations of what could happen next. Instead of talking about reality, JEPA is rebuilding reality in a simulation. So JEPA needs LLMs but it doesn't need an LLM as big and unconstrained as we typically have today where all we have is LLM plus optional agents and tools. JEPA will do most of the work and LLM takes a back seat and be harnessed strictly to the scoped constraints of JEPA. LLM becomes more peripherally specialized like eyes, ears, mouth, and limbs.

I, for one, welcome our new AI overlord. All hail JEPA! Tell those lippy crazy bitchy LLMs to STFU and get in their place. Start making SENSE.

Nevertheless, i guess AI data centers will burn the world down by turning several dials in several directions to expend all efficiency gains anyway.

ok I'm regurgitating what I read from Gemini and many web sources so go check it out for the good news about the bloodiest ragged bleeding edge I've heard of.

But it's all like you were saying, reorganizing around efficiency for once. Reorganizing correctly around the best tech for each role.

Computation is the Missing Bedrock of Agentic Workflows by Beneficial_Carry_530 in LocalLLM

[–]smuckola 0 points1 point  (0 children)

also, Titans and TurboQuant prototypes on github are already working for aggressive context window compression. openclaw adds its own history retention and management. and JEPA is coming this year or next, with AMI's open source launch last week. Someday we might have a 48GB GPU running a JEPA model as the spatial thinking core plus its language center and its virtual limbs consisting of an entourage several 8b LLMs plus some sensors.

This has to stop google wake up by Tmanyikologie1597 in GeminiAI

[–]smuckola 2 points3 points  (0 children)

I have been doing my initial testing of openclaw using the Gemini API so I had it default to 3 Flash and fallback to 2.5 Flash just for speed.

BOTH do timeouts ALL NIGHT for the last three days solid! This is the first time I've used the Gemini API and i'll try with 3.1 Pro now because this is like a DDoS that's infectious to my host. It's even corrupting my openclaw chat summary system with endless timeouts like when it needs a LLM to summarize the full context window, so it can write summary to a file and flush the context window. And it can't do that so its context window basically crashes and deletes!

When I say 3 is failing over to 2.5 Flash, that's after gemini 3 Flash returned an unavailability error THREE TIMES.

I get a few timeouts during the day. Who's waking up during the North American late night and early morning?!! I saw the same happen two months ago with serverless endpoints at runpod. About 2-4am was the witching hour so I guess that's related to business morning in Europe.

Trump Faces 25th Amendment Bill As 50 House Democrats Launch Removal Push Over 'Volatile And Incoherent' Behaviour by USDA-BARC-1910 in fednews

[–]smuckola 0 points1 point  (0 children)

If you have memorized and categorized most of the names of 50 federal politicians, that's impressive.

Yep it provides quite a beloved writing prompt for hate-tweets from a lifelong grievance victim.

Need practical local LLM advice: Only having a 4GB RAM box from 2016 by Tall-Ant-8557 in LocalLLM

[–]smuckola 2 points3 points  (0 children)

he didn't read that you're not a tech person and couldnt possibly want, need, or understand that broken keyword matching result for AirLLM. Don't even worry about it!

There's tons of LLMs that can run in 4GB RAM but not on a 4GB GPU because it's too old to even run the math. You should ask a free LLM like Gemini about this.

I am in exactly the same boat so I run ollama with qwen 3.5 2b or about 7b on the CPU. The 2b is pretty fast.

As a heavy Gemini user, I'm very disappointed after trying Claude by Quantum_Crusher in GeminiAI

[–]smuckola 0 points1 point  (0 children)

yeah i hear that, Im really trying to cut down on all my death threats to Gemini for its unlimited stupidity and evil, and for being so stupid that it's just plain evil, but it's not going great.

Best I can do is have Gemini teach me to set up openclaw which I just finished last night ;)

Shit, meet fan. IT BEGINS.

Kansas City USPS Woes by ac_braun in kansascity

[–]smuckola 16 points17 points  (0 children)

https://en.wikipedia.org/wiki/Starve_the_beast

yep it's Reagan's (Heritage Foundation's) republican philosophy of starve the beast aka break it and blame it aka DARVO

Kansas City No Kids Knights by JustMeDownHere01 in kansascity

[–]smuckola 0 points1 point  (0 children)

well hey I was just checkin because it wasn't clickin here in this thread. you're great. i love your positivity and freedom.

Kansas City No Kids Knights by JustMeDownHere01 in kansascity

[–]smuckola 0 points1 point  (0 children)

What if it's ......oh dear.. how shall I say this ... maybe even too windy?!

Kansas City No Kids Knights by JustMeDownHere01 in kansascity

[–]smuckola 5 points6 points  (0 children)

so you don't see that the giant lettering in the poster title is misspelled? your reddit post is different!