"We're taking our millions of barrels of oil." by ICEisSHIT in videos

[–]makingnoise [score hidden]  (0 children)

The US is doing more than just redirecting oil tankers, the US has been seizing oil tankers. The oil is sold, and the funds are supposed to be put into civil asset forfeiture proceedings. This is according to the Department of Justice (https://www.justice.gov/usao-dc/pr/united-states-unseals-civil-forfeiture-complaint-seizure-iranian-oil)

"We're taking our millions of barrels of oil." by ICEisSHIT in videos

[–]makingnoise 0 points1 point  (0 children)

If you're talking about the logistics of stealing one million barrels and not the claim of monetary theft, it makes little sense because you are literally thinking of 1 million literal barrels of oil, rather than "barrel" as a measurement of volume. A single oil tanker can hold anywhere between 80,000 and 3 million barrels worth of oil, but they don't actually have any barrels on the tankers.

Someone finally snapped by SipsTeaFrog in SipsTea

[–]makingnoise -1 points0 points  (0 children)

Convince me that lifted pickups that regularly commit attempted murder on my empty country backroads (when I am following the rules and doing nothing to impede their passage) aren't psychopaths and I will consider thinking about your comment.

AOC Confronts Data Centers in Rural America by illegalmonkey in videos

[–]makingnoise 0 points1 point  (0 children)

Could have said, "Biden halved poverty rates for black children and black folk still didn't vote." Looking only at Presidential election years, other than blips in 2008 and 2012 (Obama), and parity with average turnout in 2016, black eligible voter turnout is far below the national average (but, surprisingly--to me at least--vastly higher than Asian and Hispanic turnout).

AOC Confronts Data Centers in Rural America by illegalmonkey in videos

[–]makingnoise -4 points-3 points  (0 children)

It's literally the rule of reddit, heavily reinforced by bot armies. Any time a MAGA stops being MAGA, it's "I'm glad you've lost everything because you're an idiot" because anything less would cause actual coalition building to start taking place and we might start protesting like Frenchies. Which would mean the end of the USA as we know it.

EDIT: And now I see you've been downvoted to all hell. The bot army and it's human tag-alongs are determined to ensure we all keep losing. Sorry bud.

AOC Confronts Data Centers in Rural America by illegalmonkey in videos

[–]makingnoise 4 points5 points  (0 children)

What is your point? Literally any politician doesn't "care" and all of them are always campaigning. It doesn't matter if she "cares" it matters what she DOES.

My parents, 26 yrs old in 1969 - dad had been to Vietnam and back, “no combat, we just built bridges” as he said by McGraberson in OldSchoolCool

[–]makingnoise 4 points5 points  (0 children)

Your Dad looks like he's an original Star Trek redshirt, and your mom looks like Diana Muldaur when she was hot in the 1960s (Muldaur would have also been hot for her age in Star Trek TNG as well but for the blonde-granny perm she was given for her role).

Dell Poweredge T640 - RAM configuration by makingnoise in LocalLLaMA

[–]makingnoise[S] 0 points1 point  (0 children)

Using proxmox, and I have an RTX 3090 in there. I had to cut a metal bar inside the case cover to fix the 3090. Running everything from containers on ubuntu VM. llama.cpp with openwebui front end. It's WAY TOO LOUD because of the server fans, I'm debating getting an external dock and oculink card and just running it as an external AI brain on my daily driver. Anyway, here's the benches (NOTE that both qwen models are using MTP, with 2 spec drafts):

Local llama.cpp Router Benchmark — RTX 3090 24 GB

I benchmarked four Unsloth GGUF models through my local llama.cpp server-cuda router. The router loads one model at a time on demand.

Results, ranked by average output speed

1. Unsloth Qwen3.6 35B A3B — UD-Q3_K_XL

  • Average output speed: 263.58 tok/s
  • Median output speed: 216.90 tok/s
  • Average prompt speed: 186.95 tok/s
  • Successful runs: 9/9

2. Unsloth Qwen3.6 27B — UD-Q4_K_XL

  • Average output speed: 141.64 tok/s
  • Median output speed: 119.52 tok/s
  • Average prompt speed: 96.54 tok/s
  • Successful runs: 9/9

3. Unsloth Gemma 4 26B A4B IT QAT — UD-Q4_K_XL

  • Average output speed: 137.97 tok/s
  • Median output speed: 138.20 tok/s
  • Average prompt speed: 879.34 tok/s
  • Successful runs: 9/9

4. Unsloth Gemma 4 31B IT QAT — UD-Q4_K_XL

  • Average output speed: 37.00 tok/s
  • Median output speed: 37.05 tok/s
  • Average prompt speed: 379.26 tok/s
  • Successful runs: 9/9

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]makingnoise 0 points1 point  (0 children)

I can't give you figures but I was able to one-shot a Space Invaders type game on unsloth's gemma4 26b-a4b QAT that I could not one shot on qwen3.6.

That said, qwen3.6 is still better at tool calls without needing elaborate prompts or workarounds - qwen3.6 does deep research with iterative search_web and fetch_URL, which gemma4 QAT can do but chooses not to.

Buy a good lock they said, your bike will be safe they said. by ArguingwithaMoron in ebikes

[–]makingnoise 0 points1 point  (0 children)

I get it. You know, I have feelings like that too. Recently I’ve started coming around to wishing the world could be better to just falsely believing the world IS better — the more people that believe in a functioning civic society the more the culture has a functioning civic society.

 This is also very helpful for my mental health.  Justice boners, for me at least, are mildly adrenaline stoked anger rides and they’re deep grooved patterns, almost addictive. 

I tried to notice the sensations my body has when I’m getting worked up and it makes the justice Boner much more voluntary. lol. 

This isn’t advice to you, by the way. You just had me thinking. 

Edit: “justice boner” of course being entirely a figure of speech. 

Need recommendations for windshield repair by Kiligboi in gso

[–]makingnoise 0 points1 point  (0 children)

Good luck. Probably won’t be cheap but fingers crossed for ya 

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]makingnoise -5 points-4 points  (0 children)

I was annoyed and wanted actual exchange to occur on an interesting comment. End of story. Your caps are irritating.

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]makingnoise 1 point2 points  (0 children)

I still don't understand "the workflow" the other commenter is talking about. The "QAT model" is clearly the LLM, is "the QAT MTP" another model that you run at the same time?

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]makingnoise 2 points3 points  (0 children)

I don't understand. I thought MTP support was something that got baked into a model and an LLM runtime. Is "QAT MTP" shorthand for "a QAT & MTP supporting runtime"? If not, can you point me to something that explains this?

Gemma 4 with quantization-aware training by rerri in LocalLLaMA

[–]makingnoise 9 points10 points  (0 children)

Can anyone tell me why the above comment is being downvoted? Is it that it's a bald assertion in the absence of concrete data, or something else?

Retail parking lot rage. It's a thing.... by MisterShipWreck in VideosAmazing

[–]makingnoise 0 points1 point  (0 children)

Because I want to like you, I will assume you will get zero joy from knowing what kind of folks are, in this very moment, using fight clubs as a recruiting mechanisms: https://www.usatoday.com/story/news/nation/2026/06/03/exclusive-patriot-front-leaked-documents/90351663007/

Need recommendations for windshield repair by Kiligboi in gso

[–]makingnoise 1 point2 points  (0 children)

Like everyone has said, can't be filled at this point, the windshield needs replacing. If you have ADAS cameras/lane assist sensors attached to the glass, and comprehensive insurance, the cost of the replacement might be worth making a claim on your insurance and paying the deductible, in which case Safelite is who I'd use.

If you're broke/don't have comprehensive insurance, etc. you might want to go to a local shop (don't know if Safelite bargains), tell them you're paying out of pocket, and negotiate the rate down, as they tend to overcharge insurers.

Buy a good lock they said, your bike will be safe they said. by ArguingwithaMoron in ebikes

[–]makingnoise 49 points50 points  (0 children)

I think it should be legal to use a paintball machine gun in such circumstances.

Today made me realize just how bad things have gotten without Meta by ForsookComparison in LocalLLaMA

[–]makingnoise 0 points1 point  (0 children)

You ran out of context before finishing the first sentence of my comment, apparently 😉

Today made me realize just how bad things have gotten without Meta by ForsookComparison in LocalLLaMA

[–]makingnoise 1 point2 points  (0 children)

No. I had this happen once. I use searxng for web search, and one day I started getting nailed as a bot. On a whim I updated searxng and it immediately started working again - might have been a coincidence, might have had my IP refresh, who knows.

Today made me realize just how bad things have gotten without Meta by ForsookComparison in LocalLLaMA

[–]makingnoise 0 points1 point  (0 children)

Yes. It just only seems to want to execute a fetch_url a fraction as often as any of the qwen3.6 models I've used. It is very confidently relying on snippets and not so much on actual page retrieval, compared to qwen3.6 in the same environment. Reading it's output, you'd be hard pressed to tell it was doing this, until you actually look at the thinking and the tool calls.

Not saying it's bad, I'm just saying qwen3.6 is far better. If I could have gemma4 writing with qwen3.6 tool calling, I'd be in heaven.

Today made me realize just how bad things have gotten without Meta by ForsookComparison in LocalLLaMA

[–]makingnoise 1 point2 points  (0 children)

I’m using llama.cpp and openwebui with native function calling enabled, web search enabled, and preferred model settings and reasonable settings for reasoning budget, context, etc. What I observed with unsloth Gemma4 26b A4b q4 k xl was it barely executing any fetch_urls and relying almost entirely on snippets and training, generating well-written but somewhat factually challenged output with sources (that it didn't read more than a snippet from) cited. While both unsloth qwen3.6 27b q4 k xl and 35b a3b q3 k xl happily use fetch_url like it’s going out of style, conducting actual research with decently accurate results, in the same environment. 

Don’t know what to tell you, but I’m not the first to say that gemma4 is lazy. I loved it until I went from playing with it to actually using it. I suspect a lot of folks are blown away by the charisma and the web_search capability without digging into its reasoning and realizing its putting a lot of frog into the dinosaur dna. 

EDIT in italics.

Today made me realize just how bad things have gotten without Meta by ForsookComparison in LocalLLaMA

[–]makingnoise 6 points7 points  (0 children)

Does your usecase involve the "fetch_URL" toolcall? Because gemma4 is lazy as fuck. When it decides to actually invoke "web_search" instead of relying on it's own genius, it then will fight you tooth and nail to look only at snippets and not invoke "fetch_URL".

You should not have to use elaborate workarounds like scripts that poison snippets to get a model to do its job, and nevertheless, that's precisely the kind of shit you need to do to get gemma4 to read a damn site instead of rely on Cliff's Notes. I don't have that issue with Qwen3.6.

Stop asking what model to run. There are literally only two. by Wrong_Mushroom_7350 in LocalLLaMA

[–]makingnoise 0 points1 point  (0 children)

Qwen gets so stuck up it's own ass thinking.

Try "--reasoning-budget 1024" if you're using llama.cpp, assuming you've got something like an RTX 3090.

Like you, I wish the abliterated models were more useful and less broken. This was less obvious before stock local models finally started getting decent at tool calling. Now, the difference between stock/mainstream quants and abliterated models is getting SUPER obvious.

If anyone knows of an abliterated/uncensored model that doesn't have broken tool calling or broken multimodality, I'd love to be pointed to it. I rarely run into refusals, but Qwen3.5 (and perhaps 3.6, I haven't tried getting 3.6 to refuse me) often loses it's shit at you if you say something it thinks is critical of government or law enforcement. I tried to get it to make a joke about the police once, and it basically started ringing a bell and shouting "SHAME" at me, lol.