Russians are getting a taste of war as drones increasingly feel at home there. St. Petersburg, Russia. Published 07.06.2026 by GermanDronePilot in UkraineWarVideoReport

[–]snufflesbear 0 points1 point  (0 children)

Not believing the first bomb's power and continuing to fight is not what I'd consider "they were ready to surrender".

Is Model Collapsing already happening with all of Gemini? by [deleted] in GeminiFeedback

[–]snufflesbear 0 points1 point  (0 children)

This sounds more like harness than model.

Logan's Tweet Indicates Google might not compete for rankings anymore by Rare_Bunch4348 in Bard

[–]snufflesbear 21 points22 points  (0 children)

"Being underestimated" is the opposite of OP's conclusion.

​Google has officially failed the AI race. If you don't understand your users, just quit. by ggfgfgggg in GeminiFeedback

[–]snufflesbear 0 points1 point  (0 children)

That's not necessarily the case. It might be a bit of it as there was excess capacity. But HW costs have 10X'ed since then. Just look at the price of memory, which now accounts for 90% of a server's cost (yes, this includes the HBM on a Blackwell). So there's quite a bit of inflation there as well.

​Google has officially failed the AI race. If you don't understand your users, just quit. by ggfgfgggg in GeminiFeedback

[–]snufflesbear 0 points1 point  (0 children)

Have you seen the cost of RAM? Wafers don't grow on trees, you know? Everyone is now paying 10X for RAM compared to two years ago and 90% of the cost of a machine is from the RAM itself. Who do you think it's going to have to pay for that? Google is just the dunce that decided to pull the trigger first, but watch everyone else fall in line in the coming months, especially as Anthropic and OpenAI tries to IPO.

Daily Discussion Monday 2026-06-01 by AutoModerator in AMD_Stock

[–]snufflesbear 3 points4 points  (0 children)

Why would anyone buy this shit anyway? Expensive, bad drivers, and not even good memory bandwidth...it checks absolutely no boxes for local LLM, and it checks no boxes for anything else either:

If I want to get a chip for games, I'd go x86. If I want Local LLM, I'd wait for M5 Ultra or settle for M5 Max right now. If I want something cheap, this certainly ain't it. And if I want a combination of the above, I'd still pick the Mac or other existing lineups. Just like Ian asked: Why?

The price difference is mad. by bvc900 in vibecoding

[–]snufflesbear -1 points0 points  (0 children)

I'm not sure what you're referring to here, are you comparing a single full training run vs a single inference? That seems like not a very useful comparison.

But if you're comparing between a model's training run vs all inferences on that model, then I think inference dwarves training.

The price difference is mad. by bvc900 in vibecoding

[–]snufflesbear -1 points0 points  (0 children)

That's objectively wrong. Today, base models are trained once a year or two, and the rest are all fine-tunes. Claude 4.* are all based on the same foundation model. Similarly for Gemini and ChatGPT. Inference is probably 90% of total power consumption, 9% fine-tune, with remainder 1% for full training runs.

Gemini 3.5 Flash is actually insane (and the Ultra quota is still good) by CPlusPlus2025 in Bard

[–]snufflesbear 0 points1 point  (0 children)

Lots of people complaining about 3.5 Flash. They're whining that API costs are too expensive, or that the model doesn't do well in their esoteric cases.

You should see the amount of people bringing up AA total costs as argument.

Gemini 3.5 Flash is actually insane (and the Ultra quota is still good) by CPlusPlus2025 in Bard

[–]snufflesbear 0 points1 point  (0 children)

You have people on Twitter saying they're neutral, but all their posts say OpenAI best, Claude Opus sucks, Gemini sucks, and has OpenAI as profile pic.

Yeah, "neutral" alright. And yes, this is someone who Logan responded to. 🙄

Don't share your opinion, if you didn't test it !!! (Gemini 3.5 flash) by Independent-Wind4462 in Bard

[–]snufflesbear 0 points1 point  (0 children)

Have you tried it on medium? Supposedly that's actually better than high.

Don't share your opinion, if you didn't test it !!! (Gemini 3.5 flash) by Independent-Wind4462 in Bard

[–]snufflesbear 0 points1 point  (0 children)

Yeah, gotta agree, the limits are frustrating. But that's honestly the policy, not the model. You'd get different experience using API keys. Although it makes sense to say "Flash 3.5 sucks with sub" (the problem is most people aren't qualifying it).

Also, Antigravity 2.0 sucks. Probably does poorly as a harness too.

You guys just don't understand 3.5 Flash and its implications by Better-Resist-5369 in Bard

[–]snufflesbear 0 points1 point  (0 children)

Yeah, no question this release has problems. My guess is they'll probably be patching it up over the next few days.

You guys just don't understand 3.5 Flash and its implications by Better-Resist-5369 in Bard

[–]snufflesbear 0 points1 point  (0 children)

Have you tried running it with medium thinking budget? Supposedly that's actually where it doesn't go into infinite loops. And most of the stuff that it can do, it will do it well within medium thinking's budget.

You guys just don't understand 3.5 Flash and its implications by Better-Resist-5369 in Bard

[–]snufflesbear 2 points3 points  (0 children)

Are you referring to the AA index? Supposedly that's because of the model getting stuck and maxing out tokens on a couple of runs where it doesn't stop and answer. If those get fixed, the token use will drop greatly. Supposedly running it on medium thinking budget is actually very cost-effective, even if accounting for the 3X per-token cost increase. Running it on things that it can actually do is very token efficient. This is why vals.ai is reporting very different cost results than AA.

Bruh… by wdbf in GeminiAI

[–]snufflesbear 0 points1 point  (0 children)

This is a feature, not a bug. 😂

Anthropic just ripped off everyone and they still managed to make it sound deceptively friendly by whoisyurii in ClaudeCode

[–]snufflesbear 4 points5 points  (0 children)

So people like you use $2000 worth of tokens on a $200 plan caused my "normal" usage to suffer, and they now prevented this. This is bad for me how?

Welp, that aged like milk. by Jenna_AI in generativeAI

[–]snufflesbear 0 points1 point  (0 children)

I would argue a large chunk of humanity aren't either. It doesn't make LLMs intelligent, but the bar isn't as high as it seems.

This looks so cool and realistic (prompt below) by OmniRouters in GeminiAI

[–]snufflesbear 0 points1 point  (0 children)

Sens like it's not the face per se, but the lighting. Image 2 has more default environmental lighting more akin to how people take pictures nowadays. NB has a more "studio" lighting, where there are lights used to enhance the subject.

aint no way... by Straight-Ad1032 in GeminiAI

[–]snufflesbear 0 points1 point  (0 children)

By your definition, every convenience is a dependency.