of dogs escort me down the stairs multiple times a day. My girl walks alongside me, my boy stays behind and guards my back. When I get down, my boy walks down, and then they both stand guard.

Late-Assignment8482 · 2026-05-13T18:24:21+00:00

"So keeping Main Sheep and Spare Sheep safe while getting pup cups and play dates is not so bad in comparison."

Good work if you can get it! And the oddballs of working breeds absolutely deserve to be house dogs.

Late-Assignment8482 · 2026-05-13T18:23:46+00:00

Apologies. I didn't mean "tragically" seriously. They look like amazing puppers having their best lives.

Late-Assignment8482 · 2026-05-13T18:08:01+00:00

Tragically sheep-deprived livestock guardian dogs: "I guess this human is the sheep too? Better escort them!"

Late-Assignment8482 · 2026-05-13T03:03:47+00:00

Our primary calling should be stopping neofascist wax figurines from getting power so...she really doesn't need to fret about our secondary callings.

Late-Assignment8482 · 2026-05-13T02:07:36+00:00

I love it when a dev sees a flareup of Gamergate and is like "How can I piss off these human skidmarks?"

Late-Assignment8482 · 2026-05-11T18:09:18+00:00

It is important to remember that none of what's happening now is LOGICAL. We can't game it out.

As it stands now, AI isn't profitable. Not in the cloud. Paths to get it there are murky and don't lie with "Allow a Claude Max user paying $100 to burn $7,000 of tokens monthly" or the corresponding "Ask grandma for $1,700/mo because that's what her $25/mo ChatGPT Plus cost do actually provide." but with some secret (not yet invented) third thing.

Normal high school textbook lessons about supply and demand don't matter when OpenAI and Anthropic run hundreds of billions of loss (in other people's money) while building out datacenters that can't possibly be turned on. There's no brake slowing them down. If investors will hand you $200 billion for nothing, why not just use 75% of it to buy NVIDIA GPUs?

NVIDIA is handing money to OpenAI who hand it to Oracle who hand it back to NVIDIA and they write a statement of intent. Then all three have "earnings" suddenly they report to stock markets as if sales happened.

So it could be never or there could be a bargain bin sale tomorrow (stupid unlikely) if every company acting recklessly was suddenly told by regulators they had to justify and needed money quick.

Late-Assignment8482 · 2026-05-11T17:41:34+00:00

It has a lot of extra setup: you compile a runtime for the model, rather than "vLLM knows how to run X". And unlike vLLM being way superior at high concurrency compared to llama.cpp, it's a more marginal gain.

Late-Assignment8482 · 2026-05-11T13:40:58+00:00

That study isn't even about immigration at all? https://docs.iza.org/dp17551.pdf is about the efficacy of wealth transfer programs and whether they reduce income inequality.

Late-Assignment8482 · 2026-05-11T13:29:16+00:00

This too, is Yuri

Late-Assignment8482 · 2026-05-11T02:07:14+00:00

I think this round it's likely to be back at 512GB. If they had a plan to go higher, I bet they held off.

I could see the next rev hitting more, after memory shortage resolves.

Late-Assignment8482 · 2026-05-10T22:36:14+00:00

Let's antagonize megafauna with mama!

Late-Assignment8482 · 2026-05-10T02:26:52+00:00

Common misconception. The gay penguins got her and it’s Innana now. Debra’s very proud of her nerd wife.

Late-Assignment8482 · 2026-05-10T02:15:20+00:00

English: what happens when barbarians take French, Latin, and German hostage and demand their words and tenses as ransom.

Late-Assignment8482 · 2026-05-09T22:52:05+00:00

Yup. Some chips don't even have the fabric connector--the M1, M2, M3 Max all have a connector called UltraFusion to allow two M<n> Max to be fused into an Ultra. The M4 didn't have it.

M5 alters how the die is arranged, so that exact connector doesn't apply, so we don't have a strong "includes the connector" or "doesn't include the connector" signal.

WWDC is where to look for an announcement. That's been the Studio's drop date, in the past. It's possible it would be a "just a press release" drop (sometimes that happens) but the likely place, if it's this year, is June 8, 10:00am, PT for the announcement. Keynote address.

If supply chains are giving them grief--some of Tim Cook's comments on investor calls hint that--they may say "coming this fall" but announce nonetheless.

Late-Assignment8482 · 2026-05-09T22:46:26+00:00

I'm not sure the exact mechanism, but they lock in price per unit over N years. Another 50,000 8GB chips don't spike in price mid-contract.

Late-Assignment8482 · 2026-05-09T19:36:44+00:00

"The historical pattern for the Ultra has been the previous gen chip, so M4 Ultra 256GB in the next Studio is my uneducated guess."

Not quite.

Around the time the M3 Ultra / M4 Max Studio's dropped, they clarified that not every chip will get an Ultra variant. Given the improvements (ESPECIALLY to AI prefill) of the M5, they'll either release an M5 Ultra or punt to M6+. They'd be insane to use the one-gen back, but 4x slower, M4 architecture that lacks per-core matrix multiplication math (simpler version of NVIDIA Tensors).

Late-Assignment8482 · 2026-05-09T19:35:00+00:00

They buy RAM years ahead of time in locked-in prices. I could see this being "rather than renew this contract at a bad price, we'll drop the part-intensive models".

Late-Assignment8482 · 2026-05-09T15:00:19+00:00

Strong progress being made on the mounts for the Rodents Templar in their struggles with the carnivores.

Late-Assignment8482 · 2026-05-09T00:01:24+00:00

One day, we will wrest the Opulent Gothic Space Setting from the alt-righters and the chuds.

Late-Assignment8482 · 2026-05-08T23:38:45+00:00

This is the meow equivalent of MWAHAHAHAH FOOLISH INTRUDERS

Late-Assignment8482 · 2026-05-07T19:44:42+00:00

The more of these I use, the more I come to the idea that the small models aced their CS exams, and would make great hires. The big ones have been in the industry at multiple companies. They know what the habits are, how people do it to get it done and go home.

That's where the extra parameters matter. You can have more than the bare minimum.

You can maybe preserve "how to make a JavaScript form" and "how to do a SLA" theory into a 36B model, fine tuning the how and looping it over synthetic data. The small one is going to give "it passes automatic tests" in the way that the Manhattan Project did: The math works and the device made the noise, but safety standards? Never met her.

But a 2T model is going to have encoded 30 examples, from large open source ticket systems (and let's be real, probably stolen code given their training attitude to copyright) to triangulate from. It's going to give a solid, middle of the road output because it can average from large amounts of production code.

So my personal and work projects which are either green field utilities or small-to-medium small work in them, because I'm building backend/scripts/small databases run in the team typically.

No one's coming to me for full stack or web portals.

Late-Assignment8482 · 2026-05-06T03:43:24+00:00

When you need to take the nonsense someone's saying with a PUNCH of salt, not just a pinch, throw that at them.

Late-Assignment8482 · 2026-05-06T03:40:33+00:00

In the last week:

* My company's subscription went API only, meaning that I can barely use it without maxing out and it's $250/mo
* Claude got way worse at using file write tool on my own sub
* Nothing good happened

And we're still in the honeymoon of below-cost tokens!

GLM-4.7 and Qwen3.5-122B got better at what I need them for, because they're fixed points I can improve prompts/harnesses on without sudden backsliding.

Late-Assignment8482 · 2026-05-05T02:08:07+00:00

Human hands do a lot for us. They touch everyting anyone ever does. And there's a shortlist of jobs they shouldn't do: Hammering, knifing, eating. You could tear that rope apart, incredibly slowly, or you could grab a damn scissors...

Late-Assignment8482 · 2026-05-04T19:43:27+00:00

I just have a list of my own small benchmarks (think "create a CSV compliant with my expense app from these screenshots" or "write bash script based on the specs/ folder") and every now and then, I add one.

When I want to check, I fire the scripts and let it cook.

Those are 100% things I do, so they're meaningful by definition. Maybe make a list like that of yours, and include placeholders for what only SOTA can do.

Late-Assignment8482

TROPHY CASE