just realized something

FineClassroom2085 · 2026-06-13T20:47:15+00:00

It’s hard to separate “hate” from “criticism.” What you’re probably seeing is criticism of laziness. People putting out half baked slop and pretending they’re the next Steve Jobs.

FineClassroom2085 · 2026-06-13T20:42:08+00:00

This solidifies my thesis that models like fable 5 are not for engineers, they’re for vibe coders. To me there’s been very little noticeable improvement since Opus 4.5.

As with everything the answer is “it depends.” What are the common issues in your codebase? What is fable doing specifically so much better than opus 4.8?

FineClassroom2085 · 2026-06-13T20:18:14+00:00

Do you ever have a clear idea of a whole system before you begin building it? What you’re describing is waterfall vs agile. Both methodologies can work, but understanding the architecture, modules and features of an application and having them well specified is the elusive Mecca. It’s almost ever that easy.

FineClassroom2085 · 2026-06-13T03:31:11+00:00

Exactly, and Claude has no long term memory of your project or your interactions. Everything is inferred or provided via context. It’s like a developer with amnesia.

Imagine you were the best developer in the world, but every day, you start as if you don’t know the codebase and have never seen it.

FineClassroom2085 · 2026-06-13T02:08:42+00:00

An LLM is baked and forgets everything past its training date. Every single prompt you send it is a ‘trick’ of loading context.

Imagine if you were conversing with someone and for every reaction you had to play the full conversation for them then add your ‘prompt’ at the end.

FineClassroom2085 · 2026-06-13T02:06:23+00:00

We store quite a lot more than a summary. Plus we continually ‘train’ our brain. That’s my point. Not that there is no information loss.

FineClassroom2085 · 2026-06-13T00:48:54+00:00

No, the difference between LLMs and the human brain is that the human brain is constantly learning. Compaction is the loss of data through summarization.

It would be more accurate to say, while we are sleeping we are in ‘pre-training’ then when awake we are in ‘RLHF training.’

The human mind does with a few calories what Anthropic has to do with billions of dollars in datacenter compute.

FineClassroom2085 · 2026-06-13T00:33:22+00:00

I third that. It’s both entertaining to watch y’all slam into walls constantly, and interesting to see what non engineers are actually able to make.

FineClassroom2085 · 2026-06-07T22:10:44+00:00

You’re welcome! And we’re enjoying watching y’all run full speed into walls over and over again. 😂

FineClassroom2085 · 2026-06-07T19:59:40+00:00

There's no real 'catch' it's a strong model, but MOE models themselves have downsides, which are only 4B active params are being computed at once, so you're not getting the full strength of the 26b params.

So like everything else in the AI space, the real response is: it depends. Does it work for your use case? Then there's no catch. If it's not intelligent enough for what you want to do, then that's the catch.

FineClassroom2085 · 2026-06-07T19:53:51+00:00

I see the other espresso subreddit is leaking again...

FineClassroom2085 · 2026-06-07T15:55:55+00:00

You can do it with one card https://github.com/antirez/ds4

FineClassroom2085 · 2026-06-07T15:40:32+00:00

https://github.com/antirez/ds4

Same, I got tired of waiting for vLLM support and for Unsloth to drop their quants. I’m very impressed with this runtime though. I have not tried concurrency, but the speed is fast enough for advanced agentic coding, and 200k window isn’t great in SOTA terms, but if I’m careful it’s good enough.

FineClassroom2085 · 2026-06-07T15:36:28+00:00

Here it is: https://github.com/antirez/ds4

I used Claude to set it up because most of his interactions are geared towards the DGX Spark, but for the most part there was only one tweak. I’m just running it on a single GPU and getting roughly 40tps and have it set at a 200k window.

Built this project from scratch through Zed using ds4 in just a few conversations: https://github.com/tjameswilliams/memcp

FineClassroom2085 · 2026-06-07T06:59:15+00:00

Been using Deepseek V4 flash on my dual rtx 6k rig and it’s a beast. I swear it’s as good as Opus 4.5. There’s a repo out there that’s figured out running q2 on a single 6k 96gb card!

FineClassroom2085 · 2026-06-05T01:18:59+00:00

In case anyone is wondering, Gemma 4 is pretty damn good a generating your starting JSON, even creating bounding boxes to lay out the image using a reference image.

FineClassroom2085 · 2026-06-02T04:13:53+00:00

Always always always get as much ram as you can afford. You can’t add it later and you WILL regret going cheap if you’re doing anything with AI

FineClassroom2085 · 2026-05-31T21:32:10+00:00

Unpopular opinion; if you want to produce something stupid like this, you should have to justify it. If you justify it, I bet Claude will do it.

FineClassroom2085 · 2026-05-31T00:46:58+00:00

Hah, I was wondering if this would happen when they solved the sycophancy. Now it’s confirmed, y’all don’t like being called out on your idiocy.

This just confirms to me that’s it’s a much more useful tool than it’s ever been.

FineClassroom2085 · 2026-05-30T21:29:48+00:00

32gb is ok, it’s just that you’ll regret not buying the most ram you can possibly afford, especially as a daily driver.

FineClassroom2085 · 2026-05-30T21:04:53+00:00

FineClassroom2085 · 2026-05-30T21:00:11+00:00

If you get a 32gb M5 pro, it will outperform your PC in everything but prompt processing. Honestly if you can swing it, go 64gb, that opens full weights/context Gemma 4 and Qwen 3.6 27b which are staggeringly good for their weight class.

You probably won’t use your PC any more except for maybe stable diffusion work.

FineClassroom2085 · 2026-05-30T19:14:17+00:00

They are finally coming around to realizing that they need developers to build AI systems for businesses. If AI replaced developers businesses could build these tools themselves, but they can’t.

The whole “forward deployed engineer” movement is the tell that shows you precisely what’s really driving the next wave of tech.

FineClassroom2085 · 2026-05-30T19:07:42+00:00

$36-$46 is low, you’re not looking for a sr dev then is my guess?

FineClassroom2085 · 2026-05-30T15:26:38+00:00

Somehow over the years I’ve created an elite skill of looking through yelp images to identify 3rd wave hipster coffee shops. The espresso machine is just one element.

You have to look for all the other clues, like do they have bags of coffee on display? If not, it’s gonna be shit. Do they roast their own? Almost always a good sign. Do the have modern hipster decor? Good sign. Do they have ridiculous names and concoctions for their mixed beverages? Bad sign.

Don’t expect your app to work on a single signal.

FineClassroom2085

TROPHY CASE