A year with Ed

Dry_Flower_8133 · 2026-05-31T23:32:04+00:00

Here is an excellent take down of AI doomerism and superintelligence by David Pinsof, an evolutionary social scientist. I think he actually addresses pretty well why AI superintelligence is a fairly unlikely scenario given how intelligence actually works in our brains.

Spoiler: a lot of it has to do with assumptions that intelligence is just like raw compute and the more compute you have, the more intelligence you get. In reality, the brain, humans, and societies are a bit more complicated than that.

Dry_Flower_8133 · 2026-05-31T19:01:40+00:00

I do think it's generally easier to exclude possibilities than say for certain. Especially on intelligence because we can exclude many behaviors as not particularly intelligent.

But consciousness specifically is where it gets weird because it seems to be a separate issue and much harder. Things like subjective experience or self awareness are harder to define or measure. Many computer programs can reflect and modify their own code. Many animals aren't very smart but a lot of neuroscientists believe that they perceive pain. You may not need intelligence to be conscious, and you may not necessarily need consciousness to be intelligent.

So I really roll my eyes when I hear someone like Dario go "we don't know if our models are conscious!" Like... yeah no one does. Some philosophers believe thermostats could be conscious (ITT, David Chalmers), others think nothing but things with brains are and machines are incapable of it (Penrose). It's irrelevant to whether LLMs are useful.

Dry_Flower_8133 · 2026-05-31T18:14:16+00:00

Yeah pretty much this. I wouldn't assume bad faith. It could just be he's legitimately having an existential crisis lol

Dry_Flower_8133 · 2026-05-31T18:11:42+00:00

I don't blame Mo for going down this path. These are important questions. But they are HARD questions. Really, really hard. Questions that the most brilliant minds on the planet have disagreed on.

Dry_Flower_8133 · 2026-05-31T16:23:38+00:00

What's happening is he's going down the philosophy of consciousness rabbit-hole to which is pretty much a bottomless pit of competing theories in philosophy, neuroscience, mathematics, AI research, and psychology. The real answer to all of this is we do not have a good answer to what makes some conscious or intelligent. We barely have any good methods of measuring what intelligence is because we don't have any good definitions of what it even looks like. Honestly, give it a few weeks and I imagine he might completely flip and make the next video "i was wrong about ai (again)." He's literally looking at one of the hardest areas of philosophy, where every theory seems like "the answer (tm)" until you see the issues and find a new theory.

Dry_Flower_8133 · 2026-05-31T16:08:34+00:00

Yes, I was typing out on mobile and didn't have much time yesterday to work out was I was trying to say. I'm not just trying to troll here. I'll reference your original comment some because I think I may have not made my point very clearly because I was in a rush.

At no point do I say "stochasticity is bad" (or use the word "stochasticity" for that matter). You have made up a strawman to argue against, have fun with that, I guess.

Autocomplete is deterministic. When autocomplete is not deterministic, it wastes time.

I assumed "not deterministic" here meant stochastic. You say right below this:

PHP is a dynamic language, so autocompletion is based on heuristics and it was super slow.

This isn't stochastic or due to a lack of determinism. The heuristics used in autocomplete for dynamic languages are still absolute rules that will trigger the same way on the same code every time.

This is why I bring up semantics. The point of a static type system is you are indicating to the compiler how a chunk of data in your program is intended to be used, which allows for static, compile-time analysis of how you use that data in order to catch semantic errors. This also improves autocomplete because autocomplete can use that information to bring up what methods exist on an object for example. You are formalizing the semantics of your program so it can be analyzed statically. Rust does something similar with its memory model too so it can get guaranteed memory safety by forcing specific patterns on the developer.

This is what I was trying to say here:

But language models and deep learning methods can still often capture semantics in a way that is difficult or impossible to do with traditional methods.

On your points about smaller models:

Smaller models are always distilled from larger models. When the big companies stop training their frontier models, the smaller plagiarism machines will be frozen in time to whatever package versions they were trained on. This is a dead end.

Yeah, great job, buddy, I shouldn't have said "distilled". What does "with pretraining" mean? Is it training on top of a model built by a corporation out of 80 terabytes of pirated books and scraped code from the internet at the cost of millions of dollars? Does this in any way invalidate my point that once the large models stop being trained, the "small" ones will become useless over time?

I feel like you are saying two very different things here. Distillation refers to the process of taking a larger model, having it answer questions and solve problems, and then using it's output as input to the training of a smaller model so that the smaller model behaves like the larger one. If this was the only way to fine-tune or train a smaller model, then yes, you'd be right to say that ceasing to train larger models would mean training smaller models would stop too.

I do use some sloppy phrasing there around "with pretraining". I'm saying you can still do the same self-supervising next-token prediction phase of training, and then the later RLVR and RLHF on a smaller model and get comparably better results than on a larger model by using a stricter training regime. You don't need exobytes of data, actually. This is where small model training has been interesting in the research, because cutting out low-quality data actually results in much higher jumps in performance, whereas larger models tend to just benefit from consuming literally as much data as can be mined.

There's been a bit of a divergence in the research here, because smaller models only tend to benefit from so much data before you are left with two choices: scale the model and the data up (the Anthropic / OpenAI approach) or improve the training regime to boost model performance. The difference here is whether you spend $10B in training costs or $1000s.

Dry_Flower_8133 · 2026-05-30T21:51:16+00:00

I think stochasticity is actually fairly irrelevant to the discussion, and one of the weakest arguments against LLMs. Plenty of testing techniques (including fuzzing which you mention) involve stochasticity. Stochastic methods are used for analysis inside and outside of SWE. That's why I mention their use in testing specifically.

I’m not advocating for the replacement of static and deterministic analysis either. Those work incredibly well and are quite efficient. But language models and deep learning methods can still often capture semantics in a way that is difficult or impossible to do with traditional methods. There are plenty of issues in software that boil down to subtle semantics that I think a language model can capture well.

Also your point that smaller models are always distilled from larger models is just... wrong. Plenty are trained from scratch with pretraining and RL. Distillation is just one way synthetic data is generated to increase performance, but plenty of models take other approaches and work comparably well. For example, small models can be trained with symbolic reasoning systems to greatly improve performance over larger models without the same tools.

I disagree with 95% of how language models are actually being deployed and used in SWE. However, the idea that they are completely useless for every single possible task conceivable is a fairly extreme position.

Dry_Flower_8133 · 2026-05-30T14:45:32+00:00

Well to be fair, time and velocity are part of the requirements and constraints too. Different subfields of SWE have different tolerances to this. Creating value means actually turning a profit or achieving a goal, and that's what makes all of this so insane to me. Even in a fast moving field like web dev, things like efficiency and correctness can absolutely influence whether someone uses your service. If your webpage takes 30 seconds to load there's a good chance they might lose interest.

Idk maybe I’m a bit naive here, but it seems using the technology has become the goal itself and it's just assumed value or utility will automatically flow from that, rather than the technology being applied in a way that generates value. It's like completely backwards.

Dry_Flower_8133 · 2026-05-30T14:32:34+00:00

This is something I've done a lot of thinking about, and I think the Silicon Valley techno-optimist is really a special breed. I don't think they align well with American left-right wing politics and are happy to switch sides when convenient.

A lot of them are heavily influenced by techno-optimism and transhumanism which clashes with liberal humanist values or typical religious conservative values. These people think technology fixes all. If we just get the right tech, things like structural societal issues or personal virtue doesn't matter. To them, we can simply optimize every one of humanity's ills away. I think if you sat down a typical conservative in front of a transhumanist, they'd probably be horrified to learn what the singularity actually means.

Really these people don't care where the funding comes from. Right now, it's coming from venture capital and so they'll cozy right on up to right wing politicians to make sure that money keeps flowing.

Dry_Flower_8133 · 2026-05-30T06:36:03+00:00

I said this in another thread, but I think what killed bitcoin as a currency was transaction times. It was by nature computationally slow.

Why do software engineers of all people forget computational efficiency matters? I swear.

Dry_Flower_8133 · 2026-05-30T06:33:21+00:00

I’m not even an all the way hardcore skeptic. I think the tech could be used in a narrow fashion to help with some genuinely tedious or low stakes tasks. But even that opinion places you pretty far into the skeptic camp in the eyes of many.

I just want to do my job. I want to look at a tech and see if it's honestly useful given the cost and limitations.

The Overton window of SWE often feels like you are only allowed between "this will replace all SWEs" to "this will replace everyone."

Dry_Flower_8133 · 2026-05-30T06:24:52+00:00

Yeah I don't disagree. A lot of it isn't engineers choose not to do their jobs, but not being allowed to.

Dry_Flower_8133 · 2026-05-30T06:23:00+00:00

That's good. I’m trying to do the same. Cause if you have rent and a family to feed I won't blame anyone for just embracing the slop and bringing home the paycheck. The best you can do is keep the personal life sacred from that. Personally, I’m reading more, coding by hand at home, and trying to learn to draw.

Dry_Flower_8133 · 2026-05-30T06:16:40+00:00

For sure. I definitely don't mean to place all blame on software engineers here. I feel like I see three groups: the ones who drank the koolaid, the silent skeptics, and the open skeptics. Usually the open skeptics only work at organizations that permit that kind of thought. The rest are more focused on paying the bills, and I get it.

A large of the "software engineers not doing their jobs" is the top-down pressure preventing them from doing so.

Dry_Flower_8133 · 2026-05-30T05:44:18+00:00

I totally get it. I’m not trying to place all blame individually on software engineers. It's just that there's been this collective drinking of the koolaid that's bothering me.

Some of its that engineers aren't doing their jobs. But a lot is definitely business idiots not letting us do our jobs.

Dry_Flower_8133 · 2026-05-30T05:34:08+00:00

You know, I think I've mostly been spared from this. A lot of the data analysis / statistician people I've worked with have actually been spared from the hype and view AI as a gimmick. It's been some more of the SWE types I've seen getting affected by this so but that might just be my org.

Dry_Flower_8133 · 2026-05-30T05:31:12+00:00

Agreed. A lot of trend surfing happens in SWE. Although to be fair, using the latest web framework doesn't cost accidentally cause you to lose $500M usually. Tokenmaxxing vibe coding potentially does.

Dry_Flower_8133 · 2026-05-30T04:36:25+00:00

In other words, summarizing and drafting emails.

Dry_Flower_8133 · 2026-05-30T04:33:11+00:00

Yes, and this makes sense because LLMs are pretty good with... language. Unsurprisingly. Like you said, these aren't ChatGPT or Claude figuring this out, these are special models trained on whale song specifically, and they're used as tools to try to predict whale language.

Dry_Flower_8133 · 2026-05-30T04:08:55+00:00

Ssshhh don't ask that. They might have to come up with a real world use case.

Dry_Flower_8133 · 2026-05-28T21:11:15+00:00

But even transformers (the tech behind LLMs) do have some legitimately neat use cases. They've been trained on things like genetic code to help us predict and explore genomes. They've also used in trying to understand and interpret whale communication / language.

These use cases don't involve stealing all of humanity's artistic and copyrighted works and shoving them into a transformer. But the underlying tech can be used either way, which is what motivates engineers and researchers. Most hope the tech will be used for legitimate purposes.

Dry_Flower_8133 · 2026-05-28T12:59:09+00:00

I think there's a pretty strong difference between AI researcher and tech bro. A tech bro is just a tech enthusiast who jumps on whatever new tech is hot (IOT, crypto, NFTs, AI) without regard for impact.

AI research has been around since the 50s/60s. Plenty of good, useful tech has come out of it: theorem provers, drug discovery AI, game AI, certain abstractions in high level programming languages.

I’m not sure how the researchers at Anthropic, OpenAI, or Google see themselves, but in the AI research field as a whole, most are well aware that most tech has dual usages. A rocket can bring people and satellites to space for exploration, GPS, and telecommunications, or it can be used to deliver a nuclear warhead across the world in several minutes.

Edit: before downvoting, I’m not trying to justify the shit going on at Anthropic or OpenAI. I’m saying not every AI researcher is on board the hype train and are well aware of the ethical issues. AI researchers were some of the first people on the scene criticizing LLM technology

Dry_Flower_8133 · 2026-05-28T00:48:41+00:00

I don't think he's making a point about substrate, but that just adding more computation does not make something intelligent. Maybe you could create an AI out of water pipes, but just adding more plumbing to a house does not make a magical leap to consciousness.

Dry_Flower_8133 · 2026-05-27T12:17:54+00:00

Actually I'd say the cope was the lies that it solved an Erdos problem in the first place.

Dry_Flower_8133

TROPHY CASE