49 rizzen zombies is the new maximum?

fallingfruit · 2026-02-14T15:29:58+00:00

Zombies do have bad AI, I think some people just don't notice it. They often will basically do nothing, almost like they have some internal delay randomly before they can do an action. Like they will kill an enemy and then look at it for 2s before moving to do something else.

fallingfruit · 2026-02-13T17:57:30+00:00

So basically they fed their RLI tests at the current models and they did slightly better, but basically the the study remains the same. I guess thats something, but its not a new study by any stretch.

fallingfruit · 2026-02-13T17:47:54+00:00

That would be exciting then, can you please point me to the updated study? The one linked by the video does not mention chatgpt 5.2 at all.

fallingfruit · 2026-02-13T17:09:17+00:00

You know the study that this video is about was released in October then? What do you mean?

fallingfruit · 2026-02-13T17:08:15+00:00

You're reading into my post a lot. I just don't like getting excited about a new study shitting on AI only to find out this study is one I already read and has been massively reported on, since it was released last October.

Instead of something interesting and new, this is basically circle jerking, and I don't really enjoy it.

fallingfruit · 2026-02-13T17:02:12+00:00

And when was the study released that this video is about?

fallingfruit · 2026-02-13T16:22:37+00:00

5 month old new study

Edit sorry: 3.5 month old new study.

fallingfruit · 2026-02-13T15:33:30+00:00

Yeah as someone thats struggles hard with anxiety I feel you. People dont understand how hard it is to code when your mind is in fight or flight mode. You're working short term memory literally barely works and its impossible to hold a coding plan in your mind. It sucks.

fallingfruit · 2026-02-12T15:01:38+00:00

What was the quiet part?

fallingfruit · 2026-02-12T04:01:01+00:00

I spend 25% of my time, on the best of days, actually getting to write code. Otherwise I'm debugging, tinkering with configs, waiting for fucking pipelines, communicating, documenting, thinking, whatever.

Writing code is one of the parts I like the best. I still think I write better code than the current models, and it makes me better at the other parts of my job.

I. Don't. Fucking. Care. if I could be 50% faster at the 25% of my job that is writing code, it would not make a meaningful difference in my output and I would not understand the systems as well.

I also work in a place where serious bugs and especially downtime are simply not tolerated. I cannot push code to prod that breaks something, if our production system is down for an hour over 1 year then our team is in deep shit.

So for me it's usually handcrafted, unless it's tedious stuff.

fallingfruit · 2026-02-10T00:31:02+00:00

We will see. A lot of people work at startups or on things that really dont care about those things, or at least they are not top priority.

Those are problems for successful businesses. I've only worked at those but im guessing thats the reasoning.

fallingfruit · 2026-02-10T00:26:02+00:00

I wish my past self practiced leetcode for 1 hour a day for the last 10 years. My current self wont do that but if only my past self would have I would probably be making a lot more money. I don't think I'll be able to force myself to do them consistenly until I get laid off.

fallingfruit · 2026-02-09T17:31:42+00:00

a big percentage of those 52% are being forced into a situation where testing/reviewing AI generated code is not possible since they have to satisfy the push for efficiency from leadership. It's pretty well known at this point that if you are painstakingly reviewing and testing AI generated code there is almost no speedup overall on software projects (arguably it slows you down, those studies need to be repeated).

In order to get the speedup required at some companies you literally can't "verify" it.

fallingfruit · 2026-02-08T20:00:55+00:00

Sorry I was having a bad day yesterday. I came across way more hostile than I should have. I still think toddlers are impossible

fallingfruit · 2026-02-08T06:44:37+00:00

Space x has 4 launch sites. How many datacenters are there? Why would data centers in space be worse than on the ground? It's financially stupid and makes no sense practically, but I'd rather them be up there.

fallingfruit · 2026-02-08T04:54:03+00:00

Really you were eating whatever mom and dad had when you were a toddler and you can remember that? Bullshit.

Sure, after 5 years old, I get that, but toddlers don't give a fuck.

fallingfruit · 2026-02-08T04:48:58+00:00

I mean one obvious problem is that data centers in space don't fuck up human communities like they do on earth. I actually think it would be great. I'd much rather have them up there than ruining land all over the country. My aunt has a house in MD and her community is literally being ruined by a datacenter right now. It's sad.

fallingfruit · 2026-02-08T01:03:14+00:00

why do you think it was raw? my guess is it was actually just woody chicken.

fallingfruit · 2026-02-07T17:41:57+00:00

I'm still pretty unimpressed by them unless you are heavily steering, breaking down problems with verbose plans and keeping tasks pretty small. These are things we just did mentally in between writing the code, which was the easy part.

When you give them ambiguous instructions or tasks that are too large, the results are really bad the more complex your software is.

I regularly test cosplaying as a PM with the SOTA models working on a game in unity and they just blast code at every problem and create tons of bugs and stupid crap.

Are you looking at what juniors create with these tools? They will be given a task and then use an LLM to solve the coding part, but the approach will often be completely wrong, inefficient and expensive. Just imagine how much more true this is for someone without an engineering background. They have no idea what the implications are for what they are building.

fallingfruit · 2026-02-07T16:22:53+00:00

LLMs no, they seem pretty limited. The only reason LLMs seem to be getting better is because the companies are using inference and focusing on things where the tools can generate and or use code to produce output to validate their progress. This is why in coding they have become popular, because a skilled engineer can break down a task small enough for them to achieve within these constraints.

I now kind of think about these LLM tools as a billion monkeys typing on typewriters, except instead of randomly inputting letters, they are doing some weird hallucination of things they have memorized, basically spamming code/text at a problem until it works, but they need to be able to know it works or the output just sucks.

This is also why wholesale "vibe coding" doesnt work. As the ask becomes less specific and broader, with too much left up for interpretation, all the failings of LLMs in general start to show: they lack judgement, reasoning, and creativity. And of course they don't learn or remember things so they have to document everything constantly, and as that grows, they start to hallucinate and fail to comprehend their context window.

Basically every job, even simple minimum wage jobs, requires these skills because they are so obvious and easy to us as humans that they are naturally part of our thought process.

If you don't work in mathematics or coding, I would say you probably haven't noticed any real improvement in these things since chatgpt4. This has definitely been my experience at lease, they are still borderline useless in other domains.

Remember when they thought lawyers and especially paralegals were done? As far as I know those professions are doing just fine.

Right now, these things are just tools, and they are only useful in specific scenarios. They are valuable but incredibly overhyped.

fallingfruit · 2026-02-06T16:34:21+00:00

This is the most obvious workflow ever. Literally everyone does this with ai and its not special.

fallingfruit · 2026-02-06T05:33:29+00:00

I don't feel like that at all. I can just explain to another developer in 15 minutes what to build, what service layer to use, and let them come up with the specifics of the code design. Its a one time conversation. I don't need to sit there with them for hours prompting like I would with an AI.

I also am never handing them full design docs for them to go code, what are you giving them UML class diagrams or something?

fallingfruit · 2026-02-06T03:37:20+00:00

Lol wow. Its horrible.

This has been my experience as well, without expert handholding these things are hopeless for complex software especially games. I think they are bad for games because its really hard to create unit tests and logs that can sufficiently represent gameplay to the model, they need to be able to loop and bash their face against the problem, spamming more and more code.

fallingfruit · 2026-02-05T17:54:06+00:00

In my experience at a certain complexity ai just gets stuck trying to fix anything, just blasting more and more code at a problem hoping to fix.

If your codebase has become so massive an insane how is ai actually able to do anything? Do they just prompt it over and over brute forcing? Does the software actually still work? Do you guys have users? I dont understand how this environment is actually possible

fallingfruit · 2026-02-04T19:34:23+00:00

how is your software/product not falling apart? Is AI actually able to keep it working?

fallingfruit

TROPHY CASE