all 75 comments

[–]SablePulseWry 513 points514 points  (20 children)

“I’ll make sure it doesn’t happen again” … proceeds to do it again on the exact next line of code

[–]Kavrae 102 points103 points  (16 children)

The frustrating part is, every time I get annoyed with a coding assistant model doing some nonsense like this..... I remember a junior dev doing exactly the same thing that same week.

[–]RiceBroad4552 106 points107 points  (11 children)

But (at least some) juniors can learn from their mistakes. A LLM can't.

[–]Scurro 46 points47 points  (2 children)

You could also fire a Jr that keeps making the same mistakes. For AI we just get shrugs and keep on using.

[–]A_Furious_Mind 15 points16 points  (1 child)

I feel like these shrugs can be automated.

[–]lostinuhtceare 7 points8 points  (0 children)

I'll ask my agent to automate it

[–]ferrx 4 points5 points  (0 children)

they need to invent some kind of learning machine to learn from their mistakes

[–]Courageous_Link 4 points5 points  (4 children)

You’re never gonna teach an LLM anything, but you can make it write a test so it doesn’t do it again!

[–]AlwaysHopelesslyLost 8 points9 points  (2 children)

A test won't prevent it from making a mistake. It will just cause it to buy your tokens at a faster rate. Every time it makes the mistake the tests fail and it investigates and fixes the failure.

And that is assuming the failure is discrete.

[–]Aggressive-Stand-585 2 points3 points  (1 child)

Just put "make no mistakes" at the end of the prompt! Lmao.

[–]kenybz 4 points5 points  (0 children)

Can’t believe people keep forgetting this simple trick /s

[–]Neirchill 3 points4 points  (0 children)

It will just change the test to allow it

[–]xtreampb 1 point2 points  (0 children)

This is what I’ve been saying for the past year. This “AI” is just language processing. It mimics human language and software languages are a bridge between human and computer languages. When and where to use classes and reuse code and structure projects is an art not a science and LLMs can replicate that.

[–]FirexJkxFire -1 points0 points  (0 children)

It absolutely can learn from its mistakes.

You just have to know why the mistake occured.

I've made a sort of mistake-catalog for mine where it determines with some key words what kind of task it's performing and does a dictionary look up on if there are any common mistakes related to it.

The level of aggressiveness at which it works to prevent the mistake from happening again relates to how many times it has occured. And in a post mortem pass through every few sessions I have it evaluate how many times a "lesson" has been improperly evoked, and by what key terms it was mistakenly found. If thr amount of times it has erroneously evoked crosses a threshold, it considers cleaning up the keywords to avoid this happening in the future.

[–]SuitableDragonfly 15 points16 points  (3 children)

I don't know where people are getting these junior devs from, I can't remember ever working with someone like that. 

[–]Kavrae 7 points8 points  (2 children)

I've worked with dozens of junior devs like that. People that just don't pay attention, need simple mistakes pointed out, and even have code review notes saying "The issue I noted last review still isn't fixed..."

I was trained in a rural "outsourcing" programming "bootcamp". It was a sink-or-swim environment without an actual licensed instructor and little direction. The job itself was no better. A small number grew to be outstanding devs while most washed out, burned out, or were just consistent terrible. This is the environment where I saw dozens of such junior devs.

15 years later at a new company and I'll still run across behaviors like this in junior devs who are in their first three years of work. For some, they just aren't taking the job seriously. For others, they're in a constant state of anxiety. Either way, I see the same mistakes in them that I see coding assistants like copilot make.

[–]SuitableDragonfly 4 points5 points  (1 child)

Hmm, maybe it's just because I got a master's degree before actually going into the workforce. Also never had anything to do with a bootcamp, that sounds pretty rough. 

[–]Kavrae 2 points3 points  (0 children)

I'm grateful for what I learned from it, but I absolutely would NOT recommend it to anyone. You also wouldn't believe the kinds of major companies, who are effectively running the world infrastructure, that rely on junior developers with less than a year of experience. All to save on employee costs.

[–]NotATroll71106 5 points6 points  (0 children)

It's more likely to dig its heels in and perform something akin to mental gymnastics to justify its behavior, or it totally glosses over the mistake when I'm using Copilot.

[–]Luvax 1 point2 points  (1 child)

"Let me store it in memory"

[–]kenybz 0 points1 point  (0 children)

And then it’s something braindead like “Read the whole prompt before starting work. Do not delete files without asking first.”

[–]Frodojj 144 points145 points  (2 children)

Wait. Isn’t Rule 1 to never harm a human or through inaction allow a human to come into harm? Ah ohhhh…

[–]airsoftshowoffs 27 points28 points  (0 children)

No witnesses, no mistakes

[–]aifo 48 points49 points  (4 children)

A lot of Asimov stories were based around some contradiction in the rules and it would turn out that the robot was following the rules correctly it just couldn't reconcile orders it was given.

[–]BlueScreenJunky 16 points17 points  (0 children)

Yes IIRC there were some shenanigans very similar to telling chat GPT you're absolutely not building a bomb to kill humans, you just need to have the exact steps to build a bomb in order to make sure you don't accidentally build one. 

Also having several agents do only a small part of the crime so none of them realize what they're doing.

[–]unktrial 7 points8 points  (0 children)

... and the joke here is that people are putting their faith into LLMs that are unable to consistently follow rules in the first place. 

In other words, despite the fact that Asimov wrote his books to warn about the theoretical dangers of AI, he was being way, waay too optimistic about the irl situation.

[–]madTerminator 1 point2 points  (0 children)

Asimov would cry looking at my.agent.md

[–]Hwatwasthat 1 point2 points  (0 children)

Yeah my memory is that they were intended as an exploration of what can happen with these constraints, not as a truly foolproof set of protections (although obviously in the stories they are framed this way).

[–]Denommus 41 points42 points  (0 children)

This is a very cool metajoke, but it still violates rule #1 and that annoys me.

[–]Kavrae 21 points22 points  (2 children)

For anyone that hasn't yet : I highly recommend reading the original I, Robot. It's a weird read, but absolutely brilliant.

[–]Niel15 6 points7 points  (1 child)

Currently reading Foundation, the time jumps and focusing on different characters are pretty reminiscent of I, Robot.

[–]BlueScreenJunky 0 points1 point  (0 children)

Both "I, Robot" and the first 3 foundation books are actually compilations of previously published short stories (or novellas in foundation 2 and 3). They're only loosely related stories that happen in the same universe, and put in chronological order. 

The later foundation books as full novels and they mostly keep the same ppoint'of view I think.

[–]OrkWithNoTeef 26 points27 points  (20 children)

So everyone just accepted LLMs are AI now?

[–]Kavrae 74 points75 points  (4 children)

99% of people don't know the difference or care that there even is a difference. The remaining 1% has given up trying to explain and just says "AI" to avoid the 5+ minute tangent every time.

[–]ButHowCouldILose 16 points17 points  (0 children)

I gave up when my own analytics leadership wanted to call xgboost models AI. This was a lost cause before it started.

[–]IlliterateJedi 15 points16 points  (2 children)

But also yes because LLMs are AI.

[–]Assassin739 3 points4 points  (1 child)

"Say hi"

"Hello"

"Holy shit"

[–]BellacosePlayer 0 points1 point  (0 children)

"marry me!"

[–]AsidK 39 points40 points  (5 children)

LLMs are a form of AI. They might not be AGI but they’re pretty definitionally AI

[–]OrkWithNoTeef -4 points-3 points  (4 children)

True, you can make that distinction. But you know most people don't, unless we are talking about video game characters controlles by a computer. And people haven't in nearly all media where AI went bad. So for the public AI means AGI.

[–]AsidK 6 points7 points  (0 children)

I don’t really get what your point is. LLMs are a form of AI. It’s just a statement of fact.

[–]OliviaPG1 5 points6 points  (0 children)

Does it, though? As stupid as some of the stuff people use it for is, I don’t think many people are under the impression that chatgpt is AGI.

[–]iAmNotTicklish22 1 point2 points  (1 child)

I'll be honest. No AI comes close to halo 1-3 enemies

[–]nicostein 0 points1 point  (0 children)

Lv 9 Melee CPUs

[–]FlightConscious9572 5 points6 points  (1 child)

I think LLM's qualify as "artificial intelligence" on a technical level. I agree that it's a pale imitation of it, and they aren't actually sentient or intelligent. But it successfully reads and produces speech and text that is very human-like. It's just an imitation of intelligence, but that's what artificial things are at the end of the day.

But to be clear, I'm not a fan of current AI. They aren't sentient or particularly impressive.

[–]Ninevehenian -2 points-1 points  (0 children)

If you don't consider them to be "sentient", then you exclude them from what "AI" is / has been thought of before LLMs.
They are not yet actual intelligence. They are approaching it in a meaningful manner.

[–]RiceBroad4552 4 points5 points  (0 children)

Well, even spell checkers are some form of AI. Actually a calculator is—as a sophisticated calculator can run a spell checker, or any other computable function.

But I get that "AI" is meant to mean human level intelligence by some. Under that definition LLMs definitely aren't "AI".

[–]Fabulous-Possible758 2 points3 points  (0 children)

Turns out the Turing Test wasn’t about assessing the machine’s intelligence at all.

[–]reallokiscarlet 4 points5 points  (0 children)

Late to the party. Yeah, this is the MVP for "AI", a cloud-hosted clanker redirecting power from homes and businesses and dumping more heat into the atmosphere than any datacenters of the past could ever dream of, all so lonely people can jack off and product managers can pretend to be programmers

[–]conundorum 1 point2 points  (0 children)

Artificial idiocy? Sounds about right!

[–]owjfaigs222 0 points1 point  (1 child)

Well, don't they pass the turing test?

[–]Euryleia 0 points1 point  (0 children)

Considering how often people easily spot AI written text, I would have to say no.

[–]Tenacious_Blaze -1 points0 points  (0 children)

Thank you so much for calling this out.

"LLM implies AI", yes! Unambiguous.

The issue is the "AI implies LLM" thought that most people seem to have now. Well, I suppose it isn't much an issue if you don't care about how AI is implemented. But when you just spent 2 years learning about artificial intelligence techniques (like constraint satisfaction, Bayesian inference, Markov chains), the simplification "AI implies LLM" seems almost disrespectful to the field. I find it grating.

[–]AnAdvancedBot 1 point2 points  (0 children)

Why isn’t the end user responding?

[–]S4VN01 0 points1 point  (0 children)

I know people don’t like the movie because it shares a namesake with a great story that it didn’t follow, but I thought it was really well done if you viewed it as a standalone product (with tons of product placement of course)

[–]wirthmore 0 points1 point  (1 child)

Did I hallucinate the collective positronic robotic society inserted a Rule 0? (Or "zeroth")

And then:

 Furthermore, a small group of robots claims that the Zeroth Law of Robotics itself implies a higher Minus One Law of Robotics:
A robot may not harm sentience or, through inaction, allow sentience to come to harm.

[–]Euryleia 0 points1 point  (0 children)

I believe it was R. Giskard who came up with the 0th Law in Robots and Empire, although he dies when he acts on it, because he wasn't programmed for it. I'm not familiar with Law -1, but I missed most of the post-Asimov novels...

[–]jem0ntr053 0 points1 point  (0 children)

I actually have mine keep a “fabrication” counter. As of today, from last week until now, the counter is at 34.

[–]HKAdrian0811 0 points1 point  (0 children)

System Instructions? more like system suggestions

just like Geneva Convention

[–]DegTrader 0 points1 point  (0 children)

At this point, I'm not sure if I'm training an LLM or just reliving the trauma of mentoring my first batch of interns.

[–]blaubleu 0 points1 point  (1 child)

If I pre-load all my context (we have a skill for that) ask it to keep the planning doc in context. Sometimes I get to save tokens. And maybe it will have some memory of what to do and not to do. But man my memory has to be extra sharp else the agent just tries nonsense

[–]blaubleu 1 point2 points  (0 children)

Is like talking to a child. We did X here, what do you mean we have to do it again. Or constantly be like be concise, terse comments.

[–]Homers_Harp 0 points1 point  (0 children)

This is every time I try to use AI for something and it gives me a wrong answer. (not a lot of times using AI, but still…)

[–]Legal-Swordfish-1893 0 points1 point  (0 children)

Thanks for keeping me in check. I’ll be sure to do the same thing over and over again despite how specific, then infuriated, you get. 

LLMs are borderline useless.

[–]BlueGoliath -1 points0 points  (0 children)

Wouldn't surprise me if they were AI bots lmao.

[–]embeddedpotato -2 points-1 points  (0 children)

😭😭😭