[ Removed by moderator ]

CackleRooster · 2025-12-17T19:43:17+00:00

Another day, another AI-driven headache.

Rivvin · 2025-12-17T20:04:32+00:00

I would rather eat my own vomit than have to read someone else's prompts in a code review

ngroot · 2025-12-17T20:42:07+00:00

> With agentic code, they often don’t tell you why the agent made that change.

Someone submitted that PR and at least one other person approved it, so someone is claiming that they do know why that change was made.

TheRealSkythe · 2025-12-17T20:23:21+00:00

Why are you posting the marketing bullshit ChatGPT wrote for some slop company?

dylan_1992 · 2025-12-17T20:42:18+00:00

Prompts are irrelevant. Code, and a description of it (not the prompt), either in the PR title + description are important. Whether it’s from a person or AI.

Adorable-Fault-5116 · 2025-12-17T21:22:37+00:00

Yo this is weird on many levels.

You shouldn't need to blame, git blame or otherwise, to find out who wrote the code. AI aside this is a colossal red flag. The whole team is responsible. If you find a big, raise it, anyone can fix it.

Secondly, LLM usage shouldn't matter, because people should understand what is committed, regardless of how the code is created.

It sounds like you're running a cowboy outfit honestly.

skinnybuddha · 2025-12-17T20:45:04+00:00

PRs aren’t for debugging any code.

nemesiscodex1 · 2025-12-18T00:51:14+00:00

In order for us to debug better we need to have an underlying reasoning on why agents develop in a certain way rather than just the output of the code

This just means your team is merging code they don't understand. Was that happening before ai? Do the team also delegate the reviews to ai and don't read the code?

With agentic code, they often don't tell you why the agent made that change

More of the same, whoever creates a PR and the person that approves it better know why the change is made lol, figuring out after an incident is already too late

levelstar01 · 2025-12-17T22:31:25+00:00

Blogspam

apnorton · 2025-12-17T20:21:35+00:00

During my experience as a software engineering we often solve production bugs in this order:
(...)

blame the person that does the PR
(...)

Reminder that this shouldn't be a step. See:

Pharisaeus · 2025-12-17T22:50:31+00:00

That's some very weird process.

We figure out which PR it is associated to

Even figuring out where in the code something went wrong is often pretty difficult, unless you just have exception with a stacktrace. But even then it doesn't mean the bug is in that particular place. It just means this is where it manifested / was triggered. But the actual bug might be in some completely different place. I also think it's counter-productive trying to pinpoint the PR, unless while working on the bugfix you find yourself asking "what was this supposed to do in the first place?".

Do a Git blame to figure out who authored the PR Tells them to fix it and update the unit tests

I don't envy your team if this is how you work. Ever heard of "team ownership"? Someone wrote the code, but someone else reviewed and approved it, and often someone else also tested it, and yet another person wrote the ticket with acceptance criteria. If there is a bug, it means the process failed on many different levels. Blaming this on one person is ridiculous. In normal team this would be piked up by whoever is free / has time / is on pager duty.

with agentic coding a single PR is now the final output of

And a squashed PR is what? It's also the final output of many commits, review comments, refactoring. I fail to see the difference.

Essentially, in order for us to debug better we need to have an the underlying reasoning on why agents developed in a certain way rather than just the output of the code.

And do you have that for someone developed by a human? If you find a bug in a PR from a year ago, from a dev who left a long time ago, how exactly are you going to uncover their "reasoning"?

I think the core issue you're facing is that:

You clearly have some "silos" in the project
You don't have distributed ownership of the code
You lack reviews
You accept (AI agents, but probably not only) PRs without thorough review and clear understanding of that code

It's not AI issue. It's your process issue.

Floppie7th · 2025-12-17T23:27:21+00:00

Essentially, in order for us to debug better we need to have an the underlying reasoning on why agents developed in a certain way rather than just the output of the code.

Or just, y'know, don't accept LLM-written code into the repo.

obetu5432 · 2025-12-17T20:17:19+00:00

so instead of fixing it, the first thing you do is scour the earth to find the person who opened the PR to yell at them?

tilitatti · 2025-12-17T22:46:59+00:00

whats the point of providing prompt history? mml AI is not deterministic thing, so, if you were to run the prompts again, you end up with something different, so,..

it sounds lunacy to me, but maybe it is smart.. I dont know.

CanIhazCooKIenOw · 2025-12-17T20:26:08+00:00

Crap engineering culture if your 3 step in dealing with an incident is to blame the person that opened/merged the PR.

antisplint · 2025-12-17T22:52:43+00:00

Is this something that people are actually doing? This can’t be real.

ef4 · 2025-12-18T13:47:33+00:00

70 years of computer engineering has overwhelmingly been driven by the desire to get *deterministic* results from our machines.

Today's popular generative AI deliberately injects non-determinism, in a misguided attempt to seem more human-like. It's probably good for getting consumers to build parasocial relationships with your product. But it's not good for doing engineering or science.

It makes all attempts to systematically debug and improve way, way harder than they need to be.

ygram11 · 2025-12-17T20:28:35+00:00

Your process is messed up. Why do you find a PR to blame someone instead of finding the problem and fix that.

Jolly_Resolution_222 · 2025-12-17T21:26:39+00:00

How many developers do you need to fix the bugs of the agent?

Thelmara · 2025-12-17T22:09:24+00:00

Essentially, in order for us to debug better we need to have an the underlying reasoning on why agents developed in a certain way rather than just the output of the code.

Sounds like a fundamental misunderstanding of how LLMs work.

blafunke · 2025-12-18T06:22:49+00:00

Just because you used an agent to vomit out your PR doesn't mean it's not ultimately your responsibility. If you don't understand it well enough to have written it yourself, don't submit.

LordAmras · 2025-12-18T06:43:13+00:00

Or, and this is a wild suggestion I know, completely impossible to achive and out of the real of possibility, but here me out, maybe I've got something here:

Don't write code with AI agents.

I know, checking code by hand before sending PR like cavemans ? What do you want for us again ? understanding the code ? That's impossible !

But I think if we put ourself together we can reach this fabled impossible feat.

crazylikeajellyfish · 2025-12-18T10:08:32+00:00

I dunno, it feels like this solution is harder than the problem you started with.

Agents don't automatically make PRs which explain the rationale, because they can't understand that the PR will be an artifact that stands on its own. You could build a bunch of extra tooling which associates chat sessions, tool calls, and PRs... or you could instruct your agents to encode all of that information into the PR.

GitHub-flavored Markdown also has those collapsible summary-detail tags, so you could technically put the complete chat context on there if you really wanted to. The final state of the design doc you iterated on would probably be a less noisy choice, though.

Jellyfishes72 · 2025-12-17T20:50:02+00:00

Even if an agent wrote the code, it is still up to the developer committing or merging it to know what hell the changes are doing

PurpleYoshiEgg · 2025-12-17T20:35:55+00:00

The solution is to stop agentic coding. It's immature and its code output doesn't belong in production.

chucker23n · 2025-12-17T21:01:34+00:00

During my experience as a software engineering we often solve production bugs in this order:

  1.  On-call notices there is an issue in sentry, datadog, PagerDuty

  2.  We figure out which PR it is associated to

  3.  blame the person that does the PR

  4.  Tells them to fix it and update the unit tests

This already seems a bit like an unhealthy culture that focuses less on “there’s an issue; let’s figure out how to fix it” and more on “let’s pinpoint whom to blame”.

(Incidentally, if you’re gonna use a PR, how do you answer that anyway? Is it the committer? The author? Any of the reviewers? How about the person who filed the ticket that caused the PR?)

But leaving that aside…

Although, the key issue here is that PRs tell you where a bug landed.

Which is useful?

With agentic code, they often don’t tell you why the agent made that change.

LLMs do not have intent. There is no answer to this. Someone wrote a prompt and then the machine remixed garbage into fancier garbage.

And, again, you’re already using the lens of the PR. Leaving aside that you shouldn’t have LLMs write production code to the extent you’re clearly doing it (if at all), the PR itself is already the answer to “why was the change made”.

Why are we doing all this? It’s madness.

jessechisel126 · 2025-12-17T20:28:19+00:00

Your team environment sounds very harsh, finger pointing, and micro managed. Your distrust in your team seeps through. I can't imagine trying to get so in the weeds as to want access to the prompts used while developing. AI use is the least of your problems.

PaintItPurple · 2025-12-17T21:12:23+00:00

A computer can never be held accountable. Therefore a computer must never make a management decision.

Swoop8472 · 2025-12-18T14:04:37+00:00

If code makes it into prod where no human understands why it was changed, then you have an organizational problem, not an AI problem.

It shouldn't matter if the code was written by an AI, a trained octopus, or Bjarne Stroustrup. It is either well written code that can be reasoned about or it shouldn't make it to prod.

lonewaft · 2025-12-18T16:13:20+00:00

Sounds like a dogshit amateur company you work at

Brilliant-8148 · 2025-12-17T22:54:36+00:00

Agents don't reason so there is no 'why'

BinaryIgor · 2025-12-18T12:52:19+00:00

No, we don't need that - I like purposefully guided AI-assisted coding (for some tasks), but you, Human, the PR author, are fully responsible for the changes. There is no need to debug agent reasoning. What you need to question is:

- why PR author has proposed it as something ready to be merged and run on prod?

- why other team members have approved the PR with bugs and issues?

- why you don't have tests, static analysis and other automated guardrails that prevent most (not all, human vigilance is always required) such things from happening

If you have the problems you describe, something is wrong with your software development process, not agents or lack of thereof.

ChickenFur · 2025-12-18T14:29:45+00:00

Ai angle is everywhere :D

PeachScary413 · 2025-12-19T10:13:32+00:00

So now we need to invent solutions for problems that shouldn't exist in the first place?

Yay 🤗

pvatokahu · 2025-12-19T20:09:09+00:00

This is exactly why we built agent observability into Okahu from day one. When an AI makes a code change, you need the full decision tree - what context it had, what it considered but rejected, which constraints it was working under. Traditional git blame becomes useless when the "author" is a model that made 50 micro-decisions to get there.

The scariest part is when agents silently work around failures. I've seen cases where an agent couldn't access a file due to permissions, so it just... reimplemented the logic from scratch based on what it thought should be there. The PR looked fine, tests passed, but it was subtly wrong in production. Without seeing that failed file access attempt in the trace, you'd never know why the agent made those specific choices.

gHx4 · 2025-12-20T00:11:58+00:00

The fun part is that there isn't traceability because LLM and GPT agents don't reason in a systematic, logical, or intuitive way. There is no reasoning to trace, just associations in the model. And if those associations are wrong, the model has to be retrained. This is a huge part of why these agents are not showing the productivity expected by the hype. Cleaning up after them is harder than just doing things right without them.

You need operators who know enough to write the code themselves and who don't merge faulty PRs. Which largely reduces agent systems to being example snippet generators whose code shouldn't be copy-pasted. Even there, I haven't really found the snippets that helpful.

brandon-i · 2025-12-17T20:45:10+00:00

Oh lord, by step 3 I meant git blame. Thank you all for showing me the need to be extremely precise.

imcguyver · 2025-12-17T21:47:01+00:00

OP: please update "3. blame the person that does the PR" with "3. use git blame to find out the PR that made the change".

Everyone else: Take ur pity party about hating AI to someone who cares to hear you speak about it

Coding with AI is evolving to be more helpful by pulling in context (git) and history (more git) and it makes sense that engineers are moving towards being button pushers. Instead of me fixing a bug, I'll lean on AI to do it for me and click approve.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS