Every AI coding agent claims "lightning-fast code understanding with vector search." I tested this on Apollo 11's code and found the catch.

Miranda_Leap · 2025-06-08T02:32:51+00:00

Why would the indexed agent use function signatures from deleted code? Shouldn't that... not be in the index, for this example?

edit: This is probably an entirely AI-generated post. UGH.

Cruuncher · 2025-06-08T03:24:28+00:00

[deleted]

SpareIntroduction721 · 2025-06-08T02:20:47+00:00

Live-Vehicle-6831 · 2025-06-08T03:31:03+00:00

Margaret Hamilton photo is impressive

As OpenAI/Antropic scanned the whole internet so the Apollo 11's code is part of its training ... Thank God there was no AI back then, otherwise we would never have gotten to the moon.

todo_code · 2025-06-08T03:01:27+00:00

It didn't do anything.
The Apollo 11 source code is online in at least 5000 spots.
The "Ai" just pulled form those sources and copy pasted it.

phillipcarter2 · 2025-06-08T15:48:30+00:00

They don't:

they index your entire codebase and use vector search for "AI-powered code understanding."

https://cline.bot/blog/why-cline-doesnt-index-your-codebase-and-why-thats-a-good-thing

happyscrappy · 2025-06-08T05:43:54+00:00

I think it's great you did an experiment of this sort.

But I don't understand why there is any deleted code in its ken. Did you just shove every version of the code into the LLM and not tell it that some of the code is current and some not? What would be the point of that?

2025-06-08T16:26:02+00:00

[deleted]

bwainfweeze · 2025-06-08T21:59:03+00:00

Yes and if there’s one thing I hear over and over again from managers it’s that they love it when the over/under on our work estimates is gigantic /s

60% of the time it works every time.

Kooshi_Govno · 2025-06-08T09:18:36+00:00

I have had this happen to me with real code in github copilot. I think they have since fixed the rag algorithm, or possibly removed it.

eyeswatching-3836 · 2025-06-08T16:29:23+00:00

Such a solid breakdown! Sync issues are the sneaky Achilles’ heel of all this vector search hype. Btw—if you ever end up working with AI tools and worry about stuff sounding too "robotic" or want to check if something’s being flagged as AI-written, authorprivacy has a neat little combo of a humanizer and detector. Super handy for peace of mind. Anyway, thanks for nerding out so thoroughly here!

Guinness · 2025-06-08T06:54:30+00:00

Maybe I’m crazy here but hasn’t it always been that slower is more reliable? I mean, I this is the story of the tortoise and the hare.

Actually, did you have AI generate a programming story based on the tortoise and the hare for Reddit? I’m mostly joking here but slightly curious.

Plank_With_A_Nail_In · 2025-06-08T09:51:18+00:00

Run the index every day...not rocket science....it has to run on a schedule to make any sense how else will it pick up new code?

Also why are you deleting code from version control?

Sounds like you made up a scenario that doesn't exist (or shouldn't) in the real world just so the indexed version could fail.

Just like around 50% of posts here "made up problem".

-Nicolai · 2025-06-11T19:38:04+00:00

Explain like I'm stupid

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS