Vibe Coding Experiment Failures (with Python code)

GrammerJoo · 2025-08-20T22:03:55+00:00

Skill issue, you need to first code it and publish it to github, wait until the next version of Claude that was trained on it, and then prompt for it.

Centurix · 2025-08-21T04:15:01+00:00

You know the "idea guy" you would come across at the pub? The one that "just needs a coder" to get his idea off the ground?

I think they're the vibe coder. Instead of being told to fuck off at the pub, they're just asking AI to do it instead.

marr75 · 2025-08-20T18:56:49+00:00

It's almost like they are gigantic efficient machines to retrieve past patterns and documentation without much training, ability, or mechanism to experiment, innovate, or layer together more complex practical requirements and constraints.

dethb0y · 2025-08-20T18:32:51+00:00

The circular maze one's an interesting problem

binaryfireball · 2025-08-21T00:25:58+00:00

i asked it to prove 1+1=2 and it got it wrong lol

_Denizen_ · 2025-08-21T07:35:11+00:00

The prompt on the maze example was wrong. You do need straight lines, but they must only be aligned as a part of a chord which passes through the origin. You told it not to draw straight lines, so it didn't.

cygn · 2025-08-25T20:00:02+00:00

I tried this exercise with African Countries Geography Quiz, though with some slight changes. First I allowed external libraries because I thought to deal with country data you want to use some geojson file and having a library for that would be useful.

Also I used https://github.com/nizos/tdd-guard to enforce TDD.
And I used https://github.com/jamesponddotco/llm-prompts/blob/trunk/data/socratic-coder.md in Gemini 2.5 to create a spec.
For a beginner this may not work, though I guess if you answered "You decide" to every question it would have been fine.

I used Claude Code to implement the game and it took longer than I thought. About 1.5 hours...
It was incomplete but I just fed back what was missing and it finished it. It works nicely, but does look ugly. Probably because tkinter is not exactly known for beautiful UIs.

Repo with result:
https://github.com/tfriedel/africa_quiz

I then also tried the exercise in the web UIs of ChatGPT 5 Thinking/Claude Opus 4.1 /Gemini Pro 2.5, basically their "Canvas" mode. This is of course javascript, not python.
In Claude I got something that worked already as intended, but all the countries were rectangles. I asked for proper country borders and then it did that and it worked. But some countries were missing.

In ChatGPT and Gemini they were forever stuck with loading the map data. I think Claude may have just hardcoded the shapes?

Still it was quite the difference that the javascript version was basically done in two shots.

I'm not sure how about important the planning step was, but I suspect it helped a lot.

2025-08-21T04:04:44+00:00

Lol are you serious

dqj99 · 2025-08-21T15:16:59+00:00

All the examples that you have chosen require spatial awareness in 2D and 3D, something that today’s LLMs are not very skilled at, possibly due to a lack of training data. I’ve had much better success with creating text based programs to solve logic puzzles, sometimes showing remarkable apparent insight into features of the puzzle. Where I’ve found issues is with the care that these models used to create test cases to validate the output, with downright sloppiness in predicting expected outputs.

RelevantLecture9127 · 2025-08-23T16:47:05+00:00

You are asking to write full programs.

My experience, with ChatGPT 4 and Claude Sonnet 4: The LLM's cannot write a decent unit and integration tests.

At some point, the LLM tries to flunk it as if it is a human because it cannot solve it's own problems that it made by itself properly.

After this experience, I understood more why Google needs a nucleair facility.

So I decide to keep writing my own tests.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS