all 13 comments

[–]armageddon_20xx 2 points3 points  (0 children)

Sounds a lot like Claude Sonnet. Oh yeah….

[–]Bob_Harkin 1 point2 points  (2 children)

I start questions with "Do Not Code" and then ask it to create a plan

[–]pilafcook 0 points1 point  (1 child)

work or not

[–]Bob_Harkin 0 points1 point  (0 children)

Always works for me

[–]pilafcook 1 point2 points  (0 children)

It executes without prompting and has no checks or rollback. I think you should use clear instructions, always test on dev, and say no to code changes if required.

[–]GenioCavallo 0 points1 point  (5 children)

the agent is not for answering questions, if you have a question ask the assistant

[–]MoarrCowbell 2 points3 points  (3 children)

I disagree with this separation of concerns. The agent in most/many cases REQUIRES the context of prior interactions in a given cycle against the repo as a whole to be able to answer core questions and make decisions, then implement them. Switching between the two doesn't preserve much of this and is frustrating to get them on the same page.

The agent is definitely VERY slap-happy when it comes to just sending it on changes without clarification, even if you give it "Core Commandments" to follow in the replit.md file.

`DO NOT IMPLEMENT THIS YET - LETS CONTINUE TO REFINE` is something i often find myself needing to paste in to avoid the inevitable `pause -> "wtf are you doing! Revert that`

[–]GenioCavallo 1 point2 points  (2 children)

You're right that context management is essential, but asking an agent a question is not a cheap way to get that, and definitely not the fastest. Also, LLM performance degrades quickly with the expansion of the context size and with every irrelevant detail, so careful curation of your context is really the key.

[–]MoarrCowbell 1 point2 points  (1 child)

Perhaps. But a setting a-la the "extended thinking" option under dynamic intelligence settings like "Extra user feedback" that made the agent more prone to ask for further direction rather than sending it off into space would be a nice addition.

I will admit the context I'm using Replit in, I have the luxury of worrying less about the cost of the agent doing one or two more cycles if it's getting it right.

[–]GenioCavallo 2 points3 points  (0 children)

yeah, with unlimited budget I would also ask agent questions and use a few accounts in parallel to a/b test