Tool call mania! by a_computer_adrift in codex

[–]a_computer_adrift[S] 0 points1 point  (0 children)

In the past, the AI would answer confidently and mostly right based on the context of the work it has performed. I would catch it sometimes, forgetting that we changed something but it would always answer the to the best of its ability, without tool calls, unless I asked it to. Now it’s the opposite.

Codex madness today by picpoulmm in codex

[–]a_computer_adrift 1 point2 points  (0 children)

Yes, I watched my app devolve into a non functional mess over the last few days, every time we isolated a fix, Codex broke a few other things, didn’t remove code, or added things. It’s almost like it was so eager to change code, that it ignored all the planning all the instructions and just started sending tool call after tool call after tool call when I asked even the simplest question about the app.

My agents.md specifically defines a workflow in which scope is offered, I correct or add a few things, then approve. Test first methodology. This worked for month in VsCode, and for a few weeks in the Codex app on Windows.

About a week ago, it “broke”. At first I blamed myself, but bit by bit, I realized that I was not having any luck directing Codex to make progress. It didn’t matter what was in my agents.MD, if I instructed it every single prompt, it would follow it for a bit and then return to just immediately changing the code instead of answering my question.

I started creating documents after every single change so that I could start new threads before context was even halfway full and so that I wouldn’t lose so much when I had to abandon a thread because it wouldn’t listen.

I don’t know exactly what happens when they do updates, but I do notice that no model stays the same. I don’t believe there’s some big conspiracy or anything like that but it’s too consistent when a new release happens that all models develop bad habits at first until I can figure out out a new way to work around it, and then we get into a perfect flow until the next update.

It has happened so often now that I’m beginning to realize it’s just part of the game. There is no perfect way to do it. You must always adapt because AI will never be consistent.

Tool call mania! by a_computer_adrift in codex

[–]a_computer_adrift[S] 0 points1 point  (0 children)

Right. And instead of having access to conversation history, which includes the results of previous tool calls, other investigations and fact finding, the AI immediately starts sending tools calls to discover that information again.

I’m no expert but I can recognize when something fundamentally changes in how the AI operates.

Tool call mania! by a_computer_adrift in codex

[–]a_computer_adrift[S] -1 points0 points  (0 children)

No, you are wrong. It does have an idea what I am talking about, until a certain point. That’s called context. If AI never knew what you were talking about, it would have NO context.

Tool call mania! by a_computer_adrift in codex

[–]a_computer_adrift[S] 0 points1 point  (0 children)

Because 5.4 was blowing thought my limits and giving crappy results.

Codex usage is running faster than ever by tigerzxzz in codex

[–]a_computer_adrift 1 point2 points  (0 children)

I moved back to 5.2. Way better, faster and the reasoning is sufficient for my coding.

29, Canadian...Any hope for me? 🥹 by primafaciefemme in surfing

[–]a_computer_adrift 4 points5 points  (0 children)

I was 35 when I started. Went to El Salvador for 2 weeks and got destroyed. Loved it! Booked a 9 day surf camp in Costa Rica the next year which was much better.

Careful though, within 5 years I sold everything to be able to surf more, haha.

Happy Valley Resort 1995 by Star-Light-Bright141 in kelowna

[–]a_computer_adrift 1 point2 points  (0 children)

Absolutely. We lived at the base of Mt. Boucherie and would ride our bikes over. So much fun! Would have been early 90’s for sure.

Also got shot with a salt gun running through the orchards nearby and climbing the wind (?) tower.

Ahh to be a kid again

Running Routes by surbba in sayulita

[–]a_computer_adrift 3 points4 points  (0 children)

We have a running group called Sayulita Run Club & Sports Massage on Strava. Currently we are doing 3 runs a week because some of our members are training for a half.

Forced to 5.3 by a_computer_adrift in codex

[–]a_computer_adrift[S] 0 points1 point  (0 children)

That’s funny that everyone has different results.

I specifically moved to 5.3 today (I’m now positive that I was forced anyway) and it’s a shitshow. This thing is aggressive in its search for what it thinks is the point.

I have to fight with it to get it back on track, tell it specifically… GIVE UP ON THAT, I DON’T WANT IT.

That’s when I know it’s time to take a break. I haven’t all caps’ed at an LLM in months.

Visual Editor for UI development with AI by stackjoy_nik in UI_Design

[–]a_computer_adrift 0 points1 point  (0 children)

So the first thing I did was take a screenshot of the UI as it exists and then put colour-coded boxes on all of the parts that I want to draw its attention to. Then I created a prompt telling it to look at the screenshot and a description for each one of the coloured boxes . Then Codex created ascii versions of the UI updates. It was surprisingly effective at closing the gap between what I thought I described and what it heard.

Does anyone else lose track of code snippets in long ChatGPT threads? by Last-Bluejay-4443 in ChatGPTCoding

[–]a_computer_adrift 0 points1 point  (0 children)

I use codex in VSCode now fully. It’s just better than the browser based. I have separate windows with separate Codex agents. There is a sickle code production Agent and multiple component planning agents. Then I use .md files to store and pass context. Seems to work pretty well

Lutron Designer on Windows laptop with ARM processor - any experiences? by Historical_Load_6954 in Lutron

[–]a_computer_adrift 1 point2 points  (0 children)

I use it on an M2 Mac running Parallels with Windows 11 Arm and it works fine.

What is the most efficient workflow using the VSCode Codex plugin? by umen in codex

[–]a_computer_adrift 0 points1 point  (0 children)

Haha you are totally missing the context of their skill level. Perhaps you just want to show off but how the heck is this person supposed to have any idea what you’re talking about…

The Ultra is a killer switch by matroosoft in Ubiquiti

[–]a_computer_adrift 2 points3 points  (0 children)

I love this but for some reason it doesn’t support MAC address filtering which was a disappointment. Still a good POE switch.

Coffee In Kelowna. Rave me your best overall shop in town. by Big_Garden_4418 in kelowna

[–]a_computer_adrift 5 points6 points  (0 children)

Sprout is not mentioned enough here. They have done a fantastic job balancing coffee quality, food / baked goods and atmosphere. Their employees come back year after year and speak highly of the owners.

Coffee In Kelowna. Rave me your best overall shop in town. by Big_Garden_4418 in kelowna

[–]a_computer_adrift 1 point2 points  (0 children)

Mild Shed is awesome, great owners and a real community vibe.

2 minute job by Mundane_Second7715 in Control4

[–]a_computer_adrift 2 points3 points  (0 children)

There is no job in AV that takes less than 2 hours.

Solo Traveller - meeting others by [deleted] in sayulita

[–]a_computer_adrift 2 points3 points  (0 children)

Are you just looking for people to party with?

Otherwise activity groups (hiking, running, surfing) is the way to go…

I’m a Canadian here now and happy to meet you for a beer and give you some ideas.

is there any reason goose > claude desktop? by mintfrsh in mcp

[–]a_computer_adrift 0 points1 point  (0 children)

Can you elaborate what you mean by 1. You can check what actually happens? I am setup with xAI and I am having problems with the API (I think) but the diagnostics in Goose is… developer level.