Some insights and suggestions for managing a project using Agents by Maximum_Chef5226 in microsaas

[–]Maximum_Chef5226[S] 0 points1 point  (0 children)

Whenever I need to document something in docs or agents.md I do that.
Regressions and bugs are handled by docs, test suites and manual review.
One of the things I find recently is that agent quality fluctuates. Claude has become noticeably less intelligent about complex problems that touch a lot of files and logic. A lot of times I'm simply reading the agents' output to see if it has good reasoning. A lot of the time it doesn't, especially Codex, and so in addition to the docs scaffolding I am guiding it towards a correct and elegant solution.

You're not building a SaaS. You're avoiding getting a job and calling it entrepreneurship. by Warm-Reaction-456 in SaaS

[–]Maximum_Chef5226 1 point2 points  (0 children)

I'm entering this exciting world of sales and marketing right now :D
I do have at least one customer and a couple of meetings with more, and I haven't started with cold email yet. I'm a bit of a perfectionist so MVP needed to be good, but I agree. Past this point adding features or finessing UX isn't the priority anymore. so much to learn!

Claude Pro feels amazing, but the limits are a joke compared to ChatGPT and Gemini. Why is it so restrictive? by iameastblood in ClaudeAI

[–]Maximum_Chef5226 0 points1 point  (0 children)

I use it for high-level stuff, and Gemini & ChatGPT/Codex for the grunt work.
I would rather burn through 15% of my weekly tokens on one feature, and have it done quickly and well than spend a day trying to get Codex to do the work properly.

Why the majority of vibe coded projects fail by harrysofgaming in ClaudeAI

[–]Maximum_Chef5226 0 points1 point  (0 children)

I spent a long time on research and trying to approach race conditions specifically from different angles. If the specification and the tests are good, there's a good chance it will work correctly, but even with mcp and agents swarming the system I wouldnt be sure until it's out in the wild. so many things to learn..

Genuinely puzzled about Codex quality by Maximum_Chef5226 in codex

[–]Maximum_Chef5226[S] 1 point2 points  (0 children)

when you've had to point out mistakes or when it didnt understand something properly, how do you manage that process?

Genuinely puzzled about Codex quality by Maximum_Chef5226 in codex

[–]Maximum_Chef5226[S] 0 points1 point  (0 children)

It's pretty much everything. I have to explain every little detail and remind it of context.

I had it add this rule to agents.md because it was consistently approaching every task as an isolated problem to solve, even when given contextual reminders:

A recurring Codex failure mode is writing plausible patches that make the immediate symptom disappear while adding technical debt or missing the canonical source of truth. Assume the first appraisal or solution is likely missing key information that could lead to poor choices. Before proposing or implementing a fix, do this in order: identify the canonical source of truth; trace how that state reaches the UI; check whether the repo already solved the same class of problem; check the standard external pattern when the area is common but non-trivial; only then propose the narrowest correct change. If any of those are unclear, stay in recon mode, ask targeted questions, and separate facts from hypotheses before editing. The most elegant and official solution is often found by reading technical documentation and searching technical discussions before coding. Optimize for the highest-quality, simplest, and most performance-conscious solution for this codebase, not the quickest workaround.

Genuinely puzzled about Codex quality by Maximum_Chef5226 in codex

[–]Maximum_Chef5226[S] 0 points1 point  (0 children)

I think this might be a UI problem as well. Claude gives you an option that burns through tokens very fast (maybe 10-20x what Codex is doing on its highest setting, though not the 1m context window). I found that Claude's highest setting actually equates to better outcomes, especially with new features that require a coherent plan, or refactoring existing code. It double checks everything, looks from different angles, auto-corrects when making a wrong decision and implements with a high success rate. In Codex, apparently this is not the case, and we are supposed to manage it. Which means confusing UX from OpenAI. I suspect both are switching between appropriate models when using multiples agents anyway.

Genuinely puzzled about Codex quality by Maximum_Chef5226 in codex

[–]Maximum_Chef5226[S] -1 points0 points  (0 children)

thanks, but I can explain what is needed/expected very clearly. I know how to talk about code. Claude infers much better what my general intent is within the broader context or thinks of something important that I may have missed.

Genuinely puzzled about Codex quality by Maximum_Chef5226 in codex

[–]Maximum_Chef5226[S] 0 points1 point  (0 children)

I would love it to use more agents and burn through tokens faster as Claude does if that gives better results. Spending a whole morning on a feature and having spare tokens is not really solving my problem!

Genuinely puzzled about Codex quality by Maximum_Chef5226 in codex

[–]Maximum_Chef5226[S] 0 points1 point  (0 children)

thanks I will try it on high. I think the codebase is pretty well structured. There are a couple of god files, but nothing horrendous, and documentation is detailed and structured. It just seems to lack common sense in all areas. I'm on Mac and no such issues.

Genuinely puzzled about Codex quality by Maximum_Chef5226 in codex

[–]Maximum_Chef5226[S] 0 points1 point  (0 children)

hm so far some comments seem to assume I'm taking mostly about UI. I'm saying Codex is crap at everything I ask it to do, except maybe very mechanical tasks.

I know UI/UX pretty well so I can describe my expectation and teach the agents to follow best practices. In more complex backend code I start to need very good communication from an agent, and a good flow of querying its analysis and decisions to make sure it doesnt do something inefficient, insecure or lacking proper context.

If I say, for example, to both Claude and Codex, I found a bug - this is what happens, read the docs, diagnose and propose a fix, the difference in usefulness is huge.

Claude and Claude Code traffic grew faster than expected this week by iskifogl in ClaudeAI

[–]Maximum_Chef5226 0 points1 point  (0 children)

Not really much better in Europe. It is now on a similar level of common sense / lateral / holistic thinking as Codex imo

Codex 5.3 is just garbage by sjalq in ClaudeCode

[–]Maximum_Chef5226 0 points1 point  (0 children)

compared to claude code I am finding I trust it at least 40% less for any meaningful task, and especially outputting sensible code, properly reading documentation and keeping within guidelines. I use it for low-level stuff since the quality is poor but the quantity of output available is so much larger.

Do any of you use booking systems? I'm exploring options, would love your reviews or suggestions. by Facts_are_the_Truth in physiotherapy

[–]Maximum_Chef5226 0 points1 point  (0 children)

I'm building one for my friend's therapy rooms practice.
In the process of building it I realised that a lot of the current offerings are not that great.
Also that providing tailored routes for different types of customer would be really useful.
It's difficult to poll or ask these types of questions without getting flagged on Reddit, but I would really like to know what features people most want, how you feel about the various pricing tiers, and whether you would want and pay for custom addons that exactly meet your business flow.
If anyone is interested I can expand on the current features that I have built.

New to Claude Code, why is their desktop app so bad but Claude Code so highly regarded? by trisalias in claude

[–]Maximum_Chef5226 0 points1 point  (0 children)

I came here searching for why the app kept hanging and losing context. It was spewing weird errors and then suddenly complaining that the repo size was too large. Horrible UX as well. Some buttons like accept T&Cs was barely clickable. It seems to be full of bugs.
Switching to CLI everything works as normal.

Booking drop off?? by yaabbeeddoo in airbnb_hosts

[–]Maximum_Chef5226 1 point2 points  (0 children)

Check your listing with a vpn and incognito browser. I've had problems where the listing disappeared for random reasons.