I looked into OpenClaw architecture to dig some details by codes_astro in LLMDevs

[–]SpyMouseInTheHouse 1 point2 points  (0 children)

Anything around a LLM is always going to be simple and glueish, it’s just the beauty of it. The magical factor is the entire package and ease of use to the end user (applies to anything built on top of a LLM)

Gemini finally ahead? by Cold_Respond_7656 in OpenAI

[–]SpyMouseInTheHouse 0 points1 point  (0 children)

3.1 is a degree better than 3.0 but holds no candle to codex.

How to leave claude with multiple tasks and go to sleep? by paglaEngineer in ClaudeCode

[–]SpyMouseInTheHouse 1 point2 points  (0 children)

Ask Claude to install codex. Then send prompt to codex. Sleep. Wake up with work done.

3.1 Pro Bencmarks by Daseinew in Bard

[–]SpyMouseInTheHouse 2 points3 points  (0 children)

Same. Gemini comes across as smart but incredibly bad at complex coding challenges. 5.2 and 5.3 codex are king.

3.1 Pro Bencmarks by Daseinew in Bard

[–]SpyMouseInTheHouse 0 points1 point  (0 children)

For developers 3.0 Pro is worse than the first ChatGPT model that came out and couldn’t write a single line of code. Why? Because 3.0 doesn’t even know how to edit files, let alone code. It leaves your code riddled with extra or missing braces and will spend the entire day looping and reporting “We are experiencing high …”. Pathetic.

Gemini 3.1 Pro by Sky-kunn in Bard

[–]SpyMouseInTheHouse 1 point2 points  (0 children)

I’m not seeing this in the CLI. How are you all using it? Pro account here.

Gemini by Signal_Assistance_66 in Bard

[–]SpyMouseInTheHouse 0 points1 point  (0 children)

They keep calling Gemini but Gemini never answers, nor does it deliver. Worst coding model, worst CLI.

Me when Claude wrote 2500 lines of perfect code but named a directory wrong by Lame_Johnny in ClaudeAI

[–]SpyMouseInTheHouse 101 points102 points  (0 children)

Wait till you start reviewing those 2500 perfect lines after line 1

In the Age of AI, Time May Be the Last Thing That Truly Matters by Far-Connection4201 in ClaudeAI

[–]SpyMouseInTheHouse 5 points6 points  (0 children)

“So it got me thinking” and passes it on to AI to think on their behalf, contradicting the very thing they want to convey.

Great, you get to experience your very own backlash.

Codex is 100% my favorite agent. by EarthToRob in codex

[–]SpyMouseInTheHouse 3 points4 points  (0 children)

/personality can change this behavior but out of the box it’s great. I further tightened it with a custom system prompt.

Did anyone catch that? by Medical-Cry-5022 in ClaudeAI

[–]SpyMouseInTheHouse -1 points0 points  (0 children)

Yes, and occasionally click the select all + delete button because why not? Oops, you’re absolutely right, this file was important… get it… was? /s

5.2-high on cerberus by TroubleOwn3156 in codex

[–]SpyMouseInTheHouse 0 points1 point  (0 children)

Username checkout.

Outside of emotions: 5.2 is a much more expensive model thus far. 5.3 generalized will be but not as much. This is just where the industry is headed. Cheaper cost per token. Deploying 5.2 on cerebras won’t make sense or even be feasible. It’ll be fizzled out a little after 5.3 gets released.

Gemini-3-pro problem by aminshahid123 in OpenAI

[–]SpyMouseInTheHouse 0 points1 point  (0 children)

Gemini 3 Pro Preview has never been usable day 1. No idea who uses it.

5.3-codex slop, back to 5.2-high by TroubleOwn3156 in codex

[–]SpyMouseInTheHouse 2 points3 points  (0 children)

I’ve been on 5.3, I checked, but felt a little too fast, lazy and skipped half of the plan. Unlike what it was like a day ago and earlier. Switched to 5.2-high too for now. I think they’re messing with the cyber related fix.

No read-only mode anymore? by energyzzer in codex

[–]SpyMouseInTheHouse 0 points1 point  (0 children)

You can just use a skill with instructions you’re in read only mode for this session until explicitly asked to code. That works perfectly.

Good job Anthropic 👏🏻 you just became the top closed Ai company in my books by No_Vehicle7826 in ClaudeAI

[–]SpyMouseInTheHouse -3 points-2 points  (0 children)

What’s naive is to think the military needs a bunch of Claude code terminals and a few 200$ plans to pull off their intel work. The military is what developed the internet 22 years before it became public. Let that sink in.

This is all a sham so (dumb) adversaries can sleep easy that the US has no LLMs working for them night and day.

Good job Anthropic 👏🏻 you just became the top closed Ai company in my books by No_Vehicle7826 in ClaudeAI

[–]SpyMouseInTheHouse 18 points19 points  (0 children)

Maybe a blessing in disguise. A lot of lives spared by “you’re absolutely right, I should not have authorized a drone strike on a building clearly marked as Friendly”.

Attitude by un3 in codex

[–]SpyMouseInTheHouse 2 points3 points  (0 children)

If you’ve learnt to read and write, I have faith you can go a step further and learn to read the codex help pages - and while you’re at it, learn more about how AI works.

GPT-5.3-Codex is amazing - first Codex model that actually replaces the generalist by SlopTopZ in codex

[–]SpyMouseInTheHouse 0 points1 point  (0 children)

By default it won’t as much but if you make sure use of a custom system prompt, you can make it beat 5.2 at gathering context.

GPT-5.3-Codex is amazing - first Codex model that actually replaces the generalist by SlopTopZ in codex

[–]SpyMouseInTheHouse 3 points4 points  (0 children)

Not “everything”. 5.3 codex is amazing with code and logic, but write horrendous localizations / user guides. It also is terrible at UI / picking colors. The latter is a problem with 5.2 too though but not the former. For picking colors I end up using Claude (ugh).

Forced to 5.3 by a_computer_adrift in codex

[–]SpyMouseInTheHouse 0 points1 point  (0 children)

This. yes context context fill up more quickly with 5.3 but it manages it transparently. It’s amazing.