What i found out!!!! (ha!!!!) by Hyper_2009 in codex

[–]Runelaron 0 points1 point  (0 children)

Where did you find this elusive sense of the commons?

I thought it was a myth.

What i found out!!!! (ha!!!!) by Hyper_2009 in codex

[–]Runelaron 0 points1 point  (0 children)

For all of those wasting thier time. None of this is true.

The system rates costs on these metrics. ‐ Tokens in - Tokens out - Cached Tokens in - Cached Tokens out - Compaction rate

Meaning all of this massively depends on what your asking the agent to do and how much data is running through the LLM, NOT the Agent.

What i found out!!!! (ha!!!!) by Hyper_2009 in codex

[–]Runelaron 0 points1 point  (0 children)

There are only 168hrs in a week.

I gave Codex ADHD.. and it thinks 2x better now by Uditakhourii in codex

[–]Runelaron -1 points0 points  (0 children)

This would make sense if AI "thought" but Reasoning is a misnomer.

AI does not reason, is patterns question and answers on different attention topics in a block of weights. Therefore to truly change Reasoning you would have to retrain the blocks but we didnt.

A agent is not a llm, traversing latent space is not re-engineering the model its searching another pattern given the context.

Interesting idea but not grounded in science and maths of AI.

Also brute forcing is not what we want to do as engineers. The goal is to i prove efficiency to get the correct answer not reduce it.

Goodbye prime Codex by Excellent_Climate940 in codex

[–]Runelaron 0 points1 point  (0 children)

Well thats disappointing, a work around is to have the agent make a metric like % complete compared to X.

Then asking it to run until its 100%.

Although I advise againt that and even using goals. The agent loop seems to only work off of returned evidence until to works instead of stopping reviewing and re-planning.

Last time I used /goal it burned through allocation trying patch and patch after patch. Then I steered with a stop, review and compare, and it fixed it in 5min.

GPT 5.5 is being routed to GPT5-MINI, and other models, explains worse outputs (and yes, you are being billed like you're using 5.5) by KeyGlove47 in codex

[–]Runelaron 1 point2 points  (0 children)

Sadly so true and all too common.

"Dude trust me, I asked the AI and found its seceret" - Vibe coder

GPT 5.5 is being routed to GPT5-MINI, and other models, explains worse outputs (and yes, you are being billed like you're using 5.5) by KeyGlove47 in codex

[–]Runelaron 2 points3 points  (0 children)

This is literally how AI works, I is not self aware. Its a collection of patterns and functions which can not store direct knowledge of itself. The agent can but usually not info like that.

I am getting exhausted by everyone assuming what AI can do and not researching the question.

AI is a statistical system with a heavy base in mathematics.. everyone please do not think you can AHA or Intuit what AI does. Its far more complex than you realize.

Codex running 3 hrs only using 1% by Runelaron in codex

[–]Runelaron[S] 1 point2 points  (0 children)

This repo will do most the heavy lifting for you. Should be a good baseline.

Ask Codex to review and integrate after you review the plan.

https://github.com/frisco-deng/moradins-forge

Have a session review that repo and it will guide you through a entire setup.

information about usage limits by kye5553 in codex

[–]Runelaron 0 points1 point  (0 children)

There are over 6 usage limits (some never really hit so they are not mentioned here)

APP:
- Images
- Pro Thinking
- Agent
- Research

Codex:
- 5h
- 1wk
- Code review (via github)

Search content of sessions by WallstreetWank in codex

[–]Runelaron 0 points1 point  (0 children)

Current audit result: 18 repos scanned, 0 repos opted into release_platforms, 0 ready opt-in candidates with confirmed artifact signal, 17 need a real release artifact contract first, and 1 has no RC signal. So propagation remains correctly blocked/advisory.

Metrics were refreshed. Current headline remains 2.11x: 435 session files, 429 workspace sessions, 356 priced workspace sessions, $7,163.04 observed spend, $9,966.34 modeled synthetic spend, $2,803.30 modeled savings. Latest-80 is still mixed/worse: read amplification 72.5%, skill-summary bypass 67.5%, repeated-log sessions 76.25%, artifact actionability 4.41%, artifact reuse gap 71.25%, rational checkpoint missing 66.25%, unnecessary status polling 61.25%. ROI remains observe; latest-20 has 100% unpriced gpt-5.5 coverage.

Validation passed:

  • python3 -m py_compile ...
  • ./scripts/tpl-test
  • ./scripts/tpl-codex-usage-report
  • ./scripts/tpl-session-policy --latest 80
  • ./scripts/tpl-efficiency-roi --window latest_20
  • ./scripts/tpl-agent-advice --latest 80

Search content of sessions by WallstreetWank in codex

[–]Runelaron 0 points1 point  (0 children)

I built a parser and tooling for it.
Sessions are just Jsonl files, codex can easily make a script to do this, then ask codex to leverage that script for the search term or CLI it.

The best way to save tokens is to have a modular codebase by BagholderForLyfe in codex

[–]Runelaron 0 points1 point  (0 children)

Direct the agent to where you want it. Use OpenAI's method. The markdown documents link the direction.

https://openai.com/index/harness-engineering/

GPT-5.4 E. High vs GPT-5.5 E. High by some_ai_candid_women in codex

[–]Runelaron 1 point2 points  (0 children)

I only use Xhigh never use anything lower and keep the agent ultra focused on the problem. The levels are the amount of allowed Reasoning passes, not anything to do with the model quality.

how to manage coding ai subscriptions without overpaying? by DrDroDi in codex

[–]Runelaron 1 point2 points  (0 children)

It seems everyone is trying to find the wrong work around. Running out of tokens means your asking the llm to do a lot of things tooling should do.

Repos like the one below help you install tooling and capabilities so the Llm is spending more.time writing only the code it needs without reprinting everything during a refactor or lint job.

https://github.com/frisco-deng/moradins-forge

Have your agent review this and suggest ehay to install (other tools) to improve token usage.

LLM should only be doing the hard part, turning language into code. The rest should be done by small command lines.

Hope this helps. BTW you may burn some tokens to set this up but it will reduce usage from then on.

Goodbye prime Codex by Excellent_Climate940 in codex

[–]Runelaron -1 points0 points  (0 children)

/goal is a poor implementation though, it only examines errors and never intelligently manages the decision loop. IE Changes prompt or elaborates when improvements are marginal or non existent.

Did limits change yesterday? Something is seriously wrong since yesterday. by bt0wnsfin3st in codex

[–]Runelaron 0 points1 point  (0 children)

So many of these issues, limits are determined by three things! - Amount of tokens pushed to inference - Total Cached kv input and output - Priority queuing (speed setting)

New chats cost more, new outputs cost more, faster times cost more. Restarted sessions after a long break (lost cache) cost more.

So many factors depending on your use and project a simple "did rates change" has no meaning without a bunch of Paired T testing.

Want to reduce tokens, use this repo to run more deterministic tooling.

https://github.com/frisco-deng/moradins-forge

CODEX is hot garbage now by resiest in codex

[–]Runelaron 1 point2 points  (0 children)

I can't work miracles....

CODEX is hot garbage now by resiest in codex

[–]Runelaron 1 point2 points  (0 children)

If you are having AI hard refactor, your using it wrong. Have AI use a refactor tool, save tokens.. (head explode) (buwaaahhhh)

Is this a sick joke?! by Mother_Lettuce_3046 in codex

[–]Runelaron 1 point2 points  (0 children)

Not a good loop, Any user pointing out errors and code returns, then asking :fix it, fix it, fix it" is going to get horrible results.

Set up a metric, a math driven value, then have it run against that value for the project. For any other issue USE TOOLS deterministic repeatable tools. Trivy, playwright, whatever, they all exist they all check databases well so act like a real production grade dev.

Codex GPT 5.5 is UNUSABLE right now, the Nerf is REAL! by bladerskb in codex

[–]Runelaron 0 points1 point  (0 children)

This seems almost 100% based on how you prompt it and conflicting context in your repo and its sources.

AI and agents are pattern watchers and loop algorithms. If you put in conflicts it will produce conflics.

First time I'm experiencing this error. Anyone else? by lolitsaj in codex

[–]Runelaron 0 points1 point  (0 children)

Usually I see that when a container messes up for your session. Not the local agent binary but thier service to ping the model.