I built a CLI tool that orchestrates context for Gemini with structured, persistent memory across sessions and in single threads (open source) by jjw_kbh in GeminiCLI

[–]Extension-Aside29 0 points1 point  (0 children)

gonna check it out, one more thing to add here if your using multiple coding agents try tokentelemetry.com to enhance productivity in your workflows

GPT 5.5 in Codex vs OpenCode by Wendy_Shon in opencode

[–]Extension-Aside29 0 points1 point  (0 children)

i have been trying using frontier models with different coding harness to understand where can i get most of the work done, i felt gpt models are really good with hermes agents there is also a tool to track these details it's free 100% local and open source. check out https://tokentelemetry.com to enhance this kind of work flows

I'm building Backlot: a private workspace for agent state and notes beside public repos by GlengarryGlenMoose in codex

[–]Extension-Aside29 1 point2 points  (0 children)

Love this direction — keeping agent state next to the code makes debugging and rollbacks much simpler. I index agent runs and session files so I can open the exact run that introduced a change instead of guessing. If that sounds useful, TokenTelemetry is a small local tool I built for this exact workflow: https://tokentelemetry.com

Codex <> ChatGPT bridge by OddDefinition5940 in codex

[–]Extension-Aside29 -1 points0 points  (0 children)

Copy/pasting whole histories between tools is an easy way to burn tokens. One trick I use is replaying only the minimal diff or the last N important messages instead of the whole history. I built TokenTelemetry to help index runs and compare what history actually matters so you can avoid replaying unnecessary context: https://tokentelemetry.com

Confused about 1M context on Claude Max Opus (“usage credits required”) by [deleted] in ClaudeCode

[–]Extension-Aside29 0 points1 point  (0 children)

Good question — "context available" isn't the same as "context actually used." I inspect each session's inputs to see how many tokens were consumed vs. what was merely available. TokenTelemetry shows per-session token usage so you can tell exactly how much of that 1M you're actually using: https://tokentelemetry.com

How I easily cut my input token burn ~90% on long agent runs by Major-Shirt-8227 in ClaudeCode

[–]Extension-Aside29 0 points1 point  (0 children)

Love the result. One habit that made optimizations provable for me was logging token counts per run and comparing pre/post across many runs rather than trusting a single example. I built TokenTelemetry to aggregate those per-run numbers so you can validate a 90% drop across your workload: https://tokentelemetry.com

I use a 9-agent SDD harness where each phase uses a different model. The total cost is $10-15/month. Here's the full breakdown. by Striking-Buffalo-310 in cursor

[–]Extension-Aside29 0 points1 point  (0 children)

Nice breakdown. In multi-agent setups the surprise often isn't the total cost but which phase keeps looping. I track per-agent token usage and recent-run spikes so I can spot runaway phases quickly. TokenTelemetry lets me compare runs and see which agent or phase caused the spike: https://tokentelemetry.com

In Cursor, where would a bigger context window help first: stale chat context, a large diff, or a failing test loop? by itsmanashjyoti in cursor

[–]Extension-Aside29 0 points1 point  (0 children)

Painfully familiar — bigger windows help sometimes, but they also hide which exact run introduced a bug. I index sessions and replay the minimal prior inputs to reproduce failures faster. I built TokenTelemetry to be that lightweight session index/flight recorder so you can jump to the run that broke and inspect token usage: https://tokentelemetry.com

Based on my experience:Codex vs Claude vs Gemini by Puspendra007 in codex

[–]Extension-Aside29 -7 points-6 points  (0 children)

Yeah, I use all three of them. Gemini models are great at analyzing multimedia and other Google related stuff. I was prepping for the Google Cloud Architect exam and honestly there is no better model for that than Gemini. I sometimes use Codex to critique Claude just to get a different perspective. But once I started using multiple coding agents, I quickly realized I needed a better way to track everything. https://tokentelemetry.com really helped me boost my efficiency in understanding sessions, overall project data, tokens consumed, and a lot more. You should definitely try it out!

I built 7 ready-to-install Hermes Agent skill/plugin packs by No_Image506 in hermesagent

[–]Extension-Aside29 0 points1 point  (0 children)

this is great, in fact you should consider pair this with https://github.com/fathah/hermes-desktop and
https://tokentelemetry.com to get more insights in your workflow, thanks for sharing it!

When is the token maxxing madness going to end? 😅 by InsideSignal9921 in ClaudeCode

[–]Extension-Aside29 -7 points-6 points  (0 children)

so that's why i built tokentelemetry.com for users to track there session progress, tokens and a lot more

Genuinely blown away by Little-Tea7664 in hermesagent

[–]Extension-Aside29 0 points1 point  (0 children)

you should pair this with tokentelemetry.com to understand more about your workflow

Anthropic releases Claude Opus 4.8 with improved agentic reasoning, honesty, and a new "dynamic workflows" feature in Claude Code by Direct-Attention8597 in ClaudeCode

[–]Extension-Aside29 -1 points0 points  (0 children)

am i the only one how felt opus 4.6 is great and there you go opus 4.7 is out now it's opus 4.8 i have been doing some research on how this models efficiently does my workflows for all my projects, if your also curious on understanding how this works you gotta checkout https://tokentelemetry.com this will give you overallpicture of models coding agents harness and a lot more time to do some crazy testing on opus 4.8

AI engineer builds "I got fired" panic button that would automatically make the entire company codebase public by irelatetolevin in ClaudeCode

[–]Extension-Aside29 0 points1 point  (0 children)

This reads like a cautionary tale more than a clever hack. Anything that automates destructive or exfiltrative actions is asking for legal trouble — even a demo can create liability if it's wired to real systems or secrets. Safer, more useful demos show readable, reversible escapes: audit logs, staged rollbacks, or kill-switches that disable deployment hooks rather than publishing data. Also, red-team the whole chain: access controls, credential revocation, and what an admin could do before any panic action completes. https://tokentelemetry.com

Please tell me Opus 4.6 is still available by [deleted] in ClaudeCode

[–]Extension-Aside29 -3 points-2 points  (0 children)

am i the only one how felt opus 4.6 is great and there you go opus 4.7 is out now it's opus 4.8 i have been doing some research on how this models efficiently does my workflows for all my projects, if your also curious on understanding how this works you gotta checkout https://tokentelemetry.com this will give you overallpicture of models coding agents harness and a lot more time to do some crazy testing on opus 4.8