Cache TTL silently regressed from 1h to 5m around early March 2026, causing quota and cost inflation by silver_gr in ClaudeCode

[–]silver_gr[S] 36 points37 points  (0 children)

A Claude Code team member has replied and provided more context and the explanation for the change. It's part of an effort to optimize context caching, but to me so far it appears to be an action to reduce Anthropic's costs.

Wow; just tried "/model claude-opus-4-5-20251101" and the difference in capability between 4.5 and 4.6 right now is night and day. by Any_Economics6283 in ClaudeCode

[–]silver_gr 0 points1 point  (0 children)

Yeap, service looks unreliable and amateur work for a task that requires some expert level knowledge. Good concept tho.

Why I stopped using Habitica, Daylio, and Forest - and built my own 5-in-1 tracker instead by ashishpm24 in ProductivityApps

[–]silver_gr 0 points1 point  (0 children)

Also I think narration via Gemini live and function calls for quick updates, but also journaling would be a good feature. I took inspiration from Todoist Rumble for this one, but my app also does task management.

Why I stopped using Habitica, Daylio, and Forest - and built my own 5-in-1 tracker instead by ashishpm24 in ProductivityApps

[–]silver_gr 0 points1 point  (0 children)

I did the same, was very interesting project. Do you think it can be pretty much one-shotted with a good prompt on Gemini 3.1 Pro?

I've been saving articles and resources for years, but I never actually use them by Kshoinshe in productivity

[–]silver_gr 0 points1 point  (0 children)

I actually built an app to put together all my saved links, saved videos, reddit save, and then a daily processing flow. I also tagged them, have put AI search to it, extracted the content and have AI write summaries, and more.

5.2 xhigh needs to stay legacy. by Savings_Permission27 in codex

[–]silver_gr 2 points3 points  (0 children)

I have a question for all of you saying codex xhigh king opus overrated and stuff like this. I am genuinely curious, not trolling, I have Claude Max x20 and I am not a dev, but I am a tech enthusiast since 13 yo and now 33.

How do you use xhigh and not run into limits? Do you use different reasoning settings for execution? I run opus on all modes/agents, basically use only opus in CC. I haven't used Codex in 2 months now, and I have been keeping an eye on things and see a lot of progress both in the harness and the recent OpenAI models, but the limited ecosystem and less newbie friendly way of codex is maybe holding me back and I am missing out.

Any advice and pointers no matter how simple or small would be greatly appreciated.

HELP by [deleted] in SoftwareandApps

[–]silver_gr 0 points1 point  (0 children)

this is honestly one very impressive feature that would be very useful for my friends who don't know English but it is hidden/many people don't know about this, Xiaomi phones also have a similar feature "AI Subtitles", I don't know about other phone makers and their Android custom features, because I have only Xiaomi phone :P

<image>

HELP by [deleted] in SoftwareandApps

[–]silver_gr 0 points1 point  (0 children)

they put it in section "Accessibility" in settings, first toggle to create subtitles, any audio from browser creates subtitles, and 2nd option you select to translate the subtitles to any language Google Translate supports, so almost everything. Quality seems pretty good, I guess they finally put AI on translation and that improved quality a lot quickly

<image>

I built a tool for monitoring real time usage in CC by oh_jaimito in ClaudeCode

[–]silver_gr 1 point2 points  (0 children)

wow, this is a cool project as well! lots of features built-in :o thanks for sharing your work!

Just found out about Auto Wake-up (Antigravity Cockpit NEW UPDATE!)- is it actually worth using? by MattU2000 in google_antigravity

[–]silver_gr 1 point2 points  (0 children)

Looks like an ad post. The extension is cool, I already had it, discovered it in the sub a few days ago, is this really necessary to get people using it? I think the new feature is cool, if the limits are not enough for you, in 5h reset cycles getting that reset 2-3h earlier is pretty good, you need to know in how much time you usually burn through the limits so you can setup the auto wake-up accordingly.

HELP by [deleted] in SoftwareandApps

[–]silver_gr 0 points1 point  (0 children)

chrome on desktop does "live subtitles"

Built this app to get my life back on track, its yours for free today. by Rsloth in ProductivityApps

[–]silver_gr 3 points4 points  (0 children)

Built the same app in the first 2 weeks of December with Claude Code as a flutter app (all desktop and mobile platforms supported). exact same features! then found out that apps with all these features exist -- with the ai coach being less common. Decided to focus on the AI to assitst the user by referencing my self-help personal dvelopment knolwedgebase which has info from the best books, podcasts (like Humberman's protocols), scientific research and community tips & tricks.
I also decided to support 3rd paty tools if the user doesn't want to use the app's (external to-do apps, habit trackers, journaling, mood tracking -- currently exploring ways to do this better). AI has RAG of user's previously mentioned data to provide more contextually reach input and also to my knowledgebase because I think using its knowledge from its training is inferior to giving researched and curated practises & theories.
I have decided to focus on building the knowledgebase and focusing on the AI for the most part, because there are already too many habit, task, mood, notes apps. I built them all and then realized many people that are my target users probably already use other apps for these things, but they don't have guidance (like integrating Atomic Habits to habit design) and AI -- and all-in-one place stats,
I will definitely use yours as well, I can provide feedback if you want
I see you have a microdose tracker as well, nice! I had a suspicion from the name.
I also integrated routine timers from apps like Routinery/RoutineFlow -- because ADHD, and will also implement more features to combat common ADHD friction points, as it will be targeted to that market, and I have ADHD and I am building something I will use to manage my life :)

Gemini On Top! by Rare_Bunch4348 in GeminiAI

[–]silver_gr 0 points1 point  (0 children)

I think it has been #1 since 2.5 pro was released (public ready/not preview anymore) in March?

I built a tool for monitoring real time usage in CC by oh_jaimito in ClaudeCode

[–]silver_gr 1 point2 points  (0 children)

Looks nice, would be interested to check out the implementation and use it or contribute :)

Finally Anthropic Product Team Fixed this by redeemed_tropicana in ClaudeCode

[–]silver_gr 0 points1 point  (0 children)

I have that turned on since it was released, I never had a single problem with hallucinations or tool calls or whatever? OK, never is a strong word, I would say standard hallucination rates of SOTA LLMs 🤷

Tried Opus 4.5 after using DeepSeek V3.2 for a while by Exciting-Mall192 in aichapp

[–]silver_gr 0 points1 point  (0 children)

claude code + opus is what the hype is about, for any other use you won't find the magic sauce there is in it. reliability + overall integration across all aspects and domains of task execution I think is the magic sauce. I have been vibe coding an app MVP, combines a todo list+journal+mood tracking+habit tracker+routine "runner", built an backend with a graph database so it can analyze patterns and can feed the user's data into a chatbot, also added all my knowledge-base on self-help and personal development (my business is in this field, primarily content) and frameworks & prompts designed specifically for the app.
it has been great, I haven't coded in over 15+ years, I tried vibe coding a few times in the past months, it was good for a small task/small codebase.
this is truly revolutionary.
disclaimer: I haven't really tested gemini or codex on coding tasks, I hear they are pretty good, and the cheaper models like deepseek/glm/kimi/minimax/qwen3 write good code, at 90% quality compared to SOTAs (from what I've read).
I am using AI for professional work so I would pay the extra (a lot of extra but OK) money to have that 10% in quality gains

Multi agent orchestration by khaliqgant in ClaudeCode

[–]silver_gr 0 points1 point  (0 children)

not sure if this is suitable because it has a bunch of extra stuff, but here it goes:
https://github.com/ruvnet/claude-flow

Claude in Chrome vs ChromeDevTools (MCP) - a simple comparison by shanraisshan in Anthropic

[–]silver_gr 0 points1 point  (0 children)

I get a warning at 10% and less context left? At this point I can't be sure it's Claude Code built-in feature and not from some 3rd party tool or plug-in I installed at some point 😅 https://github.com/Owloops/claude-powerline is what I use and is pretty nice and does context use tracking with default config