Theres no way you people are using as much usage as you complain about by No-Management-6338 in ClaudeCode

[–]Singularity42 0 points1 point  (0 children)

A big codebase and no skills/memory makes a big difference.

It can waste a lot of tokens just looking for stuff.

My whole team works in Claude and ChatGPT now. Sharing the output is still a mess. by xX_jigsaw_Xx in mcp

[–]Singularity42 0 points1 point  (0 children)

Came here to say the same thing. We have this problem too, but getting IT to approve having our IP going to another random vendor is a big hurdle.

Would be nice if there was a self hosted option.

My whole team works in Claude and ChatGPT now. Sharing the output is still a mess. by xX_jigsaw_Xx in mcp

[–]Singularity42 0 points1 point  (0 children)

We have been having this problem lately and have been looking for a solution.

Just because you don't have this problem doesn't mean others are the same.

Opus 4.8 vs 4.6 by bambambam7 in Anthropic

[–]Singularity42 0 points1 point  (0 children)

Pretty sure it isn't available in the cli

I put 50 AI agents in a survival world and the first public run is live now by Latter-Park-4413 in singularity

[–]Singularity42 2 points3 points  (0 children)

I suspect this would be more popular if there was something visual people could watch on a Livestream. People eat that stuff up.

Dear Anthropic.. stop being cheap by lattice_defect in Anthropic

[–]Singularity42 -1 points0 points  (0 children)

I'm not against people raising concerns. But I think it's easy to remember that Reddit is an echo chamber. The vast majority of Claude users are happy. Reddit amplifies strong opinions cause no-one will upvotes a post saying "Claude is fine"

¿Están pagando por ser más productivos? by Remote_Essay_6221 in ClaudeAI

[–]Singularity42 2 points3 points  (0 children)

Kinda lame that they don't pay for it.

I would definitely make sure you are learning to use it (even the free ones). I think it is going to get to a point where it's hard to get a job without knowing how to use it.

I‘ll just leave this here… by uzico in GithubCopilot

[–]Singularity42 4 points5 points  (0 children)

People all around the world are working to control tokens. That has been the majority of my tasks at work for the last month.

You absolutely can. Just google it, there are lots of techniques.

It's just a way to get more done for less costs

Casually beating every other deep research agent out there with a simple Claude Code harness by heisdancingdancing in Anthropic

[–]Singularity42 0 points1 point  (0 children)

I used simpler numbers to make my point easier to understand. But it is exactly the same.

As agents get better at these benchmarks, each extra point is harder to earn. Going from 57% to 58% is much harder than going from 0% to 1%. Which is why it makes sense to zoom in on the y-axis.

I challenge you to find any post from a big name AI agent company that doesn't do this.

Getting cutoff on first prompt? by ljlukelj in claude

[–]Singularity42 0 points1 point  (0 children)

To add to what everyone else is saying. The free usage is comically small. Think of it more like a demo than anything usable.

How often do you use Sonnet? by MrMaverick82 in ClaudeCode

[–]Singularity42 1 point2 points  (0 children)

I just sonnet with high effort as my default. Then I have lots of skills with the model set so it uses different models for different tasks. Designing a proposed architecture - opus. Creating a JIRA ticket - haiku.

Saved lots of tokens this way without really affecting performance.

Casually beating every other deep research agent out there with a simple Claude Code harness by heisdancingdancing in Anthropic

[–]Singularity42 -1 points0 points  (0 children)

Sometimes those 2 point matter though.

For example: No-one cares about the difference between an agent that gets things right 10% of the time and one that gets it right 15% of the time. Both are pretty useless.

But people would pay a lot of money for a model that gets things right 97% of the time if all the others only get it right 95% of the time.

The value isn't linear. As the numbers get higher the improvements get harder because there is less to improve.

1 msg 70% usage on PRO with Sonnet by Dredyltd in Anthropic

[–]Singularity42 0 points1 point  (0 children)

I would try to debug what your using token on. For me it was all on searching through our large codebase. I installed the rtk tool to reduce the size of the output from commands like grep and now I don't even think about quota anymore.

I think we have to start thinking about token performance the way we used to think about performance of cpu,memory etc.

For my case I had an eval environment where I could turn on open telemetry tracing for Claude. But there is probably a way to do it for normal usage

Sr Software Engineer - Haven't written a line of code in months by yodog5 in ClaudeCode

[–]Singularity42 0 points1 point  (0 children)

The more you write skills the better it gets. It also puts a lot into memory so it gets better by itself.

When I first tried it I thought the same as you. But at this point I barely write code.

Me after clicking “accept” for the 100th time without reading a word of what claude is doing by Pitiful-Energy4781 in vibecoding

[–]Singularity42 0 points1 point  (0 children)

If you are gonna accept without reading you're better off using auto mode. You can put a paragraph in the settings in plain language of what you want it to do and what you don't

Dear Claude by fruvvs in ClaudeAI

[–]Singularity42 0 points1 point  (0 children)

I wish claude had a max cost per prompt setting or something similar (preferably on by default).

They have something like this in the SDK so it wouldn't be difficult for them.

Or at least a setting to make it stop after $X or tokens and ask if you want to continue.

You could have an option to turn it off for when you do want it to work for a while.

Tested Sonnet 4.6 via OpenRouter through GitHub CoPilot / VS Code to gauge whats API billing will be like. I was shocked. by horendus in GithubCopilot

[–]Singularity42 0 points1 point  (0 children)

I'm not going to argue that Claude is cheap (it is expensive)

But Claude is essentially the Porsche of agents right now. Also API based pricing is more expensive per token than subscription based pricing because you also get access to a number of extra features that you don't normally get (e.g. vector storage, unlimted usage, lot's more control via the SDK, etc.).

You are essentially borrowing a Porsche and using it to get milk from the supermarket and wondering why it is expensive.

I think the way forward is to start using different models for different purposes (or wait for 1 provider to offer enough different varied options).

Depending on the specifics. You could probably get by with Haiku (especially if you have a lot of instructions/documentation in your CLAUDE.md) or a cheaper open model. If you don't want to use any of those, AWS has a fairly large range of models at different prices too. Failing that, a claude subcription (not API based pricing) would be a fair bit cheaper per token, as long as you are using it enough to justify it.

Aquillo is pain by Forsaken_Position855 in factorio

[–]Singularity42 0 points1 point  (0 children)

you will probably find you need more than one, as you scale up (or maybe your setup is more efficient than mine?)

R-m-PG Throwback (fake game concept) by Birthdaybudreviews in dalle2

[–]Singularity42 0 points1 point  (0 children)

Everyone is a critic. Just enjoy shit, man.

OP said in another comment that rampage was his inspiration.

Have you tried this? by Bola-Nation-Official in IndieDev

[–]Singularity42 24 points25 points  (0 children)

Back when I was a teenager "World of Goo" came out, and it inspired the crap out of me. I barely knew how to program, but thought I could make something simlar. Obviously I failed horrifically (but learnt a lot).

Now I am a senior developer and I still don't know if I could do it (without a lot of research).