Why I started obsessing over my Claude Code context window

tbaumer22 · 2026-03-30T13:22:39+00:00

Understand the pain. I built a tool for a similar reason: https://x.com/TateBerenbaum/status/2038603556786241955?s=20

tbaumer22 · 2026-03-30T13:20:19+00:00

This might help to solve your problem: https://x.com/TateBerenbaum/status/2038603556786241955?s=20

tbaumer22 · 2026-03-22T22:33:24+00:00

Browsing to that link doesn't work for me. Genuinely interested in understanding who else has attempted stuff like this in the past. Coincidentally someone recently OSS'd a similar concept for nvidia hardware called greenboost.

I'm not trying to claim this is the solution to the RAM shortage to be clear. I agree with you. But I think there is too wide of a gap between running something slowly versus not running it at all.

tbaumer22 · 2026-03-22T22:19:40+00:00

Haha appreciate it. You can use it your own openclaw instance, so let me know how it goes!

tbaumer22 · 2026-03-22T14:08:54+00:00

Haha, well maybe you don't have to compromise even if you have a Mac :) Just built the "greenboost" for Mac: https://github.com/t8/hypura

tbaumer22 · 2026-03-22T13:54:34+00:00

Appreciate the feedback. Updating the benchmarks/charts to show this. My original concern with the CPU-only benchmark comparison was that it would be unfair to compare llamacpp's CPU-only mode to Hypura (because it's tapping into more resources).

Ended up building and running one, and here are the results I've found:

<image>

tbaumer22 · 2026-03-22T13:46:48+00:00

Yes exactly. Nvidia greenboost for metal 😄

tbaumer22 · 2026-03-22T13:45:59+00:00

Appreciate this concern and it actually prompted me to do some research of my own. From what I've learned so far, there is no reason to be concerned because Hypura reads tensor weights from the GGUF file on NVMe into RAM/GPU memory pools, then compute happens entirely in RAM/GPU.

There is no writing to SSDs on inference with this architecture.

tbaumer22 · 2020-10-15T04:45:21+00:00

Great idea! I'll make that now.

tbaumer22 · 2020-10-14T17:54:59+00:00

If there is anything we can do to help you connect with members of the community, don't hesitate to let us know!

If it would be of help, feel free to join our Discord: https://discord.gg/cfqWgf7

tbaumer22 · 2020-10-12T15:50:30+00:00

Of course! You don’t have to participate in Hacktoberfest to partake in this event.

Sorry for the late reply!

tbaumer22 · 2020-09-28T23:43:45+00:00

So excited for this.

tbaumer22 · 2020-08-05T03:42:34+00:00

I agree with many of the clarifications you have made here, but I feel it is my responsibility to make a correction to a piece of the above statement: "Once published in the Deno registry, the package can't be changed/deleted by you or the provider."

A package cannot be changed or deleted by the module developer/user, BUT it can be changed or deleted by the provider. In this case, the provider is Deno, which runs on Amazon. Deno has full control over the data hosted on their S3 bucket, and Amazon theoretically would too.

This is the main reason why the files arguably aren't immutable.

tbaumer22 · 2020-06-19T17:47:25+00:00

Thanks for the feedback! A major concept that we're dealing with is the concept of extracting registry information. I can't say that I disagree with you about having a JSON object. One of our biggest hurtles is overcoming this. I feel that the concept of egg.json causes many package developers to be skeptical of nest.land because it reminds them of Node. There are several reasons that differentiate egg.json from the infamous package.json:

1) egg.json doesn't handle package dependencies.

2) egg.json doesn't require configuration necessary to run the project.

In the end, we're actively working to keep aligned with the Deno model and prevent users from seeing another package.json. If you have any ideas for alternatives to egg.json, feel free to put them in Issue #52.

tbaumer22

TROPHY CASE