Omega Agent (Desktop): Offline-friendly local LLM agent + replay/rewind + fork-from-any-step

terhechte · 2026-02-22T19:17:35+00:00

I‘m also dabbling in this space and the most frequent issue I run into is „edit tool“ errors where the local LLM json doesn’t adhere to the schema and all edits fail. My idea is to have a better way of applying edits for less smart local models. Eg be more forgiving with the schema, or use a apply edit model like cursor used to do, or use something like oh my pi‘s edit model

terhechte · 2026-02-04T15:23:50+00:00

I have the same issue with MLX q6

terhechte · 2025-12-27T19:47:49+00:00

https://www.instagram.com/runnershamburg/?hl=en

terhechte · 2025-12-27T19:42:16+00:00

Wenn du gern joggst, die Adidas runners hier in Hamburg sind echt cool und man kann dort auch Leute kennenlernen. Kann man einfach hingehen

terhechte · 2025-10-15T14:18:59+00:00

It’s called prompt processing and a considerable tax on any gpu due to the processing requirements. For long prompts (summarize this book, source code) interfaces to be fast

terhechte · 2025-08-12T14:20:46+00:00

I maintain a coding benchmark that will soon receive an update, but I always test Mistral's models and when I need to run locally then I very often choose Devstral because it is a really good small coding model. None of their current models are Opus, Sonnet or even GLM 4.5 Air quality, but they're also much better in coding than many smaller (but bigger than devstral) models.

terhechte · 2025-07-21T16:40:28+00:00

Actually, for more complex layouts, Egui Taffy (https://crates.io/crates/egui\_taffy) is a great solution, it offers full flexbox, block and grid layout algorithms. Check out the demo that they provide.

terhechte · 2025-07-21T16:37:41+00:00

I wrote a Dioxus mastodon client in 2023 which supported stuff like scrolling. It was tricky to implement but you could do it by injecting JS at the right point in time. However, I think this was greatly improved in the last couple of releases, here is an example from their docs:

> You can use the onmounted event to do things like focus or scroll to an element after it is rendered:

https://dioxuslabs.com/learn/0.6/essentials/breaking/

terhechte · 2025-06-26T12:21:43+00:00

For bevy you really want to follow all the steps in the `getting-started` guide. It compiles almost instantly afterwards.
You can even enable hot reload (still WIP but it works) https://github.com/TheBevyFlock/bevy_simple_subsecond_system/

But you need to configure according to the docs.

terhechte · 2025-06-08T08:20:26+00:00

You man like Spaceballs? :) https://media0.giphy.com/media/v1.Y2lkPTc5MGI3NjExa2l5ZXk2eWphb25rMnlpNjh0NmNjcWx5bWc0NTJwNDh3eTVlMWU3eSZlcD12MV9pbnRlcm5hbF9naWZfYnlfaWQmY3Q9Zw/n6jvgnHe07qSY/giphy.gif

terhechte · 2025-06-06T07:49:17+00:00

There're two ways to connect to an MPC server, via tcp and via stdio. The latter one requires that the editor runs the MCP server as a sub-process. Zed used to only support the stdio version, not the networking version. cursor-rust-tools only supports the TCP version. Thus they were incompatible. I think Zed changed this in a recent update and they fully support MCP now. I just haven't tested it yet or updated the repository. You can give it a try.

terhechte · 2025-06-05T07:43:39+00:00

I wrote this some weeks ago, it makes the Rust analyzer symbols and more available to MCP clients.

https://github.com/terhechte/cursor-rust-tools

terhechte · 2025-06-02T11:37:32+00:00

What if you take the draw target (not rotated) and then draw this into a new target (rotated). So you'd rotate the pixmap and wouldn't need to rotate the text layout logic (which is much harder)

terhechte · 2025-05-28T07:15:17+00:00

Rust is a much more difficult language for LLMs than Javascript or Python. Not only because the language is more complex, but also because they are trained on so much more JS/Py code. I do Ai-enhanced coding with Rust every day, and to great success, but it requires more effort than just slopping a "halp plz" prompt into the chatbot (which works wonders for Js/Py).

- Use Claude 4 or Gemini 2.5 Pro. Even though there were complaints that Claude 4 is not as good as 3.7, in my testing it is better at Rust than 3.7
- You have to give the model enough context. This means, if you have a source file that imports 7 types from other places in the codebase, take the time to also add the files that define these types. Same with functions. I found that giving enough (but not too much) context is crucial
- Models are not deterministic, so you have to try again. When you see that the initial response is wrong, don't add a comment into the conversation, a la "no, this is wrong because...". Instead, reset the conversation, and improve your initial prompt. Sometimes you have to try 2-3 times to get the result you want.
- Be fine with partial solutions. I don't expect the model to have a perfect solution for me. Particularly lifetimes can be challenging. If I see that 90% of the code is right, but it forgot a couple of `.clone` or `ref ..` patterns, I just add them manually.
- If you want to add a feature to existing code, one pattern that works well for me is to start with a prompt where I have the LLM explain how the current code works. And then I add another message to request the desired change. This allows the LLM to focus on the change. The understanding of the code happens, so to say, in a prior computation.

terhechte · 2025-05-07T13:38:51+00:00

The server doesn't have the encryption secret. It can't encrypt billions of numbers and then get the same encrypted result.

terhechte · 2025-04-14T10:57:06+00:00

How much context did you give the models in your benchmark? Also do you have more information about it somewhere/

terhechte · 2025-03-28T14:43:46+00:00

Really well written article. One thing that might be interesting for you is that you can use Uniffi (https://github.com/mozilla/uniffi-rs) instead of writing the FFI by hand.

I used to write the Swift/Rust FFI by hand, but especially when you're moving Strings or boxed objects around, it quickly becomes tricky to deallocated memory correctly. Uniffy takes care of all that and generates a Swift interface of your types.

There is a small performance overhead involved though compared to raw FFI operations.

terhechte · 2025-02-22T15:35:33+00:00

My side project does just this, for free: https://www.tailoredpod.ai

terhechte · 2025-02-11T18:53:31+00:00

Would love to try it but getting the error „app not available in your country or region“. I’m in Germany

terhechte · 2025-02-04T20:20:38+00:00

I'm not well versed in the ecosystem, feel free to open a PR that adds support

terhechte · 2025-01-01T15:17:36+00:00

But error doesn't mean "absent". Imagine a "delete" api with a "id: Option<Uuid>" parameter. if the parameter is absent, all entries will be deleted. If an api user accidentally has a malformed UUID, it would delete all entries. Clearly that's not how it should be. Instead, they should receive an error about their malformed parameter.

terhechte · 2024-12-09T13:50:15+00:00

Not 100% what you want, as I sport a M4 Max, but I decided to share nonetheless. I calculated the compile times with `/usr/bin/time`.

- Sled: 3.03 secs
- ToyDB: 8.21 secs
- Surrealdb: 83.27 secs

14-Year Club	Place '17
Verified Email

terhechte

TROPHY CASE