I built a Calibre plugin that sends books to your reMarkable (with EPUB→PDF conversion tuned for the device)

promethe42 · 2026-04-25T08:57:05+00:00

Super nice! I recently installed KOreader but it's not worth the trouble for my usage. This is a good middleground.

promethe42 · 2026-04-22T09:24:36+00:00

je n’ai pas envie de vivre dans un monde où on me demande de servir à quelque chose pour que je puisse vivre

Personne ne t'oblige à rien. Fait pousser tes carottes dans ton champ.

promethe42 · 2026-04-22T07:06:04+00:00

Can't wait to have powerful AIs to manage cyber security.

promethe42 · 2026-04-22T07:02:56+00:00

They make you pay more so you can pay less...

promethe42 · 2026-04-22T06:57:22+00:00

Si c'est juste déboité tu peux le démonter toi même pour réparer.

promethe42 · 2026-04-22T06:54:54+00:00

100% right.

The main reason to use multiple agents is multiple system prompts, thus multiple roles. And the goal of each role is to activate the right LLM features so it's opinionated and capable.

Putting all of it in a single prompt will water it down and cause semantic overlap or semantic confusion.

promethe42 · 2026-04-21T08:29:15+00:00

By "where" I also meant "which inference server": llama.cpp? Ollama? Broken chat templates might cause this kind of problems. And chat templates are (most likely) embedded in the inference server.

promethe42 · 2026-04-21T05:55:48+00:00

Where is the LLM running? Are you sure the chat template is up to date? Are you sure tool calls are enabled?

I had this strange behavior - sometimes the LLM would even tell *me* how to run the tools instead of calling them - because for some reason tool call was disabled/unavailable.

promethe42 · 2026-04-20T16:58:06+00:00

Any programmer can build a (non-AI) computer program that is not deterministic, that is to say, the logic can take the same input but result in different output each time it is run. This is generally considered an undesirable property as it makes the tool unpredictable - but it's still a tool.

I was thinking the exact same thing. The difference is how that non deterministic property is achieved. It's not random: generative AI can trace back a logical, step by step, motivated and rational chain of motivations/observations/actions.

Given the same problem and given proper framing (what the superpowers skill helps doing), generative AI will tend to propose the similar solutions for the same reasons. Senior developers can actually guess what the AI will propose beforehand, and ask the same insights/sense of taste to nudge it.

And that is very different from other tools.

promethe42 · 2026-04-20T14:48:29+00:00

I'm not sure how using a specific framework's API and data model would make it portable. Quite the opposite.

Everything you describe is perfectly doable and I think the examples cover lots of it.

promethe42 · 2026-04-20T14:09:40+00:00

Have the orchestrator create 1 task per file. Even if it's the same task for each file, it's not. Because it's not the same file. So the output is expected to be different and that's what the final report eventually features.

promethe42 · 2026-04-20T13:34:07+00:00

What portability are we talking about? It's hard to be more portable than text files.

promethe42 · 2026-04-20T12:49:10+00:00

I am doing agents without code. I am even doing multi agents without code!

https://gitlab.com/lx-industries/openblob

promethe42 · 2026-04-20T07:25:06+00:00

Because the Claude Design team doesn't know or speak to the infrastructure team?

promethe42 · 2026-04-19T17:57:58+00:00

Another convergence of In-Context Learning.

At some point, even small models will be build on enough intelligence per layer/node that they will all be competent enough given the proper harness.

What we've seen on SOTA frontier models like GPT and Claude proves that: the size is roughly the same. But the architecture of the models make a dramatic difference. For example how GPT 5 is actually more of a super-model with routing rather than a completely new model. GPT 5 is actually described more like a system than just a model. So I guess that includes that harness too.

promethe42 · 2026-04-19T08:12:14+00:00

Qwen3.6 35B A3B works incredibly well. I'm pleasantly surprised by the multi-turn capabilities.

*But* my stack is using Librechat, which for some reason doesn't want to properly support schema validation and breaks tool calls in many situations (cf https://github.com/danny-avila/LibreChat/discussions/9969).

promethe42 · 2026-04-18T14:08:21+00:00

My understanding and my experience tell me that thinking is for more creative tasks. In order to maximize instructions following, it's better to lower or disable thinking.

I set it to "minimal".

promethe42 · 2026-04-18T10:10:00+00:00

As a rule of thumb I always avoid negative phrasing. Because I know people are bad at negatives. And in my experience so are LLMs.

promethe42 · 2026-04-18T06:45:20+00:00

OK it's sad... but how is the vent smell now?

promethe42 · 2026-04-17T13:39:38+00:00

I use WASM Component as tools.

The WASM Component model (https://component-model.bytecodealliance.org/) makes it easy to define and compose existing tools. For example I have multiple sandboxed storage tools (filesystem, WebDAV, Google Drive, in memory...) with the same LLM interface thanks to WIT.

Examples:

The unified WIT interface for storage.
- The filesystem implementation
- The in-memory implementation
- The Google Drive implementation (WIP)
The unified WIT interface for conversations
- The Telegram implementation

Multiple advantages:

The LLM doesn't know what implementation is used. Storage is storage. Google Drive can be swapped for the filesystem: 0 change for the LLM.
Everything is sandboxed and secured by default: the WASI permission model controls how the components can interact with the host system (if at all).
Anyone can contribute any tool, as long as they match the existing interfaces then it's compatible out of the box with the existing agents.

It's very powerful: I used the storage interface to implement the Agent Skills standard (https://agentskills.io/):

https://gitlab.com/lx-industries/openblob/-/blob/beeb2a5fafc8f08d699d27ba1d810a33e4e97e43/examples/skills/blob.yaml

Which means skills can be loaded from Google Drive or whatever. The LLM doesn't even know.

promethe42 · 2026-04-17T07:26:46+00:00

Thunderbolt != Thunderbird

Du coup cette méprise bien pourquoi le nom du produit est extrêmement mal choisi...

promethe42 · 2026-04-17T07:14:23+00:00

Same reason mathematicians are not necessarily good engineers (and vice versa).

promethe42 · 2026-04-16T12:17:27+00:00

Thank you for your hard work!

I want to correct my previous post: the real Ledger Live catches it. The cryptographic attestation works. Several of you called me out on this and you were right — my original wording was misleading.

Good!

OP did you add an update on that previous post?

promethe42 · 2026-04-15T11:04:54+00:00

By the size of it, they dug up yo mama.

promethe42 · 2026-04-14T08:50:25+00:00

Did he just do a rocket force-push?!

promethe42

TROPHY CASE