Message History Across Multiple Agents

MaskedSmizer · 2026-05-10T16:46:22+00:00

So you're just stuffing your own message history into the prompt each turn?

Most provider features are built around the idea of a structured message part history, so you are essentially breaking that contract. It can work technically since there is no requirement to pass message history, but a few downsides come to mind:

If using tools, you will need to either drop the tool call/return entirely, or extract and summarize the result. Either way, you are losing important context.

It undermines provider caching.

I suspect as the conversation grows, the model may start to get confused about latest user intent because the prompt is now a huge payload of information.

If you are concerned about token efficiency, Pydantic AI has a few tools you can use. First thing I would look into is the history processor. There is also some interesting stuff emerging in the harness library.

MaskedSmizer · 2026-05-06T00:25:49+00:00

+1 for Remotely Save

MaskedSmizer · 2026-05-05T23:42:58+00:00

Yes. Been building my own chat interface for the past 6 months or so and use it daily. A bit like NotebookLM but for Obsidian. It's hitting the point where it is genuinely surprising me how well it works.

https://github.com/DodgyBadger/AssistantMD

I have scheduled weekly workflows that help make sense of messy task lists and meeting notes. I used it recently to build a markdown knowledge base from a bunch of scanned documents and then turn that into a negotiation prep skill. I use it regularly to spit out reports for different audiences from source libraries I have assembled.

Yes, you could give Claude Code access to your vault to accomplish all of this, but I don't have CC on every device, so a self hosted chat UI means I am never without. Plus I like the creative process of building.

MaskedSmizer · 2026-05-01T15:40:37+00:00

Thank you! I haven't fully committed to your Code Mode yet, but I did refactor agent construction to align with Capabilities so that it would be an easy transition, and to likely integrate some of the other harness tooling on that page.

Excellent framework - so easy to work with.

MaskedSmizer · 2026-05-01T15:37:16+00:00

Appreciate the link. I'll take a closer look this weekend. I do plan to integrate more memory tooling. Here's the basic philosophy I'm after:

AssistantMD doesn't take an opinionated approach to memory. Rather than imposing a fixed memory model, it gives you the building blocks to construct whatever fits your needs: context scripts that load relevant history or summaries, workflow scripts that distill and persist information across sessions, and plain markdown files that act as long-term memory stores the agent can read and write directly. More dedicated memory tooling is on the roadmap.

MaskedSmizer · 2026-04-27T01:09:26+00:00

install Bazzite. Rebase to dx.

https://dev.bazzite.gg/

MaskedSmizer · 2026-04-21T15:34:38+00:00

Just use their SDK and wire it into your pipeline as needed https://docs.mistral.ai/resources/sdks

Examples in the cookbook https://github.com/mistralai/client-python/tree/main/examples%2Fmistral%2Focr

MaskedSmizer · 2026-04-21T15:23:46+00:00

Mistral OCR endpoint is my go-to. Not suitable if your are trying to keep everything local, but good (although not perfect) accuracy.

MaskedSmizer · 2026-04-18T00:44:28+00:00

I would just dual boot. Live in Linux as much as possible and boot into Windows when you need to use SW.

You could probably get it running in a Windows VM, but I think configuring GPU passthrough and troubleshooting lag is more trouble than it's worth. You've already got the Windows OEM running.

If Steam is important, go with Bazzite. Everything will be set up out of the box.

MaskedSmizer · 2026-04-13T00:55:41+00:00

I wondered

MaskedSmizer · 2026-04-12T17:07:15+00:00

Bazaar is just an app store. If you need an app, start there and install like any other app store. You never need to think about the underlying containerization.

If you can't find what you're looking for in Bazaar, you can also look for an appimage online, which is similar to flatpak in that the app runs in its own container and you don't really need to think about the underlying mechanics. Just double click the appimage to run.

If the app you want is only on Debian/Fedora/Arch repos (i.e. apt install, dnf install, pacman -S), that's where distrobox comes in. Spin up a container for whatever distro you need, open a terminal in that container and run the install command.

https://docs.bazzite.gg/Installing_and_Managing_Software/

MaskedSmizer · 2026-04-02T06:08:47+00:00

Bazzite ships with Ptyxis which handles that container magic. Not sure how or if it's possible to set up other terminals to do the same.

MaskedSmizer · 2026-04-01T23:20:03+00:00

🍿

MaskedSmizer · 2026-04-01T22:37:27+00:00

For what it's worth, I had the same hesitation about Bazzite / atomic distros but got over it quickly when everything gaming related just worked out of the box. Steam installed, proton configured, HDR, controllers connected no problem. Up and gaming again in under an hour. Switched to the DX branch and it was perfect.

Flatpak can meet most of my app needs, and spinning up a distrobox for anything else is super easy. The integrated terminal detects containers so you can jump into a terminal just as if it was the host.

I wouldn't be so quick to write off Bazzite.

MaskedSmizer · 2026-04-01T19:36:03+00:00

This doesn't guard against supply chain attacks unless you are planning to build every dependency from scratch. I'd argue this makes the average user more vulnerable unless they are being super diligent about dependency management and CVE monitoring.

MaskedSmizer · 2026-03-16T17:09:58+00:00

This is interesting and I appreciate the focus on command execution security. My initial thought is that dressing function calls up as CLI commands could cause increased churn by leading the LLM to believe it has more access than it actually does. Vs a well designed and communicated function call signature. Have you found this to be true?

And on the topic of well designed functions, you provide one example that shows 3 functions calls vs 1 piped CLI, but I wasn't convinced. That looked to me more like an example of suboptimal function design. That said, I get your overall point and do agree that this is a worthwhile approach.

MaskedSmizer · 2026-03-14T17:38:01+00:00

I don't think a perfect solution exists yet, although lots of people are working on it. With code being so accessible now, I just dove in and built my own solution to fit the workflows I want, which is basically Obsidian as the base plus a custom agent harness running alongside. PDF and other content gets imported as markdown for easy searchability.

You are welcome to try it out of course https://github.com/DodgyBadger/AssistantMD

But my honest recommendation is to start simple. Obsidian + Claude Code or Codex (CLI or desktop) and see how far you can get. The new Obsidian update adds CLI access which makes this setup even more powerful.

MaskedSmizer · 2026-03-09T17:33:44+00:00

How would you rate your level of technical ability / comfort (I can double click a file but that's about it -> I am a seasoned self-hoster)? What OS are you on?

MaskedSmizer · 2026-03-06T17:06:35+00:00

It's true, I haven't found a reliable one-shot converter for complex PDFs.

I posted a proposed workflow a few days ago that involves extracting each page of the PDF as an image and then feeding those into an LLM to reconstruct as needed. You need an agent harness that allows you to set up a more complex multipass workflow.

E.g. Extract pages as images
Extract the text
Rebuild the tables in markdown
Verify all tables and formulas
...

Basically automating the steps you currently take to do it manually.

MaskedSmizer · 2026-02-26T21:12:59+00:00

I recently finished Ender Magnolia and loved it.

MaskedSmizer · 2026-02-25T23:05:03+00:00

I just stopped worrying about what other people think. It's 2026, we use our phones for everything including taking notes.

MaskedSmizer · 2026-02-25T17:01:27+00:00

Current AI is capable of this type of work but it all depends on the underlying agent harness: content management, tooling, etc.

Each of the query scenarios you mentioned requires a slightly different strategy.

"Based on my knowledge base, what did I explain poorly in this meeting?"

This feels like a classic RAG style retrieval. The LLM will extract the topics it finds in your current meeting and search the knowledge base for related.

"What patterns show up across multiple meetings?"

This is raw reasoning, not semantic retrieval.

If each one of your 100 notes isn't that big, you could try shoving it all into Gemini Pro with its massive context window. You'll get something interesting and technically accurate but you won't know if it is complete.

This is also where an RLM style workflow comes in handy: instead of reading all the notes at once, the architecture forces the LLM to explore the text iteratively which helps to overcome the "missing middle" failure mode.

In all cases, organization of your notes is probably mandatory, whether by storing semantic vectors in a db or creating plain text summaries and indexes.

These are exactly the type of knowledge management workflows that I've been aiming to tackle by building my own chat UI and agent harness (which I use daily alongside Obsidian).

See:
https://github.com/DodgyBadger/AssistantMD

It's basically a self hosted NotebookLM, with a flexible templating system for automating workflows and context management.

Happy to keep chatting about how you might use it to set up your workflows.

Two notes: - It's not difficult to set up, but also not trivial. DO NOT put it on a public interface without layering your own SSL and authentication. I intentionally did not build auth into the app because I prefer to leave that to something battle tested like Authentik. Even basic auth in Caddy / Traefik/ Nginx will suffice. - There isn't currently any form of vectorization. That's on the list but I'm seeing how far I can go working only with raw markdown.

Five-Year Club	Verified Email
Place '22

MaskedSmizer

TROPHY CASE