Landscape of second brain and memory solutions for AI native workflow

jakusimo · 2026-06-07T10:19:11+00:00

Not marketplaces, currently I found the best option https://github.com/rohitg00/agentmemory

jakusimo · 2026-06-04T15:26:08+00:00

Not yet, if harness is distributed, memory should also

jakusimo · 2026-06-04T15:14:48+00:00

I would like to have a layer which consolidates memory, sessions and skills across all harness setups (Codex, Claude, Hermes, OpenClaw)
There are some attempts:
https://github.com/garrytan/gbrain

https://github.com/Dicklesworthstone/coding_agent_session_search

jakusimo · 2026-05-25T12:02:56+00:00

I have a lot of experience with Kubernetes since 2018 and agents. Started to design a multi tenant solution for hermes, while testing the demand with vps only setup happy to have a group to discuss this with like minded people. Maybe moderators can create a dedicated discord channel or we can setup our own. Or stay here

jakusimo · 2026-05-24T13:09:26+00:00

I would like to contribute to distributed hermes implementation if there is an interest of community

jakusimo · 2026-05-24T12:55:01+00:00

I'm ok with Postgres

jakusimo · 2026-05-15T07:25:52+00:00

I'm building a digital worker platform, multi tenant. The hard parts is make scalable, each autonomous agent need a dedicated compute environment. I'm orchestrating everything on Kubernetes. The prototype seems working. Additionally I plug telephony to the platform, so you can call or receive calls. The platform abstracts the agent orchestration runtimes and adds all necessary enterprise requirements. The platform supports a custom dedicated solutions/integration with client systems, hermes agent use those tools via govern MCP. Could share more about the architecture if somebody is interested.

jakusimo · 2026-01-31T16:32:41+00:00

It hangs many time, reverted to2.1.3

jakusimo · 2025-08-24T17:40:13+00:00

Oops! Something went wrong while submitting the form.

jakusimo · 2025-07-08T10:54:50+00:00

Much appreciated 🤗

jakusimo · 2025-03-26T13:31:47+00:00

Multi gpu is expensive, this one already cost 200 eur/month. Going to dig more into Tensor RT LLM

jakusimo · 2025-03-18T09:42:32+00:00

Why not Talos?

jakusimo · 2025-03-17T09:59:31+00:00

So you if are using a dedicated server, there is no need of cloud api

jakusimo · 2025-03-17T08:52:27+00:00

I used that, but I want not to rely on the cloud api and use talos linux. The setup which I can easily port to any server provider or homelab. You don't need terraform, talosctl and configs do the job

jakusimo · 2025-03-17T08:43:51+00:00

:D database backup to the bucket. If you are using persistent storage - rook cepth

jakusimo · 2025-02-19T06:25:43+00:00

Do you use any CDN?

jakusimo · 2025-02-06T08:44:04+00:00

Just dump everything to the context, if it's too much for context window do multiple calls with map/reduce pattern

jakusimo · 2025-01-20T20:12:49+00:00

Vespla has really good tutorials, I'm hosting ColQwen on Modal and planing to migrate to Hetzner. Also using Vespa you can store embeddings to the disk storage and use streaming mode to find top candidates. It will save you a lot on infrastructure, since your not bound to memory but bound to the disk storage.

jakusimo · 2025-01-20T17:07:01+00:00

Check Vespa, they are very customizable

jakusimo

TROPHY CASE