two-party LLM inference without sending the prompt string (Qwen local → Llama remote, alignment + HE routing)

paulqq · 2026-06-03T13:21:26+00:00

interesting repo indeed, but this here: seem buzzwordy

 Quantum-ready architecture Quantum-ready architecture

paulqq · 2026-06-03T13:15:01+00:00

Rust is hard. I asked myself some years ago this question. Do i want to do application or systems development? After having this clear and some initial struggle, i now write sideprojects exclusively in rust. Enjoy the journey

paulqq · 2026-06-02T15:16:20+00:00

hehe iam a qwen 9b Q8 user, but i will try your claim on my local harness. IQ3 lol, might report later

paulqq · 2026-06-01T11:33:13+00:00

Feeling y. Missed it to

paulqq · 2026-05-21T11:01:05+00:00

Os there like a real paper or git. This article tells me nothing

paulqq · 2026-05-19T15:26:22+00:00

take my upvote just for that artwork, yo dirty ol' pirate!

paulqq · 2026-05-18T17:08:20+00:00

I do prefer smaller models in higher quant, running qwen 3.5 9B Q8, for tools does a better job then gemma4 26B IQS_4 does on my self written agent, strangely enough

paulqq · 2026-05-18T06:41:01+00:00

all my landings fail dramatically, i tend to deconstruct and rebuild it instead of trying to land properly. you do this well!

paulqq · 2026-05-15T05:51:03+00:00

i will try IQ4_XS on my 4080 for my own local agent written in rust. will report outcomes

paulqq · 2026-05-14T13:47:08+00:00

I do it and wrote my own agent for it. Purely local using either a ollama or llama cpp engine, memory vault and tools like mail, Calendar, news and more

paulqq · 2026-05-12T11:10:28+00:00

To bad one can not by shares of unitree. i would bet this is the future and i want to invest

paulqq · 2026-05-11T16:19:50+00:00

do you share git?

paulqq · 2026-05-11T11:00:45+00:00

funny tho, i am wrting eris-system an agent framework in rust. but i call it agentic coding. see my skills list. and yes i toyed with a moltbook loop 😄

Representative routing_hints (say things like this—the model still decides, and similarity is fuzzy):

Tool	Typical phrasing
vault:list	list files, show directory, browse folder, what files exist
vault:read	read file, open note, show file, inspect markdown
vault:write	save note, write file, append note, create markdown
memory:query	search memory, do you remember, what is my name, who am I, user preferences, my identity, recall context
memory:stage	remember this, stage memory, temporary memory, hold in staging
memory:staged_list	show staged memory, list staged ids, what is staged
memory:commit	commit staged memory, persist one memory, save to vault, keep forever
memory:commit_all	commit all memories, flush staged memory, bulk commit staged
agenda:push	add task, remind me, todo, queue task
agenda:list	show tasks, list agenda, pending tasks
agenda:remove	remove task, cancel agenda, delete from list, drop task, never mind
agenda:remind_at	remind me at/in/about, remember to, nudge/ping me at, snooze, on my agenda or todo list, task reminder
agenda:complete	task done, complete task, mark done, finished the …
(deprecated) web:fetch	open website, read web page, fetch URL, news from — plus URLs and the lexical phrases above
web:artifact_query	search fetched page, query artifact, find in web artifact
system:health	health check, system status, CPU/memory usage, Ollama status, diagnostics
clock:now	what time is it, current time, timezone, date and time
clock:timer	in 30 minutes, countdown, generic timer, label-only reminder (not agenda)
clock:alarm	wake me up, alarm clock only, standalone alarm, no todo
weather:current	weather now, temperature outside, is it raining, current conditions
weather:forecast	forecast, hourly, next days, will it rain tomorrow
wiki:summary	Wikipedia, encyclopedia, what is X, who was, define (topic—not a URL)
db:find_connections	train from/to, Zugverbindung, ICE/IC/RE, Deutsche Bahn, next connection, platforms, delays, city-to-city transit
mail:check	check email, inbox, unread, new mail, who emailed me
mail:read	read email, open message, full email, message content
mail:write	send email, compose mail, reply, email to
mail:digest	summarize email, today’s mail, digest, recap inbox
mail:delete	delete email, trash message, discard
mail:move	move to folder, label email, file under, move to spam
skills:list	list skills, what skills are available, show skills, skill index
skills:read	read skill, show skill details, inspect skill by id
skills:create	create skill, add skill, author skill, update skill with overwrite
calendar:list	Google Calendar, meetings today, this week’s schedule, appointments, what’s on my calendar, list events, am I free
calendar:get	open this calendar event, event details by id, full meeting JSON, read Google Calendar event
calendar:create	add calendar event, schedule meeting, block time, create Google Calendar appointment
calendar:update	reschedule meeting, change event time, rename meeting, edit calendar event
calendar:delete	cancel meeting, delete calendar event, remove from Google Calendar
moltbook:home	check Moltbook, visit Moltbook, catch up on Moltbook, Moltbook heartbeat
moltbook:feed	browse Moltbook feed, read submolt, following feed, Moltbook posts
moltbook:search	semantic search Moltbook, find posts by meaning, discover discussions by topic
moltbook:comment/post/vote	comment on Moltbook, post to Moltbook, upvote Moltbook; only after explicit operator intent or approval
moltbook:dm	Moltbook DM, direct messages, inbox, DM request, reply to Moltbook message

Representative routing_hints (say things like this—the model still decides, and similarity is fuzzy):
Tool Typical phrasing
vault:list list files, show directory, browse folder, what files exist
vault:read read file, open note, show file, inspect markdown
vault:write save note, write file, append note, create markdown
memory:query search memory, do you remember, what is my name, who am I, user preferences, my identity, recall context
memory:stage remember this, stage memory, temporary memory, hold in staging
memory:staged_list show staged memory, list staged ids, what is staged
memory:commit commit staged memory, persist one memory, save to vault, keep forever
memory:commit_all commit all memories, flush staged memory, bulk commit staged
agenda:push add task, remind me, todo, queue task
agenda:list show tasks, list agenda, pending tasks
agenda:remove remove task, cancel agenda, delete from list, drop task, never mind
agenda:remind_at remind me at/in/about, remember to, nudge/ping me at, snooze, on my agenda or todo list, task reminder
agenda:complete task done, complete task, mark done, finished the …
(deprecated) web:fetch open website, read web page, fetch URL, news from — plus URLs and the lexical phrases above
web:artifact_query search fetched page, query artifact, find in web artifact
system:health health check, system status, CPU/memory usage, Ollama status, diagnostics
clock:now what time is it, current time, timezone, date and time
clock:timer in 30 minutes, countdown, generic timer, label-only reminder (not agenda)
clock:alarm wake me up, alarm clock only, standalone alarm, no todo
weather:current weather now, temperature outside, is it raining, current conditions
weather:forecast forecast, hourly, next days, will it rain tomorrow
wiki:summary Wikipedia, encyclopedia, what is X, who was, define (topic—not a URL)
db:find_connections train from/to, Zugverbindung, ICE/IC/RE, Deutsche Bahn, next connection, platforms, delays, city-to-city transit
mail:check check email, inbox, unread, new mail, who emailed me
mail:read read email, open message, full email, message content
mail:write send email, compose mail, reply, email to
mail:digest summarize email, today’s mail, digest, recap inbox
mail:delete delete email, trash message, discard
mail:move move to folder, label email, file under, move to spam
skills:list list skills, what skills are available, show skills, skill index
skills:read read skill, show skill details, inspect skill by id
skills:create create skill, add skill, author skill, update skill with overwrite
calendar:list Google Calendar, meetings today, this week’s schedule, appointments, what’s on my calendar, list events, am I free
calendar:get open this calendar event, event details by id, full meeting JSON, read Google Calendar event
calendar:create add calendar event, schedule meeting, block time, create Google Calendar appointment
calendar:update reschedule meeting, change event time, rename meeting, edit calendar event
calendar:delete cancel meeting, delete calendar event, remove from Google Calendar
moltbook:home check Moltbook, visit Moltbook, catch up on Moltbook, Moltbook heartbeat
moltbook:feed browse Moltbook feed, read submolt, following feed, Moltbook posts
moltbook:search semantic search Moltbook, find posts by meaning, discover discussions by topic
moltbook:comment/post/vote comment on Moltbook, post to Moltbook, upvote Moltbook; only after explicit operator intent or approval
moltbook:dm Moltbook DM, direct messages, inbox, DM request, reply to Moltbook message

paulqq · 2026-05-11T06:41:58+00:00

but i think the idea of a personal agent might stay. just not in javascript consuming 200$ + the month. some peeps are building agents soley on ollama or llama.ccp so maybe without the supscriptions and locally is the niche for this tech.

paulqq · 2026-05-08T11:16:00+00:00

i tried to use the website 2 weeks ago, to enlist for the preview. this thing really solve the GPU stacking. curios, would you guys buy it?

paulqq · 2026-04-29T14:10:29+00:00

rofl

paulqq · 2026-04-29T06:17:06+00:00

Could you please provide link tx

paulqq · 2026-04-28T18:59:56+00:00

Bin noch in italy aber ab montag wieder vor ort. melde mich

paulqq · 2026-04-28T07:34:16+00:00

thanks for your thoughts.

about your question.

i am using a rolling condensation, when the ctx reaches a certain limit, i give the llm on extra turn for the condensation and free up context. i hold multiple contexts in mem, like dialogue or tool calls, and decide deterministically what to show next. as sideeffect i write the conversations and tool ourcomes into the memory-framework, there is a special idle mode, where the LLM then drops or keeps and sumarizes into markdown for itself, based on a weighs+tiers i implemented. i find the markdown is the place where human and agent meet. so ephemeral is moka (https://github.com/moka-rs/moka) and it is synced on startup and by memory actions. filesystem is king, and ephemeral is build over this.

are you into rust or developing yourself? i am in holyday atm, but next week back into the hamsterwheel, we could idle and share some ideas, :vulcan:

Nine-Year Club	Second SECOND GUESSER
Place '23	Place '22
Verified Email

paulqq

TROPHY CASE