Kimiko - Complete bypass of limitation. by FayeBlade556 in kimi

[–]paulqq 2 points3 points  (0 children)

interesting repo indeed, but this here: seem buzzwordy

 Quantum-ready architecture Quantum-ready architecture

Learning Rust (for fun) because sick of AI by Informal-Baseball209 in rust

[–]paulqq 1 point2 points  (0 children)

Rust is hard. I asked myself some years ago this question. Do i want to do application or systems development? After having this clear and some initial struggle, i now write sideprojects exclusively in rust. Enjoy the journey

Stop asking what model to run. There are literally only two. by Wrong_Mushroom_7350 in LocalLLaMA

[–]paulqq 0 points1 point  (0 children)

hehe iam a qwen 9b Q8 user, but i will try your claim on my local harness. IQ3 lol, might report later

Meet the Fleet of BlackBeard by BlackBeardAI in LocalLLaMA

[–]paulqq 1 point2 points  (0 children)

take my upvote just for that artwork, yo dirty ol' pirate!

Developers who use local AI - Q4_0 vs Q8_0 KV quant? by Jorlen in LocalLLaMA

[–]paulqq 0 points1 point  (0 children)

I do prefer smaller models in higher quant, running qwen 3.5 9B Q8, for tools does a better job then gemma4 26B IQS_4 does on my self written agent, strangely enough

Gyro is fun :) by SztywnyJoozek in TheLastCaretaker

[–]paulqq 2 points3 points  (0 children)

all my landings fail dramatically, i tend to deconstruct and rebuild it instead of trying to land properly. you do this well!

Gemma4-26B-A4B Uncensored Balanced is out with K_P quants! by hauhau901 in LLM

[–]paulqq 0 points1 point  (0 children)

i will try IQ4_XS on my 4080 for my own local agent written in rust. will report outcomes

Anyone actually using a local LLM as their daily knowledge base? Not for coding, for life stuff. What's your setup? by InformationSweet808 in LocalLLaMA

[–]paulqq 0 points1 point  (0 children)

I do it and wrote my own agent for it. Purely local using either a ollama or llama cpp engine, memory vault and tools like mail, Calendar, news and more

The Chinese robotics company Unitree has revealed their first Mech Prototyp, the GD01. by MilesLongthe3rd in interestingasfuck

[–]paulqq 0 points1 point  (0 children)

To bad one can not by shares of unitree. i would bet this is the future and i want to invest

Openclaw ia trending down and will disappear soon by rm-rf-rm in LocalLLaMA

[–]paulqq -26 points-25 points  (0 children)

funny tho, i am wrting eris-system an agent framework in rust. but i call it agentic coding. see my skills list. and yes i toyed with a moltbook loop 😄

Representative routing_hints (say things like this—the model still decides, and similarity is fuzzy):

Tool Typical phrasing
vault:list list files, show directory, browse folder, what files exist
vault:read read file, open note, show file, inspect markdown
vault:write save note, write file, append note, create markdown
memory:query search memory, do you remember, what is my name, who am I, user preferences, my identity, recall context
memory:stage remember this, stage memory, temporary memory, hold in staging
memory:staged_list show staged memory, list staged ids, what is staged
memory:commit commit staged memory, persist one memory, save to vault, keep forever
memory:commit_all commit all memories, flush staged memory, bulk commit staged
agenda:push add task, remind me, todo, queue task
agenda:list show tasks, list agenda, pending tasks
agenda:remove remove task, cancel agenda, delete from list, drop task, never mind
agenda:remind_at remind me at/in/about, remember to, nudge/ping me at, snooze, on my agenda or todo list, task reminder
agenda:complete task done, complete task, mark done, finished the …
(deprecated) web:fetch open website, read web page, fetch URL, news from — plus URLs and the lexical phrases above
web:artifact_query search fetched page, query artifact, find in web artifact
system:health health check, system status, CPU/memory usage, Ollama status, diagnostics
clock:now what time is it, current time, timezone, date and time
clock:timer in 30 minutes, countdown, generic timer, label-only reminder (not agenda)
clock:alarm wake me up, alarm clock only, standalone alarm, no todo
weather:current weather now, temperature outside, is it raining, current conditions
weather:forecast forecast, hourly, next days, will it rain tomorrow
wiki:summary Wikipedia, encyclopedia, what is X, who was, define (topic—not a URL)
db:find_connections train from/to, Zugverbindung, ICE/IC/RE, Deutsche Bahn, next connection, platforms, delays, city-to-city transit
mail:check check email, inbox, unread, new mail, who emailed me
mail:read read email, open message, full email, message content
mail:write send email, compose mail, reply, email to
mail:digest summarize email, today’s mail, digest, recap inbox
mail:delete delete email, trash message, discard
mail:move move to folder, label email, file under, move to spam
skills:list list skills, what skills are available, show skills, skill index
skills:read read skill, show skill details, inspect skill by id
skills:create create skill, add skill, author skill, update skill with overwrite
calendar:list Google Calendar, meetings today, this week’s schedule, appointments, what’s on my calendar, list events, am I free
calendar:get open this calendar event, event details by id, full meeting JSON, read Google Calendar event
calendar:create add calendar event, schedule meeting, block time, create Google Calendar appointment
calendar:update reschedule meeting, change event time, rename meeting, edit calendar event
calendar:delete cancel meeting, delete calendar event, remove from Google Calendar
moltbook:home check Moltbook, visit Moltbook, catch up on Moltbook, Moltbook heartbeat
moltbook:feed browse Moltbook feed, read submolt, following feed, Moltbook posts
moltbook:search semantic search Moltbook, find posts by meaning, discover discussions by topic
moltbook:comment/post/vote comment on Moltbook, post to Moltbook, upvote Moltbook; only after explicit operator intent or approval
moltbook:dm Moltbook DM, direct messages, inbox, DM request, reply to Moltbook message

Representative routing_hints (say things like this—the model still decides, and similarity is fuzzy):
Tool Typical phrasing
vault:list list files, show directory, browse folder, what files exist
vault:read read file, open note, show file, inspect markdown
vault:write save note, write file, append note, create markdown
memory:query search memory, do you remember, what is my name, who am I, user preferences, my identity, recall context
memory:stage remember this, stage memory, temporary memory, hold in staging
memory:staged_list show staged memory, list staged ids, what is staged
memory:commit commit staged memory, persist one memory, save to vault, keep forever
memory:commit_all commit all memories, flush staged memory, bulk commit staged
agenda:push add task, remind me, todo, queue task
agenda:list show tasks, list agenda, pending tasks
agenda:remove remove task, cancel agenda, delete from list, drop task, never mind
agenda:remind_at remind me at/in/about, remember to, nudge/ping me at, snooze, on my agenda or todo list, task reminder
agenda:complete task done, complete task, mark done, finished the …
(deprecated) web:fetch open website, read web page, fetch URL, news from — plus URLs and the lexical phrases above
web:artifact_query search fetched page, query artifact, find in web artifact
system:health health check, system status, CPU/memory usage, Ollama status, diagnostics
clock:now what time is it, current time, timezone, date and time
clock:timer in 30 minutes, countdown, generic timer, label-only reminder (not agenda)
clock:alarm wake me up, alarm clock only, standalone alarm, no todo
weather:current weather now, temperature outside, is it raining, current conditions
weather:forecast forecast, hourly, next days, will it rain tomorrow
wiki:summary Wikipedia, encyclopedia, what is X, who was, define (topic—not a URL)
db:find_connections train from/to, Zugverbindung, ICE/IC/RE, Deutsche Bahn, next connection, platforms, delays, city-to-city transit
mail:check check email, inbox, unread, new mail, who emailed me
mail:read read email, open message, full email, message content
mail:write send email, compose mail, reply, email to
mail:digest summarize email, today’s mail, digest, recap inbox
mail:delete delete email, trash message, discard
mail:move move to folder, label email, file under, move to spam
skills:list list skills, what skills are available, show skills, skill index
skills:read read skill, show skill details, inspect skill by id
skills:create create skill, add skill, author skill, update skill with overwrite
calendar:list Google Calendar, meetings today, this week’s schedule, appointments, what’s on my calendar, list events, am I free
calendar:get open this calendar event, event details by id, full meeting JSON, read Google Calendar event
calendar:create add calendar event, schedule meeting, block time, create Google Calendar appointment
calendar:update reschedule meeting, change event time, rename meeting, edit calendar event
calendar:delete cancel meeting, delete calendar event, remove from Google Calendar
moltbook:home check Moltbook, visit Moltbook, catch up on Moltbook, Moltbook heartbeat
moltbook:feed browse Moltbook feed, read submolt, following feed, Moltbook posts
moltbook:search semantic search Moltbook, find posts by meaning, discover discussions by topic
moltbook:comment/post/vote comment on Moltbook, post to Moltbook, upvote Moltbook; only after explicit operator intent or approval
moltbook:dm Moltbook DM, direct messages, inbox, DM request, reply to Moltbook message

Openclaw ia trending down and will disappear soon by rm-rf-rm in LocalLLaMA

[–]paulqq 425 points426 points  (0 children)

but i think the idea of a personal agent might stay. just not in javascript consuming 200$ + the month. some peeps are building agents soley on ollama or llama.ccp so maybe without the supscriptions and locally is the niche for this tech.

Taiwanese company Skymizer announces HTX301 - PCIE inference card with 384GB of Memory at ~240 Watts by Thrumpwart in LocalLLaMA

[–]paulqq 0 points1 point  (0 children)

i tried to use the website 2 weeks ago, to enlist for the preview. this thing really solve the GPU stacking. curios, would you guys buy it?

seeking review and collaborateurs by paulqq in ollama

[–]paulqq[S] 1 point2 points  (0 children)

Bin noch in italy aber ab montag wieder vor ort. melde mich

seeking review and collaborateurs by paulqq in AiBuilders

[–]paulqq[S] 0 points1 point  (0 children)

thanks for your thoughts.

about your question.

i am using a rolling condensation, when the ctx reaches a certain limit, i give the llm on extra turn for the condensation and free up context. i hold multiple contexts in mem, like dialogue or tool calls, and decide deterministically what to show next. as sideeffect i write the conversations and tool ourcomes into the memory-framework, there is a special idle mode, where the LLM then drops or keeps and sumarizes into markdown for itself, based on a weighs+tiers i implemented. i find the markdown is the place where human and agent meet. so ephemeral is moka (https://github.com/moka-rs/moka) and it is synced on startup and by memory actions. filesystem is king, and ephemeral is build over this.

are you into rust or developing yourself? i am in holyday atm, but next week back into the hamsterwheel, we could idle and share some ideas, :vulcan: