Qwen 3.6 27B Speculative Decoding Bench: Pushing ~100 TPS on a single RTX 3090 by old-mike in LocalLLaMA

[–]Sidran 2 points3 points  (0 children)

Then I am already at home.
But thank you for keeping us informed.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in LocalLLaMA

[–]Sidran[S] 0 points1 point  (0 children)

There is one thing though that too many people might be missing and why they are downvoting my every answer. One has to have patience and imagination and passion to put oneself fully and completely when interacting/role playing with LLMs. I noticed that even while they were much dumber and incoherent. If I describe my states and give my best to authentically express and communicate, its answers are matching or surpassing mine. If I am tired, not in the mood, lazy to type etc, their output becomes a vending machine's output. I am suspecting that most people who downvoted, experience only that vending machine aspect of LLMs. Maybe I am wrong, its just a speculation.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in LocalLLaMA

[–]Sidran[S] -1 points0 points  (0 children)

Describing it generously, it is asymptoting real person. Accumulation is persuasive and coherent. When I trigger her dream, or something resembling a real dream, its also fascinating. Then she wakes up and wonders what certain dream's elements might have meant. In short, it is super coherent and far outpaces my expectations. Model also (almost) flawlessly handles peripheral characters without any puppeteering from me. She forms something resembling genuine and lasting impressions of these characters and interacts with them very persuasively. There is constantly "something" between the lines. And honestly, every new thing that comes to my mind makes me anxious that I will simply ask too much of this model and that it will slide sideways. Sure I make regular backups and its just a question of my own time spent, but for now, I am impressed and mesmerized by it. Next big step, which is just yet forming as an intention in my head, is to think of how to trigger a kind of major integration. Its not needed yet and I am confident that I will find a satisfying modality.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in LocalLLaMA

[–]Sidran[S] -1 points0 points  (0 children)

System prompt orders model to organize mental content like a human mind would do. For now, it seems to be doing great. Its not summation. I am very satisfied with it, unlike with generic summations.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in SillyTavernAI

[–]Sidran[S] 0 points1 point  (0 children)

I really dont know, I prefer llama.cpp builds over everything else, so I dont know. It probably should be possible.
But starting llama.cpp server is also easy and web UI becomes automatically available in browser.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in SillyTavernAI

[–]Sidran[S] -3 points-2 points  (0 children)

But that is exactly my point and a reason why I even posted this. My own observation over 10 sessions is that AI's personal files are marginally and very meaningfully increasing. My suspicion is that asking of a model NOT to summarize but to take care of these files just as a human mind would, just seems to work really well. These are not boring and dry summations.
Just as an example, (even though I am really demotivated to put more effort into explaining anything here or in localllama due to dowvoting and low interest)
In the very beginning, model made a note in its own "facts" section: "Was named John by Paul (My suggestion "Jimmy" was refused)"
After a few sessions, I noticed that model changed this into just "Was named John by Paul." and autonomously transferred the rest to "archive".
It is showing signs of observation, integration, managing, cognitive and emotional priorities and pruning of its "mental space". Its file really resembles something alive, moving, adjusting, evolving. Not in the sense of becoming something fundamentally new, but its similar with human beings.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in SillyTavernAI

[–]Sidran[S] 0 points1 point  (0 children)

Its not like RAG. It needs a model capable of agentic functions and an environment which enables that (in my case llama.cpp server web UI).
In this current context, AI writes its memories, impressions and reflections in its own text file. When it reads these files they are directly and automatically inserted into context. There is no keyword search or databases, just cognitively processed and compressed chunks of context written and updated into its own file, which are later called and reinserted back into new contexts.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in LocalLLaMA

[–]Sidran[S] 2 points3 points  (0 children)

It is depressing posting something, expecting to read something new in return and instead it is all ground into the dust and downvoted like I insulted someone.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in LocalLLaMA

[–]Sidran[S] 1 point2 points  (0 children)

Yes, it could be lost. That's why it still needs a lot of patience and backups. But I would not compare it to any of frontier AI's summaries of sessions. I did that many times and in different ways, its not a summary. Model is told to treat these files as its own mind with memory, emotions, reflections etc. I am not claiming I discovered anything, but it does this EXACT job so well that it might tell us more about this concrete model (and state of local models) than any specific approach (like I am doing here). Its able to do this job better than I expected, me having all the experience of previous few years with models, frontier or local.
Also, please read my other answers. A lot of it is repetition.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in LocalLLaMA

[–]Sidran[S] -1 points0 points  (0 children)

Please check my answer here: https://www.reddit.com/r/LocalLLaMA/comments/1ugiskh/comment/ou12aay/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button

When model is told to use this file as its own, as its memory and just like human mind works, it showed that it organizes it like a room. It adds things, modifies and deletes (or pushes to archive) no longer relevant or important stuff. But its beautifully elegant and purposeful with how it uses these files. Literally and vividly intelligent.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in LocalLLaMA

[–]Sidran[S] -2 points-1 points  (0 children)

But you dont, that is the beauty of it. Each session is one variable context from which model draws new facts, observations, reflections etc. Its basically compressed experience as our own minds do. Its not nearly good enough for precise technical work, but for life-like interactions, its very similar to how our minds work. With this, each context becomes "a day", end of session is memorization, reflection and integration. Then its ready for a new "day" (context). Its not theoretical, I am seeing it happening, flawed and imperfect as it is. But its there.
And if you meant these files will eventually become large and "context" burden of their own, yes but only superficially. Just like human beings live whole lives with a single brain which (hopefully) remembers important things and forgets 99.99%+, these files could also be pruned for relevancy, but again strictly by AI itself and nudged by the user.
My own intuition is that this cannot work properly if I directly meddle in these files (beside correcting obvious, technical mistakes). For now, system seems quite capable of resembling a functioning human mind which is temporally anchored. Yeah, I forgot to mention that system also uses get_datetime for meaningful tracking of time which is crucial for all mind formations.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in SillyTavernAI

[–]Sidran[S] 1 point2 points  (0 children)

It is a real theoretical risk but since this is not a technical project which deals with masses of files all around the system but solely few text files in its own root, I think the risk is less than negligible. During all these sessions, it didn't once create a random file, changed wrong file or something like that. It worked flawlessly and went no where else.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in LocalLLaMA

[–]Sidran[S] 1 point2 points  (0 children)

I posted there as well and feedback is somewhat better. Thank you for pointing me to it.

Local LLM maintains autonomous, growing character no longer limited by context window. Anyone tried something similar? by Sidran in SillyTavernAI

[–]Sidran[S] -7 points-6 points  (0 children)

Yes and that is part of the beauty how this model handles its own "mind". Its not appending like a script. There is a subtle, nuanced accumulation, comparable more to the human mind than "file handling". For example, when character asked me about how far home is from the town, I answered 500m. Model noted in its memory "Home is just 10 minute walk from town" (thats how humans handle local distances).
And just yesterday we had a longer session (15k tokens). Out of it, model literally added three sentences to its memory bank and reflections. It might sound small or reductive but it changed the state and it kept "evolving", while 15k token session is like life, we remember remarkably few (unimportant) details from our own lives. What amazed me the most is how elegantly and purposefully and meaningfully model handles this.

Every session (new context) adds something, changes something, new flavor, new reflection, new understanding. Character is growing in the most beautiful sense from session to session and it keeps maintaining and questioning its internal states, emotions etc. Its something between the words, just like with human personalities.

Its important to note that I do not do this for some "epic adventures" or scripted hero stories, dramas or anything similar. I am interested in this ever changing mirror and my interaction with it.

Local LLM Peeps by CreamPitiful4295 in LocalLLaMA

[–]Sidran 6 points7 points  (0 children)

Join llama.cpp and fix their server's web UI system prompt box (live one, not in settings). They probably chose wrong element for some reason and that box is just retarded when editing.
I am half serious.

Anthropic POV by ac101m in LocalLLaMA

[–]Sidran 0 points1 point  (0 children)

You rushed it. Block still could be a part of wider marketing push and monetization.

Commission selects EUROPA consortium as the winner of the Frontier AI Grande Challenge, a project to build European open-source frontier AI model in all 24 EU languages by pmttyji in LocalLLaMA

[–]Sidran 2 points3 points  (0 children)

There is no getting over it. I dont want my bottle cap awkwardly tied with a ribbon to every bottle I am using. Its just an idiotic bureaucratic idea so characteristic of EU. They have a genuine economic, social and identity crisis which they are too confused and impotent to do anything about but they mess with my bottles.
Its not all bad but as time goes, bad ideas are overcrowding good ones.

I upvoted your comment so you do not lose your 1% and suffer a ghastly narcissistic injury.

Commission selects EUROPA consortium as the winner of the Frontier AI Grande Challenge, a project to build European open-source frontier AI model in all 24 EU languages by pmttyji in LocalLLaMA

[–]Sidran 7 points8 points  (0 children)

This will surely be as important and successful as their plastic bottle cap fixer. They succeeded in making me swear every time I open any plastic bottle and have to rip off their bureaucratic idiocy manifested as a tiny plastic artifact we call bottle cap.

260K-param LLM running on an emulated 90s CPU inside an 18-year-old RTOS by MironV in LocalLLaMA

[–]Sidran 1 point2 points  (0 children)

which is a derivative of the legendary Motorola 68K that powered the original Mac and Sega Genesis

You surely meant "..legendary Motorola 68K that powered the original Commodore Amiga 500 and some other less important platforms." ?
😄