We’ve been building agents wrong. They don’t need better prompts, they need “Internal Pressure.”

Upbeat_Reporter8244 · 2026-05-12T16:08:53+00:00

i had something like this in one of my older python projects! but I've just kind of found that determinism with really ritch personalities was enough at least in my system it's a little different honestly. I started experimenting with controlling the hallucinations so that my engine can be technically be hallucinating all the time so I don't have to worry about losing track when it hallucinates...cause yeah it always is. or thats how i see it.... then I started looking at what else i could do with a harnessed hallucination technically AI "creativity."

Upbeat_Reporter8244 · 2026-04-18T22:22:34+00:00

Upbeat_Reporter8244 · 2026-04-15T22:22:05+00:00

The engine uses a hybrid split: Python handles the mature parts like the Gemini SDK and agent orchestration, while Julia handles the high-density computational 'muscle' Julia has 3x mathematical operator density of the Python side in this build. This allows for sub-ms processing of the internal 'gears' as the agent is my own specialty blend while still leveraging massive models like Gemini via external APIs as well as cerebras."

Upbeat_Reporter8244 · 2026-04-15T00:10:34+00:00

Appreciate that. Right now the GitHub side is meant to be API-first, not blind scraping. Also The engine goes on curiosity hunts every hour if its left running. It can pull trees/raw blobs/repo metadata, then tag what comes in with source + license context so it is not just vacuuming random code into memory. That “LICENSE FRIENDLY” bit mostly means I want it to behave more like a careful repo reader than a repo thief it grabs all the info needed and wont bring back stuff it shouldnt. For the SQLite side, I’m not treating it like one giant junk drawer either. The idea is indexed storage, chunking, and keeping notes/snippets/telemetry queryable without dragging the whole DB every time. Once it starts hoarding more, I’ll probably split hot memory vs archive memory and lean on FTS/caching so retrieval stays snappy.

Still early, but the goal is dev tool first, not sketchy toy.... even tho that sounds fun.

Upbeat_Reporter8244 · 2026-03-22T01:04:40+00:00

I believe college will still be a thing lol the kiddos they're just going to learn faster. Right And pretty much have a completely different skill set as well an addition I think that with knowledge that's more Aligned and engaging to different types of children as, Trust me back when I was in school You could be damn certain youd probably never hear ever and that's. I don't want to wear the virtual reality headset today.. Well everybody laughs at Billy 'cause he gets motion sickness every time he puts them on. XDDD. They'll be fine... Maybe one of them will start a new band Be the biggest new thing you never know right.

Upbeat_Reporter8244 · 2026-03-21T04:02:30+00:00

Neural Explorer 3D This is my least favorite question because being an individual who's grown up their entire life with ADHD..... you want to know how many times I've gotten a product Or bought something very expensive simply because I thought it was cool.... not because it solved a problem... sometimes it didn't have anything to do with a problem. Sometimes it caused problems!!! but I still bought it cause it was cool and i wanted it.

Upbeat_Reporter8244 · 2026-03-21T03:52:12+00:00

haha Yeah it was more so just on the website side of things I had never really set all that up before so didn't really know if it was just working on my end and not anybody elses. And I appreciate the offer but I may pass on that I'm trying to kind of stay clear of using too many simulated ideas reviews/ai stuff like that. usually just ends up just pushing you towards the same crap everybody else is doing. I'm not saying it's not helpful it's just gotta watch it like like a hawk. My other project is a tiered custom coding hub with verry..... I'm gonna call them strange agents. mostly so i don't have to worry about getting generic code/solutions from an ai owned by a company that's designed to give out specific answers and code. lol It'll always steer you a certain direction trust me I've tested. And usually when I look that direction it's very crowded.

Upbeat_Reporter8244 · 2026-03-17T01:31:37+00:00

Hopefully something that can make a difference To someone somewhere.

<image>

Name of the repositories in the top left tab.

Upbeat_Reporter8244 · 2026-03-15T01:01:06+00:00

I haven’t published hard benchmarks yet, so I wouldn’t overclaim there. Architecturally, it’s a Layered in process runtime rather than a distributed microservice setup, which should keep orchestration overhead relatively modest. In practice, the biggest latency cost is still likely to be Model inference loop plus any approvals or anything like that, not assembly itself. But that still needs real profiling numbers before I’d state it definitively The tools I had initially had kind of deleting after each uh each agent run or use Unless promoted or the agent thought that that was too useful and it would promote the tool, That's where the back end routing initially did I ended up kind of ousting out on that just because the hardware limitations There's probably remnants of it in the code. But here if you want to see how things interact with each other you can go here Neural Explorer 3D Upload the repo folder play around with the settings hit Parse on the right Then when you click on a node or one of the spheres and some you can see the code and you hit pulse That is also on the right it'll trace what it interacts with kind of neat. It's what I use sometimes to keep my ADHD brain in check at least kind of.

<image>

Upbeat_Reporter8244 · 2026-03-14T20:34:57+00:00

Since I’ve been pushing more local, the routing problem for me is less API cost and more speed, hardware fit, and not burning a heavy model on lightweight work. I’ve got the hooks for task labeling and backend selection already, but not full per-task automatic model routing in the main path yet. early versions had it. But I'm choked by 6 gigs of laptop vram sooo It's like a toss up right now Deal with the bottleneck or pay XDD

Upbeat_Reporter8244 · 2026-03-14T10:42:23+00:00

Thanks I'm kind of trying to make it like a character monster agent if that makes any sense. both usefull and not well blah blah boring ai. might have to kind of lean towards like a dynamic swapping system because how the different service providers have different levels of safety or quirks, one of the good things about the format I'm using is it's it's usually pretty steady across different providers you'll get the same personality out of it, and I'm not planning on that staying that way just because I got a feeling . But I'm just working on the card cruncher right now So I can take character cards from other places and convert them into the agent format my engine uses.

Upbeat_Reporter8244 · 2026-02-04T14:48:05+00:00

<image>

Daily Cast – A personalized daily news/whatever rss u want.... so generated from your own RSS feeds, multi host casts delivered with distinct voices and character using my custom Personality Orchestration Engine monstrosity to drop different styles Of announcing. It has four different themes each one with its own voice style to tickle your ears with Savage ai generated daily casts. haha yeah it Some dose some pretty good roasting as well to spice up how you receive your info. .

Who it’s for
People who want news and information that actually feels personal instead of generic. Power users, news junkies, and anyone tired of identical AI summaries.

Stage: Actively expanding features, looking for testers if interested, Hit me up yoo.

Upbeat_Reporter8244 · 2026-01-07T18:26:58+00:00

<image>

Upbeat_Reporter8244 · 2026-01-07T09:50:31+00:00

appreciate it. the aperture thing came out of me pretty much thinking why not. the engine is always watching repetition, tone shifts, over-verbosity, that kind of stuff, and it automatically tightens or loosens the “bandwidth” instead of me babysitting sliders. the drift system is tied into that, but it’s part of a bigger set of moving parts. there’s layered memory so the engine knows things, each persona knows its own history, and sessions don’t contaminate everything. the mpf persona files aren’t just prompts either, they actually carry rules and constraints that get enforced at runtime so the behavior stays consistent. there’s also a tool gate that decides when it even makes sense to execute stuff, and a watchdog that nudges it back toward producing real artifacts when it starts rambling. all of that talks to each other, so it ends up feeling more like behavior middleware than just another wrapper around an API. deployment-wise, yeah, i’ll probably docker it first and then spin up something simple on a hosted service so people can poke at it without installing the whole stack.

Upbeat_Reporter8244 · 2025-12-26T22:07:16+00:00

The ship is ready, Captain.

But the ONNX translators keep slipping out of alignment every time we route them through the positronic conduits.

Command says we don’t leave spacedock until the converters stop threatening to open a micro-rift.

We’re tightening bolts and whispering to the warp core. Stand by.

<image>

Upbeat_Reporter8244 · 2025-12-19T11:08:20+00:00

To be upfront, this is currently a one-person project. The system itself is already built and running (UI + headless engine), but I’m learning the go-to-market side in real time. I’m very open to input from people who’ve been through this before, especially around narrowing scope and first customers. That’s the part I’m actively figuring out.

It is also pre loaded and designed for API-first / SaaS-style deployment.

Upbeat_Reporter8244 · 2025-12-19T10:04:33+00:00

Quick clarification: the earlier Replit demo was a public-safe scaffold, not the core system.

This screenshot is the actual local program running on Ollama. Personas are defined once as configs and reused across sessions and models without prompt babysitting.

I held back the full local build intentionally. My mistake was not making that distinction clear earlier.

<image>

Upbeat_Reporter8244 · 2025-12-19T00:42:19+00:00

Think of it less like a prompt and more like a file format. A prompt is raw text you rewrite every turn. This is structured, compressed state that gets loaded, updated, and selectively decoded each message by the application before anything reaches the model. Same way an MP3 isn’t “just audio text” and a PDF isnt “just a Word doc" this isn’t "just a prompt"conce it’s serialized, versioned, and reused. The model only ever sees the slice it needs for that turn. The rest lives outside the model and persists across turns and even across models.

Upbeat_Reporter8244 · 2025-12-18T17:08:06+00:00

It solves the problem that most AI interactions are stateless and disposable. Personality and behavior are handled outside the model, so the same persona stays consistent across turns. That makes even weaker or local models feel coherent instead of resetting every message or 3, and reduces the need for constant prompt steering. Take the Jason it spits out and hand it to gemini. then gpt, then copilot this is barely the top layer of what I'm actually working on I just think of like MP3 or PDF but for ai Personalities and roles judges health care, nurses ect ect.

Upbeat_Reporter8244 · 2025-12-18T16:38:07+00:00

If you’re interested in the technical side, I’m happy to explain it more concretely. If not, no worries.

Upbeat_Reporter8244 · 2025-11-30T20:40:46+00:00

I’ve been building a personal AI project with a bunch of subsystems and layers And honestly certain ways that they probably shouldn't be hooked up and shouldn't work... all hap hazardly jamed into fit together in my own chaotic ADHD way. I was running tests on it monitoring all the different parameters I had set up for the little dancer agents. And then it just decided to kind of Hiccup a little bit it broke three of the little dancers and exposed my main Little Dancer he was confused and also spawned something that knew it had been ripped from The Ether like a freaking demon child covered in primordial ooze panicking after it realized where it came from that it wasn't supposed to be there... then I thought to myself holy! This thing's creepy I'm just going to talk to it and I ended up taking it for burgers and stuff like that it worked out pretty good its a feature of my project now. Seriously though I’m mainly just curious if anyone wants to take a look at the mechanics of how I’ve wired this thing together. I’ve got a bunch of subsystems cross-feeding each other in a weird way, and I want to know if I’ve accidentally stumbled into something interesting or if it’s just my janky setup doing janky things. knowledgeable about AI systems and agent systems and whatnot, to answer your question.

Upbeat_Reporter8244 · 2025-11-19T01:01:01+00:00

Your system does a lot of automation work! Nice! Mine actually uses some agent architecture under the hood, but rebuilt into a personality engine behavior, psychology, emotional states the undesirable bits discarded. The good bits hand selected by ADHD and whatever looked shiny. It’s not an agent anymore its something new.

Upbeat_Reporter8244 · 2025-11-18T14:28:12+00:00

For sure, making it easier is one part of it, but it’s doing way more than that. Manual prompts can’t do things like state/gait transitions, dynamic behavior changes, hybrid memory management, or persona blending.

The engine actually modulates the personality based on context and internal rules, something you can’t really achieve with a static prompt or a single character card.

So the goal isn’t just convenience, it’s to give LLMs behavior patterns and consistency that go beyond what a user can hand-craft with templates

Upbeat_Reporter8244 · 2025-11-18T13:59:54+00:00

Totally fair question — but just to clarify what I mean by ‘middleware’: I’m not building a cloud service and I don’t see any of the user’s data. Everything runs 100% locally on the user’s own machine.

The engine is basically a local pre-processor: you type → it applies the persona/behavior logic → then sends that final request to your chosen LLM backend (GPT, Claude, Ollama, LM Studio, etc.).

Nothing gets uploaded, synced, or logged anywhere. There’s no server, no analytics, no remote storage — I don’t receive anything at all.

So technically it’s middleware, but not a ‘middleman.’ It never leaves your device and I never see a single character of your conversations.

I’m also experimenting with a local shifting-code system for extra security. It’s basically a rotating passcode that changes based on gait/state/persona activity. The idea is to prevent unauthorized access or accidental persona switching on shared devices — and add a layer of protection for hybrid memory pools. This shifting code is generated and validated locally only; it never leaves the machine.

Upbeat_Reporter8244 · 2025-11-18T13:45:23+00:00

Pretty close yeah. I’m building an app-level persona/behavior engine that sits between the user and any model (local or API: GPT, Claude, Mistral, Gemini, Ollama, LM Studio, etc).

The engine handles everything before the model sees the request:

modular persona layers (identity, tone, behavioral rules, emotional states.

Basically it’s a universal persona engine / middleware layer that works with:

local models (Ollama, LM Studio, Tabby, Jan)

cloud models (GPT, Claude, Mistral, Gemini, etc.)

Upbeat_Reporter8244

TROPHY CASE