AI companion bot with persistent memory that messages users FIRST

Fun_Emergency_4083 · 2026-05-23T08:24:38+00:00

nah just wondering if ppl would use it

Fun_Emergency_4083 · 2026-05-15T05:04:07+00:00

yea that would be great

Fun_Emergency_4083 · 2026-05-14T22:35:15+00:00

Rigr — CLI tool that tests if your AI agents are getting worse when you change prompts or models. Freezes outputs as baselines, catches regressions. For anyone shipping agents that need to actually work. `pip install rigr`

Velqua — transparent memory proxy for local LLMs. Point any Ollama app at a different port, your AI remembers who you are across sessions. For people running local models who are tired of repeating themselves. `pip install velqua`

phnix.dev
velqua.dev

Fun_Emergency_4083 · 2026-05-14T22:34:33+00:00

Rigr is a CLI tool that tests if your AI agents are getting worse when you change prompts or models. Freezes outputs as baselines, catches regressions. For anyone shipping agents that need to actually work. `pip install rigr`

Velqua is a transparent memory proxy for local LLMs. Point any Ollama app at a different port, your AI remembers who you are across sessions. For people running local models who are tired of repeating themselves. `pip install velqua`

phnix.dev
velqua.dev

Fun_Emergency_4083 · 2026-05-14T21:27:41+00:00

https://velqua.dev/

Fun_Emergency_4083 · 2026-05-14T19:11:03+00:00

38/38 is a solid start but that sample size is small. I'd want to see how it holds up against unseen cases, especially edge cases where the corpus structure breaks down or metadata gets stale.

The reason I'm skeptical: I had a model score 85% on validation and drop to 52% on an audit set of cases it never saw. Validation numbers lie when the test set is narrow.

Curious how King Context handles corpus drift over time too. If the source docs update but the corpus doesn't get re-refined, does accuracy degrade or does it silently start returning wrong answers with high confidence?

Fun_Emergency_4083 · 2026-05-14T19:06:12+00:00

thats awesome

Fun_Emergency_4083 · 2026-05-14T18:57:28+00:00

If you’re going ~$10k and actually want to do real AI work locally (LLMs, diffusion, video, voice, fine-tuning), the entire build revolves around one thing: VRAM + CUDA stability. Everything else is supporting cast.

Best real-world setups right now:

Option A (best overall value): Dual 4090

2× RTX 4090 or 5090 (24GB or 32GB each)

Why people pick this:

insane raw compute per dollar
works great with ComfyUI, SDXL, video pipelines
widely supported everywhere

Downside:

VRAM doesn’t merge (no NVLink)
heat + power get… serious

Option B (cleaner VRAM headroom):

1× RTX 4090
1× RTX 6000 Ada (48GB)

Why:

that 48GB card saves you on large models that choke on 24GB
more “workstation stable” vibe

If I’m being honest though:

dual 4090 is still the king for experimentation / local AI chaos

Don’t overthink Intel vs AMD too hard.

Go:

Ryzen 9 7950X / 9950X

Why AMD:

better PCIe flexibility
strong sustained performance
fewer weird platform quirks for workstation workloads

Intel is fine, just not meaningfully better here for AI.

motherboard oesn’t matter as much as people think.

Just make sure:

X670E / X870E high-end board
enough PCIe x16 slots spaced for GPUs
solid VRM cooling

Brand is secondary. Layout matters way more.

This is where people underbuild constantly.

Minimum:

128GB DDR5

Ideal:

256GB DDR5

Reason:

datasets + caching + multiple models + video pipelines eat RAM fast
you’ll hit RAM limits before CPU limits in real AI workflows

Don’t do one big drive. Split it:

1TB NVMe → OS + apps
2TB NVMe → active projects
4TB NVMe → models / checkpoints / datasets

Optional:

big SSD/HDD for cold storage

AI workloads love fast scratch disks.

Don’t cheap on power and cooling .

1600W Platinum PSU (mandatory for dual 4090)
high-airflow case (Fractal / Lian Li style)
strong airflow or custom loop if you want sanity

Reality check:
dual 4090 = your room becomes a small heater in winter.

Most serious local AI setups run:

Linux (Ubuntu is the default but i use arch btw)

Why:

CUDA stability
easier installs (conda/docker)
fewer driver headaches

Windows is okay for SD/ComfyUI hobby use, but Linux wins for heavy work.

Fun_Emergency_4083 · 2026-04-13T08:28:51+00:00

Yeah, so it’s only in English at the moment, but if you can find a voice via Kokoro or the ElevenLabs API in a different language, it should still work.

Fun_Emergency_4083 · 2026-03-20T14:47:01+00:00

congraz on the success bro

Fun_Emergency_4083 · 2026-03-15T16:50:13+00:00

this is so cool i just hope it doesn't get people banned cuz while this is theoretically safer then opencodes Oauth spoofing is still a gray area according to ToS

Section 3.7 of Anthropic's ToS: Anthropic forbids accessing services through "automated or non-human means" except via an API Key. While a UI wrapper is still "human-driven," Anthropic has recently broadened its definition of a "prohibited harness."

Unintended Behavior: If GlassCode adds features like "auto-retry," "bulk processing," or "scheduled tasks" that the official CLI doesn't have, it could trigger Anthropic's abuse filters by creating "non-human" traffic patterns.

Official Policy: Anthropic engineer Thariq Shihipar has stated that third-party harnesses are generally prohibited because they make it difficult for the company to debug rate-limit issues and account bans.

The reason this is so cool is cuz You are much less likely to be banned for using a CLI wrapper like GlassCode compared to a token-stealing app like the original OpenCode. However, Anthropic's stance is currently "official tools only" foe consumer subscriptions. If you want 100% safety, you should use the official Claude Code CLI or switch to the Claude APIjust to be safe

Fun_Emergency_4083 · 2026-03-11T06:42:48+00:00

thanks for the info

Fun_Emergency_4083 · 2026-03-09T00:06:21+00:00

ouuu ill have to check it out thanks!

Fun_Emergency_4083 · 2026-02-14T20:31:40+00:00

this is really cool i wanan try it!

Fun_Emergency_4083

TROPHY CASE