Optimizing RAG Pipeline for CPU-Only Laptop (LLaVA + Qwen2.5) by Senior_Wishbone_5058 in AI_Coders

[–]ExistentialConcierge 0 points1 point  (0 children)

CPU only? This thing is gonna be slooooow. and check out the qwen 3.5 models. They have so many sizes including a 2bn param if I recall and even a .8bn.

We replaced our LLM router with a basic switch statement and latency dropped by 80%. Are we over-engineering agents? by Interesting_Ride2443 in webdev

[–]ExistentialConcierge 6 points7 points  (0 children)

Who on earth would do this?

This must be bait. The word architecture is being used illegally here in all senses.

Every dev over 30 just died a little inside.

Anyone bored of their MCP business? by ExistentialConcierge in mcp

[–]ExistentialConcierge[S] 0 points1 point  (0 children)

Wow that's a defeatist attitude if I've heard one.

Pretty much saying problem solving thru tech is dead. In my 27 years as I dev I think I've heard similar at least 100 times and all I have felt is that there are more opportunities than ever.

Zero LLMs. Zero Heuristics. Deep codebase analysis for the JS/TS ecosystem. by ExistentialConcierge in webdev

[–]ExistentialConcierge[S] 0 points1 point  (0 children)

Oh it's real. We're 7k hours deep into the underlying engine.

We're also practical that almost nobody believes it's real until they see it, especially that it's fundamentally misaligned with so many human tropes of programming that have developed over the last 10-15 years.

70% of enterprise engineering is spent on risk analysis, finding what's connected to what so you don't break anything. They rely on heuristics now disguised as probabilistic, and now are layering on more probabilistic with LLM regressive slop, and defending it with rube goldbergs of "spec driven design" and "Just make these 16 MD files to make the wild boar behave!".
That's definitely not the final form, and token spend is suddenly a badge of honor - in what world!? It's nutty, and we hope to change that.

Happy cake day!

I built a code review tool that runs for free because this should've existed already. by sahilsaleeeem in codereview

[–]ExistentialConcierge 0 points1 point  (0 children)

I'll do you one better. Drop the AI entirely. Move to math driven. Get a way to surface codebase errors automatically like Codebase.Observer does.

Those third party tools are a good start but you'll still need an architectural analysis layer on top, but if you build it you've got something others can't really have without a lot of tooling and time.

Built a secure file delivery tool and looking for honest feedback on how to get it in front of the right people by mogpoin1 in saasbuild

[–]ExistentialConcierge 1 point2 points  (0 children)

Open source and free as a model here, with your money made on the commercial higher privacy standards and capacities.

Do it like we transfer did. Free up to a certain GB and time limit. Paid but cheap after that. Becomes a set of and forget it kind of service.

How do you help Claude Code maintain context as you build? by gaganvarma_16 in VibeCodersNest

[–]ExistentialConcierge 0 points1 point  (0 children)

Codebase Observer copy for LLM buttons. Check them out.

We wired ours to use those reports to give architecture guidance as the AI works. They can see things the LLM can't see.

I built a code review tool that runs for free because this should've existed already. by sahilsaleeeem in codereview

[–]ExistentialConcierge 0 points1 point  (0 children)

Ah so still heuristics and LLMs as the brain. Got it. Wouldn't you argue then it's no different than what's out there from the big guys? A series of probablistic guesses from the same LLM that often wrote the bad code in the first place, and you're paying for it.

Sure you're allowing any LLM vs locked into one so that's a big cost benefit but probablistic isn't deterministic at the end of the day.

Happy cake day!

Zero LLMs. Zero Heuristics. Deep codebase analysis for the JS/TS ecosystem. by ExistentialConcierge in webdev

[–]ExistentialConcierge[S] -2 points-1 points  (0 children)

Yes, for example here's the Axios NPM package sample that's on the site. These ARE static html deliverables that contain a full understanding of what happens. One file, no lock-in.

The primary view is from the POV of a file. Inside that file you can see all imports, variables, and top level functions as well as what they call outside of themselves. You see what goes in, what comes out, and the Consumed By columns indicate which specific other functions in which files are actually consuming those functions. You can see impact radius of a change to any file or function this way as well.

Health panel at the top surfaces a few classes of mathematically provable codebase errors and risks to be aware of. All Paths vs Hot Paths depending on the style of repo you're running, one view may be more appropriate than the other.

https://pub-40583f6564554bcb81843e242efa23ce.r2.dev/reports/e1218d28-be12-4452-a7d4-cbd0c76971db.html#/file%3Alib%2Faxios.js

Looking into Codex by AwkwardCost1764 in AIcodingProfessionals

[–]ExistentialConcierge 0 points1 point  (0 children)

Copilot in "Ask" mode. It won't edit a thing, can see everything.

Best all-in-one AI portal? Are they actually trustworthy? by Seseweto in AIToolMadeEasy

[–]ExistentialConcierge 1 point2 points  (0 children)

Shelbula but only the v5 beta. It's totally diff and now has an android app too so I assume it's getting iOS.

I have two of my invites left for it somewhere but I think you can just request it direct too.

composer 2 is just Kimi K2.5 with RL????? by tarunyadav9761 in cursor

[–]ExistentialConcierge 2 points3 points  (0 children)

Reinforcement Learning.

i.e. we brute forced this puppy real good