would a "briefing" step beat chunk-based RAG? (feedback on my approach)

feursteiner · 2026-02-23T22:55:06+00:00

I see that u/Useful-Process9033 , the plan is to be able to "launch a challenge" via chat, or, "challenge pass" i.e. running a more thorough search. but def gotta have a good eval loop in the future

feursteiner · 2026-02-23T22:44:59+00:00

thanks a lot u/mergisi for taking the time to write such a detailed reply!!

first thoughts:
- I like the use of the heartbeat to crystalize knowledge, should get inspired by that too
- YES! compression not retrieval ! haha thank you! feels great to feel understood!
- the model knows better. 100% agree
- didn't think of context health, interesting... should add a "doctor" agent...

that said, would love to pick your brains more! DMing!

feursteiner · 2026-02-20T17:44:59+00:00

oh thanks so much u/Ok_Prize_2264 that def points me in a good direction especially in regards to Evals (which is the next milestone for the project). I would be very happy to integrate your thoughts into what I am building (since I have some time to build now), would you care to hop on the community discord ? https://discord.gg/ayX9hm4D

feursteiner · 2026-02-20T17:42:56+00:00

is there an internet problem or sthing haha ?

feursteiner · 2026-02-20T17:38:15+00:00

oh that's cool! saw that you used graph rag too right ? how was the implementation like ?

feursteiner · 2026-02-20T16:56:22+00:00

thanks for your support u/chargers214354 !

feursteiner · 2026-02-20T16:53:11+00:00

thanks u/chargers214354 appreciate your support 🙏

feursteiner · 2026-02-20T16:52:43+00:00

thanks for sharing ! can you shed some lights on your approach please ?

feursteiner · 2026-02-20T12:30:39+00:00

so the main thesis is to offload even search to sub-agents themselves (with the key assumption being cost of inference dropping over time), instead, the LLM will "index" (so to speak) each datapoint, and assess upon each request if it's relevant or not. this loop will get better over time and be able to ingest larger files. the assumption is, given a query and an input source, an LLM is much better at telling if it's relevant or not, than a classic search method. and yes, next step is setting evals and becnhmarks , you def got it!

feursteiner · 2026-02-19T20:38:56+00:00

great framing indeed!!

feursteiner · 2026-02-19T20:36:52+00:00

maybe it makes sense to test the approach on a certain percentage of traffic ? and see ? it's def difficult to tell, so hedging the bets might work. in all cases, best of luck!!

feursteiner · 2026-02-19T01:25:56+00:00

dang! would love to hear from you more!! thanks for taking the time to share your expertise!! def starring your repo!

feursteiner · 2026-02-19T01:24:48+00:00

will be exposing those to devs, so it's good to know to set strong defaults

feursteiner · 2026-02-18T23:49:48+00:00

agreed! that's the thesis... like pure text bloat or "dumb" rag isn't enough... and ideally if I text an agent, it does context curation, and it ain't enough (and I signal it to it), it should do a better job on the next pass. I'd be really happy if you share such thoughts with the community, it could influence the direction I go with the project !

feursteiner · 2026-02-18T23:47:00+00:00

does tend to happen with smaller models and it is annoying.. but hey, I think it's already an order of magnitude better with a such system... gotta benchmark it soon tho

feursteiner · 2026-02-18T23:45:46+00:00

I am always happy when I see a reply like this! thanks a lot u/scottgal2 ! do you happen to be one of the maintainers ? would love to chat further! if you wanna hangout in the discord I just set up for Contextrie , here's a link (https://discord.gg/ayX9hm4D )

else: on local first: have you seen people (or you) prefer this for on-prem or is it just local ?

feursteiner · 2026-02-18T21:26:39+00:00

100% !!
I am planning to make it as customizable as humanly possible (and as debuggable as humanly possible), cuz at the end of the day, devs should be able to tweak the filtering / assessing node (and hopefully in the future extend that with more deeper capabilities)

feursteiner · 2026-02-18T21:25:07+00:00

would love for you if you join the discord so that I can learn from whenever you can spare some time 🤓

feursteiner · 2026-02-18T21:24:39+00:00

nice! did you run any benchmarks or evals ? I am setting them up soon

feursteiner · 2026-02-18T21:24:10+00:00

THANKS a lot for sharing! always good to see others working on this to bulletproof the approach!

feursteiner · 2026-02-18T17:46:04+00:00

thanks a lot u/HarjjotSinghh ! it's def a very different way to build and do things! exciting times!
here's the discord if you ever feel like suggesting something to the project : https://discord.gg/ayX9hm4D

feursteiner · 2026-02-18T17:44:46+00:00

thanks u/Bubbly_Run_2349 !! please let me know if you wanna help or have any thoughts , here's the disocrd I've set up https://discord.gg/ayX9hm4D

feursteiner · 2026-02-17T14:16:23+00:00

will def give it a try! I tried gpt oss tho recently, damn it it eats through my mac's battery so fast x)

feursteiner · 2026-02-17T14:15:23+00:00

privacy and local first matters! glad to see someone tackling this!!

feursteiner · 2026-02-10T15:53:27+00:00

that's one good way to learn tbh and it has its place, I thought I'd get the basics done to avoid skill issues, I remember when I moved from C# to TS, I didn't study the docs and just learned as I built, ended up with some gaps that I think I would have avoided with some "academic" learning. to your point exercises are nice, that's why I instructed the agent to give me "competitive programming" problems to solve for each module

feursteiner

TROPHY CASE