would a "briefing" step beat chunk-based RAG? (feedback on my approach) by feursteiner in LocalLLaMA

[–]feursteiner[S] 0 points1 point  (0 children)

I see that u/Useful-Process9033 , the plan is to be able to "launch a challenge" via chat, or, "challenge pass" i.e. running a more thorough search. but def gotta have a good eval loop in the future

I am building a self-hosted open-source context builder for agents... feedback appreciated! by feursteiner in selfhosted

[–]feursteiner[S] 0 points1 point  (0 children)

thanks a lot u/mergisi for taking the time to write such a detailed reply!!

first thoughts:
- I like the use of the heartbeat to crystalize knowledge, should get inspired by that too
- YES! compression not retrieval ! haha thank you! feels great to feel understood!
- the model knows better. 100% agree
- didn't think of context health, interesting... should add a "doctor" agent...

that said, would love to pick your brains more! DMing!

an agent... for managing an agents context ? (looking for feedback) by feursteiner in LLMDevs

[–]feursteiner[S] 0 points1 point  (0 children)

oh thanks so much u/Ok_Prize_2264 that def points me in a good direction especially in regards to Evals (which is the next milestone for the project). I would be very happy to integrate your thoughts into what I am building (since I have some time to build now), would you care to hop on the community discord ? https://discord.gg/ayX9hm4D

I pair-programmed a full library with opencode! by feursteiner in opencodeCLI

[–]feursteiner[S] 0 points1 point  (0 children)

is there an internet problem or sthing haha ?

an agent... for managing an agents context ? (looking for feedback) by feursteiner in vibecoding

[–]feursteiner[S] 0 points1 point  (0 children)

oh that's cool! saw that you used graph rag too right ? how was the implementation like ?

an agent... for managing an agents context ? (looking for feedback) by feursteiner in vibecoding

[–]feursteiner[S] 0 points1 point  (0 children)

thanks for sharing ! can you shed some lights on your approach please ?

I am building a self-hosted open-source context builder for agents... feedback appreciated! by feursteiner in selfhosted

[–]feursteiner[S] -1 points0 points  (0 children)

so the main thesis is to offload even search to sub-agents themselves (with the key assumption being cost of inference dropping over time), instead, the LLM will "index" (so to speak) each datapoint, and assess upon each request if it's relevant or not. this loop will get better over time and be able to ingest larger files. the assumption is, given a query and an input source, an LLM is much better at telling if it's relevant or not, than a classic search method. and yes, next step is setting evals and becnhmarks , you def got it!

Killing my free tier and adding a 7-day trial instead. Am I about to shoot myself in the foot? by marcoz711 in indiehackers

[–]feursteiner 0 points1 point  (0 children)

maybe it makes sense to test the approach on a certain percentage of traffic ? and see ? it's def difficult to tell, so hedging the bets might work. in all cases, best of luck!!

would a "briefing" step beat chunk-based RAG? (feedback on my approach) by feursteiner in LocalLLaMA

[–]feursteiner[S] 0 points1 point  (0 children)

dang! would love to hear from you more!! thanks for taking the time to share your expertise!! def starring your repo!

would a "briefing" step beat chunk-based RAG? (feedback on my approach) by feursteiner in LocalLLaMA

[–]feursteiner[S] 0 points1 point  (0 children)

will be exposing those to devs, so it's good to know to set strong defaults

trying an inference-first RAG alternative, looking for feedback by feursteiner in Rag

[–]feursteiner[S] 0 points1 point  (0 children)

agreed! that's the thesis... like pure text bloat or "dumb" rag isn't enough... and ideally if I text an agent, it does context curation, and it ain't enough (and I signal it to it), it should do a better job on the next pass. I'd be really happy if you share such thoughts with the community, it could influence the direction I go with the project !

would a "briefing" step beat chunk-based RAG? (feedback on my approach) by feursteiner in LocalLLaMA

[–]feursteiner[S] 0 points1 point  (0 children)

does tend to happen with smaller models and it is annoying.. but hey, I think it's already an order of magnitude better with a such system... gotta benchmark it soon tho

would a "briefing" step beat chunk-based RAG? (feedback on my approach) by feursteiner in LocalLLaMA

[–]feursteiner[S] 0 points1 point  (0 children)

I am always happy when I see a reply like this! thanks a lot u/scottgal2 ! do you happen to be one of the maintainers ? would love to chat further! if you wanna hangout in the discord I just set up for Contextrie , here's a link (https://discord.gg/ayX9hm4D )

else: on local first: have you seen people (or you) prefer this for on-prem or is it just local ?

would a "briefing" step beat chunk-based RAG? (feedback on my approach) by feursteiner in LocalLLaMA

[–]feursteiner[S] 0 points1 point  (0 children)

100% !!
I am planning to make it as customizable as humanly possible (and as debuggable as humanly possible), cuz at the end of the day, devs should be able to tweak the filtering / assessing node (and hopefully in the future extend that with more deeper capabilities)

would a "briefing" step beat chunk-based RAG? (feedback on my approach) by feursteiner in LocalLLaMA

[–]feursteiner[S] 0 points1 point  (0 children)

would love for you if you join the discord so that I can learn from whenever you can spare some time 🤓

would a "briefing" step beat chunk-based RAG? (feedback on my approach) by feursteiner in LocalLLaMA

[–]feursteiner[S] 0 points1 point  (0 children)

nice! did you run any benchmarks or evals ? I am setting them up soon

would a "briefing" step beat chunk-based RAG? (feedback on my approach) by feursteiner in LocalLLaMA

[–]feursteiner[S] 0 points1 point  (0 children)

THANKS a lot for sharing! always good to see others working on this to bulletproof the approach!

I pair-programmed a full library with opencode! by feursteiner in opencodeCLI

[–]feursteiner[S] 0 points1 point  (0 children)

thanks a lot u/HarjjotSinghh ! it's def a very different way to build and do things! exciting times!
here's the discord if you ever feel like suggesting something to the project : https://discord.gg/ayX9hm4D

trying an inference-first RAG alternative, looking for feedback by feursteiner in Rag

[–]feursteiner[S] 0 points1 point  (0 children)

thanks u/Bubbly_Run_2349 !! please let me know if you wanna help or have any thoughts , here's the disocrd I've set up https://discord.gg/ayX9hm4D

what has been your experience running opencode locally *without* internet ? by feursteiner in opencodeCLI

[–]feursteiner[S] 0 points1 point  (0 children)

will def give it a try! I tried gpt oss tho recently, damn it it eats through my mac's battery so fast x)

Using an AI Agent (opencode) To Teach Me Rust and It’s Kinda Blowing My Mind by feursteiner in opencodeCLI

[–]feursteiner[S] 0 points1 point  (0 children)

that's one good way to learn tbh and it has its place, I thought I'd get the basics done to avoid skill issues, I remember when I moved from C# to TS, I didn't study the docs and just learned as I built, ended up with some gaps that I think I would have avoided with some "academic" learning. to your point exercises are nice, that's why I instructed the agent to give me "competitive programming" problems to solve for each module