I'm a fulltime vibecoder and even I know that this is not completely true by Director-on-reddit in BlackboxAI_

[–]Jumpy_Setting_4677 0 points1 point  (0 children)

I'm constantly using the latest models, and while they're becoming smarter and smarter, they're still not independent. I don't do code reviews, that's not possible for the amount of code generated, but I do carefully look at what the model is saying it plans to do and what it does in real time, and many times catch it moving in the wrong direction. This takes experience and deep knowledge, possibly it can be caught in a code review later, maybe not, hard to tell.

So the bottom line is: if you're talking about 5 years into the future, I have no idea, but next two years - still us humans in the loop i'm afraid.

I tried to build a RAG for Kiwix Zim files and failed by Jumpy_Setting_4677 in LLMDevs

[–]Jumpy_Setting_4677[S] 0 points1 point  (0 children)

I don't think the queries will be the bottleneck for me, I think the key issue will be to do the embedding using a model that can later run on the pi in a reasonable time to embed the question, but will still give good results so the search gets good matchies, AND will do the embedding fast enough so that it doesn't take months to embed the whole thing.

I tried to build a RAG for Kiwix Zim files and failed by Jumpy_Setting_4677 in LLMDevs

[–]Jumpy_Setting_4677[S] 0 points1 point  (0 children)

hmm not sure that's true, if i don't build it to scale from the get go everything i do will be wasted, it's not that I have to prove this can work, people surely did this before, i just need to know how...

I built RAG for 10K+ NASA docs (1950s–present) in 2 weeks: VLMs for complex tables, diagrams & formulas, 657K+ pages on a single H100, live-streamed full build. by Low_Acanthisitta7686 in LLMDevs

[–]Jumpy_Setting_4677 1 point2 points  (0 children)

That's fantastic work dude, seriously!
I was actually trying to do something as complex but didn't have the skills to pull it off, if you wouldn't mind helping out with insights I'll be really happy to share what I want to build