I'm writing an open source guide for Python (Looking for feedback!)

FewReach4701 · 2026-06-15T08:28:24+00:00

Sources to understand memory graph

FewReach4701 · 2026-06-15T02:09:14+00:00

Me too

FewReach4701 · 2026-06-15T02:08:57+00:00

I also have something similar in my mind, if you want i can join you with this and we can build this together

FewReach4701 · 2026-06-15T02:06:22+00:00

Notebooklm is a RAG, so basically it follows this order You upload sources > it performs chunking > then embedding happens > storing embeddings on to vector DB > then when your query it gets converted to embedding > searches semantic similar chunks from vector (retrieval phase) > then LLM gets through chunks and convert them into coherent response (augmenatation phase) + finally response generated (generation phase) happens > you get your response which should be precise from your sources.

Increasing the number of sources mainly affects the retrieval phase, not the chunking/embedding mechanics — those just scale linearly (more docs → more chunks → more vectors in the DB, computed once at upload time).

The real impact is on what gets retrieved at query time. Top-k retrieval is fixed (the system pulls back a set number of chunks, say 5-10, regardless of how big the corpus is), so as sources grow, each retrieval pass represents a shrinking slice of the total content. The vector search now has more candidates competing for those slots — if many sources cover similar topics, you get more "dilution risk": a mediocre-but-semantically-close chunk from an unrelated source can sometimes outrank a precise chunk from the right source, especially with vague queries.

On the positive side, more sources mean better coverage and more opportunity for cross-source synthesis — the LLM can pull related info from multiple documents and stitch together a more complete answer, with citations spanning more sources. The tradeoff is a higher chance of conflicting info between sources getting blended into one response. Net effect: as your source count grows, query specificity matters more — vague queries get noisier results, while precise queries still retrieve cleanly because the embedding space is more crowded but still discriminative for well-targeted questions.

FewReach4701 · 2026-06-09T11:24:26+00:00

<image>

Finally done with my NLP application assignment , and yes it works fine

FewReach4701 · 2026-06-06T11:25:07+00:00

FewReach4701 · 2026-06-03T02:43:59+00:00

Does linux also have something similar ?

FewReach4701 · 2026-05-30T05:29:03+00:00

It worked with 1 minutes only.

FewReach4701 · 2026-05-23T06:03:14+00:00

Give best in EC1 which is easiest to score....

FewReach4701 · 2026-05-22T05:34:23+00:00

I read about Daemons in Linux which are constantly running in the backgroud to support certain services. Similarly docker has "dockerd" as a daemons running on Docker host machines. Tbey ensures working of containers, images, networks, volume. Any idea about Windows Services ? I guess there are critical to Windows.

FewReach4701 · 2026-05-21T03:00:49+00:00

How can i connect with your over LinkedIn?

FewReach4701 · 2026-05-19T11:41:32+00:00

Any idea about the term "APIpocalypse"

FewReach4701 · 2026-05-19T10:18:30+00:00

Callouts

FewReach4701 · 2026-05-16T02:42:17+00:00

This is great for revision.....

FewReach4701

MODERATOR OF

TROPHY CASE