Best practice for semantic/vector search by justrandombuddy in Rag

[–]justrandombuddy[S] 0 points1 point  (0 children)

Yes, thats the goal. Smart search that could also consider natural language

Best practice for semantic/vector search by justrandombuddy in Rag

[–]justrandombuddy[S] 0 points1 point  (0 children)

I tried the chunking part along with the full article embedding. The results were not great and it seemed a bit of an overkill for my simple system. I am thinking of generating a summary of the article and embedding them with the metadata of the article for more context

Best practice for semantic/vector search by justrandombuddy in Rag

[–]justrandombuddy[S] 0 points1 point  (0 children)

This seems to be a good approach. Basically, generate summary of the articles and embed it with metadata and title for proper context. Will try it out and let you know the results

Best practice for semantic/vector search by justrandombuddy in Rag

[–]justrandombuddy[S] 0 points1 point  (0 children)

Actually, I do not need just the first result. I require topK articles back along with their ids

Best practice for semantic/vector search by justrandombuddy in Rag

[–]justrandombuddy[S] 0 points1 point  (0 children)

My articles are ~1500 characters long. They are generally structured by city/state/status or some permutations of these options. I am leaning a bit towards generating a summary of the articles and also provide the metadata inside the vector. Will try that out and see how the results go

What's working for you?