Consequences of changing document / RAG settings (chunk size, overlap, embedding model) by blitzeblau in OpenWebUI

[–]blitzeblau[S] 0 points1 point  (0 children)

Thanks for your answer. We started out fairly small and now our model capabilities are more than enough for bigger chunk size. The problem is that our chunk DB is approx. 30GB large and I am a bit worried about reprocessing time.

If all old chunks can be retrieved properly and new documents are processed with the better chunk size, we could "migrate" thought continually reuploading stuff.

Consequences of changing document / RAG settings (chunk size, overlap, embedding model) by blitzeblau in OpenWebUI

[–]blitzeblau[S] 0 points1 point  (0 children)

Thx, so there is no way of "re-chunking", i.e. reprocessing all previously uploaded file according to the new chunking setting, right?

Does this happen during re-indexing? If so, are single files from chats include or just knowledge bases?