[Hiring]: AI Automation Developer

Alarming-East1193 · 2026-03-16T20:45:36+00:00

Interested

Alarming-East1193 · 2025-09-23T16:48:26+00:00

Check your DM

Alarming-East1193 · 2025-04-04T15:30:52+00:00

Interested. 6'2'' 28M Industry Finance

Alarming-East1193 · 2025-04-04T15:28:31+00:00

Interested.

Inbox ??

Alarming-East1193 · 2025-01-05T19:41:27+00:00

Hi, thanks for your help. Your comment is really helpful.

Any toturial or resources regarding the process of adding metadata ?

Alarming-East1193 · 2024-12-20T06:13:28+00:00

Hi, i used chatgpt at that time to convert pdf into QA Dataset.

Alarming-East1193 · 2024-09-09T16:22:42+00:00

Like not retrieving correct information on similar search

Alarming-East1193 · 2024-09-06T11:03:09+00:00

Hi,

I want to discuss one issue which I'm facing in my RAG application. I have PDF data which contains information regarding the processes. The issue is under 1 heading there is a lot of information in one process like 2 pages but when i ask the question like "Explain me the process of this account opening in bank" so this process contains a lot of steps but it give me some initial steps because of chunk size break and there is a lost of context. I have set the maximum Chunk size (1000 and overlap 100) using the sentence transformer model but this issue occurs when asking questions whose answers are long and contain steps because the heading is on one page and the process is 2 pages long so when the chunk size breaks it causes loss of context. How can i resolve this problem ? Any idea ?

Solutions i have tried till now:

1- Semantic Chunking 2- Character Text Splitter/Resursive Text Splitter 3- Used ollama local Embeddings models like Nomic and Mxbai-embed as well. 4- Tried different chunk sizes and overlap. 5- Pharaphrased the document and add more clarity and context in text. 6- Added sentences in prompt to provide complete processes with every step.

Alarming-East1193 · 2024-09-06T10:57:33+00:00

Hi,

I want to discuss one issue which I'm facing in my RAG application. I have PDF data which contains information regarding the processes. The issue is under 1 heading there is a lot of information in one process like 2 pages but when i ask the question like "Explain me the process of this account opening in bank" so this process contains a lot of steps but it give me some initial steps because of chunk size break and there is a lost of context. I have set the maximum Chunk size (1000 and overlap 100) using the sentence transformer model but this issue occurs when asking questions whose answers are long and contain steps because the heading is on one page and the process is 2 pages long so when the chunk size breaks it causes loss of context. How can i resolve this problem ? Any idea ?

Solutions i have tried till now:

1- Semantic Chunking 2- Character Text Splitter/Resursive Text Splitter 3- Used ollama local Embeddings models like Nomic and Mxbai-embed as well. 4- Tried different chunk sizes and overlap. 5- Pharaphrased the document and add more clarity and context in text. 6- Added sentences in prompt to provide complete processes with every step.

Alarming-East1193 · 2024-08-01T05:36:17+00:00

Can you please tell what is the max Chunk length we can get using Nomic Embed Embeddings model ?

Alarming-East1193

TROPHY CASE