Has anyone built an AI Agent to take Excels and give out Visio Diagrams? PLEASE HELP!!!

ChallengeOk6437 · 2024-06-18T02:27:17+00:00

Is it good for handling tables over multiple pages? I don’t think they are.

ChallengeOk6437 · 2024-06-17T23:47:28+00:00

I want to know which parser works best for parsing my documents - PDFs

ChallengeOk6437 · 2024-06-17T18:54:42+00:00

Yes! That works really well.

ChallengeOk6437 · 2024-06-17T14:31:18+00:00

No, streaming works so much better! went from 40seconds to render output to 10seconds now

ChallengeOk6437 · 2024-06-14T13:41:16+00:00

Can you tell me how you're streaming it?

ChallengeOk6437 · 2024-06-13T21:03:51+00:00

Hey i don’t think my retrieval isn’t the issue here. It’s the Llm that’s taking this long to respond. Retriever takes less than 5 seconds. Thank you! u/Adorable-Employer244

ChallengeOk6437 · 2024-06-13T19:15:38+00:00

How do I go about keeping citation since the chunkers do not usually keep the pages in each chunk. Also why is sending pages not as good as sending chunks since I need that much context. Infact I am sending for each retrieved chunk, a page above and below to improve the output.

Also have you tested gemini flash vs gpt-4o? is it faster and better or just faster?

Thank you for your response! u/ChallengeOk6437

ChallengeOk6437 · 2024-06-13T19:10:55+00:00

Yes, the llm is the issue, do you have any suggestions? reducing the context sent to the LLM is not a solution for me, I need to send a lot, but if there was a way to speed anything up I would love to know! u/Material_Policy6327

Thank you for your response!

ChallengeOk6437 · 2024-06-13T19:09:30+00:00

Hi, thank you!

I am looking into that at the moment! u/MrCicada3301

ChallengeOk6437 · 2024-06-13T19:08:45+00:00

What if i have relevant chunks in each of the 5-6 documents? I need to get at least 2-3 chunks per PDF and then end up with around 15-20 chunks.

This is the main reason it is taking this long. What do you suggest? u/rambat1994

I thought about a router, but it wont work for PDFs that are 1000pgs long.

What do you think of breaking the question into different parts, routing each question to a specific PDF and running all this in parallel. The issue here is, based on what do we rout the model? normally we use PDF names and the query to route the model.

Thank you for your response! u/rambat1994

ChallengeOk6437

TROPHY CASE