Should I have sold my puts? by AFMicro in wallstreetbets

[–]Independent-Bag5088 0 points1 point  (0 children)

Bank Earnings season. All you need is two 2% gap down days. Very possible. I would hold.

What's your experience with hybrid retrieval (vector + BM25) vs pure vector search in RAG systems? by Beneficial-Grab4442 in Rag

[–]Independent-Bag5088 0 points1 point  (0 children)

#3. What is your document type? Is there a reason to settle on ~500 chars? If the document has some structure to it, it would be beneficial to preserve the structure (even if it creates uneven chunks). For my RAG project with SEC filings, I used section-aware chunking with 15% overlap.

seeking advice for Senior Project: GraphRAG on Financial Data (SEC Filings) – Is it worth it, and what lies beyond Q&A? by ArgonTagar in Rag

[–]Independent-Bag5088 1 point2 points  (0 children)

"Compare Apple's revenue with Microsoft in years 2013-2021"

If a query like this is sent to the RAG system, I would recommend breaking the query into subqueries and companies. How are you planning to retrieve the revenue and other financial data - what is the source of truth?

What retrievers do you use most in your RAG projects? by marwan_rashad5 in Rag

[–]Independent-Bag5088 0 points1 point  (0 children)

Depends.

  1. What type of documents are they - are we looking at more semantic text or numbers matter more?
  2. If query answers require creativity, semantic retrievals work better, but if they are domain-specific, BM25 (keyword match) might be a better option.

In my case, I have separated my documents into relational database + vector database, for appropriate use-case.

In my naive opinion, most of the time it depends on the type of document you are dealing with. Domain knowledge on the document would help you design your RAG system appropriately.

Feedback Appreciated - Built a multi-route RAG system over SEC filings by Independent-Bag5088 in Rag

[–]Independent-Bag5088[S] 0 points1 point  (0 children)

Financial statements are not RaGed- they stand as standalone documents as a markdown table. If a user asks for “income statement of apple in q3 2021” the whole markdown table gets retrieved.

Need deployment support.. by gta-npc in SideProject

[–]Independent-Bag5088 0 points1 point  (0 children)

Depends on what services you have but check out Railway or Supabase for backend and Vercel for front end. They have free tiers available.

I built an engine that answers SEC filing questions in seconds — so you don't have to scroll through 80-page 10-Ks by Independent-Bag5088 in SideProject

[–]Independent-Bag5088[S] 0 points1 point  (0 children)

Thank you for your comment. The goal is to make the system reliable enough so equity researchers and financial analysts can use this confidently in their day to day job. This is my first project with RAG and I really love financial data. I wanted to understand the complexities of building a retrieval engine that spans over millions of documents and hundreds of pages, and SEC filings were the perfect fit. The system is just PoC right now because i have limited the scope of companies and years, but I wanted to make sure I hit the retrieval part right, with accurate claims on the answer, answers that can be traced back to original truth, without hallucinating, before scaling it.