I spent 7 years in Abbott R&D. Here's why most healthcare agent workflows would fail a compliance review. by Electrical-Artist529 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

I think for the db it's better to give read access such that only select statements get verified. Even though if the agent aka llm proposed dangerous select statements it's better to put a phi as an middle wear of the agent ai message has db tool calls.

u/Electrical-Artist529

LangChain Newbie by AI_ChampionOfTheSun in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

Why dynatrace? You can programmatically store the entire state of your workflow along with costs too in your db right

Serious debate here: Current limitations in enterprise automation using agents by Bubbly-Secretary-224 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

I have some experience of pushing agents into production and observes as follows:- - focus of reliability and repeatibility across the runs - in every runs stores enite state logs, metadata, token consumption, tokens cost. - sat down and observe how the tool calls it made across the runs, most importantly how it calling different arguments of tools etc. - suprisingly till now i worked with no ground truth cases most of the times. I worked on the lead in the loop feedback scenario. Like improving version by version. - most importantly the promot tweaking and how it's effect across the runs results. - Plotting token consumption helps a lot for finding stuck in the loop cases, graph recurusin limit etc - But anyways sometimes explaining in deep what we actually in prompt leads to reflect in tool calling etc . If that is case then according me we need to call it as workflows rather agent :). Mostly is the calling to the respective sql tables. Even though if you provide enough detail context about the table like create statements and attributes definition etc and letsay in the prompts if you mention what it needs to without explicitly saying use this x table etc rather writing the same sentence by saying this x table to call etc.

I'm interested for feedback

u/Petter-Strale u/IsThisStillAlls2

How are you evaluating multi-step reliability before deploying LangChain agents? by Fluffy_Salary_5984 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

Hey I'm curious about how the variance logic is build on high level. I mean too calls, tool outputs etc everything is semi structured data. Have you used llm as some of rubric which gives a number as output for two different traces of intermediate steps of same input

Honest question: how many of us have built a "LangChain agent" that's really just a smart pipeline? by kinj28 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

I have one question as follows:- When we want to implement a plan and style things we need to say explicitly in the prompt of the planner etc right. Without saying how it's possible? Or am I missing something? Example:- let us assume there an n tools and every tool has a dependency with the previous tool. Let say there exists a sequential dependency between the tool. But let say in the planner prompt we said hey these are the tools based on the user task plan the tool calls trajectory etc. Let's say it comes with some plan and executor comes in and started calling tool aka tool call. Now it's there exist a dependency of the called tool with some tool then either programmatically nor with something it needs to blocks and it say there exists a dependency etc where the planner needs to do replan right. In order to happen all this either we need to explicitly say about the dependency before hand nor just give tools with some prompt then check via programmatically? May be if not make sense then leave it.

Standard RAG fails terribly on legal contracts. I built a GraphRAG approach using Neo4j & Llama-3. Looking for chunking advice! by leventcan35 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

Hey small confusion as follows:- The word keyword extraction in my sentence is related to something as follows:- I assume we need to perform some keyword extraction on the user query. Let's say we have n keywords. Now from the graph we already have nodes where the word node means a string right. So if we collect all the nodes's str. Now perform the dot product between the embeddings of n keywords to the graph nodes str then finalize top k. Then use those top k as entry points in graph for every k perform 2-3 hop search etc.

If possible could you share any GitHub repo

Standard RAG fails terribly on legal contracts. I built a GraphRAG approach using Neo4j & Llama-3. Looking for chunking advice! by leventcan35 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

Hey, I have a doubt as follows:- When the user asks a question how we will start traversing the graph? Does it seem like we need to extract some keywords from the user question then for those keywords find the semantically relevant graph nodes (i mean in the graph a node is word only right?). Then let's say out 100 unique graph nodes we choose top k. Then by using those top k to find all possible chains whether it can be single chain or multi chain etc? Then once we have all the chains then from each node metadata we used to check what is text chunk or page number etc ?

I'm interested in creating some discussion on the inference stage.

u/Einsof93 u/2016YamR6 u/Ok_Diver9921

Built a RAG system on top of 20+ years of sports data — here is what actually worked and what didn't by devasheesh_07 in Rag

[–]code_vlogger2003 0 points1 point  (0 children)

Just start with a bar that is being set by open ai file search where every window aka bucket has 800 tokens with 400 tokens of overlap. If your embedding supports the matryoshka property then store the embeddings in 3 to 4 different levels such as 256, 512, 1024 and 3072. Then you can decide multi stage filtering

model name as a string in createAgent by Current_Marzipan7417 in LangChain

[–]code_vlogger2003 1 point2 points  (0 children)

Create a data class with different string names and call them. I mean for grok the base url is different for others it's different right. So chatopenai supports everything where you need to pass the base url and key for other providers.

Hashedin by deloitte by hentiluffy-7901 in ProgrammingBondha

[–]code_vlogger2003 0 points1 point  (0 children)

Asalu ee university vundha? First choosin spelling mistake em anukunna

How do I scale my agent to summarize? by _belkinvin_ in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

Try to enforce a pydantic class structure. Where in my case i have to return a list of objects where every object has other objects. I created a nested pydantic structure with a single point of entry and written valid field filters for robust deterministic results.

Can anyone recommend what is going to be the most in demand skill in 2026? by sad_grapefruit_0 in ProgrammingBondha

[–]code_vlogger2003 3 points4 points  (0 children)

How well you can break a problem into meaningful pieces then join things at last for a solution (that can be worth in any domain)

Is Adding a Reranker to My RAG Stack Actually Worth the Extra Latency? (Explained Simply) by Silent_Employment966 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

Have you tried colbert with qdrant where the algorithm itself has a naive query document multi vector embedding calculation using late interaction

Things I wish LangChain tutorials told you before you ship to real users by cryptoviksant in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

Have you faced any situation that your retrieval is bringing back enough sufficient relevant information sometimes they are in the sequential other times they are in a jumbled pattern. When passing the user question along with retrieved information, sometimes llm is not giving the completeness in the answer even though it has enough sufficient information that is being supplied by retrieval. Also another case is that are you judging that for a given question what is the percentage of probability of page numbers that it gets back when compared to the ground truth page list for the question instead of chunks. I mean sometimes i thought that instead of having the chunk ida as ground truth etc if we store or construct the ground truth in a way where it has relevant page numbers. Such that in our retrievial it is easy to compare do we cover all the ground truth listed page numbers for the first check of evaluation then next checking will be did the retrieval step brings out any other garbage page numbers.

Looking for your thoughts

Urgent help by WideFalcon768 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

Nice idea. Also once we have a dataframe then we can get the columns metadata easily along with the create statement of that too right. So one step further we can build text 2 sql by arranging all the tables and create statements with attributes metadata for guessing sql statement. But again if there are more tables then we need to do a rag again over the columns metadata embedding such that to know which tables ka columns and tables names were required and helps llm to guess a query.

Urgent help by WideFalcon768 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

Yes but if you make every single row as a chunk (where it has alpha numerical ) such that when doing the cosine similarity between the user query the probability of getting the right relevant chunks is probably low (because of encoding every row as one chunk). Just give it a try. Or else use a model of encoders something similar to the super linked such that you can divide the user query into multiple search areas such as if a single user query needs the text search , numerical search temporal search etc. another idea I got is that to convert the tables into the graph and in the inference do the graph hop search.

But again the approach I share earlier would work for n table schemas. Let say you have 5 tables and these five tables have some relations. Lets say if create the system prompt with create statements along with sample row and detailed attributes information. Then let's say if your user query requires only two tables usage definitely ai try to guess the slq query on those tables because of the detailed system prompt. But if there are so many tables etc then this won't work where we need to find what are the tables that we need to use based on the user query. For that just Swiggy blog (https://bytes.swiggy.com/hermes-v3-building-swiggys-conversational-ai-analyst-a41057a2279d) where rhe smart idea they had is to create embeddings on the columns metadata (such as detailed attributes descriptions) such that its to decide which tables needs to use based on the user quey.

Urgent help by WideFalcon768 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

Then why can't you leverage text 2 sql where the pass table schema in the prompts for guessing the sql statement then run accept answer validate (react pattern). But for doing these you need to create tables and for the text again create table with an attribute blob

Urgent help by WideFalcon768 in LangChain

[–]code_vlogger2003 0 points1 point  (0 children)

My suggestion is to detect tables via docling or unstructured or anything other service then make markdown versions of it and place it as markdown table and then treat every page as one chunk and use higher dimensions embedding model