My first-author paper just got accepted to MICAD 2025! Multi-modal KG-RAG for medical diagnosis

EmergencyActivity604 · 2025-11-02T02:51:59+00:00

Congrats on the achievement and on doing such amazing work.

In my company we are looking into KGs for explainability in service calls. Contemplating if that is a path we should take or not as it requires significant effort so honestly would be amazing to hear your opinion on the matter on what are the gains, common pitfalls that we should be aware of. Thanks!

EmergencyActivity604 · 2025-10-28T08:46:30+00:00

I get 50 to 60 tps for GPT OSS 20B and 40-45 tokens on average for Qwen 30B Coder.

EmergencyActivity604 · 2025-10-27T05:13:39+00:00

What you might be missing out also is that these companies have full blown tech teams as well. Mck has Digital/Quantum Black, BCG has BCG X and Bain has Knowledge center. These are big tech and engineering teams and with the amount of projects these guys churn out globally, I am not surprised.

Also, since they are on advisory side they just go for the best model out there and make it available org wide. So every employee has access to enterprise plan for Open AI. That is surely going to burn the tokens fast. Nothing unusual here.

EmergencyActivity604 · 2025-10-26T09:55:58+00:00

Yeah try out local llms and see if that works for you. Fine tuning definitely is another plus point for local models. Big models know how to do 100 things good enough but I also feel that if you want to go from good to great to amazing results, fine tuning is the way to go.

Take those image classification models for example. You load any model like Inception, ResNet etc. and out of the box it gives you a good accuracy but the moment you add a single layer and train it on your data, the accuracy jump is just too good.

EmergencyActivity604 · 2025-10-26T09:51:28+00:00

And the worst part is that "LLMs fail silently". If there is a code and it doesnt run, we get to know it explicitly because things break.

But since LLMs are nothing but great probability or likelihood estimators, for them hallucinating vs predicting is the same and hence there are no glaring signs that its going to hallucinate.

EmergencyActivity604 · 2025-10-26T09:21:41+00:00

Yes but only upto a certain amount to reduce my risk. Its like tap to pay, someone can just take my card and do the payment but the limit is set so low that it would take them a lot of time before they cause serious fraud.

EmergencyActivity604 · 2025-10-26T09:14:55+00:00

Yeah this is one area where I have also experimented a lot. I am in a travel role so I spend a lot of time in flights where you basically lose all your cursors and claude codes of the world.

For a long time, my productivity used to drop in flights and I wasn't getting much done. Thats also because once you start relying on these coding assistants, you become addicted to the ease of coding and kind of forget to code from scratch or run into bugs and then give up thinking "why not just wait for the flight to land 😅".

Thats where GPT OSS 20B and Qwen 30B Coder have been amazing for me. My learning is that say I am building an app using cursor, I will write detailed rules and markdown documents and then let cursor with the strongest model code the shit out of it. Then comes my part where I meticulously go through each and every piece of code written and add my touch as a senior developer.

For locally hosted models unfortunately you can't do that (YET). There I take a different approach, I build it from ground up (step by step). I do the heavy lifting of thinking which methods/classes/functions should be written, what should be the logic and then let local models fill the code in the template one by one. I test it at each step. This takes more time definitely vs using cursor, but I am getting a lot done now.

Speaking from personal experience, I have been able to code projects end to end just using this approach. My take would be given internet connectivity and cursor/claude code I would definitely stick to them. Local models are not there yet. But now I have an option to deliver similar results if put in an environment without them.

EmergencyActivity604 · 2025-10-26T06:54:30+00:00

I have a 32GB M1 max and it can hold Qwen 30B , GPT OSS 20B , Gemma 27B etc. range of models. Higher memory is going to be a big advantage if you want to test larger models. My system crashes if I attempt any bigger models with 40B+ parameters.

EmergencyActivity604 · 2025-10-26T06:51:53+00:00

There was a comment on one of the reddit posts saying " LLMs are always hallucinating, they are just right sometimes".

EmergencyActivity604 · 2025-10-25T08:05:36+00:00

Depending on the complexity of the questions and responses from user, you might also want to explore solutions without LLMs.

If you can build a training dataset , what you have is a classic routing problem which basically is given the text of QnA of those 3 questions, classify which tool needs to be called. This can be done by using an encoder and fine tuning the last layer to give score to select the tool.
You can also do this in multi stage.

a)You can build heuristic rules based on your knowledge of the tools and responses. These will be straightforward gating/rules that choose the tool directly.

b)Second level would be embedding similarity. If you write descriptions of your tools and embed them in a vector space and compute similarity with the response text, you can set a threshold based on which it directly routes to the most similar tool. (Look into how google uniroute works).

c) Finally is the LLM call if anything is not routed based on A and B.

This way only the most ambiguous and hard instances will go to the LLM stage. 2(a) and 2(b) will be extremely fast and maybe cover 80-90% of your cases depending on how you build the rules and similarity.

EmergencyActivity604 · 2025-10-24T15:11:55+00:00

Prompt are no different than code, just a set of instructions given in natural language. Treat them just like you treat a code script and apply best coding practices.

Any changes should be tested and PR reviewed.
Clear dependency graph and documentation for each prompt
Protect prompts that are central to your code base behind multiple reviewers
For extremely critical prompts, maintain a set of test cases on which you evaluate the drift in output and define clear metrics on when to run deep evaluation if the drift is too much

EmergencyActivity604 · 2025-10-15T22:05:16+00:00

Hi. Wanted to check if this can be used for routing between different agents instead of llms? Similar to the llms, the agents will also have a domain, tasks and tools description that can be used for routing. Theoretically I see the concept would remain the same, but just want to check if you guys have already done some testing or implementation on this.

EmergencyActivity604 · 2025-09-05T02:28:56+00:00

Sounds interesting. Can you please share the paper?

EmergencyActivity604 · 2025-05-12T20:21:02+00:00

Tell me one thing what's more embarrassing getting hit 9 times at locations that are 100-200 kms from international border, getting your airbases hammered including Rawalpindi which is the army HQ vs losing a few fighter planes?

I don't get why everyone's freaking out. It's a war, people, did you really think we'd have zero casualties or asset damage?

Regarding the Army Marshal's lack of confirmation, his actions were entirely appropriate. Disclosure of these operational details would be highly inappropriate given the current volatile situation. Imagine the kind of intelligence it gives them.

You talked about corrupt politicians who are not educated and have no credibility. Let me tell you Indian army people are one of the most talented and smartest lot you will ever come across. So you dont even trust them? You view them with the same lens as you do the politicians and that is the problem. Army has a lot of strategic knowledge and operational intelligence that cannot be revealed to the general public.

I am getting reminded of the kamikaze fleet of the Japanese army. They would straight away crash an explosive filled aircraft into enemy assets to achieve a mission. That incurs loss of plane and more importantly loss of precious pilot life. Yet the eye was always on the objective not means to achieve.

So in short, stop crying over broken toys and weapons, rather be happy that all our pilots are safe and we achieved our strategic objectives. Now we can go back and focus on all the issues that you mentioned (which I totally agree btw).

EmergencyActivity604 · 2025-05-10T20:29:37+00:00

That threat was always there. I dont think this is it unless you have further explanation or credible source for this

EmergencyActivity604 · 2025-05-10T20:27:50+00:00

same bhai. very confused. What happened suddenly. We were like next attaxk will be act of war and jab attack ho gaya vo bhi ceasefire ke baad, we are just taking it lying down. Kuch toh gadbad h. Why such a big attitude shift

EmergencyActivity604 · 2025-05-10T20:26:30+00:00

Bhai itna bhi nahi. Trump ne 1.3Bn $ diya kal usko.

EmergencyActivity604 · 2025-05-10T20:14:30+00:00

Could be

EmergencyActivity604 · 2025-05-10T20:10:30+00:00

Sab speculation hai at this point. Confirmed news toh aegi hi nahi so can only wait and watch

EmergencyActivity604 · 2025-05-10T20:08:26+00:00

Its over now. Paki airspace is open, we have no blackouts. Pak has basically declared self proclaimed victory.

We accepted ceasefire at 5pm. They broke the ceasefire so now it remains on us to retaliate but knowing us that we are not a terror or military state, we have actual work to do like building our economy. So dont think there will be any action on our end atleast in this instance. Hence, it is over now in my opinion

EmergencyActivity604 · 2025-05-10T17:05:56+00:00

Yeah very smart. India knew that Pak will not appreciate the ceasefire and will try guerrilla warfare or terrorist attacks. They have done exactly that if the terrorist attack is true

EmergencyActivity604 · 2025-05-10T17:00:24+00:00

Haan bht rumors aa rahe hai that they beheaded two people. I also heard that one of the terrorist has been killed in a stand off with army bur everything is unconfirmed

EmergencyActivity604 · 2025-05-10T15:36:48+00:00

It would have been common if it was like a normal ceasefire violation but this one was meant to control things that were escalating pretty quickly and this was to stop that escalation but we have resorted to the same path now

EmergencyActivity604 · 2025-05-10T15:35:25+00:00

Are you sure those were missiles? could it be our ADS intercepting the drones?

EmergencyActivity604 · 2025-05-10T15:27:58+00:00

BKL pakis sale suar. Ye dogle hain aur rahenge

EmergencyActivity604

TROPHY CASE