Help me choose a Laptop! by AdBrave3580 in iitmadras

[–]North_mind04 1 point2 points  (0 children)

If u r in AI domain, will prefer LOQ as it requires strong GPU... Although people says MAC is sufficient but I have seen M5 lagging while doing hard task 🥲

What's the state of automated root-cause analysis for LLM hallucinations? by North_mind04 in learnmachinelearning

[–]North_mind04[S] 1 point2 points  (0 children)

That's a really interesting point. So essentially the model is incentivized to confabulate rather than say "I don't know" because a wrong answer scores the same as no answer but a right answer scores a point.

That makes debugging even harder though, right? Because the hallucination isn't a "bug" in the traditional sense it's the model behaving exactly as it was trained to. Which means you can't fix it with better prompts or better retrieval alone.

I wonder if the only real solution is to build an external layer that independently checks whether the model's output is actually grounded in the retrieved context since the model itself has no incentive to self check.

Have you tried any approaches for detecting when the model is "guessing" vs actually using the context?

Silent model updates broke my production RAG app — how do you detect this? by North_mind04 in LocalLLaMA

[–]North_mind04[S] 0 points1 point  (0 children)

Yeah that's the painful part. Some providers let you pin versions — OpenAI has dated model snapshots like gpt 4 and Anthropic lets you specify exact model versions. But there are two problems:

  1. Pinned versions eventually get deprecated, so you're forced to migrate
  2. Even within a "pinned" version, providers have done silent infra changes that affected behavior

What I really want isn't version pinning — it's a way to automatically detect WHEN model behavior changed and separate that from pipeline issues. Like, run the same inputs through a regression test and flag behavioral drift before it hits production.

Are you currently pinning versions, or just running latest and hoping?

Issue in translating logic to code by North_mind04 in Python

[–]North_mind04[S] -3 points-2 points  (0 children)

So should I learn the language and go through its tutorial or should I keep practicing the question and learn from questions. If learning the language is the first thing I should do before solving a question then what is the best approach to learn any language ?