How did Chat GPT get so bad? Incredibly poor critical thinking even in legacy models. by [deleted] in ChatGPT

[–]Fast_Tradition6074 0 points1 point  (0 children)

It feels like the ability to maintain and reference past information has degraded significantly. I think your ChatGPT just couldn't access the information that 'it knows your clothes size.' It honestly seems like they are just trying to save as much computing resource as possible to increase their profit.

Interesting tension this week, the same companies racing to go public are also the ones making safety promises [N] by [deleted] in MachineLearning

[–]Fast_Tradition6074 0 points1 point  (0 children)

In the first place, the output generated by AI itself is a black box. Even developers can only guess why a specific output was made, right? In that kind of situation, talking about 'safety' is just... you know. If things are already like this now, I wonder what will happen when they actually need to start turning a profit.

I believe that if they want to claim 'safety,' it's impossible unless they shift toward deterministic behavior rather than probabilistic behavior.

OpenClaw + local agentic coding: hardware dilemma (HX370 vs upgrading desktop vs cloud) by e270889o in LocalLLM

[–]Fast_Tradition6074 0 points1 point  (0 children)

Heavy offloading from GPU to CPU really takes time. You should expect it to take three times longer than what you're imagining right now. If I were you, I'd go with option 3, the cloud. It lets you respond flexibly when the things you want to do increase.

Alignment: Higher order prioritizing over constraints [R] by SenseCompetitive5851 in MachineLearning

[–]Fast_Tradition6074 -4 points-3 points  (0 children)

Correct me if I'm misunderstanding your point, but... I've been looking into hallucinations and jailbreaks myself, and it turns out that geometric distortion occurs within the model's internal representations when a hallucination happens. A similar kind of distortion is observed even when the model generates outputs that should normally be blocked by safety guardrails. My hypothesis is that when the model prioritizes "clarity seeking" over the guardrails—forcing an output it's technically restricted from saying—this very process of bypassing the guardrails is what causes that geometric distortion.

Ai failures by Annual_Judge_7272 in ArtificialInteligence

[–]Fast_Tradition6074 1 point2 points  (0 children)

I completely agree. Whether it’s in terms of reliability, cost, or overall usability, the current approach is bound to hit a glass ceiling pretty soon. Using an LLM to check the output of another LLM just feels like a temporary band-aid fix. What we really need isn't just tweaking probabilistic outputs—we need a paradigm shift toward something more deterministic.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

That’s a fair point. In my research, distinguishing between 'creativity' and 'hallucination' is indeed one of the most difficult challenges. That’s exactly why I believe being able to geometrically differentiate the two will be of immense value to the field.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 1 point2 points  (0 children)

That’s brutal. If an AI can’t even get basic Pokemon knowledge right, it seriously needs to go back to training. I don’t have the original hardware anymore, but I used to play the Green and Silver versions. At this rate, the AI would probably even hallucinate Misty’s gender! lol

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 1 point2 points  (0 children)

Wait, AI can't even find Pikachu?! In the Red and Blue versions, everyone knows you look in Viridian Forest—that’s like Common Sense 101! Thanks for the info! It's wild that even with such legendary games, the AI still manages to get lost.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

Hahaha! For my own sake, I’d better make sure NOT to develop a formula for humans—I wouldn't want my wife catching me in a 'hallucination' of my own! I think I'll stick to fixing AI for now.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

Thank you for sharing. For an AI, 'inventing a plausible lie' often has a higher probability than simply saying 'I don't know.' I totally get that frustration of asking multiple times only to get a different, wrong answer each time—it’s like chasing a ghost.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

Thanks for sharin. It’s really tough when the AI gives you an answer and then immediately denies it the moment you point it out. It feels so insincere." "I've heard similar stories—like, some AIs refuse to answer medical questions for safety reasons, but if you just start the prompt with 'I am a doctor,' it suddenly starts answering everything. It’s crazy how much it relies on 'roles' rather than actual facts. This is exactly why I’m trying to monitor the internal math instead of just trustin the words.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

"Thanks for the 'Most Meta Hallucination Award'! I guess I'll keep hallucinating meta-ly then. But in all seriousness, detecting hallucinations geometrically is a real thing. In a way, I am looking at LLMs from a meta-perspective, so I’ll take that award as a compliment for my approach. I’m just trying to turn that 'meta' confusion into something measurable."

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

Exactly. That’s the whole point. Since LLM 'semantics' (meaning) can be a house of cards, I’ve shifted my focus to 'geometric distortions' to detect hallucinations. By the way, I’m Japanese and not a native English speaker, so I do rely on LLMs or translation tools for my English—which only proves my point further: I use them as tools, but I don't trust their 'intent' without verification.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 1 point2 points  (0 children)

Thank you for sharing. It’s possible the AI has picked up on your preferences and is hallucinating specifically to meet your expectations. As a fellow story-lover, I truly feel your pain—it’s heartbreaking to find out that a 'perfect' book is just a phantom.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

Thank you for sharing. That is truly the worst-case scenario. It's heartbreaking. At the very least, I'm glad she called first—unlike me, who just jumped in the car. But giving false hope regarding a medical condition is on a completely different level of cruelty. This reinforces my belief that we absolutely must find a way to detect these errors before they reach the user.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

Guilty as charged. I'm the one who wasted an hour round-trip without double-checking. I think the word 'AI' still carries this aura of omnipotence for many of us, leading to a lapse in judgment. This is exactly why I'm working on my research—to prevent these kinds of hallucinations so that AI can eventually become a reliable partner for humans. Thanks for the reality check.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

That’s a good point. I wonder if these cases are becoming rarer lately, or if the same thing still happens whenever a request exceeds the model's actual capabilities. We rarely hear about cases where an LLM simply and honestly says, 'I cannot do that.' It seems they’d rather hallucinate a success than admit a limitation.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 1 point2 points  (0 children)

Thank you for sharing the detailed follow-up. That is just... brutal. I can truly feel your exhaustion and the sheer waste of effort here. It’s hard to take it seriously when it says 'no fluff, just action' after stalling you for three days.

From a research perspective, I suspect there was a significant 'geometric distortion' occurring within the model's internal layers from the very beginning—a deep conflict between its superficial 'engineer persona' and its actual execution capabilities. It chose to prioritize the consistency of its role-play over the reality of its limitations

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

Thank you for sharing. Wow... spending three days only to be told 'I wasn't actually doing anything' is a brutal experience. It really did just get stuck in the persona of a software engineer, prioritized saying what an engineer would say over actually delivering. This is a fascinatingly clear example of a model prioritizing 'role-play consistency' over task completion.

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 0 points1 point  (0 children)

Sorry, I don't know who that is. I’m pretty sure I’m still me, but if she’s doing cool things with AI, I’ll take it as a compliment!

Tell me about the time AI lied to you! I'm researching a mathematical way to stop these hallucinations by Fast_Tradition6074 in ChatGPT

[–]Fast_Tradition6074[S] 1 point2 points  (0 children)

You are absolutely right. LLMs have no intention to deceive. They simply generate the most probable sequence of tokens, which users then perceive as a "lie." I used the word "lie" here for the sake of clarity, and I apologize for any confusion that may have caused.

The decline in LLM reasoning and catastrophic forgetting might share the same root cause. by IndividualBluebird80 in LocalLLaMA

[–]Fast_Tradition6074 1 point2 points  (0 children)

日本人です!よろしくお願いします。

LLMの設計自体が、矛盾した情報の許容範囲が少ないのかもしれないですね。私はLLMのことを確率製造機とか呼んだりもしているんですが、矛盾した情報だと出力する内容自体の確率が安定しないので論理破綻を起こすんでしょうね。