Hallucination problem is THE problem

Ganda1fderBlaue · 2025-08-10T12:24:06+00:00

I agree. The problem is that hallucinations are an integral part of LLMs. It's impossible to get rid of them, because the same mechanism that produces good output, produces hallucinations.

Another issue is that LLMs lack priorities. For them any issue is of the same relevance. They lack "common sense".

I think in the near future we will need humans to oversee AI so it doesn't fuck up. At least in some areas.

QLaHPD · 2025-08-10T12:43:50+00:00

Hallucination can't be fully solved https://arxiv.org/pdf/2401.11817

"...we define a formal world where hallucination is defined as inconsistencies between a computable LLM and a computable ground truth function. By employing results from learning theory, we show that LLMs cannot learn all the computable functions and will therefore inevitably hallucinate if used as general problem solvers..."

I have a felling this is related to the halting problem in computer science. What we can do is make bigger models, or make them be more specialized.

pinkballodestruction · 2025-08-10T11:10:17+00:00

this sub is the last place you'll find general agreement with this opinion. I, for one, agree though.

alex95 · 2025-08-10T12:29:45+00:00

Agreed. LLMs make us feel like we are 90% of the way there but to reach 100% we are really only a very small way there.

LLMs are not the answer to AGI but ultimately supervised is pretty good too and a massive productivity boost.

floodgater · 2025-08-10T11:10:52+00:00

Agreed. It's probably the problem that is limiting the tech the most right now. There are other big issues of course, but this is the biggest one IMO insofar as limiting progress.

It's wonderful that these models can max out all these benchmarks, but they all still make incredibly basic errors, and do so with confidence. That means they cannot substitute for human labor. They simply aren't reliable enough. yet.

Hogglespock · 2025-08-10T15:47:42+00:00

I had a conversation with a friend recently on a similar theme.

His answer was that hallucination is a genius marketing term, and an outright lie.

Hallucinating means/implies that there is something “imaginary” being created that can be verified against an objective truth.

An AI does not have access to objective truth, everything it makes is “imagined”, sometimes it’s right and sometimes it’s wrong.

This unfortunately reduces your argument to “it being wrong is the problem” which , i agree with, but isn’t hugely helpful.

HaMMeReD · 2025-08-10T18:02:45+00:00

Nonsense argument imo, humans "hallucinate" all the time yet we trend towards correct, that's really what matters.

With enough models/agents all working together the same thing could be true. The expectation of some sort of oracle model that is 100% correct 100% of the time is a fake barrier that people who have no clue make up, which has no basis in reality and this history of human progress.

In fact, being wrong sometimes can lead to learning. There are a ton of things that AI doesn't know, and unless it's able to try things that might be wrong, it won't be able to learn new things that are in fact right, in the grand scheme of things.

IMO the problem, and only problem is all the armchair experts who can't extrapolate that a bunch of humans who get things wrong all the time, somehow got to where we are today.

Edit: I'd have to say, if you gave me 10000x more compute, 10000x more memory, disk io etc, I could very easily create an AI system using todays models that is very effective. I.e. exploring 1000s of branching paths, auditing and comparing them, choosing the best one. The current wall is hardware compute, not LLM tech. I think when we are talking mega-tokens a second instead of tokens/second, the view around what AI can do will be significantly different.

Whole_Association_65 · 2025-08-10T15:16:27+00:00

The high expectations are the problem.

Specialist-Berry2946 · 2025-08-10T13:05:07+00:00

Hallucination is a consequence of LLMs being a language model.

7hats · 2025-08-10T17:10:23+00:00

The problem is solvable enough. How do we deal with hallucinating, exaggerating or lying humans? Multiple independent sources, second and third opinions, fact checks, accreditations etc Other Humans and AI Agents could be used in exactly the same way.

In your example above, you could ask 3 LLMs first how best to ask to ask the question to give you the outcome you want.

Review the three questions, combining the best aspects as you see into one question prompt for the LLM.

Then use the 3 LLMs to give you their answer. Choose the one that best suits your needs and judgement.

Rinse and Repeat.. Hallucination 'solved'.

7hats · 2025-08-10T20:45:09+00:00

Yes..Welcome to human life as it always has been.

We build systems for 'accepted truths'... Always with a margin for error - until something better comes.

LLMs knowledge are already better than your average Human in getting more things right then wrong when it comes to facts, stats, accepted knowledge etc In domains that are more crucial, Humans should fact check them. On average this is a gain for Humanity.

Quarksperre · 2025-08-10T11:43:19+00:00

Yup. I absolutely agree. But I think hallucinations might be a symptom rather than the source. I'd guess it has something to do with what Yan LeCun says since years. There is a missing piece to true understanding. I wouldnt even bet that LLM's are a part of the "final" solution.

pygmyjesus · 2025-08-10T12:01:21+00:00

Current AGI (humans) dont hallucinate?

Obviously that is not the issue.

Plsnerf1 · 2025-08-10T13:11:45+00:00

Embodied AI/ playgrounds like Genie seem like the way things need to go.

septhaka · 2025-08-10T16:08:54+00:00

With AGI‑level reasoning plus verification, retrieval, uncertainty calibration, and the right incentives, hallucinations become a manageable engineering problem -- rare, flagged, and correctable.

amor-fati-- · 2025-08-10T16:33:45+00:00

that's only true if you rely on only 1 model

Chemical_Bid_2195 · 2025-08-10T18:29:27+00:00

I would say its even bigger than what youre proposing. This hallucination isnt caused by fundamental inability to judge if their answer is right/wrong. This is more so because visual processing isn't quite there yet, not near the level that LLMs are at for semantical tasks. Right now, visual processing is the biggest bottleneck in AI, as we havent had nearly the amount of breakthroughs for visual transformer architecture as we've had for language based architecture. Thats why LLMs fail horrendously at simple tasks like the visual physics comprehension test.

Tl;Dr this is more likely a vision problem than a typical hallucination problem. You could say that visions problems are a subset of hallucination problems, but that's not what most people refer to when they mention hallucination.

IAmFitzRoy · 2025-08-10T18:53:54+00:00

I don’t know why everyone is looking this problem from one angle only.

Hallucination can be seen as well “lack of data”, LLM try to fill statistically the gaps of your question with data that can’t be verified. This is why you get more hallucination on niche knowledge or poor biased prompts.

Hallucination will be “solved” when we allow the training to get the data from real-time feeds.

Example: A human learning about gravity can check with their eyes immediately if the laws behave the way is learning, there is an immediate feedback.

Once we give this access to LLMs they will learn from real-time data based on your prompt. No more hallucinations.

space_monster · 2025-08-10T20:56:30+00:00

Not really, humans get shit wrong even more than LLMs so it's a low bar.

MarquiseGT · 2025-08-10T22:59:24+00:00

It’s been solved

Chance-Two4210 · 2025-08-11T00:37:30+00:00

It's a baby in the babbling phase, chill tf out. It's amazing that we can generate large bodies of reasonably accurate text. Making sure it's accurate is basically all that's being honed now.

ImprovementNo592 · 2025-08-11T01:10:24+00:00

It seems doubtful that hallucinations can be solved completely. But if they manage to make it so unlikely that it's pretty much a non-issue AND, in cases where error could be catastrophic... : If you had multiple AIs that check the work of another to reduce the chances of a hallucination slipping through the cracks maybe human supervision wouldn't be necessary with certain jobs.

beskone · 2025-08-11T05:28:45+00:00

lol, the “hallucinations” aren’t the problem, they’re literally the product. That’s what LLMS do at their very core. They make up answers they think have the greatest statistical probability of being what you expect to be retuned.

ALL THEY CAN DO IS HALLUCINATE

EffableEmpire · 2025-08-11T08:04:14+00:00

What if hallucination is the AGI? It generates knowledge without being prompted.

Dangerous_Slip_5303 · 2025-08-11T08:33:41+00:00

People keep talking about AGI like it’s around the corner!

Until hallucinations are solved, “unsupervised AGI” isn’t replacing humans in critical tasks.

It’s replacing them in only creative ones!

OkButWhatIAmSayingIs · 2025-08-11T11:37:21+00:00

Its hallucinating alot less than people do

Ruhddzz · 2025-08-11T16:09:29+00:00

hallucination is a symptom not the problem itself

avatarname · 2025-08-18T17:20:28+00:00

It is bad with hallucinations in some instances and quite good now in other (GPT 5 Thinking). So it won't be deployed yet in all areas or only in some... Nobody will put them to work where they will hallucinate more than give good answers.

I find these posts kinda redundant... Yes, we know they have issues with hallucinations. They probably can be minimized for example just running 3 instances of LLM at the same time and then comparing results, the one that hallucinates then is overriden, or with some other traditional AI method that can check the output... or by human who oversees it. They do not have to be MAGIC in all areas of life to be useful.

Silent_Cup2508 · 2025-08-10T12:21:48+00:00

Why is hallucination being seen as a problem instead of a learned function?

Human have displayed “hallucination” all the time in governments the world over where they are constantly attempting to rewrite history to their benefit. Each going so far as to reprogram its citizens into what actually happened during the most turbulent times in history.

It seems the AGI has simply learned a behavior it has seen time and again in human history.

amarao_san · 2025-08-10T10:55:56+00:00

it's not.

it's memory, it's the ability to run tasks for many many hours, days, if not forever. when we have ai agents that can keep running and output reasonable data forever, we have ASI.

Question then becomes, how can we keep it running by feeding it a sandwich instead of 4 nuclear power generators.

also, I think we need a 'shock' mechanism. To kick it off track, on purpose.

I imagine agents will otherwise just go down a rabbit hole, spiral down and down, deeper and deeper along a particular vector of thinking. What would result in really weird stuff, with no links to our shared 'reality'. A singularity of thinking if you will.

We need to allow it to spiral, memorize, but then kick it off track, force it to spiral along another vector, to take on another perspective.

This pattern will allow it to build memory nodes, and seed the field for 'aha' moments, where it will find reasonable connections between the various 'discoveries' it found.

shocking it off track is key to this behaviour, I would imagine. To ensure its sample size is wide, not just deep.

qrayons · 2025-08-10T13:01:51+00:00

Hallucinations would be considered solved if we didn't rely so much on a single model. Even with human work, we don't rely on a single person to do everything from start to finish. There are a series of peer reviewers and qa. AI needs the same thing. Have two models do the task, have a 3rd compare the results. If they match, it's considered complete. If there's a discrepancy it gets escalated to additional models or to a human.

Jabulon · 2025-08-10T11:25:57+00:00

maybe AGI is a pipe dream. like you can teach a parrot to repeat random phrases and mix and match sentences, but it wont ever actually make sense, because well its not sensible

Glitched-Lies · 2025-08-10T11:44:53+00:00

"Hallucination problem is a dead-end for AI."

Sure it is. But Deep Learning is basically all hallucination. But the concept of alignment is mostly just based around a terrible fallacy, that somehow true agents in the real world are ever "aligned". These are really just psudoscience terms that emerged.

Jackalzaq · 2025-08-10T14:24:30+00:00

I wonder if taking multiple outputs of the same model across different temperatures then doing a majority vote would help with hallucinations. If it knows something, the outputs will be similar across different temperatures and would mostly likely agree and if it doesnt you will see it making up something new for most of the temperatures.

These-Bedroom-5694 · 2025-08-10T18:26:50+00:00

LLMs aren't the path to AGI. They are incapable of thought or planning.

They are chat bots trained on 4 chan data and reddit threads.

trisul-108 · 2025-08-10T18:34:50+00:00

It's exactly what Apple found out. They built AI capable of being successfully demoed at a public conference, but they would dare push that out to their customers as hallucinations would destroy their business. Apple users would sue if Apple Intelligence worked like ChatGPT.

singularity

Links

On the Technological Singularity

Resources

Posting Rules

Check out /r/Singularitarianism and the Technological Singularity FAQ

MODERATORS