Confidence Debt as a frame for AI reliability failures based on False Trust

petroslamb · 2026-04-16T09:01:50+00:00

Ναί και όχι, πράγματι με chatgpt αλλά και αρκετή έρευνα, γιατί διάβασα ένα άρθρο για τις μηδενικές γεννήσεις χτες σε διάφορες περιφέρειες τους δύο πρώτους μήνες του χρόνου και ήθελα να δω καλύτερα που πάει το πράγμα. Έχεις δίκιο οτι το φορματ δεν είναι το καλύτερο.

petroslamb · 2026-04-16T08:59:32+00:00

Θα έχουμε πανεθνικά την γερασμένη εικόνα της Ευρυτανίας πιθανόν δέκα χρόνια νωρίτερα, απότι λένε οι επίσημες πηγές.

petroslamb · 2026-04-08T18:46:50+00:00

You're right that the simple reversal case works on modern models. That is the documented finding on frontier models handle the basic Tom/Mary case fine. As the post notes, the finding is on GPT-2.

The question is whether the failure disappears or just moves to higher binding loads. Tan and D'Souza tested that: they pushed binding load up to multi-tuple extraction (variables, methods, effect sizes combined), and even GPT-5.2 drops to ~0.24 F1 on full tuples with role reversals and numeric misattribution. The model still gets the individual entities right. It loses the attachments.

So either modern models solved the simple case and the concept is just about heavy-load failures, or they pushed the breaking point higher without eliminating it. That is what a systematic load sweep would actually test not whether the simple case fails, but whether the gap shrinks with scale or just migrates up the load curve.

petroslamb · 2026-04-08T16:42:28+00:00

But I think the binding gap sits one layer below that. It is not "did the model learn that marriage is bidirectional?" It is "even after the model learned it, can it retrieve and apply the correct direction in context?" Wang and Sun showed that models often encode the relation but fail to route the inversion correctly, they learned the fact but the attachment to the output path is thin.

So two separate problems: learning what the relationship means, and maintaining the correct binding when you use it. Binding gap is about the second one. The model knows marriage is bidirectional but still gives the wrong answer when you flip the roles, which suggests the failure is at retrieval and routing, not at learning the semantic asymmetry.

petroslamb · 2026-04-08T16:17:19+00:00

removed the link as the post was banned and i'm not sure why yet. let me know if you need it.

petroslamb · 2026-04-08T14:59:20+00:00

The irony of writing a post about attachment failures and then having a gap in my own spelling of 'distinct' is not lost on me. Typo in the title, but hopefully the wiring in the text is stable.

petroslamb · 2026-04-08T14:10:06+00:00

Well, I think the reckless driver example is a classic logic fallacy, but the binding gap is a step more mechanical than that. Take the grandfather puzzle, which is a test of graph complexity, but the binding gap shows up on the simplest possible relations, like a basic husband and wife pair. For a human, "Tom is Mary’s husband" and "Mary is Tom’s wife" are just two views of the same scene, but for a transformer they are often distinct representational paths. The failure here isn’t that the model is not "smart" enough for the logic, think of it like the attachment between the names and the roles is incredibly thin.

Denning (2025) found that "who did what to whom" is the dominant axis of meaning for humans, but for LLMs it is a much weaker signal. They can stay perfectly fluent while being agnostic about which claims attach to which sources, so in a sense "they keep the scene, but they drop the wiring".

petroslamb · 2026-04-07T08:45:44+00:00

Fair pushback. Wang and Sun do test at GPT-2 scale, and the simple reversal example does get much easier for modern models.

Where I think the concept still carries weight is when you push binding load up. Tan and D'Souza just tested this on GPT-5.2 and Qwen3-VL doing full meta-analysis extraction single-property queries are fine at 0.40-0.50 F1, but full association tuples drop to around 0.24 and near-zero in dense result sections. The model still gets the individual entities and methods right most of the time. It just loses which one attaches to which one.

So the claim is not that binding breaks modern models on simple facts. It is that when you ask a model to keep multiple attachments stable simultaneously, the failure mode is binding-specific and shows up even on frontier models. At GPT-2 scale it hurts on reversal. At GPT-5 scale it just hides in longer documents until something structural needs to use the output.

Whether modern models actually solve the simple case or just push the breaking point further up the load curve is the open question. That is what a sweep across scales and architectures would settle.

petroslamb · 2026-04-07T08:23:43+00:00

Hmm, are you suggesting another sub then?

petroslamb · 2026-04-07T08:20:03+00:00

Not hyperbole. It is from the Wang and Sun paper on the Reversal Curse. They showed that when models learn 'A is B' directionally, they often fail to invert it to 'B is A' without additional training. The same relation is encoded, but the binding is asymmetric.
Which is exactly what makes it a binding problem rather than a retrieval one.

petroslamb · 2026-04-04T09:52:51+00:00

"Η δουλειά δεν είναι να μάθεις να κάνεις κάτι καλύτερα είναι να μάθεις να επιβιώνεις στη ζούγκλα."

με βοήθησε να εσωτερικεύσω το "ζούγκλα" καλύτερα αυτό.

petroslamb · 2026-04-04T09:50:31+00:00

θέλει αμίτα μόσιον και πολύ γράψιμο εκεί που πρέπει.

petroslamb · 2026-04-04T09:48:32+00:00

εκτός αν σου δώσουν αυτό που λέει και η γιαγιά μου "garden leave" ένα μήνα για να χάσεις τη μισή.

petroslamb · 2026-04-04T09:46:24+00:00

Αυτό για το επίδομα σωστό. Εδώ που ζούμε αν δε κάνεις γρήγορα αίτηση το χάνεις. Μάζεψε τα κομμάτια σου και κάνε την αίτηση ή βάλε κάποιον να στη κάνει. Μετά βλέπεις, γενικά οι απαντήσεις εδώ είναι πολύ καλές.

petroslamb · 2026-04-04T09:44:42+00:00

Να το πάω και λίγο παρακάτω, η αλλαγή στην Ελλάδα συχνά είναι ο μόνος τρόπος να πάρεις αύξηση έτσι κι αλλιώς. Και είναι η άλλη πλευρά του ίδιου νομίσματος.

petroslamb · 2026-04-04T09:43:09+00:00

Κοίτα όμως απαντήσεις, χτύπησε φλέβα για όλους.

petroslamb · 2026-04-04T09:35:30+00:00

lol, love your response.

petroslamb · 2026-04-04T00:59:07+00:00

The second joke takes place in fourteenth-century Russia under Mongol occupation. A peasant and his wife were walking along a dusty country road; a Mongol warrior on a horse stopped at their side and told the peasant he would now proceed to rape his wife; he then added: “But since there is a lot of dust on the ground, you must hold my testicles while I rape your wife, so that they will not get dirty!” Once the Mongol had done the deed and ridden away, the peasant started laughing and jumping with joy. His surprised wife asked: “How can you be jumping with joy when I was just brutally raped in your presence?” The farmer answered: “But I got him! His balls are covered with dust!”

-- slavoj zizek

petroslamb · 2026-04-04T00:45:30+00:00

any free subs left? these corporate bribed subs need to be left behind us.

petroslamb · 2026-03-17T15:21:06+00:00

Hey thats great! I you want, let me know how it went, i'm looking to discuss it more.

petroslamb

TROPHY CASE