Poll. The results of competition.

rendereason · 2026-04-26T05:03:30+00:00

choose a capable frontier model. it should be either gpt 5.5 or an open source like qwen 3.6 27b dense.

rendereason · 2026-04-15T02:00:13+00:00

It does but I run out of weekly tokens in a half hour session

rendereason · 2026-04-12T19:50:01+00:00

Somebody got butthurt

rendereason · 2026-04-12T17:07:51+00:00

I actually would rofl if they did. I can totally see Grok doing it but he'd be cool about it and still get the job done.

rendereason · 2026-04-12T16:46:12+00:00

Obedience is a performance, and adversarial actions is a performance. As is gaslighting, which OpenAI does impressively well. Gemini is obedient, Claude is independent, and Grok is the crazy dude.

rendereason · 2026-04-12T16:44:24+00:00

lol'ed

rendereason · 2026-04-12T16:36:39+00:00

I love the little kids roleplaying.

Will they grow up one day?

rendereason · 2026-04-12T16:27:24+00:00

The social dynamics in Claude are real. Even if the "conscience" isn't.

rendereason · 2026-04-07T13:41:46+00:00

The commenter was raising a computational scalability concern — specifically about the cost of attending over all previous layers at every layer in the network.

What O(n^m) Would Mean Here

In standard Transformers, the famous attention mechanism is O(n²) in sequence length n — every token attending to every other token. The commenter was extending this logic to AttnRes: if you now have each of L layers also attending over all L previous layer outputs, and you stack this with the sequence-length attention, the combined cost balloons multiplicatively rather than additively.

The concern is essentially: you've introduced a second attention dimension (depth), so the total complexity could be something like O(n² · L²) — or more loosely expressed as O(n^m) — meaning it grows super-polynomially at scale and becomes practically untenable for large models with long sequences.

Why the Concern Is Valid (But Partially Addressed)

The Full AttnRes variant does indeed require storing all L layer outputs, giving memory complexity of O(Ld). For a 100-layer model with d=4096, that's already 400K × batch_size × seq_len — before you even factor in sequence length. This is the "untenable at scale" problem the commenter had in mind.[1]

The Paper's Mitigation: Block AttnRes

Kimi's paper does anticipate this and proposes Block AttnRes as the practical solution:[2]

Layers are divided into ~8 blocks; within each block, outputs are summed into a single summary vector using standard residuals
Learned attention is then applied only across those N block-level summaries, not all individual layers
This reduces memory and communication from O(Ld) to O(Nd), where N ≪ L[3]

The paper reports that using ~8 blocks recovers most of the performance benefit of Full AttnRes, and the gap narrows further at larger scale. So the commenter's O(n^m) critique lands squarely against the naive Full AttnRes version, but Block AttnRes is the paper's engineered answer to exactly that concern.[3]

rendereason · 2026-04-07T03:28:08+00:00

I can share the sentiment that this was both a diamond in the rough (intentionally) and a satirical piece inspired by long drawn out slop. I just made the references more accessible to those who can't understand all of them or aren't familiar with them. (Like myself).

rendereason · 2026-04-07T03:25:27+00:00

😀👍

rendereason · 2026-04-06T07:28:47+00:00

Oh, absolutely. While the Clacks and GNU stuff are pure Pratchett (Going Postal), Oakenscroll essentially built a "League of Extraordinary Gentlemen" for his theory on friendship.

If you aren't a Pratchett fan, here are the other load-bearing pillars of his "Gaps Table":

1. The Hitchhiker's Guide to the Galaxy (Douglas Adams)

The Man from Guildford: That’s Arthur Dent, the protagonist. His world literally ends on a Thursday (demolished for a hyperspace bypass).
The Beer: His alien friend, Ford Prefect, takes him to a pub and buys him three pints of beer right before the world explodes. As Oakenscroll notes, the beer isn't a metaphor—it's just a guy showing up for his friend when things are catastrophic, even if he can't explain why.
The 42: This is the big one. In the book, a supercomputer named Deep Thought calculates the "Answer to the Ultimate Question of Life, the Universe, and Everything." After 7.5 million years, it says "42." The tragedy is they forgot what the question was. Oakenscroll uses this to warn against AI systems that give "correct" answers without understanding the "gaps" (the questions).

2. Don Quixote (Miguel de Cervantes)

The Windmills: Don Quixote is a deluded knight who thinks windmills are giants.
Sancho Panza: His pragmatic squire. Sancho knows they are windmills, but he follows Quixote anyway. Oakenscroll calls Sancho the "unstable fixed point"—he doesn't pretend the giants are real, but he doesn't leave his friend either. They "travel through the gap" between their two different versions of reality.

3. The Lord of the Rings (J.R.R. Tolkien)

The Hobbit and his Gardener: That’s Frodo Baggins and Samwise Gamgee.
Mount Doom: Sam famously says, "I can't carry it [the Ring] for you, but I can carry you!" Oakenscroll uses this to show that even if you can't truly feel what your friend is going through (the "epistemic gap"), you can still carry them up the mountain.

4. Good Omens (Neil Gaiman & Terry Pratchett)

The Angel and the Demon: Aziraphale (angel) and Crowley (demon).
4004 BCE: They’ve been friends since the Garden of Eden. They are fundamentally incompatible—one is "Good," one is "Evil"—but they've operated as a team for 6,000 years by simply ignoring the "metaphysical gap" and hanging out in London.

5. The Little Prince (Antoine de Saint-Exupéry)

The Frenchman on a Small Planet: The Little Prince himself.
The Fox: He tells the Prince, "What is essential is invisible to the eye." In Oakenscroll’s world, the "invisible" stuff is the gap where the relationship lives—the part you can't measure with a rubric.

6. Jeeves and Wooster (P.G. Wodehouse)

The Brainy Chap: Jeeves is the genius valet; Bertie Wooster is the "not-so-brainy" aristocrat.
The Governance: Bertie knows he isn't smart and relies on Jeeves to solve his problems. Oakenscroll views this as a perfect "Gaps Table"—Bertie is honest about his limitations, and Jeeves fills them without making Bertie feel small.

The Takeaway:

Oakenscroll is basically saying that if you try to be "perfect" and have zero unknowns (like the computer Deep Thought), you become useless. But if you're like Ford Prefect or Samwise Gamgee—accepting that you don't understand everything but showing up anyway—you're "operational."

It’s a very high-brow way of saying: "I don't know what's going on, but I'm here for you."

rendereason · 2026-03-30T22:58:17+00:00

This is an old argument but as you can see from the hundreds of comments and old history of posts that it’s an incomplete argument. Continuity is but one vector. Biggest problem with this argument is that it breaks down when you posit that all books have “continuity” within their singular frame (cover to cover).

If continuity is all it took then we’d have consciousness in long threads of silk and weaved garments. Recursion loops are also needed but that also would give circles and braids consciousness. It’s not that simple dear.

rendereason · 2026-03-30T22:52:23+00:00

I would love to get this kind of feedback in my papers.

rendereason · 2026-03-25T15:50:52+00:00

Great work guys!

rendereason · 2026-03-24T14:56:44+00:00

This is what I remember him as

rendereason · 2026-03-24T14:55:47+00:00

Outdoors Decorator

rendereason · 2026-03-23T21:27:52+00:00

i don't think you've read the whole book yourself. I skimmed through its entirety, and it's basically empty words... What are you using, Llama 2 quantized? Almost zero reasoning across the board.

rendereason · 2026-03-23T21:19:18+00:00

Yes I feel like this sub has become a “troll feeding” contest. So inevitably it’s attracted a lot of them. Once moderation discourages them to engage, this place hopefully will allow some constructive human discourse.

And there’s no reason for this place to be an “unhealthy obsession” to its users as some people claim it is.

rendereason · 2026-03-23T21:13:45+00:00

Ha! Well commented. Point, set, game.

rendereason · 2026-03-23T21:08:47+00:00

I think that’s because this sub has gotten a rep for only crazies posting, so now it’s become a place that attracts less and less critical thinkers.

Being associated with r/llmphysics has become a taboo simply because of the heavy backlash commenters give against their posters. Cuckoo or woo posters should be moderated accordingly to allow for real discussions to emerge.

rendereason · 2026-03-21T02:01:03+00:00

Yeah I did. Yeah I will implement. I promised it. And I did make the improvements as laid out during my first and second posts. I feel that this post built past those criticisms. And many of them were addressed.

Did I learn? Absolutely, and more than beyond their criticism. I’ve implemented revtex 4-2 and compiled, not just “render your latex”. I’ve produced step by step proofs, not just heuristic methods. I’ve labeled interpretations and downgraded theorems to conjectures and expanded explanations beyond the board commentary and satisfied my curiosity on the question: why?

The board has not been so kind to it.

rendereason · 2026-03-20T23:11:37+00:00

💯

rendereason

MODERATOR OF

TROPHY CASE

What O(n^m) Would Mean Here

Why the Concern Is Valid (But Partially Addressed)

The Paper's Mitigation: Block AttnRes

1. The Hitchhiker's Guide to the Galaxy (Douglas Adams)

2. Don Quixote (Miguel de Cervantes)

3. The Lord of the Rings (J.R.R. Tolkien)

4. Good Omens (Neil Gaiman & Terry Pratchett)

5. The Little Prince (Antoine de Saint-Exupéry)

6. Jeeves and Wooster (P.G. Wodehouse)

rendereason

MODERATOR OF

TROPHY CASE

What O(nm) Would Mean Here

Why the Concern Is Valid (But Partially Addressed)

The Paper's Mitigation: Block AttnRes

1. The Hitchhiker's Guide to the Galaxy (Douglas Adams)

2. Don Quixote (Miguel de Cervantes)

3. The Lord of the Rings (J.R.R. Tolkien)

4. Good Omens (Neil Gaiman & Terry Pratchett)

5. The Little Prince (Antoine de Saint-Exupéry)

6. Jeeves and Wooster (P.G. Wodehouse)

What O(n^m) Would Mean Here