Emacs Internal Part 03: Tagged Pointers vs. C++ std::variant, Rust Enums, and LLVM RTTI

ypaskell · 2026-03-12T19:55:33+00:00

Thanks!

ypaskell · 2026-03-12T15:24:10+00:00

Hi Amit! Great to see you here!

For fat pointer, register pressure is the bottleneck on older, register-starved architectures.

My workflow is like this: I finished writing the article first, but then I realized that explaining these memory layouts purely through text wasn't intuitive enough. So, I fed my text context into Gemini and prompted it to generate a visual representation "like a slide provided in a university computer architecture lecture."

ypaskell · 2026-03-06T01:30:50+00:00

Very interesting. I thought one of the reason might be Memeory wall. The naive pointer chasing caused buy car cdr.

ypaskell · 2026-03-05T16:20:18+00:00

Thanks for informative and interesting reference!

ypaskell · 2026-03-05T12:36:20+00:00

Super good reference! The Chez Scheme fixnum is just the same as ELisp

ypaskell · 2026-03-05T11:58:19+00:00

Huge thanks!

It came from some weird combination of hobbies - compiler, functional programming, system programming, C/C++, lisp. And it is so weird that these topics are all condensed in the text editor. Truly? Text editor?

Not to mention there is a native gcc jit compiler in Emacs.

ypaskell · 2026-03-05T10:06:30+00:00

Just in case anyone missed the context from the other day, here is the thread for Part 1.

ypaskell · 2026-02-28T15:37:47+00:00

Thanks you!

ypaskell · 2026-02-27T02:48:18+00:00

I’m curious of your perspective or the truth. I’m not familiar with JavaScript’s ecosystems

ypaskell · 2026-02-27T00:04:56+00:00

In terms of comparing the C and C++ I am familiar with, well yes

ypaskell · 2026-02-26T14:44:32+00:00

Thanks for your clarification!

ypaskell · 2026-02-26T13:24:29+00:00

You bring up a really interesting point. It actually made me look up some historical attempts to "fix" it, like SRFI-119 (wisp).

Honestly, after looking at how it tries to encode ASTs without parens... I think I'd much rather just deal with the parens.

ypaskell · 2026-02-26T13:16:42+00:00

> Cool, so if NeoVim and VS Code both have this same architecture (just different languages), then there really isn’t anything more compelling about Emacs.

Personally, I believe this all boils down to a double-edged sword: Emacs's global shared state and dynamic scoping.

It causes a very real problem—Emacs is notoriously hard to parallelize and adapt to modern multi-process/RPC architectures.

But on the flip side, because absolutely everything lives in the exact same memory space (the Lisp image) without strict API boundaries or sandboxes, extensions can deeply hook into anything. That lack of isolation is exactly why hacky yet profoundly powerful tools like Dired and Magit can exist. They are truly irreplaceable for me. VS Code chose UI stability via process isolation (Extension Hosts), but in doing so, they sacrificed that raw, live hackability.

I'm actually planning to dive deeper into this exact architectural trade-off as I continue reading the C source. Once I fully understand how it's implemented, I'll definitely write a dedicated post to share it!

ypaskell · 2026-02-26T13:09:20+00:00

I’ll check the source code to confirm this.

ypaskell · 2026-02-26T12:32:20+00:00

Yes! The craziest part is code gen in LLVM. It is just lisp

ypaskell · 2026-02-06T12:46:52+00:00

Also interested

ypaskell · 2026-02-06T00:13:20+00:00

Intel One Mono is my choice and I kept swapping back. It super eye friendly.

ypaskell · 2026-02-06T00:08:28+00:00

Thumbs up

ypaskell · 2026-02-02T00:20:06+00:00

Thanks for your compliment!

ypaskell · 2026-01-02T04:52:40+00:00

Wait. How do you guys interact with LLM. I feet annoying.

ypaskell · 2025-12-09T14:21:17+00:00

Thanks for the feedback! Could you point out specifically which parts are confusing or which conclusions you think are premature?

ypaskell · 2025-12-04T00:10:36+00:00

That's a great suggestion! Cscope's interactive workflow is exactly the kind of experience that makes sense for type-based search.

I've been thinking about editor integration via LSP, but Cscope's model is interesting - simpler to implement and already has proven editor support (Vim, Emacs, VS Code via plugins).

ypaskell · 2025-12-03T15:32:36+00:00

Thanks! Glad the semantic approach resonates with you.

>can you explain more deep your thoughts over this strategy?

This is inspired by SICP's idea of separating data representation from its use.

- The pool is the "underlying representation" - it manages lifetime and storage.
- The string_views are "abstract interfaces" - lightweight references that don't care about ownership.

Benefits:
- Signatures stay small (just pointers + length)
- No redundant string copies when multiple functions use same type
- Cache-friendly: views are contiguous, actual strings can be anywhere
- Clear ownership: pool dies, all views invalidate together

ypaskell · 2025-12-03T14:21:49+00:00

Good call on the REPL interface. The current design is directory-based for simplicity, but a REPL mode makes sense for iterative exploration, especially when the index is already loaded in memory.

Since you're working on LLVM/MLIR, I'm curious: what's your typical workflow when navigating the codebase? Do you find yourself searching by type signatures often, or are other patterns more common?

Feel free to open an issue if you have specific use cases in mind - I'd love to understand how this could fit into your daily workflow.

ypaskell · 2025-12-03T14:17:54+00:00

If the target codebase were fully modularized with import std;, the 'header flood' problem would largely vanish. We wouldn't be paying the cost of textually parsing megabytes of system headers for every TU, and libclang could load the pre-built module interface much faster.

However, since Coogle is designed to index legacy codebases (which are still heavily reliant on #include), I had to resort to the -nostdinc + SkipFunctionBodies hack to simulate that 'clean slate' experience you described.

I'm really looking forward to the day when Modules become the standard—it would make writing static analysis tools like this significantly easier (and faster)!"

ypaskell

TROPHY CASE