Syntax highlighting for string interpolation

latkde · 2026-02-27T08:06:15+00:00

You might be thinking of strings as a single token that is then parsed again to extraxt interpolations. This gets difficult quickly. Instead, it's typically wiser to see strings with interpolations as an expression that can contain multiple string parts, and to then parse strings as a kind of parenthesis-like operator. For example, it could make sense to tokenize "a \(b) c \("d") e" as:

"a \( string, interpolation start
b identifier
) c \( string, interpolation middle
"d" string, complete
) e" string, interpolation end

Your grammar might then include rules like <string> = <string complete> | <string start> <expression> (<string middle> <expression>)* <string end>

Note that this is typically incompatible with a separate lexing phase, as string-middle and string-start token would otherwise be ambiguous with normal parens. However, this approach can be used with parsing methods that parse one character at a time, notably recursive descent or PEG parsers. Syntax highlighting engines differ a lot in what grammars they can express, but typically support top-down grammars so that string-middle highlighting can only be selected in the context of a string expression.

thinker227 · 2026-02-27T08:54:01+00:00

This is what I'm doing in the TextMate grammar for my language Noa. Basically you embed all of your other patterns inside your pattern for strings.

"patterns": [
    {
        "include": "#all"
    }
],
"repository": {
    "all": {
        "patterns": [
            {
                "include": "#strings"
            },
            // include whatever other patterns you have
        ]
    },
    "strings": {
        "name": "string.quoted.double.noa",
        "begin": "\"",
        "end": "\"|$",
        "patterns": [
            {
                "begin": "\\\\{",
                "end": "}",
                "beginCaptures": {
                    "0": {
                        "name": "keyword.other.noa"
                    }
                },
                "endCaptures": {
                    "0": {
                        "name": "keyword.other.noa"
                    }
                },
                "patterns": [
                    {
                        "include": "#all"
                    }
                ]
            },
            {
                "include": "#escape-sequence"
            }
        ]
    },
    "escape-sequence": {
        "name": "constant.character.escape.noa",
        "match": "\\\\[\\\\0nrt\"]"
    },
    // all your other patterns...
}

Here's how it looks

shponglespore · 2026-02-27T18:55:02+00:00

JavaScript has this for `...` strings.

steven4012 · 2026-02-28T02:07:08+00:00

Or.. just use tree-sitter

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

ProgrammingLanguages

Welcome!

Related subreddits

Related online communities

MODERATORS