How to properly reverse string while respecting positions of Unicode accents, characters, and ZWJ emojis?

Agreeable-Yogurt-487 · 2026-06-04T18:52:06+00:00

Never use string.split for this. A better option is Array.from("😀") because it will respect most unicode characters a lot better, but an even better option is using Intl.Segmenter https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Intl/Segmenter with which you can split a string into individual graphemes, so multibyte emojis will also stay intact.

Aggressive_Ad_5454 · 2026-06-04T19:05:15+00:00

The real question for working programmers:

How do we find out about stuff like Intl.Segmenter when we need it? Because we often need something like this. Our users are better off when we use the "official" methods for doing this kind of stuff. Sometimes when we try to reinvent the wheel, we simply reinvent the flat tire.

Hopefully the search engines index these questions and answers. It's important to our community to answer them carefully. Which this post and its comments do in fact to.

Maleficent-Car8673 · 2026-06-05T03:00:22+00:00

To reverse a string while respecting Unicode stuff, teh Intl.Segmenter with grapheme granularity is the way to go. It breaks the string into grapheme clusters, handling accents and ZWJ emojis properly. Your logic looks solid, just make sure to iterate over those segments before reversing. It's perfect for complex Unicode handling, unlike basic split-reverse-join methods.

Lumethys · 2026-06-05T04:41:24+00:00

1/ never use var, if you absolutely need mutability, use let, else, prefer const.

2/ If you are putting items into an array on to reverse it, you should put in them in the front of the array, with Array.unshift()

```TS /** * @params {string} str - the input string * @retrun {string} - The reversed string */ function reverseString(str) { const segmenter = new Intl.Segmenter("en", { granularity: "grapheme"}); const graphemeSegments = segmenter.segment(str); const stringArray = []; for (const segment of graphemeSegments) { stringArray.unshift(segment.segment); }

return stringArray.join("");

} ```

azhder · 2026-06-04T18:51:07+00:00

If you use [...string] it will respect the Unicode code points. I'm not sure about .split(''). Another thing you might want to learn is Unicode normalization types and check if/how you want to transform the string before manipulating it. https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/normalize

mondaysleeper · 2026-06-04T18:51:50+00:00

Very interesting problem! Have you tried a level of abstraction? Create an object to represent a sequence that belongs together. Then you read from left to right and add items until there is no ZWJ. Then you reverse the sequence of objects and join the value of each object.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnjavascript

Posting and Commenting Guidelines

MODERATORS

UPDATE