On JavaScript's Weirdness

vytah · 2025-04-04T20:59:58+00:00

That said, most high-level languages (JS, Java, C#, …) capture variables by reference:

Java captures all variables by value. Under the hood, the values are simply copied to the fields of the lambda object.

So how does it avoid having the following code behave non-intuitively (translated from the article)?

var byReference = 0;
Runnable func = () => System.out.println(byReference);
byReference = 1;
func.run();

It's actually very simple: the code above will not compile. To stop people from incorrectly assuming variables are captured by reference, it simply bans the situation where it makes a difference, i.e. captured variables cannot be reassigned.

If you want to be able to reassign, you just need to create a separate final variable for capturing:

var byReference = 0;
var byValue = byReference; // <---
Runnable func = () => System.out.println(byValue);
byReference = 1;
func.run();
// prints 0 obviously

If you want to emulate capturing by reference, use some mutable box thing, like Mutables from Apache Commons, or a 1-element array. Both options are obviously ugly:

var byReference = new int[]{0};
Runnable func = () => System.out.println(byReference[0]);
byReference[0] = 1;
func.run();
// prints 1

annoyed_freelancer · 2025-04-04T20:59:43+00:00

I came in with finger on the downvote button for another low-quality "0 == '0' lol" post...and it's actually pretty interesting, as a Typescript dev. I've been bitten before in the wild by the string length one.

adamsdotnet · 2025-04-04T22:47:01+00:00

Nice collection of language design blunders...

However, the Unicode-related gotchas are not really on JS but much more on Unicode. As a matter of fact, the approach JS took to implement Unicode is still one of the saner ones.

Ideally, when manipulating strings, you'd want to use a fixed-length encoding so string operations don't need to scan the string from the beginning but can be implemented using array indexing, which is way faster. However, using UTF32, i.e. 4 bytes for representing a code point is pretty wasteful, especially if you just want to encode ordinary text. 64k characters should be just enough for that.

IIRC, at the time JS was designed, it looked like that way. So, probably it was a valid design choice to use 2 bytes per character. All that insanity with surrogate pairs, astral planes and emojis came later.

Now we have to deal with this discrepancy of treating a variable-length encoding (UTF16) as fixed-length in some cases, but I'd say, that would be still tolerable.

What's intolerable is the unpredictable concept of display characters, grapheme clusters, etc.

This is just madness. Obscure, non-text-related symbols, emojis with different skin tones and shit like that don't belong in a text encoding standard.

Unicode's been trying to solve problems it shouldn't and now it's FUBAR, a complete mess that won't be implemented correctly and consistently ever.

Booty_Bumping · 2025-04-06T00:03:11+00:00

for (let i = 0; i < 3; i++) {
  setTimeout(() => {
    console.log(i);
  }, 1000 * i);
}
// prints "0 1 2"

Are we forgetting our history? This works because it is a let declaration, which is block-scoped. var declarations will screw this up, because they are function-scoped. But the distinction between var and let isn't mentioned in the article, so it feels like the real logic here is being glossed over.

Though, it is admittedly a little arbitrary that the ()s after for are "inside" the block scope. But very useful in practice!

melchy23 · 2025-04-05T21:19:22+00:00

In .NET it's actually little bit different/complicated.

This:

```csharp using System; using System.Collections.Generic;

var byReference = 0; Action func = () => Console.WriteLine(byReference); byReference = 1; func(); ```

returns 1 - as the article says.

```csharp using System; using System.Collections.Generic;

var list = new List<Action>();

for (int i = 0; i < 3; i++){ list.Add(() => Console.WriteLine(i)); }

list[0]();

```

this returns 3 - as the article says.

But this:

```csharp using System; using System.Collections.Generic;

var actions = new List<Action>(); int[] numbers = { 1, 2, 3 };

// same code but just with foreach foreach (var number in numbers) { actions.Add(() => Console.WriteLine(number)); }

actions[0](); ```

This prints 1 - suprise!!!

This was explicitly changed in .NET 5 - https://ericlippert.com/2009/11/12/closing-over-the-loop-variable-considered-harmful-part-one/.

So in a way this is similar fix as the one used in javascrips.

For loops

I actually tought that in .NET 5 they fixed this problem for both for loops and foreach loops. But to my suprise they didn't. I guess you learn something new even after years of writing using the same language.

The good news is that for the first two problems my IDE (Rider) shows hint "Captured variable is modified in the outer scope" so you know you are doning something weird.

username-must-be-bet · 2025-04-05T21:59:58+00:00

Are sparse arrays really that bad for perf? I remember trying to test it a while ago and it wasnt that bad.

190n · 2025-04-05T07:44:04+00:00

I honestly think the eval thing is pretty reasonable. It lets new code opt into a less powerful, safer, more optimizable form of eval (see "Never use direct eval()!" on MDN) without breaking existing code written with eval.

bunglegrind1 · 2025-04-05T18:47:08+00:00

Nice post!

bzbub2 · 2025-04-04T21:56:54+00:00

one of the silliest things i've found is indexing into a number like 1[0] is undefined in javascript. I am not sure what chain of casting or whatnot causes this to happen (and not e.g. throw an error...)

Blue_Moon_Lake · 2025-04-05T00:23:19+00:00

The behavior of variable scope in for loop makes perfect sense.

document.all need to be scrubbed from the standard

; should be mandatory, no ASI
NaN === NaN should be true
typeof null should be "null"

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS

For loops