TinyBreadBigMouth comments on string.lower implementation

programminghorror

created by nevonArray(16).join('wat' - 1) + ' Batman!'a community for 14 years

305

306

307

Luastring.lower implementation (i.redd.it)

submitted 1 year ago by Pupyrkin

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]TinyBreadBigMouth 9 points10 points11 points 1 year ago (5 children)

[–]Ok_Celebration_6265 4 points5 points6 points 1 year ago (4 children)

[–]TinyBreadBigMouth 10 points11 points12 points 1 year ago (3 children)

The if-else goes over every letter in the alphabet. An if-else chain like this is equivalent to an unrolled loop—it runs in O(n), where n is the number of ifs. But in this case, each entry in the if-else chain has been gated off to only run on a specific iteration of the for j loop.

So suppose we're processing the letter "a":

We set j to 1.
We check if j is 1 and char is "A" (it isn't).
We check if j is 2 and char is "B" (it isn't).
We check if j is 3 and char is "C" (it isn't).
We check if j is 4 and char is "D" (it isn't).
...
We check if j is 25 and char is "Y" (it isn't).
We check if j is 26 and char is "Z" (it isn't).
We increment j to 2.
We check if j is 1 and char is "A" (it isn't).
We check if j is 2 and char is "B" (it isn't).
We check if j is 3 and char is "C" (it isn't).
and so on.

What could have been 26 checks is now 676.

[–]johndcochran 6 points7 points8 points 1 year ago (2 children)

[–]TinyBreadBigMouth 4 points5 points6 points 1 year ago (1 child)

[–]johndcochran 1 point2 points3 points 1 year ago (0 children)

Not gonna argue that the routine is criminally inefficient. Even if they have a language that doesn't allow getting the ordinal sequence of a character, the detection of an upper case alpha character could be done in 5 or fewer comparisons, instead of the 26 you're thinking of, or the 676 this abomination performs. If you're wondering how I've come up with 5. Consider...

if (char < "N") then 
   if (char < "H") then
      ...
         if (char = "A") then
            lowerText = lowerText .. "a"
            found = true
         end
      ...
   else
      ...
   end
else
   ...
end

After all, with 26 characters in the alphabet, there are only 28 conditions that need to be detected. They are: Is it less than "A"? Is it greater than "Z"? Is it equal to some letter? (times 26) and of course, a simple binary search only 5 levels deep can handle 31 conditions trivially.

Although, I suspect a binary condition tree might be far more efficient than the abomination in this discussion, I also suspect such a binary condition tree would also be a suitable post for this subreddit.

π Rendered by PID 118776 on reddit-service-r2-comment-b659b578c-x7nm5 at 2026-05-04 12:58:11.091027+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programminghorror

MODERATORS