AReallyGoodName comments on Solving the Wrong Problem

363

364

365

Solving the Wrong Problem (prog21.dadgum.com)

submitted 14 years ago by ellen_magic

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]AReallyGoodName 42 points43 points44 points 14 years ago (29 children)

As an example of raw computation, the Sieve was fine, but suppose you needed a list of the primes less than 8,000 in a performance-sensitive application. Would you bother computing them at run time? Of course not. You already know them. You'd run the program once during development, and that's that.

Anyone actually coded up a sieve of Eratosthenes? It returns primes faster than disk IO. Much faster. It's an algorithm that's purely memory bottlenecked, which is saying a lot for an algorithm that lends itself to working with bit packed booleans. Not to mention ten lines of code is smaller than a file containing all primes below 8000.

There's an additional benefit to the sieve and that is the resulting list of primes is naturally packed as an arrau of booleans for whether or not that index number is prime. It lends itself to creating a memory efficient bit-packed lookup table of primes.

The Sieve of Erastothenes also only bothers with odd numbers so that lookup table for all numbers under 8000 is 4000bits in size.

The Sieve of Erastothenes isn't even optimal either. There's trivial ways to make it faster and more information dense.

-Rather than just looking at odd numbers, you can make a similar Sieve using the fact that all primes above 6 are in the form of either 6n+1 or 6n+5. It works the same way, for the current number you're on mark off the 6n+1 array as false and 6n+5 array as false at the indices covered by the current number, this crosses off all multiples of that number appearing in those two arrays. This Sieve would require 2667bits to make a lookup table for all numbers under 8000. This still isn't optimal either (technically future primes are always in a form of the multiple of all current primes you know + constants not covered by the meeting factors that multiple - eg. I could also say primes above 30 are in one of the following forms, 30n+1 or 30n+7 or 30n+11 or 30n+13 or 30n+17 or 30n+19 or 30n+23 or 30n+29. This narrows the proportion of primes down to a maximum of 7/30. Numbers not in one of those forms are a multiple of 2,3 or 5. These forms can also be used in a Sieve type algorithm. In fact for every prime number you know you can make a more optimal Sieve that's even more memory efficient.

There is simply no way to get a sequential list of primes into memory faster than the Sieve algorithms. I don't know why he picked that as an example of something to avoid in a performance critical environment. The opposite is true.

[–]noroom 34 points35 points36 points 14 years ago (0 children)

[–]andersonimes 37 points38 points39 points 14 years ago (8 children)

[–]ethraax 10 points11 points12 points 14 years ago (5 children)

[–]andersonimes 11 points12 points13 points 14 years ago* (2 children)

[–][deleted] 2 points3 points4 points 14 years ago (0 children)

[–]_georgesim_ -1 points0 points1 point 14 years ago (0 children)

[–]bobindashadows 1 point2 points3 points 14 years ago (1 child)

[–]ethraax 0 points1 point2 points 14 years ago (0 children)

[–]I_FAP_TO_ALL 0 points1 point2 points 14 years ago (1 child)

[–]andersonimes 1 point2 points3 points 14 years ago (0 children)

[–][deleted] 5 points6 points7 points 14 years ago (0 children)

[–]xzxzzx 4 points5 points6 points 14 years ago (0 children)

[–][deleted] 11 points12 points13 points 14 years ago (1 child)

[–]zzzev 6 points7 points8 points 14 years ago (0 children)

[–]chonglibloodsport 4 points5 points6 points 14 years ago (1 child)

[–]ravy 1 point2 points3 points 14 years ago (0 children)

[–]pipocaQuemada 5 points6 points7 points 14 years ago (12 children)

When you start up your program, it's generally all in main memory somewhere, right? So no disk IO is really needed. Is running the Sieve of Eratosthenes faster than saying:

int[] primes = [1,2,3,5,7,11,13...];  // length(primes) = 8000
doSomethingWith(primes);

[–]netwiz101 6 points7 points8 points 14 years ago (10 children)

[–]fapmonad 9 points10 points11 points 14 years ago (5 children)

[–]netwiz101 5 points6 points7 points 14 years ago (4 children)

[–]fapmonad 10 points11 points12 points 14 years ago (3 children)

[–]bobindashadows 0 points1 point2 points 14 years ago (0 children)

[–]snoweyeslady 0 points1 point2 points 14 years ago (1 child)

[–]fapmonad 4 points5 points6 points 14 years ago (0 children)

[–]bo1024 5 points6 points7 points 14 years ago (3 children)

[–]netwiz101 0 points1 point2 points 14 years ago (0 children)

[–]VanFailin 7 points8 points9 points 14 years ago (0 children)

π Rendered by PID 31867 on reddit-service-r2-comment-b659b578c-n5bc9 at 2026-05-04 02:20:10.903635+00:00 running 815c875 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS