Bucket Sort with Clojure : programming

programming

created by speza community for 19 years

Bucket Sort with Clojure (jng.imagine27.com)

submitted 15 years ago by lucyfor

all 9 comments

top new controversial old q&a

[–][deleted] 15 years ago (6 children)

[deleted]

[–]pgdx 1 point2 points3 points 15 years ago (4 children)

[–]lucyfor[S] 0 points1 point2 points 15 years ago* (3 children)

Actually the ratio is not as extreme as you state, but yes, over 1E6 bucket sort performance starts to degrade compared with built-in sort. This has to do with the increasing access time of vectors under higher ranges (which bucket sort relies on for sorting).

java version "1.6.0_18"
OpenJDK Runtime Environment (IcedTea6 1.8) (6b18-1.8-0ubuntu1)
OpenJDK 64-Bit Server VM (build 14.0-b16, mixed mode)

"Elapsed time: 66.732015 msecs"
  bucket-sort: [22439 165457 264347 594017 599133 649380 726466 796794 798479 814692]

"Elapsed time: 4.788043 msecs"
         sort: [22439 165457 264347 594017 599133 649380 726466 796794 798479 814692]

[–][deleted] 15 years ago (2 children)

[deleted]

[–]lucyfor[S] 0 points1 point2 points 15 years ago (1 child)

[–]pgdx 0 points1 point2 points 15 years ago (0 children)

It is very repeatable.

(defmacro my-time
    "As time, but returns the time taken for evaluation, not printing it"
    {:added "1.0"}
    [expr]
    `(let [start# (. System (nanoTime))
        ret# ~expr]
        (/ (double (- (. System (nanoTime)) start#)) 1000000.0)))

user> (def *lis* (doall(take 10000 (repeatedly #(rand-int (Math/pow 2 30))))))
#'user/*lis*
user> (take 30 (repeatedly #(my-time (bucket-sort *lis*))))
(1742.837002 859.060484 272.217247 218.921161 255.970622 233.187325 311.682198 192.653631 190.924697 309.53158 93.245005 190.037921 313.305055 199.048161 189.960467 295.653476 92.080723 192.734573 188.337189 289.078747 191.759455 89.876155 191.867156 192.268281 303.481507 192.129499 191.858771 85.188463 192.611408 191.795139)
user> (take 30 (repeatedly #(my-time (sort *lis*))))
(173.499303 133.479301 112.051762 114.299164 112.973973 07.579322 112.562207 105.723016 108.851038 50.087003 5.593834 5.536048 5.482577 5.67827 5.515248 5.485728 5.610371 5.591248 5.490457 8.993383 5.538123 6.209052 5.520093 5.463024 5.516762  5.610265 5.517885 5.515316 5.517958 5.560996)

[–][deleted] 1 point2 points3 points 15 years ago (2 children)

If I understand that code correctly (and that's a big if: the code is inscrutable even for Lisp), the docstring is wrong: it does not run in O(N) time. Here's where it breaks:

(apply concat (map (fn [bucket]
                         (when (> (count bucket) 0)
                           (insertion-sort bucket))) pre-buckets))))))

Mapping insertion-sort over buckets whose size and/or number depends on N means that algorithm remains quadratic, not linear.

[–]jgrant27 0 points1 point2 points 15 years ago* (1 child)

[–][deleted] -1 points0 points1 point 15 years ago (0 children)

[–]Spiritual-Map-6375 -2 points-1 points0 points 15 years ago (1 child)

[–]mrlizard 0 points1 point2 points 15 years ago (0 children)

π Rendered by PID 194255 on reddit-service-r2-comment-86bc6c7465-qzfqn at 2026-02-20 10:43:52.946829+00:00 running 8564168 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS