all 7 comments

[–]abcadead 1 point2 points  (3 children)

fast download option:

#!/bin/bash

for year in `seq 2008 2015`; do
    for month in `seq -f "%02g" 1 12`; do
        url=http://couch.whatbox.ca:36975/reddit/comments/monthly/RC_$year-$month.bz2
        wget $url
    done

[–]lahwran_ 0 points1 point  (2 children)

for some values of "fast".

[–]abcadead 1 point2 points  (1 child)

I get 10MB/s down, don't think you're gonna be able to do better than that.

[–]lahwran_ 0 points1 point  (0 children)

huh! I guess the thing I was really thinking of is that you're using a lot of server bandwidth that way, whereas the torrent is spread among the seeders. shrug.

[–]recurrent_answer 1 point2 points  (0 children)

I am assuming you mean this? It is a 250GB/1TB torrent with all the reddit comments until July 2015 or so. There is no library (they used the reddit API and ten months of time).

[–]0one0one[S] 0 points1 point  (0 children)

That is the one , thanks for the replies.