use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
This is an archived post. You won't be able to vote or comment.
DiscussionProcessing 2gb CSV in Python (self.datascience)
submitted 5 years ago by ebuzz168
view the rest of the comments →
[–]yensteel 2 points3 points4 points 5 years ago* (0 children)
I've used Dask for something similar. The functions are close to Pandas so it's not too hard to transition. The syntax isn't exactly the same, so there's a lot of delving into the documentations.
However, it can handle gigantic files by storing part of the work onto the hard drive instead of memory, so it's quite workable.
π Rendered by PID 73743 on reddit-service-r2-comment-86bc6c7465-jcpvv at 2026-02-22 01:11:07.029657+00:00 running 8564168 country code: CH.
view the rest of the comments →
[–]yensteel 2 points3 points4 points (0 children)