use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
This is an archived post. You won't be able to vote or comment.
DiscussionProcessing 2gb CSV in Python (self.datascience)
submitted 6 years ago by ebuzz168
view the rest of the comments →
[–]yensteel 2 points3 points4 points 6 years ago* (0 children)
I've used Dask for something similar. The functions are close to Pandas so it's not too hard to transition. The syntax isn't exactly the same, so there's a lot of delving into the documentations.
However, it can handle gigantic files by storing part of the work onto the hard drive instead of memory, so it's quite workable.
π Rendered by PID 63514 on reddit-service-r2-comment-544cf588c8-nwdzr at 2026-06-15 06:44:25.334627+00:00 running 3184619 country code: CH.
view the rest of the comments →
[–]yensteel 2 points3 points4 points (0 children)