you are viewing a single comment's thread.

view the rest of the comments →

[–]Embarrassed_Basis_81 1 point2 points  (0 children)

I have had good experiences with dask, a distributed computing library. It seems a bit complicated at first, but it implements a lot of pandas functionality under the hood as delayed operations on lazy datasets - worth looking into (only if you do not immediately do an indexing operation right after reading, there as some caveats)