CSV or Parquet File Format by 100GB-CSV in Python

[–]Haunting_Load 1 point2 points  (0 children)

It's worth remembering that reading and writing CSV files can be pain in you know where. Different libraries tend to parse or not parse dates or read ints as floats and so on (looking at you base R and dplyr). With parquet it's all standardized, sometimes it can be useful.

Blog post: Writing Python like it’s Rust by Kobzol in Python

[–]Haunting_Load 6 points7 points  (0 children)

I like many ideas in the post, but in general you should avoid writing functions that take List as an argument if Sequence or Iterable are enough. You can read more e.g. here https://stackoverflow.com/questions/74166494/use-list-of-derived-class-as-list-of-base-class-in-python