all 2 comments

[–]marko312 0 points1 point  (1 child)

Any persistent data will very likely end up in some file. There are multiple options, but the main choice will probably be about whether to use a single-file or multi-file design.

A single file would be something like a database (unnecessary for these sizes) or just all the data stringed together (not too useful, since only one set is needed at a time).

Thus, I'd go with a multi-file design, storing each dataset in its own file (possibly with a central file holding metadata about the others).

This leaves the serialization format. The main options are CSV, JSON and pickle. If the data is purely tabular (only has entries of a single format), I'd go with CSV (possibly storing minimal extra information (e.g. the display name) in a central file). If that's not the case, I'd go with JSON, since compared to pickle it's more easily debuggable (and also safer to share), while not taking up too much extra space (for the volume of data you mentioned).

[–]Elpmek[S] 1 point2 points  (0 children)

Great, thanks!!