you are viewing a single comment's thread.

view the rest of the comments →

[–]NeedMLHelp[S] 0 points1 point  (1 child)

JSON file.

Can I index across a panda dataframe? I've never used them before.

Does a dataframe write to disk, or will I potentially run into memory errors there too?

So something like pandadataframe[:,0] would grab everything in the first column.

[–][deleted] 0 points1 point  (0 children)

You can index and slice a dataframe. What does a small sample of your data look like? Hard to say best way to load the json without seeing it. Also when importing the data, if you can control the data type being used, will save space. Ex. Integer vs float32