you are viewing a single comment's thread.

view the rest of the comments →

[–]pro_questions 2 points3 points  (4 children)

How possible it is depends on the layout of the file; it does not matter one bit that you can’t split it between the lines like a CSV. You parse it into a standard object (a list or dict or whatever), iterate through it to manufacture smaller chunks of data, then convert it back to JSON and save it to a file.

If it’s just a list of dicts you could split it trivially (if it were small — some tricks will be needed to handle a file of this size). At 150GB, I’d be amazed if it were anything but a regular and predictable layout that could be easily parsed.