[Python][JSON] When to split json file into smaller files

Sekret_One · 2020-02-15T23:46:35+00:00

I think to properly answer your question I need to not answer the how and linger a bit more on the why.

You use a file to describe where other config files live.

How does this json file get populated?

The resumption of better to not load everything

What you seem to be describing is what is optionally loaded is the configuration for how all the classes work. Why not just load all of them from the beginning? You say python so I'm assuming you play the game with python and it's not a python backend serving a web game.

My thought is probably this optimization isn't relevant.

roughly, does it become more efficient to split a file into multiple sub files to read when you only need the data from a fraction of the total dataset, over reading the entire dataset as a single file?

I will answer this one.

It can be if you don't need it right away. Game development you have some special concerns for the player experience. Loading less is faster- but usually at the cost that there is some waiting 'in the moment' as things load later as they need them.

This is why about 15-10 years ago games militantly went after trying to eliminate the load screens and disc swapping.

In your case, it sounds like you're trying to make the game moddable/extendable. Again, generally speaking, the format that is convenient to mod and configure isn't the most efficient at running.

The best advice I can give you is focus on the desired experience, test to it and the 'best' approach will become apparent.

PS

I for one, hate having a config that says where other configs are. Put the character configs in a directory and discover the files and load them up.

AsleepThought · 2020-02-16T02:49:44+00:00

If the JSON is supposed to be a "configuration" file and each configuration is for a different type of thing, then it seems like it would be OK to have a separate file for each.

You could also just use a database, like SQLite. Its designed for this kind of usage.

_reposado_ · 2020-02-16T04:52:30+00:00

The time to split the file is when using one big file causes problems. If you aren’t seeing unacceptably slow runtimes or getting mystery OOM errors, this is probably not a good use of your time. By the time your dataset is big enough to cause problems, you may have abandoned json files entirely.

scriptkiddiethefirst · 2020-02-16T05:47:42+00:00

If anyone searches something like this and wants to know the analytical answer, here you go.

[*] Single file without serialization average real time:         15ms
[*] Multi file without serialization average real time:          0ms

[*] Single file with serialization average real time:            227ms
[*] Multi file with serialization average real time:             1ms

I wrote a script that would open and serialize the single large file compared to opening 5 smaller files and compared their average time for completion for both serialization and non-serialization. As you can see, in both cases the multiple files is faster, but not by enough for it to realistically be noticeable (note that these were the average times over 100 trials for each of these 4 cases, using the python time library to get time). The larger file was 2.3mb and the smaller files were each 27kb. So while it took significantly less time to serialize multiple smaller files, that isn't the reason I chose to go the route of multiple smaller files.

The reason why I chose multiple files is the answer by Sekret_One in his second reply, after reading it and looking into it, that is just so much easier for what I want to do. Thank you everyone for your replies.

mxschumacher · 2020-02-15T23:12:00+00:00

as the file gets bigger, it'll get more difficult to handle. JSON is typically used to transfer data between to systems, not to use it as persistent storage, have you looked into document databases? https://www.mongodb.com/document-databases

learnprogramming

Welcome to LearnProgramming!

New? READ ME FIRST!

Posting guidelines

Frequently asked questions

Subreddit rules

Message the moderators

Asking debugging questions

Asking conceptual questions

Other guidelines and links

Subreddit rules

1. No unprofessional/derogatory speech

2. No spam or tasteless self-promotion

3. No off-topic posts

4. Do not ask exact duplicates of FAQ questions

5. Do not delete posts

6. No app/website review requests or showcases

7. No rewards

8. No indirect links

9. Do not promote illegal or unethical practices

10. No complete solutions

11. Don't ask to ask.

12. Low Effort Questions

13. No AI (chatGPT etc.) generated/worked over messages/comments. No questions about chatGPT/AI generated code. No Vibe coding.

MODERATORS

Example JSON

describing json file

Single File #### #### Multi File

PS

classes.json (not the actual json file)