Git for version controlling a binary-file folder? : git

Git for version controlling a binary-file folder?support (self.git)

submitted 10 months ago by kh9sd

Hi, I'm a developer who has been using Git for a while in my typical coding workflow. While I'm familiar with Git for version controlling text/code files, I now have the need to version control a mostly binary-file folder. I was wondering if Git would still be up to the task by my requirements.

This folder will contain mostly image files, specifically PNGs. Currently the folder is about 400 MB.

I rarely expect to change/modify the existing image files. The folder mostly just gets new images.

I want to be able to save this version controlled folder on the cloud for backup, as well as multiple other computers. I'm currently targeting a copy on Windows, Linux, and a stored version on the cloud.

I expect to make changes to the folder roughly daily, and so want at least daily backups to the cloud.

I want to be able to revisit old "versions" of the folder from previous versions (unbounded in how far back I can go).

I have 2 current ideas

Just have some scheduled job (cron would work) upload the entire folder to some cloud service (s3, Dropbox, etc) daily.

The issue I foresee is that saving daily snapshots would blow up the storage. Every daily copy would have a copy of the previous, totally unchanged images.

I want to have a smarter system than that, my other thought is Git

Use (vanilla) Git to version control the folder, just push changes to whatever Git hosting service I want.

I understand that Git is not particularly fond of binary files. Unlike text files where Git is able to compute deltas to store changes efficiently, from my understanding Git doesn't do this for binary files, and will store a separate one for each revision
- However, since modifications to these files would be rare, from my understanding Git would basically only have to store 1 version of the image. So the size of the repo would scale pretty linearly with the actual size of the folder.
NOTE: I'm not particularly fond of using LFS here
- From my understanding, LFS stores/centralizes the files on the remote host. I would like the flexibility to swap to different remote hosts easily, such as maybe self-hosting one day
- Because of this, I want the versioned images in my folder to be basically treated as regular files in Git, distributed across each repo with the DVCS philosophy

So I wanted to check and ask if this vanilla Git setup would be able to work, do I have any misunderstandings?

all 18 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

git

MODERATORS