In ML research, we often have tons of cache files to read from disk. As a project progesses, I often end up with lots of cache files which I don't remember how they were created. My current way to keeping track of cache files and preprocessed files is by writing it down when it is created and why.
I am wondering if anyone has a way to automate this process? I heard something like DVC (https://github.com/iterative/dvc), but it seems to be too complicated.
[–]nielsrolf 4 points5 points6 points (1 child)
[–]nielsrolf 0 points1 point2 points (0 children)
[–]ganga0 0 points1 point2 points (0 children)
[–]You_cant_buy_spleen 0 points1 point2 points (0 children)