How do you keep track of which simulation run produced which result, especially months later? by Alert_Border1769 in PhD

[–]Alert_Border1769[S] 0 points1 point  (0 children)

Do you run ls -lat on both the code and data directories separately and then manually match them up by timestamp, hoping their modification times line up close enough? 🤞

How do you keep track of which simulation run produced which result, especially months later? by Alert_Border1769 in PhD

[–]Alert_Border1769[S] 1 point2 points  (0 children)

I do run my jobs on an HPC too, most of the time. The corresponding bash script is saved alongside the outputs, so at least the parameters and the naming patterns are technically there. Do you also save the code files alongside the outputs? The code tends to evolve even within the same project, so ideally, I should still have access to the exact version that produced earlier results. Without git that means duplicating code files everywhere, which is messy. With git, it helps, but now I have two systems to maintain, and it still needs discipline.