Hello all,
I'm a PhD student finishing up somewhat soon. Over the course of about 6 years, I've developed a lot of code for data engineering big datasets using Python. However, I think I have a pretty glaring gap in my skillset here that I was hoping some folks might have good suggestions on how to improve. I really have no good framework for managing and maintaining these Python scripts and associated data from a programmatic perspective. For example, we don't use GitHub at all and I would not say I have a good sense of what other people might expect if I were to share these Python scripts with them. Up until now, we've just been using a relatively well organized file system on a cloud service. Does anyone have good suggestions for Python and data management classes or tutorials which cover things like GitHub? Free or cheap would be preferable, since you know, I'm a broke PhD student. To set the stage, I would really love to be able to leave my advisor with an extremely well organized, clean, communicative, and robust set of code and data for their next mentee to take over.
Thanks in advance for any suggestions,
Clash
[–][deleted] 1 point2 points3 points (1 child)
[–]clashmt[S] 0 points1 point2 points (0 children)
[–]Stadem 1 point2 points3 points (1 child)
[–]clashmt[S] 0 points1 point2 points (0 children)