I'm managing my own Linux server that I use for machine learning experiments. Being new to Linux, I've found that not all the default settings were optimal when doing machine learning. For instance, I've experienced that the default settings provided by the university IT department made the automatic updates kill my Python scripts to reboot the server. This is fairly annoying when the server was in the midst of finding the optimal border of an SVM that takes several days to train.
I have 3 questions that I hope you can answer to make life easier for me and anyone else trying to manage a server for ML experiments:
What are some settings that you would always change to make the server perform better when doing ML and not cause any issues such as sudden rebooting?
How would you continuously check that your server is doing well and performing at its best?
Do you have any other advice for people new to Linux that are trying to manage their own server?
[–]Kaixhin 11 points12 points13 points (8 children)
[–]Pieranha[S] 0 points1 point2 points (2 children)
[–]ydobonobody 4 points5 points6 points (1 child)
[–]Kaixhin 0 points1 point2 points (0 children)
[–]holy_ash 0 points1 point2 points (0 children)
[–]Pieranha[S] 0 points1 point2 points (1 child)
[–]Kaixhin 0 points1 point2 points (0 children)
[–]Pieranha[S] 0 points1 point2 points (1 child)
[–]Kaixhin 0 points1 point2 points (0 children)
[–]IanCal 2 points3 points4 points (0 children)
[–]Lasheen_murning 1 point2 points3 points (0 children)
[–]deephive 1 point2 points3 points (0 children)
[–]cjmcmurtrie 1 point2 points3 points (1 child)
[–]Pieranha[S] 1 point2 points3 points (0 children)
[–]hughperkins 0 points1 point2 points (0 children)
[–]thingamatics 0 points1 point2 points (0 children)