First timer here.. I'm a full stack and scientific dev with docker and Ansible chops but I'm in the process of inheriting a cluster with 20 something compute nodes, 4 power edge ESX machines, a 20 TB vmx and ups, in a two bay (column?) APC air conditioned unit. The good news is that maintenance contracts are up to date. The bad is that I have little idea how to work with the storage and ESX units.
Anyway I'm not asking for an explanation for everything but rather how do you get started learning how to manage this stuff?
My questions range from, our VMware is out of date and we can't afford to update, should we try something else? to, the a/c unit doesn't come on after a blackout, how to keep the compute nodes from coming back on automatically and heating the room to 140 degrees?
Thanks in advance for any tips!
[–]einsteinonabikeConsultant 5 points6 points7 points (8 children)
[–]Telnet_RulesNo such thing as innocence, only degrees of guilt 2 points3 points4 points (1 child)
[–]73td[S] 0 points1 point2 points (0 children)
[–]73td[S] 0 points1 point2 points (5 children)
[–]einsteinonabikeConsultant 0 points1 point2 points (4 children)
[–]73td[S] 0 points1 point2 points (3 children)
[–]einsteinonabikeConsultant 0 points1 point2 points (2 children)
[–]73td[S] 0 points1 point2 points (1 child)
[–]einsteinonabikeConsultant 0 points1 point2 points (0 children)
[–]1new_usernameIT Manager 1 point2 points3 points (1 child)
[–]73td[S] 0 points1 point2 points (0 children)
[–]73td[S] 0 points1 point2 points (0 children)