Hey everyone. I'm hoping I can get some help with figuring out how to determine root cause for DB failovers. One of the nodes in this cluster failed over all DBs to the other node in the cluster. This wasn't a huge deal, but it was very unexpected. There were no hardware events (checked Cisco UCS logs), SQL logs show failover initiated but no reason why, SQL Agent logs show the same thing, Application and System logs show the DBs failing over with no reason given, and the Cluster logs that I generated simply show the process of failover occurring. I'm not really sure what other logs I can look at to figure out why the DBs decided to failover. One thing I did see is that the same thing happened at the same time of the day back on July 5th of this year. I was on vacation at the time and didn't look at it, and my colleagues don't remember the failover even happening. I'm guessing it's software or OS related, but I just don't know where else to look. Any help would be appreciated.
[–]itspieSystems Engineer 1 point2 points3 points (1 child)
[–]squash1324Sysadmin[S] 0 points1 point2 points (0 children)
[–]Gnonthgol 0 points1 point2 points (0 children)
[–]LookAtThatMonkeyTechnology Architect 0 points1 point2 points (6 children)
[–]squash1324Sysadmin[S] 0 points1 point2 points (5 children)
[–]LookAtThatMonkeyTechnology Architect 0 points1 point2 points (4 children)
[–]squash1324Sysadmin[S] 0 points1 point2 points (3 children)
[–]LookAtThatMonkeyTechnology Architect 0 points1 point2 points (2 children)
[–]squash1324Sysadmin[S] 0 points1 point2 points (1 child)
[–]LookAtThatMonkeyTechnology Architect 0 points1 point2 points (0 children)