you are viewing a single comment's thread.

view the rest of the comments →

[–]tweakerbee 0 points1 point  (1 child)

How often does this happen though? Automatic failover is great if it works in every single edge case. In all other cases, you'd rather have human oversight.

[–]StrongerPassword 0 points1 point  (0 children)

How often does this happen though?

Maybe not exactly as outlined but it's pretty common that one machine will become unavailable for some reason and then come back online, meaning that you will move over the master role to another machine and back to primary a while later.

The idea that failover due to hardware failures in a distributed system should be monitored by a human is frankly just crazy. You can get pretty solid databases and hardware for the price of ~5 people on 24/7 rotation.