Klaus Darilion
2018-09-07 08:27:40 UTC
Hi!
I have a HA question if the cluster network partitions.
E.g. 3 nodes. VM100 is running on node 3.
Suddenly the network breaks and node3 is isolated. Hence, node3 is alone
without quroum, node1+2 form a new groub with quorum.
What happens now exactly when HA is configured for VM100?
According to https://pve.proxmox.com/wiki/High_Availability node 3 will
reboot after 60 seconds ("When a cluster member determines that it is no
longer in the cluster quorum, the LRM waits for a new quorum to form. As
long as there is no quorum the node cannot reset the watchdog. This will
trigger a reboot after the watchdog then times out, this happens after
60 seconds.")
But what is the timing for starting VM100 on another node? Is it
guaranteed that this only happens after 60 seconds? (avoiding concurrent
access to the shared storage, and service-network on node3 may still be
functional although the cluster network broke)
Thanks
Klaus
I have a HA question if the cluster network partitions.
E.g. 3 nodes. VM100 is running on node 3.
Suddenly the network breaks and node3 is isolated. Hence, node3 is alone
without quroum, node1+2 form a new groub with quorum.
What happens now exactly when HA is configured for VM100?
According to https://pve.proxmox.com/wiki/High_Availability node 3 will
reboot after 60 seconds ("When a cluster member determines that it is no
longer in the cluster quorum, the LRM waits for a new quorum to form. As
long as there is no quorum the node cannot reset the watchdog. This will
trigger a reboot after the watchdog then times out, this happens after
60 seconds.")
But what is the timing for starting VM100 on another node? Is it
guaranteed that this only happens after 60 seconds? (avoiding concurrent
access to the shared storage, and service-network on node3 may still be
functional although the cluster network broke)
Thanks
Klaus