Leader election clusters are recommended in these cases:
Trade-offs include:
Leader election asynchronous clusters can guarantee continuity of service at no performance cost for the leader and in specific conditions achieve zero data loss.
In production deployments, hardware failures occur infrequently. When failures happen, repairing the replication stream quickly is critical. replication-manager facilitates rapid recovery and tracks failover SLA (Service Level Availability) to provide historical view of replication stream latency. This enables routine switchovers for database maintenance.
Automatic failover in asynchronous clusters is not always desirable. replication-manager provides tunable settings to constrain the architecture state in which failover can occur.
Automatic failover scenarios can be classified into three SLA states:
Staying in sync
When replication can be monitored in sync, failover can be performed without data loss, provided that replication-manager waits for all replicated events to be applied to the elected replica before re-opening traffic. To maintain this state, use the settings described in the next section.