Available since: replication-manager v3.1.26
Intervention mode — also called Mute — silences all alert notifications for a cluster or globally across all clusters during planned maintenance. Unlike the scheduled alert blackout window which uses a fixed cron expression, intervention mode is started and stopped on demand from the GUI or API, with optional scheduling and auto-unmute.
When intervention mode is active:
All suppressed notifications are counted and shown as "blocked notifications" in the GUI badge and intervention history.
Monitoring, state detection, failover, and switchover continue to operate normally — only the notification delivery is muted.
Exception — intervention start/stop notifications are always delivered:
| Event | Level | When sent | Message |
|---|---|---|---|
| Scheduled | ALERT | Immediately when scheduled | "Intervention scheduled by {user} at {time}: {reason} — notifications will be muted" |
| Started | ALERT | Before muting activates | "Intervention started by {user}: {reason} (scope: {scope}) — notifications are now muted" |
| Ended | ALERTOK | After unmuting | "Intervention ended by {user}: {reason}. Duration: {duration}. Suppressed alerts: {count} — notifications resumed" |
These three notifications bypass the mute so that all team members are informed when maintenance begins and ends, and how many alerts were silenced.
Two warning codes track intervention status in the cluster state machine:
| Code | Meaning |
|---|---|
| WARN0172 | Intervention scheduled at a future time — not yet active |
| WARN0173 | Intervention active — notifications muted |
These states appear in the cluster's warning badge and are visible in the GUI alongside other warnings. They are set on every monitoring tick while the condition holds and cleared automatically when the intervention starts (WARN0172) or ends (WARN0173).
| Scope | Effect | Started from |
|---|---|---|
| Cluster | Mutes a single cluster | Cluster view → Mute badge |
| Global | Mutes all clusters simultaneously | Home view (no cluster selected) → Mute badge |
A global intervention starts a separate intervention on every cluster with scope: "global". Ending a global intervention ends all of them at once via the "Close All" button.
The Mute badge appears in the navbar next to Blockers and Warnings:
Clicking the badge opens the Intervention Panel which shows:
The Intervention Modal requires:
| Field | Description | Default |
|---|---|---|
| Description | Free-text reason for the intervention (required) | — |
| Start time | When the intervention begins | Now |
| Estimated duration | How long the maintenance is expected to last | 30 minutes |
| Auto-unmute | Automatically end the intervention after the estimated duration | Checked |
An intervention ends when:
interventions.json)All interventions are persisted to {working-dir}/interventions.json per cluster. The file stores:
The history records who started the intervention, why, when it started and ended, and how many notifications were blocked. This provides an audit trail for maintenance activities.
The GUI shows the last 5 interventions in the panel. The full history is available in the JSON file.
POST /api/clusters/{clusterName}/actions/intervention-start
Request body (JSON):
{
"reason": "Rolling restart for v3.1.26",
"startAt": "2026-06-02T14:00:00Z",
"endAt": "2026-06-02T15:00:00Z"
}
| Field | Required | Description |
|---|---|---|
reason |
Yes | Free-text description of the maintenance |
startAt |
No | ISO 8601 start time. Omit or set to now for immediate start. Future time schedules the intervention. |
endAt |
No | ISO 8601 auto-unmute time. Omit for manual-only unmute. |
POST /api/clusters/{clusterName}/actions/intervention-end
Ends the active intervention on the specified cluster.
POST /api/actions/intervention-start
Same request body as cluster-level. Starts an intervention on all clusters with scope: "global".
POST /api/actions/intervention-end
Ends the active intervention on all clusters.
Alert Blackout (scheduler-alert-disable) |
Intervention Mode (Mute) | |
|---|---|---|
| Trigger | Cron schedule | On-demand (GUI or API) |
| Scope | Per-cluster config | Per-cluster or global |
| Duration | Fixed (scheduler-alert-disable-time) |
Operator-defined with optional auto-unmute |
| History | No audit trail | Full history with user, reason, blocked count |
| Warning state | No | WARN0172 (scheduled), WARN0173 (active) |
| Available since | v2.1.0 | v3.1.26 |
Both can be active simultaneously. When either is active, notifications are suppressed.