900 Series Alarm Messages¶
Alarm Severities
One or more of the following severity levels is associated with each alarm.
CriticalIndicates that a platform service affecting condition has occurred and immediate corrective action is required. (A mandatory platform service has become totally out of service and its capability must be restored.)
MajorIndicates that a platform service affecting condition has developed and urgent corrective action is required. (A mandatory platform service has developed a severe degradation and its full capability must be restored.)
- or -
An optional platform service has become totally out of service and its capability should be restored.
MinorIndicates that a platform non-service affecting fault condition has developed and corrective action should be taken in order to prevent a more serious fault. (The fault condition is not currently impacting / degrading the capability of the platform service.)
WarningIndicates the detection of a potential or impending service affecting fault. Action should be taken to further diagnose and correct the problem in order to prevent it from becoming a more serious service affecting fault.
Alarm ID: 900.001 |
Patching operation in progress. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Complete reboots of affected hosts. |
Management Affecting Severity |
warning |
Alarm ID: 900.002 |
Patch host install failure. Command “sw-patch host-install” failed. |
Entity Instance |
host=<hostname> |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Undo patching operation. Check patch logs on the target host (i.e. /var/log/patching.log) |
Management Affecting Severity |
warning |
Alarm ID: 900.003 |
A patch with state ‘obsolete’ in its metadata has been uploaded. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
warning |
Proposed Repair Action |
Remove and delete obsolete patches. |
Management Affecting Severity |
warning |
Alarm ID: 900.004 |
The upgrade and running software version do not match. Command host-upgrade failed. |
Entity Instance |
host=<hostname> |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Reinstall host to update applied load. |
Management Affecting Severity |
warning |
Alarm ID: 900.005 |
System Upgrade in progress. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
No action required. |
Management Affecting Severity |
warning |
Alarm ID: 900.006 |
Device image update operation in progress. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Complete reboots of affected hosts. |
Management Affecting Severity |
warning |
Alarm ID: 900.007 |
Kubernetes upgrade in progress. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
No action required. |
Management Affecting Severity |
warning |
Alarm ID: 900.008 |
Kubernetes rootca update in progress |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Wait for kubernetes rootca procedure to complete |
Management Affecting Severity |
warning |
Alarm ID: 900.009 |
Kubernetes root CA update aborted, certificates may not be fully updated. Command “system kube-rootca-update-abort” has been run. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Fully update certificates by a new root CA update. |
Management Affecting Severity |
warning |
Alarm ID: 900.010 |
System Config update in progress |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Wait for system config update to complete |
Management Affecting Severity |
warning |
Alarm ID: 900.011 |
System Config update aborted, configurations may not be fully updated |
Entity Instance |
host=<hostname> |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Lock the host, wait for the host resource in the deployment namespace to become in-sync, then unlock the host |
Management Affecting Severity |
warning |
Alarm ID: 900.101 |
Software patch auto-apply in progress |
Entity Instance |
orchestration=sw-patch |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for software patch auto-apply to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.102 |
Software patch auto-apply aborting |
Entity Instance |
orchestration=sw-patch |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for software patch auto-apply abort to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.103 |
Software patch auto-apply failed. Command “sw-manager patch-strategy apply” failed. |
Entity Instance |
orchestration=sw-patch |
Degrade Affecting Severity: |
none |
Severity: |
critical |
Proposed Repair Action |
Attempt to apply software patches manually; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.201 |
Software upgrade auto-apply in progress |
Entity Instance |
orchestration=sw-upgrade |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for software upgrade auto-apply to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.202 |
Software upgrade auto-apply aborting |
Entity Instance |
orchestration=sw-upgrade |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for software upgrade auto-apply abort to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.203 |
Software upgrade auto-apply failed. Command “sw-manager update-strategy apply” failed |
Entity Instance |
orchestration=sw-upgrade |
Degrade Affecting Severity: |
none |
Severity: |
critical |
Proposed Repair Action |
Attempt to apply software upgrade manually; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.301 |
Firmware Update auto-apply in progress |
Entity Instance |
orchestration=fw-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for firmware update auto-apply to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.302 |
Firmware Update auto-apply aborting |
Entity Instance |
orchestration=fw-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for firmware update auto-apply abort to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.303 |
Firmware Update auto-apply failed. Command “sw-manager kube-rootca-update-strategy apply” failed. |
Entity Instance |
orchestration=fw-update |
Degrade Affecting Severity: |
none |
Severity: |
critical |
Proposed Repair Action |
Attempt to apply firmware update manually; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.501 |
Kubernetes rootca update auto-apply in progress |
Entity Instance |
orchestration=kube-rootca-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for kubernetes rootca update auto-apply to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.502 |
Kubernetes rootca update auto-apply aborting |
Entity Instance |
orchestration=kube-rootca-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for kubernetes rootca update auto-apply abort to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.503 |
Kubernetes rootca update auto-apply failed. Command “sw-manager kube-upgrade-strategy apply” failed. |
Entity Instance |
orchestration=kube-rootca-update |
Degrade Affecting Severity: |
none |
Severity: |
critical |
Proposed Repair Action |
Attempt to apply kubernetes rootca update manually; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.601 |
System config update auto-apply in progress |
Entity Instance |
orchestration=system-config-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for system config update auto-apply to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.602 |
System config update auto-apply aborting |
Entity Instance |
orchestration=system-config-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for system config update auto-apply abort to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.603 |
System config update auto-apply failed. Command “sw-manager kube-upgrade-strategy apply” failed |
Entity Instance |
orchestration=system-config-update |
Degrade Affecting Severity: |
none |
Severity: |
critical |
Proposed Repair Action |
Attempt to apply system config update manually; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.701 |
Node <hostname> tainted. |
Entity Instance |
host=<hostname> |
Degrade Affecting Severity: |
major |
Severity: |
major |
Proposed Repair Action |
“Execute ‘kubectl taint nodes <hostname> services=disabled:NoExecute-’ If it fails, Execute ‘system host-lock <hostname>’ followed by ‘system host-unlock <hostname>’. If issue still persists, contact next level of support.” |
Management Affecting Severity |
warning |