900 Series Alarm Messages¶
Alarm Severities
One or more of the following severity levels is associated with each alarm.
Critical
Indicates that a platform service affecting condition has occurred and immediate corrective action is required. (A mandatory platform service has become totally out of service and its capability must be restored.)
Major
Indicates that a platform service affecting condition has developed and urgent corrective action is required. (A mandatory platform service has developed a severe degradation and its full capability must be restored.)
- or -
An optional platform service has become totally out of service and its capability should be restored.
Minor
Indicates that a platform non-service affecting fault condition has developed and corrective action should be taken in order to prevent a more serious fault. (The fault condition is not currently impacting / degrading the capability of the platform service.)
Warning
Indicates the detection of a potential or impending service affecting fault. Action should be taken to further diagnose and correct the problem in order to prevent it from becoming a more serious service affecting fault.
Alarm ID: 900.001 |
Patching operation in progress. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Complete reboots of affected hosts. |
Management Affecting Severity |
warning |
Alarm ID: 900.002 |
Patch host install failure. Command “sw-patch host-install” failed. |
Entity Instance |
host=<hostname> |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Undo patching operation. Check patch logs on the target host (i.e. /var/log/patching.log) |
Management Affecting Severity |
warning |
Alarm ID: 900.003 |
A patch with state ‘obsolete’ in its metadata has been uploaded. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
warning |
Proposed Repair Action |
Remove and delete obsolete patches. |
Management Affecting Severity |
warning |
Alarm ID: 900.004 |
The upgrade and running software version do not match. Command host-upgrade failed. |
Entity Instance |
host=<hostname> |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Reinstall host to update applied load. |
Management Affecting Severity |
warning |
Alarm ID: 900.005 |
System Upgrade in progress. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
No action required. |
Management Affecting Severity |
warning |
Alarm ID: 900.006 |
Device image update operation in progress. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Complete reboots of affected hosts. |
Management Affecting Severity |
warning |
Alarm ID: 900.007 |
Kubernetes upgrade in progress. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
No action required. |
Management Affecting Severity |
warning |
Alarm ID: 900.008 |
Kubernetes rootca update in progress |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Wait for kubernetes rootca procedure to complete |
Management Affecting Severity |
warning |
Alarm ID: 900.009 |
Kubernetes root CA update aborted, certificates may not be fully updated. Command “system kube-rootca-update-abort” has been run. |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Fully update certificates by a new root CA update. |
Management Affecting Severity |
warning |
Alarm ID: 900.010 |
System Config update in progress |
Entity Instance |
host=controller |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Wait for system config update to complete |
Management Affecting Severity |
warning |
Alarm ID: 900.011 |
System Config update aborted, configurations may not be fully updated |
Entity Instance |
host=<hostname> |
Degrade Affecting Severity: |
none |
Severity: |
minor |
Proposed Repair Action |
Lock the host, wait for the host resource in the deployment namespace to become in-sync, then unlock the host |
Management Affecting Severity |
warning |
Alarm ID: 900.101 |
Software patch auto-apply in progress |
Entity Instance |
orchestration=sw-patch |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for software patch auto-apply to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.102 |
Software patch auto-apply aborting |
Entity Instance |
orchestration=sw-patch |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for software patch auto-apply abort to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.103 |
Software patch auto-apply failed. Command “sw-manager patch-strategy apply” failed. |
Entity Instance |
orchestration=sw-patch |
Degrade Affecting Severity: |
none |
Severity: |
critical |
Proposed Repair Action |
Attempt to apply software patches manually; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.201 |
Software upgrade auto-apply in progress |
Entity Instance |
orchestration=sw-upgrade |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for software upgrade auto-apply to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.202 |
Software upgrade auto-apply aborting |
Entity Instance |
orchestration=sw-upgrade |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for software upgrade auto-apply abort to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.203 |
Software upgrade auto-apply failed. Command “sw-manager update-strategy apply” failed |
Entity Instance |
orchestration=sw-upgrade |
Degrade Affecting Severity: |
none |
Severity: |
critical |
Proposed Repair Action |
Attempt to apply software upgrade manually; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.301 |
Firmware Update auto-apply in progress |
Entity Instance |
orchestration=fw-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for firmware update auto-apply to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.302 |
Firmware Update auto-apply aborting |
Entity Instance |
orchestration=fw-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for firmware update auto-apply abort to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.303 |
Firmware Update auto-apply failed. Command “sw-manager kube-rootca-update-strategy apply” failed. |
Entity Instance |
orchestration=fw-update |
Degrade Affecting Severity: |
none |
Severity: |
critical |
Proposed Repair Action |
Attempt to apply firmware update manually; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.501 |
Kubernetes rootca update auto-apply in progress |
Entity Instance |
orchestration=kube-rootca-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for kubernetes rootca update auto-apply to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.502 |
Kubernetes rootca update auto-apply aborting |
Entity Instance |
orchestration=kube-rootca-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for kubernetes rootca update auto-apply abort to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.503 |
Kubernetes rootca update auto-apply failed. Command “sw-manager kube-upgrade-strategy apply” failed. |
Entity Instance |
orchestration=kube-rootca-update |
Degrade Affecting Severity: |
none |
Severity: |
critical |
Proposed Repair Action |
Attempt to apply kubernetes rootca update manually; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.601 |
System config update auto-apply in progress |
Entity Instance |
orchestration=system-config-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for system config update auto-apply to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.602 |
System config update auto-apply aborting |
Entity Instance |
orchestration=system-config-update |
Degrade Affecting Severity: |
none |
Severity: |
major |
Proposed Repair Action |
Wait for system config update auto-apply abort to complete; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.603 |
System config update auto-apply failed. Command “sw-manager kube-upgrade-strategy apply” failed |
Entity Instance |
orchestration=system-config-update |
Degrade Affecting Severity: |
none |
Severity: |
critical |
Proposed Repair Action |
Attempt to apply system config update manually; if problem persists contact next level of support |
Management Affecting Severity |
warning |
Alarm ID: 900.701 |
Node <hostname> tainted. |
Entity Instance |
host=<hostname> |
Degrade Affecting Severity: |
major |
Severity: |
major |
Proposed Repair Action |
“Execute ‘kubectl taint nodes <hostname> services=disabled:NoExecute-’ If it fails, Execute ‘system host-lock <hostname>’ followed by ‘system host-unlock <hostname>’. If issue still persists, contact next level of support.” |
Management Affecting Severity |
warning |