Kubernetes Alerts
This table contains the alerts configured for Kubernetes.
| Alert name | Description |
|---|---|
| KubeStateMetricsListErrors | kube-state-metrics is experiencing errors at an elevated rate in list operations. This is likely causing it to not be able to expose metrics about Kubernetes objects correctly or at all. |
| KubeStateMetricsWatchErrors | kube-state-metrics is experiencing errors at an elevated rate in watch operations. This is likely causing it to not be able to expose metrics about Kubernetes objects correctly or at all. |
| KubePodCrashLooping | Pod is crash looping. |
| KubePodNotReady | Pod has been in a non-ready state for more than 15 minutes. |
| KubeDeploymentGenerationMismatch | Deployment generation mismatch due to possible roll-back |
| KubeDeploymentReplicasMismatch | Deployment has not matched the expected number of replicas. |
| KubeStatefulSetReplicasMismatch | StatefulSet generation mismatch due to possible roll-back |
| KubeStatefulSetUpdateNotRolledOut | StatefulSet update has not been rolled out. |
| KubeDaemonSetRolloutStuck | DaemonSet rollout is stuck. |
| KubeContainerWaiting | Pod container waiting longer than 1 hour |
| KubeDaemonSetNotScheduled | DaemonSet pods are not scheduled. |
| KubeDaemonSetMisScheduled | DaemonSet pods are misscheduled. |
| KubeJobCompletion | Job did not complete in time. |
| KubeJobFailed | Job failed to complete. |
| KubeHpaReplicasMismatch | HPA has not matched the desired number of replicas. |
| KubeHpaMaxedOut | HPA is running at max replicas. |
| KubePersistentVolumeFillingUpCritical | PersistentVolume is filling up. |
| KubePersistentVolumeFillingUp | PersistentVolume is filling up. |
| KubePersistentVolumeErrors | PersistentVolume is having issues with provisioning. |
| KubeClientCertificateExpiration | Client certificate is about to expire. |
| KubeClientCertificateExpirationCritical | Client certificate is about to expire. |
| KubeAggregatedAPIErrors | Kubernetes aggregated API has reported errors. |
| KubeAggregatedAPIDown | Kubernetes aggregated API is down. |
| KubeAPIDown | Target disappeared from Prometheus target discovery. |
| KubeNodeNotReady | Node is not ready. |
| KubeNodeUnreachable | Node is unreachable. |
| KubeletTooManyPods | Kubelet is running at capacity. |
| KubeNodeReadinessFlapping | Node readiness status is flapping. |
| KubeletPlegDurationHigh | Kubelet Pod Lifecycle Event Generator is taking too long to relist. |
| KubeletPodStartUpLatencyHigh | Kubelet Pod startup latency is too high. |
| KubeletClientCertificateExpiration | Kubelet client certificate is about to expire. |
| KubeletClientCertificateExpirationCritical | Kubelet client certificate is about to expire. |
| KubeletServerCertificateExpiration | Kubelet server certificate is about to expire. |
| KubeletServerCertificateExpirationCritical | Kubelet server certificate is about to expire. |
| KubeletClientCertificateRenewalErrors | Kubelet has failed to renew its client certificate. |
| KubeletDown | Target disappeared from Prometheus target discovery. |
| KubeVersionMismatch | Different semantic versions of Kubernetes components running |
| KubeClientErrors | Kubernetes API server client is experiencing errors. |