Try Cloud Hosting with DigitalOcean and get $200 credit.

Self-Healing & Health Checks in Kubernetes

Kubernetes provides powerful built-in features to automatically detect failures and recover applications without manual intervention. This makes applications more reliable and production-ready.

👉 In simple words: Kubernetes can automatically detect problems and fix broken applications.

What is Self-Healing?

Self-healing is a core feature of Kubernetes that allows it to automatically restart, replace, or reschedule failed containers and pods to maintain the desired state.

👉 Instead of manually fixing issues, Kubernetes ensures your application keeps running smoothly.

learn code with durgesh images

How Kubernetes Detects Failures

Kubernetes continuously monitors the health of applications and detects issues using multiple mechanisms:

Container exit status → Detects if a container has stopped or crashed.
Health checks (Probes) → Verifies if the application is running correctly.
Node failures → Detects if a worker node becomes unavailable.

learn code with durgesh images

Pod Restart Policy

The Pod Restart Policy defines what Kubernetes should do when a container inside a pod stops or crashes.

It controls how Kubernetes handles container failures.

Restart Policy Options:

Always → The container is restarted every time it stops (default for most workloads).
OnFailure → The container is restarted only if it exits with an error.
Never → The container is not restarted after it stops.

Example

apiVersion: v1
kind: Pod
metadata:
  name: restart-pod
spec:
  restartPolicy: OnFailure
  containers:
  - name: my-container
    image: busybox
    command: ["sh", "-c", "exit 1"]

What This Configuration Does

Sets restart policy to OnFailure.
The container exits with an error (exit 1).
Kubernetes automatically restarts the container.

learn code with durgesh images

Liveness Probe (Is app alive?)

A Liveness Probe checks whether the application inside a container is still running properly.

If the probe fails, Kubernetes assumes the application is unhealthy and restarts the container automatically.

Example

apiVersion: v1
kind: Pod
metadata:
  name: liveness-pod
spec:
  containers:
  - name: my-app
    image: nginx
    livenessProbe:
      httpGet:
        path: /
        port: 80
      initialDelaySeconds: 5
      periodSeconds: 10

Readiness Probe (Is app ready?)

A Readiness Probe checks whether the application is ready to receive traffic.

If the probe fails, Kubernetes does not send traffic to that pod, but the container is not restarted.

Example

apiVersion: v1
kind: Pod
metadata:
  name: readiness-pod
spec:
  containers:
  - name: my-app
    image: nginx
    readinessProbe:
      httpGet:
        path: /
        port: 80
      initialDelaySeconds: 5
      periodSeconds: 10

learn code with durgesh images

Startup Probe

A Startup Probe is used for slow-starting applications to give them enough time to initialize before other health checks begin.

It ensures that Kubernetes does not kill or restart the container while the application is still starting.

It helps to:

Give the application time to fully start.
Prevent unnecessary restarts during initialization.
Delay Liveness and Readiness checks until startup is complete.

Example

apiVersion: v1
kind: Pod
metadata:
  name: startup-pod
spec:
  containers:
  - name: my-app
    image: nginx
    startupProbe:
      httpGet:
        path: /
        port: 80
      failureThreshold: 30
      periodSeconds: 10

learn code with durgesh images

CrashLoopBackOff Explained

CrashLoopBackOff is a common Kubernetes error that occurs when a container keeps crashing repeatedly after starting.

The cycle looks like this:

Start → Crash → Restart → Crash → Repeat

learn code with durgesh images

👉 Kubernetes keeps restarting the container, but with increasing delay between attempts (back-off time).

Common Reasons

Wrong configuration (environment variables, ports, etc.)
Application crash or bug in code.
Missing dependencies or required services.
Incorrect command or entrypoint.

Example

Check pod status:

kubectl get pods

Output:

NAME        READY   STATUS             RESTARTS
my-app      0/1     CrashLoopBackOff   5

How to Debug

Check logs of the container:

kubectl logs <pod_name>

Describe the pod for more details:

kubectl describe pod <pod_name>

Real-World Understanding

Suppose your app has a wrong database URL:

Container starts
Fails to connect → crashes
Kubernetes restarts it
Same issue happens again → CrashLoopBackOff

👉 In simple words: CrashLoopBackOff means your container is stuck in a crash-restart loop and needs fixing.

Auto-Recovery of Failed Containers

Kubernetes provides automatic recovery mechanisms to ensure that your applications remain available and stable, even when failures occur.

It continuously monitors the system and takes corrective actions without manual intervention.

Kubernetes automatically:

Restarts failed containers → If a container crashes, it is restarted based on the restart policy.
Recreates pods → If a pod is deleted or fails, it is recreated (via Deployment/ReplicaSet).
Reschedules pods → If a node fails, pods are moved to healthy nodes.
Maintains desired state → Ensures the defined number of replicas are always running.

Example

Suppose you have a Deployment with 2 replicas:

spec:
  replicas: 2

What happens:

One pod crashes
Kubernetes detects the failure
A new pod is automatically created
Total pods return to 2 running instances

Conclusion

Kubernetes provides powerful self-healing capabilities that ensure applications remain stable and available even when failures occur.

Self-healing → Automatically detects and recovers from failures.
Liveness Probe → Restarts unhealthy applications.
Readiness Probe → Controls traffic to ensure only healthy pods receive requests.
Startup Probe → Supports slow-starting applications.
Restart Policy → Defines how containers behave after failure.
CrashLoopBackOff → Indicates repeated failures that need debugging.

Spring Boot Handbook

Spring Boot Handbook

Kubernetes Self Healing & Health Checks

Self-Healing & Health Checks in Kubernetes

What is Self-Healing?

How Kubernetes Detects Failures

Pod Restart Policy

Liveness Probe (Is app alive?)

Readiness Probe (Is app ready?)

Startup Probe

CrashLoopBackOff Explained

Real-World Understanding

Auto-Recovery of Failed Containers

Conclusion

Kubernetes Self Healing & Health Checks

Self-Healing & Health Checks in Kubernetes

What is Self-Healing?

How Kubernetes Detects Failures

Pod Restart Policy

Liveness Probe (Is app alive?)

Readiness Probe (Is app ready?)

Startup Probe

CrashLoopBackOff Explained

Real-World Understanding

Auto-Recovery of Failed Containers

Conclusion