Troubleshooting

Troubleshooting & Incident Cases

Each case includes Problem, Solution, Tools, Outcome, and a GitHub link.

K8s: CrashLoopBackOff

Pod restarts repeatedly after deploy.

K8s: ImagePullBackOff

Cluster can’t pull container image.

K8s: Ingress 404/503

Ingress exists but returns 404 or 503.

DNS: Not resolving

Hostname does not resolve or resolves incorrectly.

Terraform: State lock

Terraform apply blocked by lock.

Linux: systemd failing

Service fails to start or keeps restarting.