K8s Troubleshooting — Pod Zombie Process
K8s Troubleshooting handbook
Recently, our K8s cluster run into some Zombie process issues. Pods cannot be deleted or created, and even can’t SSH into the node. We found a lots of defunct process in many Pods. The symptom in Pod looks like:
CPU: 0% usr 0% sys 0% nic 98% idle 0% io 0% irq 0% sirq
Load average: 0.02 0.39 0.46 4/7217 25257…