Troubleshooting the “Failed to Create Pod Sandbox” Error

Kubernetes is a widely popular container orchestration tool, but its complexity can cause a number of problems as well. DevOps and cloud engineers deploying workloads to Kubernetes frequently deal with issues from failed health checks and unhealthy nodes to being unable to bind a volume to a name.

Luckily, Kubernetes has an active user base. Members of the community will readily answer questions and offer resources to help you through any situation.

This article will focus on one error message in particular: “failed to create pod sandbox”. You’ll learn what causes it to pop up when you’re creating a pod and how you can troubleshoot this error.

What causes `FailedCreatePodSandBox`?

This error usually occurs due to some issues with the networking, although it can also be caused by suboptimal system resource limit configuration. Since networking is one of the most complex parts of a Kubernetes setup, figuring out the exact cause requires you to have a good understanding of Kubernetes networking.

If pods are stuck in the ContainerCreating state, your first step is to check the pod status and get more details with the kubectl describe podname command. The output should provide a detailed error message, which you can use as a baseline for further investigation.

It’s important that you’re familiar with such error messages. In a production environment, you will need to fix the issue as soon as possible, and understanding common error messages will help you find the root cause quickly.

Following are all the possible messages you could see related to this error, as well as possible root causes and methods for fixing them.

Scenario 1: CNI not working on the node

The Kubernetes Container Network Interface (CNI) configures networking between pods. If CNI isn’t running properly on the nodes, pods can’t be created because they will be stuck in the ContainerCreating state.

Here’s how to simulate, troubleshoot, and fix this issue. Say you have a two-node (one control plane, one node) kubeadm cluster running on Kubernetes version 1.23.4, with Weave Net as your CNI. Weave Net runs as a DaemonSet.

To simulate the issue, you’ll prevent the Weave Net from running on your node. Follow these steps:

Step 1

Label your control plane node with label weave=yes (control-plane is your node name):

bash

Step 2

Edit the weave-net DaemonSet and add a node selector under spec.template.spec. This node selector will use label weave=yes, which ensures the CNI pod only runs on the control plane node:

bash

The CNI pod should be running only on the control plane node.

Step 3

Now, try to run a pod using kubectl run nginx --image=nginx. If you check the pod status, it should be stuck in the ContainerCreating state.

If you run the command kubectl describe pod nginx, you’ll see the error message “FailedCreatePodSandBox: failed to setup network for sandbox”.

Debugging and resolution

The error message indicates that CNI on the node—where nginx pod is scheduled to run—is not functioning properly, so the first step should be to check if the CNI pod is running on that node. If the CNI pod is running properly, you’ve eliminated one possible root cause.

In this case, once you remove the nodeSelector from the DaemonSet definition and ensure the CNI pod is running on the node, the nginx pod should be running fine.

Scenario 2: Missing or incorrect CNI configuration files

Even if the CNI pod is running, you may still experience problems if the CNI configuration files have errors. To simulate this, you’ll make some changes in the CNI configuration files, which are stored under the /etc/cni/net.d directory.