Troubleshooting
Diagnose and resolve cluster, node, and application issues in Kubernetes environments (30% of exam).
Troubleshooting is a significant domain in the Certified Kubernetes Administrator (CKA) exam, accounting for approximately 30% of the curriculum. It requires a hierarchical approach to diagnosis, moving from the Application layer to the Worker Nodes, and finally to the Control Plane. At the **Appl…
Concepts covered: Manage and evaluate container output streams, Troubleshoot services and networking, Debugging pods and containers, Analyzing control plane logs, Node troubleshooting and kubelet issues, Resource quota and limit troubleshooting, kubectl debug and ephemeral containers, etcd troubleshooting and backup, Certificate and authentication issues, Troubleshoot clusters and nodes, Troubleshoot cluster components, Monitor cluster and application resource usage
CKA - Troubleshooting Example Questions
Test your knowledge of Troubleshooting
Question 1
A Kubernetes administrator wants to determine which pods are using the most CPU resources in the 'development' namespace to optimize workload distribution. The metrics-server has been running for over an hour. What kubectl command would retrieve the CPU and memory metrics for pods in that specific namespace?
Question 2
A Pod named 'client-app' in namespace 'frontend' is attempting to connect to a Service named 'data-api' in namespace 'backend'. The Service is of type ClusterIP with port 443 and targetPort 8443. Running 'kubectl exec client-app -n frontend -- curl -k https://data-api.backend.svc.cluster.local:443' returns 'connection refused'. However, 'kubectl get endpoints data-api -n backend' shows three healthy endpoints. Running 'kubectl exec client-app -n frontend -- curl -k https://10.244.1.50:8443' (one of the endpoint IPs) successfully returns a response. kube-proxy logs show no errors and iptables rules appear correct. What should you investigate to resolve this Service connectivity issue?
Question 3
You are troubleshooting an etcd backup operation that consistently produces a snapshot file of 0 bytes. The etcdctl command completes with exit code 0 and no error messages are displayed. The etcd cluster is healthy with all three members responding properly. Certificate files are readable and endpoint connectivity is confirmed. What is the most probable cause of this empty snapshot file?