DevOps Engineer Interview Questions
10 curated questions with evaluation guidance for hiring managers.
How would you design a CI/CD pipeline for a microservices architecture with 20+ services?
Should discuss monorepo vs. polyrepo strategies, parallelized builds, service-level pipelines, artifact management, and deployment strategies (canary, blue-green). Look for handling of dependencies between services.
Explain the difference between containers and virtual machines. When would you choose one over the other?
Should explain isolation levels, resource efficiency, and startup time differences. Containers for microservices and consistent environments, VMs for stronger isolation or legacy workloads.
How do you handle secrets management in a production environment?
Should mention tools (HashiCorp Vault, AWS Secrets Manager, K8s secrets), rotation policies, least-privilege access, and auditing. Red flag if they mention hardcoding secrets or committing to version control.
Describe your approach to infrastructure as code. What tools do you prefer and why?
Should compare tools (Terraform, Pulumi, CloudFormation) and explain state management, modularity, and testing. Look for understanding of drift detection and plan-before-apply workflows.
How do you design monitoring and alerting that avoids alert fatigue?
Should discuss SLOs/SLIs, meaningful thresholds, alert grouping, escalation policies, and runbooks. Look for understanding that fewer actionable alerts are better than many noisy ones.
Walk me through how you would troubleshoot a Kubernetes pod that keeps crashing.
Should follow a logical sequence: check pod events, describe pod, inspect logs, check resource limits, examine liveness/readiness probes, and review recent deployments. Systematic diagnosis matters.
How do you handle disaster recovery and backup strategies for cloud infrastructure?
Should discuss RPO/RTO targets, backup automation, cross-region replication, regular testing of recovery procedures, and documentation. Look for practical experience with actual recovery scenarios.
Explain how you would implement zero-downtime deployments.
Should mention blue-green deployments, canary releases, rolling updates, health checks, and database migration strategies. Look for understanding of rollback procedures.
How do you manage cost optimization in cloud environments?
Should discuss right-sizing, reserved instances, spot instances, auto-scaling, unused resource cleanup, and cost tagging. Look for balance between cost and reliability.
Describe your experience with GitOps. How does it differ from traditional deployment approaches?
Should explain declarative infrastructure, Git as single source of truth, automated reconciliation (ArgoCD, Flux), and auditability benefits. Look for practical implementation experience.
Want AI-generated interview questions tailored to your specific job description? Workro analyses your JD and generates behavioural and technical questions calibrated for the role, seniority level, and required skills — in seconds.
Try free