Top 10 Interview Questions for
DevOps Engineer
1. What is DevOps, and why is it important? Can you
explain the key principles of DevOps?
Explanation:
This might be the first question after your introduction. You don’t have
to memorize the long and boring paragraph on Wikipedia. Instead, try
use an example to explain how DevOps approach helps on the software
development lifecycle. Regarding the key principles, you don’t have to
explain them in details here since this will be across the whole
interview process and asked in the following questions.
Example Answer:
DevOps is a software development methodology that emphasises
collaboration and communication between development teams and
operations teams to improve the speed and efficiency of software
delivery.
Imagine a software development team works separately from operation
team. Every time, when the code is handed off to operation team,
miscommunication and delays happens. It might cause slow
deployment or deployment failure. With DevOps approach, where
development and operational teams work together throughout the
entire software development lifecycle, code can developed, tested, and
deployed more quickly and frequently. Also, with DevOps approach,
the whole team can focus more on the scalability of the system,
building better monitoring and logging solutions, etc.
The key principles of DevOps I think includes:
   1. Collaboration and communication which ensures everyone of
      the team is aligned and work towards a common goal.
   2.Continuous integration and delivery automate the software
     delivery process and ensure the changes are tested before
     being deployed to production
   3.Infrastructure as Code can help the team automate the
      provisioning and management of infrastructure resources,
      and make it easier to manage and scale complex
      environments.
   4.Monitoring and feedback which could continuously optimise
      the reliability and performance of software system.
2. What are some of the tools and technologies
commonly used in DevOps?
Explanation:
This question is used to check your knowledge base and your
experience as a DevOps engineer. No one could master all the tools but
you should know what essential skills are required for a DevOps
engineer. You can check out this article — <How to Become a
Cloud DevOps Engineer #Skills You Need> which covers all the
technical skills required. Finally, you just need to explain the tools you
have used in each domain and make sure you let the interviewer know
you are a fast learner.
Example Answer:
As a DevOps engineers, I have used quite a lot of different tools.
For the cloud platform, I have 2-year experience on AWS and I have
built multiple systems based on different AWS services such as EC2,
ALB, CloudFront, Lambda, API Gateway, DynamoDB, RDS,
ElasticCache, SQS, etc and I also have some experience on Microsoft
Azure but not good as AWS.
I am familiar with Linux as I had been a system administrator for
more than 3 years.
For the version control system, I use Git quite a lot and I have
experience on Github and Bitbucket. Our team is following the
industrial standard to manage the code.
I have been using Docker quite a lot and deployed
many containerised applications on AWS ECS. Also, I am learning
Kubernetes and preparing for the CKA exam.
Regarding CI/CD, I use Github Actions quite a lot as it provides
great flexibility to our systems across different cloud platforms. I also
know Jenkins and AWS Code Build and Code Pipeline as I have
done some small projects with them.
For IaC, I use Terraform a lot also because we have used multiple
cloud platforms such as AWS, Azure and GCP.
For monitoring and logging, I usually build the logging solution
natively on the cloud platform such as CloudWatch on AWS and
Azure Monitor on Azure. I can also use tools like Grafana, Splunk
and Data Dog.
Overall, every tool has its own pros and cons. As a DevOps engineer, I
enjoy learning new tools and I can select appropriate tools and
technologies for different projects and as far as I could say, they are all
working well.
3. What is the role of automation in DevOps?
Explanation:
For this question, you cannot just explain the role of automation in
DevOps. You need to give the interviewer an example about what you
have done using automation and how it benefited your work.
You might hear the job interview technique — STAR method
which stands for Situation, Task, Action,Result. Yes, it can also be
used in technical interview. I will mark the four steps in the
answer for this question.
Example Answer:
S — Situation: In my previous role as a DevOps Engineer, I was
responsible for implementing and managing the automation tools used
in our software delivery pipeline.
T — Task: One of my key tasks was to identify opportunities to
automate manual processes and reduce the amount of manual
intervention required in our software delivery pipeline.
A — Action: To accomplish this task, I worked with the development
and operations teams to identify the most time-consuming and error-
prone manual processes in our software delivery pipeline. I then
evaluated various automation tools and technologies and
recommended the ones that were most suitable for our needs. Some of
the tools we implemented included:
   •   Continuous Integration tools like Jenkins and AWS Code
       Pipeline to automatically build and test code changes as soon
       as they are committed to the source code repository.
   •   IaC tools like Terraform to automate the deployment and
        configuration of servers and infrastructure resources.
   •   Python and Shell script for customised batch task.
R — Result: The implementation of automation tools resulted in
significant improvements in our software delivery pipeline. We were
able to reduce the amount of manual intervention required in our
processes, which resulted in faster and more reliable software releases.
The automation also helped us to identify and resolve issues more
quickly, resulting in improved uptime and customer satisfaction.
4. What is infrastructure as code, and how does it
benefit DevOps?
Explanation:
You might have experience on IaC tools such
as Terraform or Pulumi but if you cannot explain how it benefit
DevOps, the interviewer cannot be convinced just by the number of
projects you have done. You can first cover the benefits that IaC has
but don’t worry if you miss any benefits, you just pick the key things
related to your work and give them an example on how you use IaC
tool in your work. And you can mention some detailed techniques to
show your expertise.
If you feel unfamiliar with below example, you can watch my video on
Youtube about it.
Example Answer:
With IaC tools, infrastructure can be treated as software code. As I
mentioned before, I use Terraform for most of the projects. It provides
lots of benefits includes:
   1. Consistency and Repeatability: with Terraform, I can
      deployed the consistent infrastructure across different
      environment. Also, it reduces errors and improves
      reliabilities especially for complex systems.
   2.Flexibility: with Terraform, I can quickly provision, modify
     and tear down infrastructure resources which allow us to
     response quickly to changing business requirement
   3.Automation: with Terraform, lots of manual work can be
      reduced and I can focus more on the architecture design.
In my previous role, I was responsible for implementing the IaC
approach for one of our systems running on AWS. The key task was to
use Terraform to manage our existing test environment and build a
consistent production environment.
I first finished the terraform code for the test environment where I put
all the resources in the self-build modules based on different
components. I use terraform import command to import all the
existing resources into the Terraform code I built for test environment
and then replicate the code for building the production environment.
For fitting different environments in the same module, I use
condition expression, dynamic blocks to create conditional
resources and block. In order to integrate with the existing
resources which were created for multiple projects, I use data sources
for terraform to get the information.
The implementation of IaC using Terraform resulted in significant
benefits for the existing project. I feel great as I was able to contribute
to the overall success of the team by ensuring that we were able to
deliver high-quality software more efficiently.
5. What is a CI/CD pipeline, and how does it work? What is the
difference between continuous integration and continuous
delivery?
Explanation:
At first, you can simply explain the basic concept of CI/CD pipeline and
how it can benefit the software deployment process. Regarding the
difference between CI and CD, you can use your work example to
explain to the interviewer which might be easy for both of you.
Example Answer:
A CI/CD pipeline is a set of automated processes that help in the
building, testing, and deployment of software applications. It is a
crucial component of modern software development practices and
helps to ensure that code changes are thoroughly tested and ready for
production release.
In my previous role as a DevOps Engineer, I was responsible for
designing and implementing the CI/CD pipeline for software delivery
process using Github Actions. I have never used Github
Actions before but fortunately, I have experience on some other tools
such as AWS code pipeline, Azure DevOps as well as Jenkins. This
helps me quickly pick up the skills to implement the pipeline on
GitHub Actions.
I worked with the development and operations teams to design the
CI/CD pipeline for our software delivery process. Here is how it
worked:
   1. Continuous Integration (CI): Whenever a developer
      made a code change, it was automatically built and tested in
      a clean environment to ensure that the code changes didn’t
      break the application.
   2.Continuous Delivery (CD): Once the code changes were
     tested and approved in the CI stage, they were automatically
     deployed to a staging environment for further testing and
      review. If everything looked good in the staging
      environment, the approved version of the code were
      automatically deployed to the production environment.
As we use Github as our code repository and our software products are
across different platforms, Github Actions provides us flexibilities.
Also, we built lots of common methods which can be reused across the
workflows. The implementation of the Github Actions pipeline brought
in lot of benefits for our projects such as fast and reliable software
release, no human errors, etc. And the most important thing is we
standardise the deployment process across all the projects
6. How do you ensure the security of software in a DevOps
environment?
Explanation:
As a DevOps engineer, you should always think about Security. This
is one of the very important aspects you need to show to the
interviewers. Talking about security in DevOps, you can cover aspects
such as secure coding practices, security testing, infrastructure
security, continuous monitoring, etc in a high-level and pick one or two
you are confident in and explain to the interviewer with examples.
Example Answer:
Ensuring the security of software in a DevOps environment is a critical
aspect of modern software development.
The architect should design and build secured infrastructure with
considering access management, network security, patch
management, backup and recovery, data encryption,
compliance, etc. The developers should follow secure coding
practices such as input validation, proper error handling. Utilising
security testing tools such as penetration testing tool to simulate real-
world attacks to test the security of the application. Another key point
is to continuously monitor the security vulnerabilities and breaches of
the software and infrastructure. For all of them I mentioned,
promoting security culture is one of the most important
responsibility for DevOps engineer.
In my previous job, I was tasked with building a secure infrastructure
on AWS to host a web application. I need to implement security best
practices to ensure the infrastructure compliant with industry
standards.
Here are the steps I took:
You just need to pick some of them based on your own experience.
   1. Identity and Access Management (IAM): I created IAM
       users, groups, and roles with the principle of least privilege,
      and enabled MFA for added security. I also ensured that
      strong passwords were enforced for all users.
   2.Network Security: I used VPCs and subnets to isolate the
      infrastructure, and implemented security groups and NACLs
      to control network traffic. I encrypted data in transit using
      SSL/TLS, and data at rest using server-side encryption.
   3.Web Application Firewall: I used AWS WAF integrated
      with CloudFront to protect against common web exploits
      such as SQL injection and cross-site scripting.
   4.Logging and Monitoring: I enabled logging and
     monitoring to detect and respond to security incidents. I
     used AWS CloudTrail to log API calls, and AWS Config to
      track resource configurations. I also implemented Amazon
      GuardDuty to detect and respond to security threats in real-
      time.
   5. Patch Management: I automated patch management
      using AWS Systems Manager, ensuring that all software and
      operating systems were up-to-date with the latest security
      patches.
   6.Backup and Recovery: I implemented regular backups
      and stored them securely. I also created disaster recovery
      procedures to ensure business continuity in case of a security
      incident.
   7. Compliance: I ensured that the infrastructure complied
      with regulatory requirements such as HIPAA, PCI-DSS, and
      GDPR. I used AWS Artifact to access compliance reports,
      and AWS Config to ensure that resources complied with
      security policies.
By following these steps, I was able to build a secure infrastructure on
AWS that was resilient against security threats, compliant with
industry standards, and provided a stable and high-performance
environment for the web application to operate in.
7. What is your experience with cloud computing platforms such
as AWS or Azure?
Explanation:
You might only have work experience on a single Cloud platform. But
don’t worry, if you are close to the interview, you can just focus on the
one you are good at instead of learning another new platform. I have
been working with AWS, Azure, GCP, Tencent Cloud for many years.
So I could say even if these platforms have different ways to build their
lower infrastructure, however, for the users, like Cloud engineer or
infrastructure engineer, you can easily pick up the new one.
To answer this question, you can cover the services you have used
based on categories such as compute, storage, networking, database
and tools, etc. You can explain these services based on a project.
Example Answer:
I have been working on AWS for more than 3 years and I am familiar
with all of the most popular services. I can explain these services based
on one the recent project I have done which is a three-tier application
on AWS.
   1. For the networking layer, I used VPC to isolate the
      application from the public internet. I used private subnets
      to deploy the application server as well as the databases with
      security group to control the access. I also used Application
      Load Balancing to distribute traffic across multiple EC2
      instances and provide fault tolerance.
   2.For the compute layer, I used Amazon EC2 instances to run
      the application servers. I also used Auto Scaling to
      automatically adjust the number of instances based on traffic
      and demand.
   3.For the storage layer, I used Amazon S3 to store static
     content such as images, videos, and other media files. I also
     used Amazon Elastic File System (EFS) for shared storage
     across multiple EC2 instances.
   4.For the database layer, I used Amazon RDS to deploy a
      managed database instance of MySQL. I also used Amazon
      DynamoDB, a NoSQL database for storing and retrieving
      non-relational data.
   5. For some of the event-based services, I built a bunch of REST
      APIs via API Gateway and Lambda Functions. This provides
      a scalable and secure way to expose APIs.
By using these services with event-based architecture, I was able to
build a highly scalable and available three-tier application that could
handle high traffic loads and provide a great user experience while
minimising operational overhead.
8. What is Containerisation, and how does it benefit DevOps?
Explanation:
As a DevOps engineer, you have to know container technology. So to
answer this question, you should be able to convince the interviewer
with your knowledge and deep understanding on containerisation and
how it benefit DevOps. Same as other questions, you should give them
an example on your experience of implementing containerisation for
your DevOps work even if they did not ask for it.
Example Answer:
Containerisation is a technique that enables the creation and
deployment of applications in isolated, self-contained environments
called containers. A container consists of an entire runtime
environment, including the application, its dependencies, and the
underlying system libraries, all packaged together in a lightweight
container that can run on any system that supports containerisation.
Containerisation offers several benefits such as
   1. Consistency: consistent environment for running
      applications;
   2.Portability: container can run on any systems that support
     containerisation
   3.Scalability: containers can be easily replicated and deployed
      across multiple systems
   4.Automation: containerisation can be integrated with DevOps
      automation tools and processes
I have been using Docker which is the most popular
container platform for many years. For all of the new
projects, I built local development environment using
Docker Compose for our development team. The developers
can easily run the scripts to start it. Using this method, we
can make sure the local development environment exactly
matches the staging and production environments, which
helps us to trouble shoot when any issue happens.
I used AWS ECS Fargate quite a lot to host the applications
we built. I chose Fargate as it provides a serverless way to
run containers without the need to manage underlying EC2
instances. ECS Fargate makes it easy to scale containerised
applications based on demand. I set up automatic scaling
rules to automatically adjust the number of containers
running based on metrics such as CPU utilisation and
network traffic. With ECS Fargate, I can easily integrated it
with services like Application Load Balancer to distribute
and redirect traffic, or IAM to manage the permission and
access to other AWS services.
9. What is your experience with monitoring and logging tools?
Explanation:
Monitoring and Logging are very important for DevOps, which is also
called Observability. If you have direct experience with monitoring
and logging tools, you can answer this interview question by
highlighting your specific knowledge and skills related to these tools.
For example, you could discuss the types of monitoring and logging
tools you have used in the past, how you implemented them, and the
benefits they provided.
Example Answer:
Observability including monitoring and logging is important for
DevOps because it provides development and operation team with the
ability to gain insight into complex systems and applications. In a
DevOps environment, where teams are responsible for developing,
deploying, and operating software, observability is crucial for
understanding how the various components of the system are
interacting and performing.
I have extensive experience working with various monitoring and
logging tools. In my previous role as a DevOps Engineer, I regularly
used tools such as Datadog and Prometheus to monitor the
performance and behaviour of our systems and applications. I
implemented these tools by configuring various metrics, alerts, and
dashboards to provide real-time visibility into our infrastructure.
As a result, we were able to proactively identify and resolve issues
before they impacted our users. I also utilised tools such
as CloudWatch Logs, Splunk, and ELK Stack for logging
purposes, to capture and analyse application events, errors, and
transactions. These logs were instrumental in troubleshooting issues
and identifying areas for optimisation
10. What is version control and how is it used in DevOps
Explanation:
To answer this question, you need to explain the concept of version
control and how it benefits DevOps. You can also explain the process of
using Github, for example, to manage the code within you team.
As this is the last question, I would like to emphasise that real-
world example is always the best way to show the interviewer
your experience and expertise.
Example Answer:
In my previous role as a DevOps Engineer, I was responsible for
managing the version control system for our software development
team. I’ve used Github for managing code changes, merging
code branches, and collaborating with other developers and
stakeholders.
I have created and reviewed pull requests to merge code
changes from different branches or forks into the main repository.
I’ve used Github’s review tools, such as code review comments and
approval workflows, to ensure that changes are thoroughly tested and
reviewed before being merged.
I have created and managed different branches in Github to isolate
changes for specific features or bug fixes. I’ve used Github’s merge
tools to ensure that changes are merged correctly and don’t conflict
with other changes in the codebase.
I have implemented branch protection with restricted merging,
status check, code review requirement as well as push restriction. This
helps us to enforce best practices, reduce errors, and improve the
overall quality of code changes in a repository
I think the key point is to standardise the version control
process. Team members may have different ways of managing code
changes, which can lead to confusion, errors, and delays. So I have
documented the version control workflow for the team to learn and
follow and kept it optimised.