explaingit

mxssl/sre-interview-prep-guide

8,903Audience · ops devopsComplexity · 1/5Setup · easy

TLDR

A curated study guide with links to articles, videos, and courses covering Linux, networking, containers, Kubernetes, Terraform, and reliability topics for people preparing for Site Reliability Engineer job interviews.

Mindmap

mindmap
  root((SRE Prep))
    Fundamentals
      Linux internals
      Processes and threads
      Networking
    Containers
      Docker
      Kubernetes
    Infrastructure
      Terraform
      Ansible
    Reliability
      Uptime metrics
      Incident reports
      Monitoring
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Work through a structured reading list to prepare for an SRE job interview, checking off topics as you complete them.

USE CASE 2

Find curated resources on Kubernetes networking, Terraform, and Linux internals without searching each topic separately.

Getting it running

Difficulty · easy Time to first run · 5min

In plain English

This repository is a study guide for people preparing to interview for a Site Reliability Engineer (SRE) role. An SRE is a type of engineer who focuses on keeping software systems up and running reliably, bridging the gap between writing code and managing the infrastructure it runs on. The guide is a curated collection of links to articles, videos, courses, and books across the topics that tend to come up in SRE job interviews. The material is organized into sections. A Basics section covers foundational concepts like what happens when you type a web address into a browser, how operating system processes work, and how threads relate to processes. The Linux section goes deeper into the operating system that most servers run on, covering how a machine boots up, how filesystems and memory management work, and how to troubleshoot a system that is misbehaving. The Networking section explains how computers communicate with each other, including protocols, load balancing, and virtual networking. The Containers section covers tools like Docker that package software into isolated units. Kubernetes, a system for running many containers across a cluster of machines, gets its own section with material ranging from beginner introductions to production best practices and networking specifics. There are also sections on infrastructure-as-code tools like Terraform and Ansible, monitoring and observability, and reliability topics specific to SRE work, such as how teams measure uptime, plan for failures, and write post-incident reports after something goes wrong. The repository does not contain original writing. Every entry is a link to an external resource. Items appear as checkboxes so you can track your progress as you work through the material. The repo is open to contributions.

Copy-paste prompts

Prompt 1
I'm preparing for an SRE interview in 4 weeks, help me build a weekly study schedule using the topics in the sre-interview-prep-guide.
Prompt 2
Based on the SRE interview prep guide, explain what happens when you type a URL into a browser, covering all the networking layers.
Prompt 3
I'm weak on Kubernetes networking, quiz me on the key concepts from the Kubernetes section of the SRE interview prep guide.
Prompt 4
Explain Linux process management, memory, and filesystem concepts using the SRE interview prep guide topics as a syllabus.
Open on GitHub → Explain another repo

← mxssl on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.