AWS Site Reliability Engineer (SRE) - Basics Quiz
← Back to Quiz Home
This quiz covers the core principles of Site Reliability Engineering, including observability, error budgets, and basic resilience patterns.
What are the "Three Pillars of Observability"?
Metrics tell you what is happening, Logs tell you why, and Traces tell you where.
What are the "Golden Signals" of monitoring?
These four signals give you a complete picture of your service's health from a user's perspective.
What is an "Error Budget"?
If you burn your error budget, you stop releasing features and focus on stability.
Which AWS service allows you to introduce chaos (fault injection) into your environment to test resilience?
FIS lets you stop instances, failover databases, or inject latency in a controlled manner.
What is "RTO" (Recovery Time Objective)?
If RTO is 1 hour, your disaster recovery plan must restore service within 1 hour.
What is "RPO" (Recovery Point Objective)?
RPO dictates your backup frequency (e.g., every 5 minutes).
How does "Exponential Backoff" help during an outage?
This prevents a "thundering herd" from overwhelming a struggling service.
What is a "Circuit Breaker" pattern?
It protects the system from cascading failures by failing fast.
The goal is learning and system improvement, not punishment.
In the context of the Golden Signals, what is "Saturation"?
Saturation tells you how "full" your service is.
What is "Jitter" in the context of retries?
Jitter smoothes out traffic spikes caused by synchronized retries.
Which AWS service acts as a "Dead Letter Queue" (DLQ) for failed Lambda invocations?
DLQs capture messages that could not be processed so they can be analyzed later.
What is "Distributed Tracing"?
AWS X-Ray is a tool for distributed tracing.
What does a "504 Gateway Timeout" error typically indicate?
It usually means the backend is too slow or hung (idle timeout exceeded).
What is "SLA" (Service Level Agreement)?
SLO is the internal goal; SLA is the external promise.
How does AWS Auto Scaling prevent "Oscillation" (flapping)?
Cool-downs allow the system to stabilize before making another decision.
What is "Infrastructure as Code" (IaC)?
IaC (Terraform, CloudFormation) ensures reproducibility and reduces drift.
What is the "Blast Radius" of a failure?
SREs aim to minimize blast radius using cells, bulkheads, and regions.
GameDays build muscle memory for handling real outages.
Critical for reliable systems that use retries.
Quiz Progress
0 / 0 questions answered
(0%)
0 correct
Quiz Complete!
0%
📚 Study Guides
📬 Weekly DevOps, Cloud & Gen AI quizzes & guides