Skip to content

AWS Site Reliability Engineer (SRE) - Advanced Quiz

Back to Quiz Home


This quiz tests your ability to design robust distributed systems, implement complex observability pipelines, and manage failure at scale.


#

How can you systematically test if your application can withstand the loss of an entire Availability Zone?

#

What is "Priority Load Shedding"?

#

How do you implement "Sampling" in Distributed Tracing (X-Ray) to control costs?

#

What is the "Control Plane" vs "Data Plane" distinction in AWS resilience?

#

How do you mitigate "TCP Incast" collapse in a cluster?

#

What is "Cashflow Protection" in AWS Shield Advanced?

#

How do you debug high "Steal Time" (CpuSteal) on an EC2 instance?

#

What is "Wait Time" vs "Service Time" in queueing theory?

#

How do you implement "Cross-Region Disaster Recovery" using Route 53?

#

What is the "N+1 Problem" in database queries and how does it affect reliability?

#

What implies a "bimodal" latency distribution graph?

#

How do you monitor "Connection Leaks" in a Java application?

#

What is "Adaptive Concurrency Control"?

#

How can "Key Spaces" in DynamoDB cause throttling?

#

What is the purpose of "Log Structured Merge Trees" (LSM) awareness for SREs?

#

How do you secure Prometheus metrics in a Kubernetes cluster?

#

What is "Toil reduction"?

#

How does "S3 Intelligent-Tiering" affect performance?

#

What is a "Retry Storm" and how do you prevent it?

#

How do you validate Terraform/CloudFormation templates before deployment?

Quiz Progress

0 / 0 questions answered (0%)

0 correct


📚 Study Guides


📬 Weekly DevOps, Cloud & Gen AI quizzes & guides