From the course: Site Reliability Engineering Essential Training
Unlock this course with a free trial
Join today to access over 25,300 courses taught by industry experts.
Using error budgets
From the course: Site Reliability Engineering Essential Training
Using error budgets
Error budgets. What are error budgets? Error budget represents the allowable amount of downtime or performance degradation while still meeting the Service Level Objective. It's basically what is left after the SLO. So let's dig into the details. Error budget depicts what is left after meeting the SLO. Here's an example. 99% of requests complete with the HTTP status code of 200 over a calendar month. Again, over a month, I need to have 99% of my requests complete with a status code of 200. So what is the error budget here? If you serve 10 million requests per month, then the error budget is 1% or 100,000 requests. So 1% is basically what you get after 99%, right? So if your objective is 99%, you have a wiggle room of 1% for errors. That's what error budget is. So just remember this, error budget is what is left after meeting the SLO. So how do you leverage error budget? Why do you need error budget? How does it help you? It accelerates release velocity. Now this may be of a surprise…