From the course: Site Reliability Engineering Essential Training
Unlock this course with a free trial
Join today to access over 25,300 courses taught by industry experts.
Being on call
From the course: Site Reliability Engineering Essential Training
Being on call
Being on-call. Nobody likes to hear on-call. I personally don't like to be on-call, but as an SRE, it is an important part of your job. On-call responsibilities. What do we do? First, respond to pages. Once again, it is not the easiest thing to do when you get a page at 2 a.m. in the morning, but it is our responsibility. Be the first line of defense. You are the one who will be looking at what went wrong, what is going on. Escalate appropriately. When things don't get solved or when you need to escalate it to your supervisor, do it appropriately. Document and update playbooks. Because you are the first line of defense, you have the best opportunity to identify gaps and update the playbooks. Participate in postmortems. More often than not, it's the on-call engineer that works on critical issues that result in outages. So naturally during the postmortem analysis, on-call engineer takes part to explain what went wrong and what was done to fix the issue. Finally, identifying automation…