Agile, Scrum & DevOps:
DevOps Site Reliability Engineering (SRE) Practitioner Training
500 Learners
Intermediate
The SRE (Site Reliability Engineering) Practitioner course teaches how to grow services in a company efficiently and reliably. It investigates techniques for improving agility, cross-functional cooperation, and transparency of service health in order to promote resiliency through design, automation, and closed-loop repair.
DevOps Site Reliability Engineering (SRE) Practitioner Training
Accreditation With
Certified DevOps Site Reliability Engineering (SRE) Practitioner Training Overview
Through the use of real-life situations and case studies, the course seeks to provide learners with the techniques, methodologies, and resources needed to engage individuals throughout the organization interested in dependability. Participants will have tangible takeaways to leverage when they return to the office, such as implementing SRE models that fit their organizational context, building advanced observability in distributed systems, building resiliency by design, and effective incident responses using SRE practices, after completing the course.
What You Will Learn?
  • Successfully establish a thriving SRE culture in your organization.
  • Manage the organizational implications of SRE implementation.
  • In a distributed, zero-trust system, design security and resilience.
  • Prepare for the DevOps Institute SRE Practitioner test.
  • Participation in one-of-a-kind exercises designed to put principles into practice
  • Obtain examples of documents, templates, tools, and procedures.
  • Access to more valuable resources and communities
  • Continue your education and confront new difficulties with one-on-one instructor tutoring after the course.
Course Key Features
  • Pre-course consultation
  • Exam voucher included
  • Access to DevOps Institute additional sources of information and communities
  • Real-Life case studies are weaved throughout the course
  • After-course coaching available
Training Options
In-Class
Starts from
No price
  • 3-days in-class training 
  • Exam vouchers included
  • Pre-course consultation
  • Highly experienced instructor(s)
  • Post-course follow-up
  • All related Averest's quality control tools
  • Required stationary
  • 5 or 4 stars training venue
  • Pay later by invoice -OR- at the time of checkout by credit card
  • 24x7 learner assistance and support
Online Instructor-Led
Starts from
No price
  • 3-day instructor-led training course
  • Exam vouchers included
  • Highly experienced instructor(s)
  • Post-course follow-up
  • Pay later by invoice -OR- at the time of checkout by credit card
  • 24x7 learner assistance and support
Certified DevOps Site Reliability Engineering (SRE) Practitioner Training Schedules
You can get this course with 2 training options and 2 venues
Filter:
Customized to your team’s needs
We will tailor the Certified DevOps Site Reliability Engineering (SRE) Practitioner Training Program to meet your company's specific needs
Customized to your team’s needs
Certified DevOps Site Reliability Engineering (SRE) Practitioner Training Curriculum
Eligibility
Anyone starting or leading a DevOps cultural transformation program Anyone interested in modern IT leadership and organizational change approaches Business Managers Change Agents DevOps Consultants DevOps Engineers IT Managers Lean Coaches Practitioners Product Owners Scrum Master
Pre-requisites
Before taking the SRE Practitioner course, participants must take the SRE Foundation course with an approved DevOps Institute Education Partner. It is suggested that you have a working grasp of common SRE terminology, ideas, principles, and related job experience. Please remember that the DevOps Institute SRE Foundation certification is required before taking the SRE Practitioner test.
Course Content
Section 01 - SRE Anti-patterns
Rebranding Ops or DevOps or Dev as SRE​
Users notice an issue before you do​
Measuring until my Edge​
False positives are worse than no alerts​
Configuration management trap for snowflakes​
The Dogpile: Mob incident response​
Point fixing​
Production Readiness Gatekeeper​
Fail-Safe?
Section 02 - SLO is a Proxy for Customer Happiness
Define SLIs that meaningfully measure the reliability of a service from a user’s perspective​
Defining System boundaries in a distributed ecosystem for defining correct SLIs
Use error budgets to help your team have better discussions and make better data-driven decisions
Overall, reliability is only as good as the weakest link on your service graph
Error thresholds when 3rd party services are used
Section 03 - Building Secure and Reliable Systems
SRE and their role in Building Secure and Reliable systems​
Design for Changing Architecture​
Fault-tolerant Design​
Design for Security​
Design for Resiliency​
Design for Scalability
Design for Performance
Design for Reliability
Ensuring Data Security and Privacy
Section 04 - Full-Stack Observability
Modern Apps are Complex & Unpredictable​
Slow is the new down​
Pillars of Observability​
Implementing Synthetic and End-user monitoring
Observability driven development
Distributed Tracing
What happens to the monitor?
Instrumenting using Libraries and Agents
Section 05 - Platform Engineering and AIOPs
Taking a Platform Centric View solves Organizational scalability challenges such as fragmentation, inconsistency, and unpredictability
How do you use AIOps to improve resiliency?
How can DataOps help you in the journey?
A simple recipe to implement AIOps
Indicative measurement of AIOps
Section 06 - SRE & Incident Response Management
SRE Key Responsibilities towards incident response​
DevOps & SRE and ITIL​
OODA and SRE Incident Response​
Closed Loop Remediation and the Advantages
Swarming – Food for Thought
AI/ML for better incident management
Section 07 - Chaos Engineering
Navigating Complexity
Chaos Engineering Defined
Quick Facts about Chaos Engineering
Chaos Monkey Origin Story
Who is adopting Chaos Engineering?
Myths of Chaos
Chaos Engineering Experiments
GameDay Exercises
Security Chaos Engineering
Chaos Engineering Resources
Section 08 - SRE is the Purest form of DevOps
Key Principles of SRE​
SREs help increase reliability across the product spectrum​
Metrics for Success​
Selection of Target areas
SRE Execution Model​
Culture and Behavioral Skills are key​
SRE Case study
Certified DevOps Site Reliability Engineering (SRE) Practitioner Exam & Certification
Successfully passing (65%) the 90-minute examination, consisting of 40 multiple-choice questions, leads to the SRE Practitioner certificate. The certification is governed and maintained by DevOps Institute. The certification is managed and supported by the DevOps Institute; exams are delivered through an independent, global examination partner.
Certified DevOps Site Reliability Engineering (SRE) Practitioner Exam & Certification
Certified DevOps Site Reliability Engineering (SRE) Practitioner Training FAQs
What makes a good SRE?
  • 2+ years in operations or software engineering role.
  • Excellent verbal and written communication skills.
  • Strong problem-solving skills.
  • Passion for technology as well as helping customers and team members.
Does DevOps pay well?

The median annual salary for DevOps engineers is around $93,000, while the top 10% earn approximately $135,000 per year.

What is SRE model?

Site reliability engineering (SRE) is a software engineering approach to IT operations. SRE helps teams find a balance between releasing new features and making sure that they are reliable for users.

You Maybe Interested
Let Us Help You!
Please fill the contact form and we'll get back to you soon.