Más contenido relacionado La actualidad más candente (20) Similar a SRE-iously: Defining the Principles, Habits, and Practices of Site Reliability Engineering (20) SRE-iously: Defining the Principles, Habits, and Practices of Site Reliability Engineering1. ©2008–18 New Relic, Inc. All rights reserved
SRE-iously!
Defining the Principles, Habits, and
Practices of Site Reliability Engineering
Beth Long, Senior Software Engineer
2. ©2008–18 New Relic, Inc. All rights reserved 2
Our Speaker
Beth Adele Long
Senior Software Engineer
New Relic
@BethAdeleLong
linkedin.com/in/beth-adele-long
3. ©2008–18 New Relic, Inc. All rights reserved
Housekeeping
3
Today’s webinar is being
recorded. A link to the recording
will be emailed to all registrants
shortly after.
Please be sure to ask
any questions via the “questions
panel”
There will be a brief survey after
the webinar for any additional
questions or feedback.
4. Confidential ©2008–18 New Relic, Inc. All rights reserved
New Relic is Your Partner on Your DevOps Journey
4
NEWR
NYSE – 2014
16k+
Customers
1.8B
Scale: Events & Metrics per Minute
13,500+ Disruptors2,000+ Global Enterprises
50%
of Fortune 100 Use NR
5. ©2008–18 New Relic, Inc. All rights reserved
“Part of the [Apollo program] culture
was to learn from everyone and
everything, including from that which
one would least expect.”
Margaret Hamilton
6. ©2008–18 New Relic, Inc. All rights reserved 6
The SRE Handbook
But what if you’re not Google?
7. ©2008–18 New Relic, Inc. All rights reserved 7
What is Site Reliability Engineering?
Site Reliability Engineering
applies Software Engineering to Ops:
Software Development Lifecycle
System Maintenance
System Lifecycle
8. ©2008–18 New Relic, Inc. All rights reserved 8
SRE at New Relic
Goal
Continuously improve
reliability of systems in the
New Relic platform.
1
Roles
Pure SRE
Embedded SRE
2
Spheres
Stability
Reliability
Engineering
3
9. ©2008–18 New Relic, Inc. All rights reserved 9
How it was
On-Premises
On Premises
Relational Data
Customers
NoSQL
Data Store
Public Cloud
Micro Services
API
Mobile
Apps
Browser
10. ©2008–18 New Relic, Inc. All rights reserved 10
We Asked Our Stakeholders
Why do we have
SREs at New
Relic?
What’s the vision
for our SRE team?
How can SREs most
effectively contribute to the
future of our platform?
11. ©2008–18 New Relic, Inc. All rights reserved 11
One Goal
Continuously improve
the reliability of systems in the
New Relic platform
12. ©2008–18 New Relic, Inc. All rights reserved 12
Two Roles
Pure SRE
Build and support our
core internal platform:
Container Fabric
Networking Systems
Embedded SRE
Partner with Eng Teams
Domain Experts in:
Reliability
Tooling
Scaling
13. ©2008–18 New Relic, Inc. All rights reserved
Three Spheres
13
Stability Reliability Engineering
14. ©2008–18 New Relic, Inc. All rights reserved 14
What SREs Do
Champion reliability best practices.
Guide designs and processes with an eye toward resilience and low toil.
Reduce technical complexity and sprawl.
Drive the usage of tooling and common components.
Implement software and tooling to improve resilience and automate operations.
15. ©2008–18 New Relic, Inc. All rights reserved 15
SRE Tasks
Work with teams to adopt
operational best practices
Building, or helping teams adopt,
core shared internal platform components
Stay current on the New Relic
platform, and know the top risks
for their team(s)
Improve the monitoring and
observability of the New Relic platform
16. ©2008–18 New Relic, Inc. All rights reserved 16
SRE Tasks (continued)
Implement automation, tooling, and
application code to improve reliability and
reduce toil.
Mentor less senior SREs and grow
the SRE community and practice
at New Relic
Perform task-based operational work (toil) as required.
17. ©2008–18 New Relic, Inc. All rights reserved 17
Keys to SRE Success
Reliability is a feature Query your stakeholders
Reliability depends on
shared understanding
Develop clear, specific guidelines
SRE is a challenging,
cross-disciplinary practice
Build a strong SRE community
18. ©2008–18 New Relic, Inc. All rights reserved 18
SRE at New Relic
Goal
Continuously improve
reliability of systems in the
New Relic platform.
1
Roles
Pure SRE
Embedded SRE
2
Spheres
Stability
Reliability
Engineering
3