SlideShare una empresa de Scribd logo
1 de 50
Descargar para leer sin conexión
STAYING ALIVEPATTERNS FOR FAILURE MANAGEMENT FROM THE BOTTOM OF THE OCEAN
RONNIE CHEN SLACK
1 — Ronnie Chen @rondoftw
WHY DID I BECOME A DIVER?
2 — Ronnie Chen @rondoftw
A WHOLE NEW WORLD
3 — Ronnie Chen @rondoftw
4 — Ronnie Chen @rondoftw
5 — Ronnie Chen @rondoftw
7 — Ronnie Chen @rondoftw
8 — Ronnie Chen @rondoftw
TECHNICAL DIVING
▸ longer dive times
▸ deeper dives
▸ overhead ceiling
▸ decompression obligations
▸ more gear. a lot more.
▸ higher pressure
▸ more risks
9 — Ronnie Chen @rondoftw
RISKS MAY INCLUDE...
1. hypoxia
2. hyperoxia
3. nitrogen narcosis
4. carbon dioxide buildup
5. oxygen sensor failure
6. deep tissure isobaric counterdiffusion (ICD)
7. high pressure nervous syndrome (HPNS)
8. software failure
9. exhausting your carbon dioxide scrubber
10. carbon dioxide channeling from a poorly packed scrubber
11. carbon buildup causing an spark leading to an oxygen fire. underwater.
12. flooding of breathing loop or circuitry
13. water mixing with the scrubbing agent to produce a toxic caustic soda that will give you chemical burns on your mouth, airway, and lungs
14. plain old decompression sickness
10 — Ronnie Chen @rondoftw
 
If you own a rebreather for five years, two percent of you
are going to die on it.
— Jill Heinerth, underwater explorer
11 — Ronnie Chen @rondoftw
12 — Ronnie Chen @rondoftw
13 — Ronnie Chen @rondoftw
14 — Ronnie Chen @rondoftw
HOLD UP!15 — Ronnie Chen @rondoftw
THIS IS A TALK ABOUT
COMMUNICATION AND PROCESS
? ? ?16 — Ronnie Chen @rondoftw
(it was a trap)
17 — Ronnie Chen @rondoftw
YOU CAME TO HEAR COOL
STORIES...
18 — Ronnie Chen @rondoftw
BUT YOU'RE GETTING A MEANDERING MEDITATION ON
BEST PRACTICES*
WHEN DEALING WITH COMPLEX SYSTEMS INSTEAD
*
These guidelines have only been shown to work for life or death situations under the ocean.
They have not been proven to work for tech.
19 — Ronnie Chen @rondoftw
How failures really happen
20 — Ronnie Chen @rondoftw
Complex systems are designed
to protect against simple failures.
21 — Ronnie Chen @rondoftw
But accidents still happen.
22 — Ronnie Chen @rondoftw
CATASTROPHES ARE CAUSED BY A FAILURE CASCADE
▸ you have a rebreather malfunction
▸ which you would have caught it if you were testing your equipment on a regular basis
▸ your backup tank had a leak and is running low and that wasn't caught either
▸ and your buddy is too far away and isn't checking in with you
▸ and your dive light that you use to communicate at a distance is out of power
▸ and in the excitement you kick up silt and the visibility drops
▸ and in your panic your air consumption goes up and then you breathe through the last of the air in your tank
▸ so you swim for the surface even though you have a decompression obligation
23 — Ronnie Chen @rondoftw
A post-mortem that blames this incident on a simple mechanical
malfunction would only cover 12.5% of the issues that led up to this
accident.
24 — Ronnie Chen @rondoftw
Complex system failures don't happen because a single part of the
system fails. They happen because all the safety procedures that are
supposed to protect them from the simple system failure didn't work.
25 — Ronnie Chen @rondoftw
CORE RULES OF SAFETY SYSTEMS
1. An unused safety system doesn't exist.
26 — Ronnie Chen @rondoftw
NORMALIZATION OF DEVIANCE
That natural human tendency,
particularly in pressure circumstances,
to take a safety shortcut.
To accept a lower standard of performance.
— Colonel Mike Mullane, astronaut
27 — Ronnie Chen @rondoftw
FALSE FEEDBACK
the absence of something bad happening means that it was safe
ADAPTATION
experience is no longer a suitable gauge of risk
SOCIAL PRESSURE
this is just how we do things
28 — Ronnie Chen @rondoftw
CORE RULES OF SAFETY SYSTEMS
2. An untested safety system doesn't exist either!
29 — Ronnie Chen @rondoftw
CORE RULES OF SAFETY SYSTEMS
3.Unused or untested safety systems are more dangerous than not
having one at all. Therefore, safety systems must be tested at regular
intervals.
The length of this interval should be determined not only by how likely it
is for this system to fail but also how great the impact will be if it does.
30 — Ronnie Chen @rondoftw
A QUICK SIDENOTE ON ASSESSING RISK
▸ Make assessments based on likelihood of occurrence.
▸ Make assessments based on magnitude of regret.
If you are only evaluating risk based on the chance of it happening, you
must be prepared to experience the corresponding level of regret if it
does.
31 — Ronnie Chen @rondoftw
failures will happen
32 — Ronnie Chen @rondoftw
WHAT IS SAFETY?
33 — Ronnie Chen @rondoftw
FAILURE MANAGEMENT
▸ A framework for redundancy
▸ The training and judgment to use it
34 — Ronnie Chen @rondoftw
FAILURE MANAGEMENT FOR SYSTEMS
▸ Have redundancy for systems that you cannot survive without.
▸ Have a redundant pathway to success: a procedure for graceful
degradation for systems that are important but not critical.
▸ Have a process for changing over from primary to redundant
systems.
35 — Ronnie Chen @rondoftw
FAILURE MANAGEMENT FOR SYSTEMS (CONT)
▸ Keep failures contained so that they don't bring down other systems
▸ Make it easy to do the right thing and hard to do the dangerous
things
36 — Ronnie Chen @rondoftw
FAILURE MANAGEMENT FOR HUMAN SYSTEMS
37 — Ronnie Chen @rondoftw
TRAINING FOR PRESSURE
38 — Ronnie Chen @rondoftw
TRAINING: INEXPERIENCED PEOPLE TO THE FRONT
▸ Most inexperienced person leads
▸ Experienced person advises and intervenes only when necessary
▸ Team is invested in personal success to ensure mission success
39 — Ronnie Chen @rondoftw
TRAINING: INEXPERIENCED PEOPLE TO THE FRONT (CONT)
▸ Frees up more experienced people from micromanaging
▸ Opportunity to revise and improve problematic systems
▸ One of the best ways to equalize a gap in experience
40 — Ronnie Chen @rondoftw
GOOD JUDGMENT
Good judgment enables the reshaping of rules and frameworks
to adapt to a changing environment.
41 — Ronnie Chen @rondoftw
REFINING JUDGMENT
▸ Post-Mortems
▸ Pre-Mortems
▸ Fire Drills
▸ Revisit Past Decisions
42 — Ronnie Chen @rondoftw
POST-MORTEMS
▸ Look at the safety procedures that failed to stop the cascade
▸ Look for opportunities to create new safety systems at critical
points
43 — Ronnie Chen @rondoftw
PRE-MORTEMS
▸ Don't wait for failures to build safety frameworks
▸ Identify potential avenues of of failure and make plans for them
▸ Include both likely failures and high regret failures
44 — Ronnie Chen @rondoftw
FIRE DRILLS
▸ Vet your plans and safety systems
▸ Perform targeted training
▸ Evaluate effectiveness of tools and documentation
45 — Ronnie Chen @rondoftw
REVISIT PAST DECISIONS
▸ Examine successful operations to see what key insights were helpful
▸ Identify any dependency on luck in previous projects
▸ Share rationale for decisions
46 — Ronnie Chen @rondoftw
RECOGNIZING SUCCESS
47 — Ronnie Chen @rondoftw
I WANT TO LEARN MORE!
1. Diane Vaughn
- The Challenger Launch Decision
2. Richard I. Cook
- How Complex Systems Fail
3. Mike Mullane
- https://www.youtube.com/watch?v=Ljzj9Msli5o
4. Steve Lewis aka decodoppler
- Staying Alive
5. Sidney Dekker
- Drift into Failure
48 — Ronnie Chen @rondoftw
49 — Ronnie Chen @rondoftw
Any Questions?
50 — Ronnie Chen @rondoftw

Más contenido relacionado

Destacado

Destacado (20)

Machine Learning for Developers - Danilo Poccia - Codemotion Rome 2017
Machine Learning for Developers - Danilo Poccia - Codemotion Rome 2017Machine Learning for Developers - Danilo Poccia - Codemotion Rome 2017
Machine Learning for Developers - Danilo Poccia - Codemotion Rome 2017
 
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
How to Leverage Machine Learning (R, Hadoop, Spark, H2O) for Real Time Proces...
 
Con te non ci lavoro - Francesco Fullone - Codemotion Rome 2017
Con te non ci lavoro - Francesco Fullone - Codemotion Rome 2017Con te non ci lavoro - Francesco Fullone - Codemotion Rome 2017
Con te non ci lavoro - Francesco Fullone - Codemotion Rome 2017
 
Microservices in GO - Massimiliano Dessì - Codemotion Rome 2017
Microservices in GO - Massimiliano Dessì - Codemotion Rome 2017Microservices in GO - Massimiliano Dessì - Codemotion Rome 2017
Microservices in GO - Massimiliano Dessì - Codemotion Rome 2017
 
Reactive MVP - Giorgio Natili - Codemotion Rome 2017
Reactive MVP - Giorgio Natili - Codemotion Rome 2017Reactive MVP - Giorgio Natili - Codemotion Rome 2017
Reactive MVP - Giorgio Natili - Codemotion Rome 2017
 
Community in a nutshell for developers - Alessio Fattorini - Codemotion Rome ...
Community in a nutshell for developers - Alessio Fattorini - Codemotion Rome ...Community in a nutshell for developers - Alessio Fattorini - Codemotion Rome ...
Community in a nutshell for developers - Alessio Fattorini - Codemotion Rome ...
 
Does Your Web App Speak Schadenfreude? - Greg Rewis - Codemotion Rome 2017
Does Your Web App Speak Schadenfreude? - Greg Rewis - Codemotion Rome 2017Does Your Web App Speak Schadenfreude? - Greg Rewis - Codemotion Rome 2017
Does Your Web App Speak Schadenfreude? - Greg Rewis - Codemotion Rome 2017
 
Webhooks do's and dont's: what we learned after integrating +100 APIs - Giuli...
Webhooks do's and dont's: what we learned after integrating +100 APIs - Giuli...Webhooks do's and dont's: what we learned after integrating +100 APIs - Giuli...
Webhooks do's and dont's: what we learned after integrating +100 APIs - Giuli...
 
Cyber Wars in the Cyber Space - Andrea Pompili - Codemotion Rome 2017
Cyber Wars in the Cyber Space - Andrea Pompili - Codemotion Rome 2017Cyber Wars in the Cyber Space - Andrea Pompili - Codemotion Rome 2017
Cyber Wars in the Cyber Space - Andrea Pompili - Codemotion Rome 2017
 
Thinking Functionally - John Stevenson - Codemotion Rome 2017
Thinking Functionally - John Stevenson - Codemotion Rome 2017Thinking Functionally - John Stevenson - Codemotion Rome 2017
Thinking Functionally - John Stevenson - Codemotion Rome 2017
 
Web Based Virtual Reality - Tanay Pant - Codemotion Rome 2017
Web Based Virtual Reality - Tanay Pant - Codemotion Rome 2017Web Based Virtual Reality - Tanay Pant - Codemotion Rome 2017
Web Based Virtual Reality - Tanay Pant - Codemotion Rome 2017
 
Cyber Analysts: who they are, what they do, where they are - Marco Ramilli - ...
Cyber Analysts: who they are, what they do, where they are - Marco Ramilli - ...Cyber Analysts: who they are, what they do, where they are - Marco Ramilli - ...
Cyber Analysts: who they are, what they do, where they are - Marco Ramilli - ...
 
Unreal Engine 4 Blueprints: Odio e amore Roberto De Ioris - Codemotion Rome 2017
Unreal Engine 4 Blueprints: Odio e amore Roberto De Ioris - Codemotion Rome 2017Unreal Engine 4 Blueprints: Odio e amore Roberto De Ioris - Codemotion Rome 2017
Unreal Engine 4 Blueprints: Odio e amore Roberto De Ioris - Codemotion Rome 2017
 
Container orchestration: the cold war - Giulio De Donato - Codemotion Rome 2017
Container orchestration: the cold war - Giulio De Donato - Codemotion Rome 2017Container orchestration: the cold war - Giulio De Donato - Codemotion Rome 2017
Container orchestration: the cold war - Giulio De Donato - Codemotion Rome 2017
 
Handle insane devices traffic using Google Cloud Platform - Andrea Ulisse - C...
Handle insane devices traffic using Google Cloud Platform - Andrea Ulisse - C...Handle insane devices traffic using Google Cloud Platform - Andrea Ulisse - C...
Handle insane devices traffic using Google Cloud Platform - Andrea Ulisse - C...
 
Component-Based UI Architectures for the Web - Andrew Rota - Codemotion Rome...
Component-Based UI Architectures for the Web  - Andrew Rota - Codemotion Rome...Component-Based UI Architectures for the Web  - Andrew Rota - Codemotion Rome...
Component-Based UI Architectures for the Web - Andrew Rota - Codemotion Rome...
 
Xamarin.Forms Performance Tips & Tricks - Francesco Bonacci - Codemotion Rome...
Xamarin.Forms Performance Tips & Tricks - Francesco Bonacci - Codemotion Rome...Xamarin.Forms Performance Tips & Tricks - Francesco Bonacci - Codemotion Rome...
Xamarin.Forms Performance Tips & Tricks - Francesco Bonacci - Codemotion Rome...
 
Invader Studios: sviluppatori da “Incubo” - Tiziano Bucci - Codemotion Rome ...
Invader Studios: sviluppatori da “Incubo”  - Tiziano Bucci - Codemotion Rome ...Invader Studios: sviluppatori da “Incubo”  - Tiziano Bucci - Codemotion Rome ...
Invader Studios: sviluppatori da “Incubo” - Tiziano Bucci - Codemotion Rome ...
 
Galateo semi-serio dell'Open Source - Luigi Dell' Aquila - Codemotion Rome 2017
Galateo semi-serio dell'Open Source -  Luigi Dell' Aquila - Codemotion Rome 2017Galateo semi-serio dell'Open Source -  Luigi Dell' Aquila - Codemotion Rome 2017
Galateo semi-serio dell'Open Source - Luigi Dell' Aquila - Codemotion Rome 2017
 
Docker Inside/Out: the ‘real’ real-world of stacking containers in production...
Docker Inside/Out: the ‘real’ real-world of stacking containers in production...Docker Inside/Out: the ‘real’ real-world of stacking containers in production...
Docker Inside/Out: the ‘real’ real-world of stacking containers in production...
 

Similar a Staying Alive: Patterns for Failure Management From the Bottom of the Ocean - Ronnie Chen - Codemotion Rome 2017

Similar a Staying Alive: Patterns for Failure Management From the Bottom of the Ocean - Ronnie Chen - Codemotion Rome 2017 (10)

The insights that will help your medtech clinical trial succeed
The insights that will help your medtech clinical trial succeedThe insights that will help your medtech clinical trial succeed
The insights that will help your medtech clinical trial succeed
 
4th SEALNET meeting, Item 9: Training on Health and Safety
4th SEALNET meeting, Item 9: Training on Health and Safety4th SEALNET meeting, Item 9: Training on Health and Safety
4th SEALNET meeting, Item 9: Training on Health and Safety
 
Bodies and Buildings NYU ITP Class 1 9_8_2014
Bodies and Buildings NYU ITP Class 1 9_8_2014Bodies and Buildings NYU ITP Class 1 9_8_2014
Bodies and Buildings NYU ITP Class 1 9_8_2014
 
1st NENALAB meeting, Item 33 - General Laboratory Safety, Hanane Aroui, IRD F...
1st NENALAB meeting, Item 33 - General Laboratory Safety, Hanane Aroui, IRD F...1st NENALAB meeting, Item 33 - General Laboratory Safety, Hanane Aroui, IRD F...
1st NENALAB meeting, Item 33 - General Laboratory Safety, Hanane Aroui, IRD F...
 
Remove before flight #APIL17
Remove before flight #APIL17Remove before flight #APIL17
Remove before flight #APIL17
 
Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?
Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?
Testing Hyper-Complex Systems: What Can We Know? What Can We Claim?
 
HLD Training Course ppt 10-09-2015
HLD Training Course ppt 10-09-2015HLD Training Course ppt 10-09-2015
HLD Training Course ppt 10-09-2015
 
PSCI 2 presentation process safety fundamentals by Maharshi Mehta
PSCI 2 presentation process safety fundamentals by Maharshi MehtaPSCI 2 presentation process safety fundamentals by Maharshi Mehta
PSCI 2 presentation process safety fundamentals by Maharshi Mehta
 
Psci2 pres process safety fundamentals maharshi mehta
Psci2 pres  process safety fundamentals maharshi mehtaPsci2 pres  process safety fundamentals maharshi mehta
Psci2 pres process safety fundamentals maharshi mehta
 
Understanding deadlocks in operating system
Understanding deadlocks in operating systemUnderstanding deadlocks in operating system
Understanding deadlocks in operating system
 

Más de Codemotion

Más de Codemotion (20)

Fuzz-testing: A hacker's approach to making your code more secure | Pascal Ze...
Fuzz-testing: A hacker's approach to making your code more secure | Pascal Ze...Fuzz-testing: A hacker's approach to making your code more secure | Pascal Ze...
Fuzz-testing: A hacker's approach to making your code more secure | Pascal Ze...
 
Pompili - From hero to_zero: The FatalNoise neverending story
Pompili - From hero to_zero: The FatalNoise neverending storyPompili - From hero to_zero: The FatalNoise neverending story
Pompili - From hero to_zero: The FatalNoise neverending story
 
Pastore - Commodore 65 - La storia
Pastore - Commodore 65 - La storiaPastore - Commodore 65 - La storia
Pastore - Commodore 65 - La storia
 
Pennisi - Essere Richard Altwasser
Pennisi - Essere Richard AltwasserPennisi - Essere Richard Altwasser
Pennisi - Essere Richard Altwasser
 
Michel Schudel - Let's build a blockchain... in 40 minutes! - Codemotion Amst...
Michel Schudel - Let's build a blockchain... in 40 minutes! - Codemotion Amst...Michel Schudel - Let's build a blockchain... in 40 minutes! - Codemotion Amst...
Michel Schudel - Let's build a blockchain... in 40 minutes! - Codemotion Amst...
 
Richard Süselbeck - Building your own ride share app - Codemotion Amsterdam 2019
Richard Süselbeck - Building your own ride share app - Codemotion Amsterdam 2019Richard Süselbeck - Building your own ride share app - Codemotion Amsterdam 2019
Richard Süselbeck - Building your own ride share app - Codemotion Amsterdam 2019
 
Eward Driehuis - What we learned from 20.000 attacks - Codemotion Amsterdam 2019
Eward Driehuis - What we learned from 20.000 attacks - Codemotion Amsterdam 2019Eward Driehuis - What we learned from 20.000 attacks - Codemotion Amsterdam 2019
Eward Driehuis - What we learned from 20.000 attacks - Codemotion Amsterdam 2019
 
Francesco Baldassarri - Deliver Data at Scale - Codemotion Amsterdam 2019 -
Francesco Baldassarri  - Deliver Data at Scale - Codemotion Amsterdam 2019 - Francesco Baldassarri  - Deliver Data at Scale - Codemotion Amsterdam 2019 -
Francesco Baldassarri - Deliver Data at Scale - Codemotion Amsterdam 2019 -
 
Martin Förtsch, Thomas Endres - Stereoscopic Style Transfer AI - Codemotion A...
Martin Förtsch, Thomas Endres - Stereoscopic Style Transfer AI - Codemotion A...Martin Förtsch, Thomas Endres - Stereoscopic Style Transfer AI - Codemotion A...
Martin Förtsch, Thomas Endres - Stereoscopic Style Transfer AI - Codemotion A...
 
Melanie Rieback, Klaus Kursawe - Blockchain Security: Melting the "Silver Bul...
Melanie Rieback, Klaus Kursawe - Blockchain Security: Melting the "Silver Bul...Melanie Rieback, Klaus Kursawe - Blockchain Security: Melting the "Silver Bul...
Melanie Rieback, Klaus Kursawe - Blockchain Security: Melting the "Silver Bul...
 
Angelo van der Sijpt - How well do you know your network stack? - Codemotion ...
Angelo van der Sijpt - How well do you know your network stack? - Codemotion ...Angelo van der Sijpt - How well do you know your network stack? - Codemotion ...
Angelo van der Sijpt - How well do you know your network stack? - Codemotion ...
 
Lars Wolff - Performance Testing for DevOps in the Cloud - Codemotion Amsterd...
Lars Wolff - Performance Testing for DevOps in the Cloud - Codemotion Amsterd...Lars Wolff - Performance Testing for DevOps in the Cloud - Codemotion Amsterd...
Lars Wolff - Performance Testing for DevOps in the Cloud - Codemotion Amsterd...
 
Sascha Wolter - Conversational AI Demystified - Codemotion Amsterdam 2019
Sascha Wolter - Conversational AI Demystified - Codemotion Amsterdam 2019Sascha Wolter - Conversational AI Demystified - Codemotion Amsterdam 2019
Sascha Wolter - Conversational AI Demystified - Codemotion Amsterdam 2019
 
Michele Tonutti - Scaling is caring - Codemotion Amsterdam 2019
Michele Tonutti - Scaling is caring - Codemotion Amsterdam 2019Michele Tonutti - Scaling is caring - Codemotion Amsterdam 2019
Michele Tonutti - Scaling is caring - Codemotion Amsterdam 2019
 
Pat Hermens - From 100 to 1,000+ deployments a day - Codemotion Amsterdam 2019
Pat Hermens - From 100 to 1,000+ deployments a day - Codemotion Amsterdam 2019Pat Hermens - From 100 to 1,000+ deployments a day - Codemotion Amsterdam 2019
Pat Hermens - From 100 to 1,000+ deployments a day - Codemotion Amsterdam 2019
 
James Birnie - Using Many Worlds of Compute Power with Quantum - Codemotion A...
James Birnie - Using Many Worlds of Compute Power with Quantum - Codemotion A...James Birnie - Using Many Worlds of Compute Power with Quantum - Codemotion A...
James Birnie - Using Many Worlds of Compute Power with Quantum - Codemotion A...
 
Don Goodman-Wilson - Chinese food, motor scooters, and open source developmen...
Don Goodman-Wilson - Chinese food, motor scooters, and open source developmen...Don Goodman-Wilson - Chinese food, motor scooters, and open source developmen...
Don Goodman-Wilson - Chinese food, motor scooters, and open source developmen...
 
Pieter Omvlee - The story behind Sketch - Codemotion Amsterdam 2019
Pieter Omvlee - The story behind Sketch - Codemotion Amsterdam 2019Pieter Omvlee - The story behind Sketch - Codemotion Amsterdam 2019
Pieter Omvlee - The story behind Sketch - Codemotion Amsterdam 2019
 
Dave Farley - Taking Back “Software Engineering” - Codemotion Amsterdam 2019
Dave Farley - Taking Back “Software Engineering” - Codemotion Amsterdam 2019Dave Farley - Taking Back “Software Engineering” - Codemotion Amsterdam 2019
Dave Farley - Taking Back “Software Engineering” - Codemotion Amsterdam 2019
 
Joshua Hoffman - Should the CTO be Coding? - Codemotion Amsterdam 2019
Joshua Hoffman - Should the CTO be Coding? - Codemotion Amsterdam 2019Joshua Hoffman - Should the CTO be Coding? - Codemotion Amsterdam 2019
Joshua Hoffman - Should the CTO be Coding? - Codemotion Amsterdam 2019
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 

Staying Alive: Patterns for Failure Management From the Bottom of the Ocean - Ronnie Chen - Codemotion Rome 2017

  • 1. STAYING ALIVEPATTERNS FOR FAILURE MANAGEMENT FROM THE BOTTOM OF THE OCEAN RONNIE CHEN SLACK 1 — Ronnie Chen @rondoftw
  • 2. WHY DID I BECOME A DIVER? 2 — Ronnie Chen @rondoftw
  • 3. A WHOLE NEW WORLD 3 — Ronnie Chen @rondoftw
  • 4. 4 — Ronnie Chen @rondoftw
  • 5. 5 — Ronnie Chen @rondoftw
  • 6.
  • 7. 7 — Ronnie Chen @rondoftw
  • 8. 8 — Ronnie Chen @rondoftw
  • 9. TECHNICAL DIVING ▸ longer dive times ▸ deeper dives ▸ overhead ceiling ▸ decompression obligations ▸ more gear. a lot more. ▸ higher pressure ▸ more risks 9 — Ronnie Chen @rondoftw
  • 10. RISKS MAY INCLUDE... 1. hypoxia 2. hyperoxia 3. nitrogen narcosis 4. carbon dioxide buildup 5. oxygen sensor failure 6. deep tissure isobaric counterdiffusion (ICD) 7. high pressure nervous syndrome (HPNS) 8. software failure 9. exhausting your carbon dioxide scrubber 10. carbon dioxide channeling from a poorly packed scrubber 11. carbon buildup causing an spark leading to an oxygen fire. underwater. 12. flooding of breathing loop or circuitry 13. water mixing with the scrubbing agent to produce a toxic caustic soda that will give you chemical burns on your mouth, airway, and lungs 14. plain old decompression sickness 10 — Ronnie Chen @rondoftw
  • 11.   If you own a rebreather for five years, two percent of you are going to die on it. — Jill Heinerth, underwater explorer 11 — Ronnie Chen @rondoftw
  • 12. 12 — Ronnie Chen @rondoftw
  • 13. 13 — Ronnie Chen @rondoftw
  • 14. 14 — Ronnie Chen @rondoftw
  • 15. HOLD UP!15 — Ronnie Chen @rondoftw
  • 16. THIS IS A TALK ABOUT COMMUNICATION AND PROCESS ? ? ?16 — Ronnie Chen @rondoftw
  • 17. (it was a trap) 17 — Ronnie Chen @rondoftw
  • 18. YOU CAME TO HEAR COOL STORIES... 18 — Ronnie Chen @rondoftw
  • 19. BUT YOU'RE GETTING A MEANDERING MEDITATION ON BEST PRACTICES* WHEN DEALING WITH COMPLEX SYSTEMS INSTEAD * These guidelines have only been shown to work for life or death situations under the ocean. They have not been proven to work for tech. 19 — Ronnie Chen @rondoftw
  • 20. How failures really happen 20 — Ronnie Chen @rondoftw
  • 21. Complex systems are designed to protect against simple failures. 21 — Ronnie Chen @rondoftw
  • 22. But accidents still happen. 22 — Ronnie Chen @rondoftw
  • 23. CATASTROPHES ARE CAUSED BY A FAILURE CASCADE ▸ you have a rebreather malfunction ▸ which you would have caught it if you were testing your equipment on a regular basis ▸ your backup tank had a leak and is running low and that wasn't caught either ▸ and your buddy is too far away and isn't checking in with you ▸ and your dive light that you use to communicate at a distance is out of power ▸ and in the excitement you kick up silt and the visibility drops ▸ and in your panic your air consumption goes up and then you breathe through the last of the air in your tank ▸ so you swim for the surface even though you have a decompression obligation 23 — Ronnie Chen @rondoftw
  • 24. A post-mortem that blames this incident on a simple mechanical malfunction would only cover 12.5% of the issues that led up to this accident. 24 — Ronnie Chen @rondoftw
  • 25. Complex system failures don't happen because a single part of the system fails. They happen because all the safety procedures that are supposed to protect them from the simple system failure didn't work. 25 — Ronnie Chen @rondoftw
  • 26. CORE RULES OF SAFETY SYSTEMS 1. An unused safety system doesn't exist. 26 — Ronnie Chen @rondoftw
  • 27. NORMALIZATION OF DEVIANCE That natural human tendency, particularly in pressure circumstances, to take a safety shortcut. To accept a lower standard of performance. — Colonel Mike Mullane, astronaut 27 — Ronnie Chen @rondoftw
  • 28. FALSE FEEDBACK the absence of something bad happening means that it was safe ADAPTATION experience is no longer a suitable gauge of risk SOCIAL PRESSURE this is just how we do things 28 — Ronnie Chen @rondoftw
  • 29. CORE RULES OF SAFETY SYSTEMS 2. An untested safety system doesn't exist either! 29 — Ronnie Chen @rondoftw
  • 30. CORE RULES OF SAFETY SYSTEMS 3.Unused or untested safety systems are more dangerous than not having one at all. Therefore, safety systems must be tested at regular intervals. The length of this interval should be determined not only by how likely it is for this system to fail but also how great the impact will be if it does. 30 — Ronnie Chen @rondoftw
  • 31. A QUICK SIDENOTE ON ASSESSING RISK ▸ Make assessments based on likelihood of occurrence. ▸ Make assessments based on magnitude of regret. If you are only evaluating risk based on the chance of it happening, you must be prepared to experience the corresponding level of regret if it does. 31 — Ronnie Chen @rondoftw
  • 32. failures will happen 32 — Ronnie Chen @rondoftw
  • 33. WHAT IS SAFETY? 33 — Ronnie Chen @rondoftw
  • 34. FAILURE MANAGEMENT ▸ A framework for redundancy ▸ The training and judgment to use it 34 — Ronnie Chen @rondoftw
  • 35. FAILURE MANAGEMENT FOR SYSTEMS ▸ Have redundancy for systems that you cannot survive without. ▸ Have a redundant pathway to success: a procedure for graceful degradation for systems that are important but not critical. ▸ Have a process for changing over from primary to redundant systems. 35 — Ronnie Chen @rondoftw
  • 36. FAILURE MANAGEMENT FOR SYSTEMS (CONT) ▸ Keep failures contained so that they don't bring down other systems ▸ Make it easy to do the right thing and hard to do the dangerous things 36 — Ronnie Chen @rondoftw
  • 37. FAILURE MANAGEMENT FOR HUMAN SYSTEMS 37 — Ronnie Chen @rondoftw
  • 38. TRAINING FOR PRESSURE 38 — Ronnie Chen @rondoftw
  • 39. TRAINING: INEXPERIENCED PEOPLE TO THE FRONT ▸ Most inexperienced person leads ▸ Experienced person advises and intervenes only when necessary ▸ Team is invested in personal success to ensure mission success 39 — Ronnie Chen @rondoftw
  • 40. TRAINING: INEXPERIENCED PEOPLE TO THE FRONT (CONT) ▸ Frees up more experienced people from micromanaging ▸ Opportunity to revise and improve problematic systems ▸ One of the best ways to equalize a gap in experience 40 — Ronnie Chen @rondoftw
  • 41. GOOD JUDGMENT Good judgment enables the reshaping of rules and frameworks to adapt to a changing environment. 41 — Ronnie Chen @rondoftw
  • 42. REFINING JUDGMENT ▸ Post-Mortems ▸ Pre-Mortems ▸ Fire Drills ▸ Revisit Past Decisions 42 — Ronnie Chen @rondoftw
  • 43. POST-MORTEMS ▸ Look at the safety procedures that failed to stop the cascade ▸ Look for opportunities to create new safety systems at critical points 43 — Ronnie Chen @rondoftw
  • 44. PRE-MORTEMS ▸ Don't wait for failures to build safety frameworks ▸ Identify potential avenues of of failure and make plans for them ▸ Include both likely failures and high regret failures 44 — Ronnie Chen @rondoftw
  • 45. FIRE DRILLS ▸ Vet your plans and safety systems ▸ Perform targeted training ▸ Evaluate effectiveness of tools and documentation 45 — Ronnie Chen @rondoftw
  • 46. REVISIT PAST DECISIONS ▸ Examine successful operations to see what key insights were helpful ▸ Identify any dependency on luck in previous projects ▸ Share rationale for decisions 46 — Ronnie Chen @rondoftw
  • 47. RECOGNIZING SUCCESS 47 — Ronnie Chen @rondoftw
  • 48. I WANT TO LEARN MORE! 1. Diane Vaughn - The Challenger Launch Decision 2. Richard I. Cook - How Complex Systems Fail 3. Mike Mullane - https://www.youtube.com/watch?v=Ljzj9Msli5o 4. Steve Lewis aka decodoppler - Staying Alive 5. Sidney Dekker - Drift into Failure 48 — Ronnie Chen @rondoftw
  • 49. 49 — Ronnie Chen @rondoftw
  • 50. Any Questions? 50 — Ronnie Chen @rondoftw