SlideShare una empresa de Scribd logo
1 de 6
Descargar para leer sin conexión
itSM Solutions®
DITY™ Newsletter
Reprint
This is a reprint of an itSM Solutions® DITY™ Newsletter. Our members receive our weekly DITY Newsletter, and
have access to practical and often entertaining articles in our archives. DITY is the newsletter for IT professionals
who want a workable, practical guide to implementing ITIL best practices -- without the hype.

become a member
(It's Free. Visit http://www.itsmsolutions.com/newsletters/DITY.htm)

Publisher
itSM Solutions™ LLC
31 South Talbert Blvd #295
Lexington, NC 27292
Phone (336) 510-2885
Fax (336) 798-6296
Find us on the web at: http://www.itsmsolutions.com.
To report errors please send a note to the editor, Hank Marquis at hank.marquis@itsmsolutions.com
For information on obtaining copies of this guide contact: sales@itsmsolutions.com
Copyright © 2006 Nichols-Kuhn Group. ITIL Glossaries © Crown Copyright Office of Government Commerce. Reproduced with the
permission of the Controller of HMSO and the Office of Government Commerce.
Notice of Rights / Restricted Rights Legend
All rights reserved. Reproduction or transmittal of this guide or any portion thereof by any means whatsoever without prior written permission of
the Publisher is prohibited. All itSM Solutions products are licensed in accordance with the terms and conditions of the itSM Solutions Partner
License. No title or ownership of this guide, any portion thereof, or its contents is transferred, and any use of the guide or any portion thereof
beyond the terms of the previously mentioned license, without written authorization of the Publisher, is prohibited.
Notice of Liability
This guide is distributed "As Is," without warranty of any kind, either express or implied, respecting the content of this guide, including but not
limited to implied warranties for the guide's quality, performance, merchantability, or fitness for any particular purpose. Neither the authors, nor
itSM Solutions LLC, its dealers or distributors shall be liable with respect to any liability, loss or damage caused or alleged to have been caused
directly or indirectly by the contents of this guide.
Trademarks
itSM Solutions is a trademark of itSM Solutions LLC. Do IT Yourself™ and DITY™ are trademarks of Nichols-Kuhn Group. ITIL ® is a
Registered Trade Mark, and a Registered Community Trade Mark of the Office of Government Commerce, and is registered in the U.S. Patent
and Trademark Office, and is used here by itSM Solutions LLC under license from and with the permission of OGC (Trade Mark License No.
0002). IT Infrastructure Library ® is a Registered Trade Mark of the Office of Government Commerce and is used here by itSM Solutions LLC
under license from and with the permission of OGC (Trade Mark License No. 0002). Other product names mentioned in this guide may be
trademarks or registered trademarks of their respective companies.
Fire Fighting is What You Want

Subscribe

Vol. 2.49

PDF Download

Back Issues

DECEMBER 20, 2006

Fire Fighting is What
You Want
DITY Weekly Reader
The workable, practical guide to Do IT
Yourself
Many IT organizations think they are ‘fighting fires’, and that this is a bad
thing. I disagree. I think its insanity, and that every IT organization and
every person working in IT should strive to become more like firefighters.

http://www.itsmsolutions.com/newsletters/DITYvol2iss49.htm (1 of 5)12/20/2006 6:04:23 AM
Fire Fighting is What You Want

By Hank Marquis

Albert Einstein once said “The definition of insanity is doing the same thing over
and over again and expecting different results”. Think about that quote for a
moment and ask yourself if this applies to the way your IT organization operates.

hank

Have you or your staff been fighting the same fires time and time again? Have
you been doing the same thing over and over again expecting different results?

MARQUIS
Articles
E-mail
Bio

Many people equate “fighting fires” with being reactive and even chaotic. I have
a different idea. It seems to me that reactive IT organizations are not “fighting
fires”, because if they were they would be a well-oiled smoothly operating
operation.

Think about fire fighters and what fire fighters really do:
q

q

q

q

Team members from the various areas come together at the fire house, each knowing their
role, duties, and position in the team
They don’t quibble about who is going to drive the fire truck, or complain that they always have
to open the door to the firehouse
They assemble their tools, perform the required tasks, complete the mission, and then disband
until required again
They make every effort after putting out the fire to determine why and how it happened,
uncover the root cause, and take steps to make sure it never happens again.

Chasing the same problems day after day is not fire fighting – its insanity. Many IT workers today
have an attitude of “If It Ain’t Broke Then Don’t Fix It”, but if you are living Einstein’s definition
of insanity than it is surely broken!
Based on my experience with many IT organizations, this article describes how Problem
Management can help stop the insanity, and turn you and your team into real firefighters!

Root Cause Analysis
All IT outages occur from some combination of equipment, software and human error. ITIL
Problem Management exists to establish root cause, examine proposed solutions, and to perform
trend analysis.
What most miss is that the underlying root causes of most failures are often traceable to weak
management; that is, procedures, programs and policies with errors or omissions. Such
management weaknesses set the stage for failure and enable individuals to make mistakes. This is
the real cause of most reactive organizational response to failures, incorrectly called “fire fighting”
by many in IT.

http://www.itsmsolutions.com/newsletters/DITYvol2iss49.htm (2 of 5)12/20/2006 6:04:23 AM
Fire Fighting is What You Want

Root causes are the most basic causes of an outage or failure event. Root causes have to meet the
following conditions:
q

They can be identified

q

Management has control over them

Often for a single failure event there is more than one underlying root cause, yet most
troubleshooters stop at the first root cause, which is usually the Configuration Item (CI) that
failed or needs to change. This is NOT root cause. Root cause is the reason for the failure; not
simply identification of the failed CI.
If the real root causes are not found and corrected, the underlying management weaknesses will
lead to similar if not identical other failures—and this is the cause of the insanity within IT today.
For example, an IT organization claims not to have the time to perform preventative maintenance
since they are so busy responding to failures. In this example, the failure symptoms (e.g., failing
CI) are failing power supplies and other server components.
It is a failure of management to stop root-cause analysis after identifying which faulty component
in the server to replace since the components will simply fail again without solving the true rootcause. Getting back to our example of failing server components, perhaps the true root cause is
that the exhaust fan screens are clogged with dust; which limits airflow; which increases internal
server temperatures; which causes components to go non-linear and fail.
A bit of management applied to the situation would try to determine the managerial (e.g., true)
root cause of the physical root cause of the failure and remove it from the equation. Thus, the root
cause is failure to perform preventative maintenance. If IT would vacuum the screens regularly,
they would have fewer failures. The management root cause is failure to allocate and authorize
resources to perform preventative maintenance. Without addressing this issue, the server will
continue to experience regular failures.
Interestingly, most trapped in the endless loop of Einstein’s insanity believe they don’t have time
to perform root cause analysis; yet they always have time and resource to fix a failure...

A Higher Standard
Root Cause Analysis (RCA) provides the means to determine how and why an event or failure
happened. Understanding what happened, for example, that a power supply failed in the server is
simply not enough. We also need to know why management allowed it to happen, and make no
mistake, every failure is ultimately the result of some form of management error or omission.
Understanding what happened shows you the symptoms (e.g., failure physical root cause), but
does not show you the underlying condition that requires correction (e.g., management control
http://www.itsmsolutions.com/newsletters/DITYvol2iss49.htm (3 of 5)12/20/2006 6:04:23 AM
Fire Fighting is What You Want

root cause.) Failure events are just the symptoms of the underlying shortcomings of the
administrative controls that are supposed to keep failures from happening in the first place. This
is a very different thought process than that employed by most within IT today.
Failure to address the underlying management root cause simply sets the stage for the next
physical failure event. Sound RCA is a deeper analysis of the underlying conditions to uncover and
then correct those omissions that will contribute to future failures. Key features of RCA include:
q
q
q

Understanding how a failure occurred; not just what occurred
Discovering the underlying management weakness of the key contributors to the failure
Developing and implementing practical and effective solutions for preventing future failures

While this may sound like what you are already doing, RCA is actually very different as experience
shows that most technicians operate from intuition and assumption. One study shows that most
technicians have jumped to an intuitive conclusion of what they plan to do, and formulate a
strategy before they even finish reading the problem description! Key differences between RCA
and traditional problem solving include:
q
q
q
q
q
q

Reasoning through cause-effect instead of jumping to conclusions
Focusing on facts versus supposition and intuition
Considering a range of possibilities
Thinking about procedural (management) failures vs. technical failures
Discovering multiple root causes
Trending data in order to discover systemic reasons for failure across more than one failure

Formal ITIL Problem Management include Problem Control (discovers root cause), Error Control
(evaluates root cause data), Major Problem Review (deeper analysis of specific Problems), and
Proactive Problem Management (trending.) These activities provide a very workable workflow
and set of tasks that can easily help discover those management failures that keep IT staff “doing
the same thing over and over again and expecting different results.”
RCA is not over until you have identified the management failure that allowed, enabled, or
encouraged the hardware, software, or person failure. ITIL describes a Known Error as the root
cause defined (e.g., what requires physically change to restore service), and a workaround
established. I would like to add another requirement – documentation of what requires
management change to prevent similar outages as well.
So consider adding another level of analysis to your existing troubleshooting—make an effort to
discover the missing management control that set the stage for the physical root cause. After
evaluating and trending a series of “management root causes” you will discover those key controls
that you can easily implement to reduce future failures. Your reward will be true firefighting, and
much more time to proactively eliminate even more physical and managerial root causes.
So, the next time someone tells you that fighting fires is a symptom of failing IT organization, tell
them that’s insanity, and your goal is to be more like a fire fighter!
http://www.itsmsolutions.com/newsletters/DITYvol2iss49.htm (4 of 5)12/20/2006 6:04:23 AM
Fire Fighting is What You Want

-Where to go from here:
q
q
q

Subscribe to our newsletter and get new skills delivered right to your Inbox, click here.
Download this article in PDF format for use at your own convenience, click here.
Browse back-issues of the DITY Newsletter, click here.

Related articles:
q

“How To Measure IT Service Quality” by Hank Marquis for how to establish cost of downtime

Entire Contents © 2006 itSM Solutions LLC. All Rights Reserved.

http://www.itsmsolutions.com/newsletters/DITYvol2iss49.htm (5 of 5)12/20/2006 6:04:23 AM

Más contenido relacionado

Destacado (17)

Dit yvol3iss50
Dit yvol3iss50Dit yvol3iss50
Dit yvol3iss50
 
Dit yvol4iss30
Dit yvol4iss30Dit yvol4iss30
Dit yvol4iss30
 
Dit yvol3iss33
Dit yvol3iss33Dit yvol3iss33
Dit yvol3iss33
 
Dit yvol4iss19
Dit yvol4iss19Dit yvol4iss19
Dit yvol4iss19
 
Dit yvol4iss34
Dit yvol4iss34Dit yvol4iss34
Dit yvol4iss34
 
Dit yvol4iss46
Dit yvol4iss46Dit yvol4iss46
Dit yvol4iss46
 
Dit yvol2iss22
Dit yvol2iss22Dit yvol2iss22
Dit yvol2iss22
 
Dit yvol5iss39
Dit yvol5iss39Dit yvol5iss39
Dit yvol5iss39
 
Dit yvol4iss14
Dit yvol4iss14Dit yvol4iss14
Dit yvol4iss14
 
Dit yvol5iss40
Dit yvol5iss40Dit yvol5iss40
Dit yvol5iss40
 
Dit yvol3iss7
Dit yvol3iss7Dit yvol3iss7
Dit yvol3iss7
 
Dit yvol2iss3
Dit yvol2iss3Dit yvol2iss3
Dit yvol2iss3
 
Dit yvol3iss16
Dit yvol3iss16Dit yvol3iss16
Dit yvol3iss16
 
Open it sm solutions final
Open it sm solutions   finalOpen it sm solutions   final
Open it sm solutions final
 
Dit yvol6iss23
Dit yvol6iss23Dit yvol6iss23
Dit yvol6iss23
 
Dit yvol3iss27
Dit yvol3iss27Dit yvol3iss27
Dit yvol3iss27
 
Dit yvol1iss3
Dit yvol1iss3Dit yvol1iss3
Dit yvol1iss3
 

Similar a Dit yvol2iss49 (20)

Dit yvol2iss17
Dit yvol2iss17Dit yvol2iss17
Dit yvol2iss17
 
Dit yvol2iss32
Dit yvol2iss32Dit yvol2iss32
Dit yvol2iss32
 
Dit yvol1iss7
Dit yvol1iss7Dit yvol1iss7
Dit yvol1iss7
 
Dit yvol2iss30
Dit yvol2iss30Dit yvol2iss30
Dit yvol2iss30
 
Dit yvol1iss4
Dit yvol1iss4Dit yvol1iss4
Dit yvol1iss4
 
Dit yvol2iss50
Dit yvol2iss50Dit yvol2iss50
Dit yvol2iss50
 
Dit yvol3iss6
Dit yvol3iss6Dit yvol3iss6
Dit yvol3iss6
 
Dit yvol2iss19
Dit yvol2iss19Dit yvol2iss19
Dit yvol2iss19
 
Dit yvol2iss24
Dit yvol2iss24Dit yvol2iss24
Dit yvol2iss24
 
Dit yvol2iss16
Dit yvol2iss16Dit yvol2iss16
Dit yvol2iss16
 
Dit yvol2iss45
Dit yvol2iss45Dit yvol2iss45
Dit yvol2iss45
 
Dit yvol3iss3
Dit yvol3iss3Dit yvol3iss3
Dit yvol3iss3
 
Dit yvol2iss26
Dit yvol2iss26Dit yvol2iss26
Dit yvol2iss26
 
Dit yvol2iss38
Dit yvol2iss38Dit yvol2iss38
Dit yvol2iss38
 
Dit yvol3iss10
Dit yvol3iss10Dit yvol3iss10
Dit yvol3iss10
 
Dit yvol2iss23
Dit yvol2iss23Dit yvol2iss23
Dit yvol2iss23
 
Dit yvol3iss12
Dit yvol3iss12Dit yvol3iss12
Dit yvol3iss12
 
Dit yvol2iss27
Dit yvol2iss27Dit yvol2iss27
Dit yvol2iss27
 
Dit yvol2iss41
Dit yvol2iss41Dit yvol2iss41
Dit yvol2iss41
 
Dit yvol2iss14
Dit yvol2iss14Dit yvol2iss14
Dit yvol2iss14
 

Más de Rick Lemieux

Más de Rick Lemieux (20)

IT Service Management (ITSM) Model for Business & IT Alignement
IT Service Management (ITSM) Model for Business & IT AlignementIT Service Management (ITSM) Model for Business & IT Alignement
IT Service Management (ITSM) Model for Business & IT Alignement
 
Dit yvol5iss41
Dit yvol5iss41Dit yvol5iss41
Dit yvol5iss41
 
Dit yvol5iss38
Dit yvol5iss38Dit yvol5iss38
Dit yvol5iss38
 
Dit yvol5iss37
Dit yvol5iss37Dit yvol5iss37
Dit yvol5iss37
 
Dit yvol5iss36
Dit yvol5iss36Dit yvol5iss36
Dit yvol5iss36
 
Dit yvol5iss35
Dit yvol5iss35Dit yvol5iss35
Dit yvol5iss35
 
Dit yvol5iss34
Dit yvol5iss34Dit yvol5iss34
Dit yvol5iss34
 
Dit yvol5iss31
Dit yvol5iss31Dit yvol5iss31
Dit yvol5iss31
 
Dit yvol5iss33
Dit yvol5iss33Dit yvol5iss33
Dit yvol5iss33
 
Dit yvol5iss32
Dit yvol5iss32Dit yvol5iss32
Dit yvol5iss32
 
Dit yvol5iss30
Dit yvol5iss30Dit yvol5iss30
Dit yvol5iss30
 
Dit yvol5iss29
Dit yvol5iss29Dit yvol5iss29
Dit yvol5iss29
 
Dit yvol5iss28
Dit yvol5iss28Dit yvol5iss28
Dit yvol5iss28
 
Dit yvol5iss26
Dit yvol5iss26Dit yvol5iss26
Dit yvol5iss26
 
Dit yvol5iss25
Dit yvol5iss25Dit yvol5iss25
Dit yvol5iss25
 
Dit yvol5iss24
Dit yvol5iss24Dit yvol5iss24
Dit yvol5iss24
 
Dit yvol5iss23
Dit yvol5iss23Dit yvol5iss23
Dit yvol5iss23
 
Dit yvol5iss22
Dit yvol5iss22Dit yvol5iss22
Dit yvol5iss22
 
Dit yvol5iss21
Dit yvol5iss21Dit yvol5iss21
Dit yvol5iss21
 
Dit yvol5iss20
Dit yvol5iss20Dit yvol5iss20
Dit yvol5iss20
 

Último

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 

Último (20)

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 

Dit yvol2iss49

  • 1. itSM Solutions® DITY™ Newsletter Reprint This is a reprint of an itSM Solutions® DITY™ Newsletter. Our members receive our weekly DITY Newsletter, and have access to practical and often entertaining articles in our archives. DITY is the newsletter for IT professionals who want a workable, practical guide to implementing ITIL best practices -- without the hype. become a member (It's Free. Visit http://www.itsmsolutions.com/newsletters/DITY.htm) Publisher itSM Solutions™ LLC 31 South Talbert Blvd #295 Lexington, NC 27292 Phone (336) 510-2885 Fax (336) 798-6296 Find us on the web at: http://www.itsmsolutions.com. To report errors please send a note to the editor, Hank Marquis at hank.marquis@itsmsolutions.com For information on obtaining copies of this guide contact: sales@itsmsolutions.com Copyright © 2006 Nichols-Kuhn Group. ITIL Glossaries © Crown Copyright Office of Government Commerce. Reproduced with the permission of the Controller of HMSO and the Office of Government Commerce. Notice of Rights / Restricted Rights Legend All rights reserved. Reproduction or transmittal of this guide or any portion thereof by any means whatsoever without prior written permission of the Publisher is prohibited. All itSM Solutions products are licensed in accordance with the terms and conditions of the itSM Solutions Partner License. No title or ownership of this guide, any portion thereof, or its contents is transferred, and any use of the guide or any portion thereof beyond the terms of the previously mentioned license, without written authorization of the Publisher, is prohibited. Notice of Liability This guide is distributed "As Is," without warranty of any kind, either express or implied, respecting the content of this guide, including but not limited to implied warranties for the guide's quality, performance, merchantability, or fitness for any particular purpose. Neither the authors, nor itSM Solutions LLC, its dealers or distributors shall be liable with respect to any liability, loss or damage caused or alleged to have been caused directly or indirectly by the contents of this guide. Trademarks itSM Solutions is a trademark of itSM Solutions LLC. Do IT Yourself™ and DITY™ are trademarks of Nichols-Kuhn Group. ITIL ® is a Registered Trade Mark, and a Registered Community Trade Mark of the Office of Government Commerce, and is registered in the U.S. Patent and Trademark Office, and is used here by itSM Solutions LLC under license from and with the permission of OGC (Trade Mark License No. 0002). IT Infrastructure Library ® is a Registered Trade Mark of the Office of Government Commerce and is used here by itSM Solutions LLC under license from and with the permission of OGC (Trade Mark License No. 0002). Other product names mentioned in this guide may be trademarks or registered trademarks of their respective companies.
  • 2. Fire Fighting is What You Want Subscribe Vol. 2.49 PDF Download Back Issues DECEMBER 20, 2006 Fire Fighting is What You Want DITY Weekly Reader The workable, practical guide to Do IT Yourself Many IT organizations think they are ‘fighting fires’, and that this is a bad thing. I disagree. I think its insanity, and that every IT organization and every person working in IT should strive to become more like firefighters. http://www.itsmsolutions.com/newsletters/DITYvol2iss49.htm (1 of 5)12/20/2006 6:04:23 AM
  • 3. Fire Fighting is What You Want By Hank Marquis Albert Einstein once said “The definition of insanity is doing the same thing over and over again and expecting different results”. Think about that quote for a moment and ask yourself if this applies to the way your IT organization operates. hank Have you or your staff been fighting the same fires time and time again? Have you been doing the same thing over and over again expecting different results? MARQUIS Articles E-mail Bio Many people equate “fighting fires” with being reactive and even chaotic. I have a different idea. It seems to me that reactive IT organizations are not “fighting fires”, because if they were they would be a well-oiled smoothly operating operation. Think about fire fighters and what fire fighters really do: q q q q Team members from the various areas come together at the fire house, each knowing their role, duties, and position in the team They don’t quibble about who is going to drive the fire truck, or complain that they always have to open the door to the firehouse They assemble their tools, perform the required tasks, complete the mission, and then disband until required again They make every effort after putting out the fire to determine why and how it happened, uncover the root cause, and take steps to make sure it never happens again. Chasing the same problems day after day is not fire fighting – its insanity. Many IT workers today have an attitude of “If It Ain’t Broke Then Don’t Fix It”, but if you are living Einstein’s definition of insanity than it is surely broken! Based on my experience with many IT organizations, this article describes how Problem Management can help stop the insanity, and turn you and your team into real firefighters! Root Cause Analysis All IT outages occur from some combination of equipment, software and human error. ITIL Problem Management exists to establish root cause, examine proposed solutions, and to perform trend analysis. What most miss is that the underlying root causes of most failures are often traceable to weak management; that is, procedures, programs and policies with errors or omissions. Such management weaknesses set the stage for failure and enable individuals to make mistakes. This is the real cause of most reactive organizational response to failures, incorrectly called “fire fighting” by many in IT. http://www.itsmsolutions.com/newsletters/DITYvol2iss49.htm (2 of 5)12/20/2006 6:04:23 AM
  • 4. Fire Fighting is What You Want Root causes are the most basic causes of an outage or failure event. Root causes have to meet the following conditions: q They can be identified q Management has control over them Often for a single failure event there is more than one underlying root cause, yet most troubleshooters stop at the first root cause, which is usually the Configuration Item (CI) that failed or needs to change. This is NOT root cause. Root cause is the reason for the failure; not simply identification of the failed CI. If the real root causes are not found and corrected, the underlying management weaknesses will lead to similar if not identical other failures—and this is the cause of the insanity within IT today. For example, an IT organization claims not to have the time to perform preventative maintenance since they are so busy responding to failures. In this example, the failure symptoms (e.g., failing CI) are failing power supplies and other server components. It is a failure of management to stop root-cause analysis after identifying which faulty component in the server to replace since the components will simply fail again without solving the true rootcause. Getting back to our example of failing server components, perhaps the true root cause is that the exhaust fan screens are clogged with dust; which limits airflow; which increases internal server temperatures; which causes components to go non-linear and fail. A bit of management applied to the situation would try to determine the managerial (e.g., true) root cause of the physical root cause of the failure and remove it from the equation. Thus, the root cause is failure to perform preventative maintenance. If IT would vacuum the screens regularly, they would have fewer failures. The management root cause is failure to allocate and authorize resources to perform preventative maintenance. Without addressing this issue, the server will continue to experience regular failures. Interestingly, most trapped in the endless loop of Einstein’s insanity believe they don’t have time to perform root cause analysis; yet they always have time and resource to fix a failure... A Higher Standard Root Cause Analysis (RCA) provides the means to determine how and why an event or failure happened. Understanding what happened, for example, that a power supply failed in the server is simply not enough. We also need to know why management allowed it to happen, and make no mistake, every failure is ultimately the result of some form of management error or omission. Understanding what happened shows you the symptoms (e.g., failure physical root cause), but does not show you the underlying condition that requires correction (e.g., management control http://www.itsmsolutions.com/newsletters/DITYvol2iss49.htm (3 of 5)12/20/2006 6:04:23 AM
  • 5. Fire Fighting is What You Want root cause.) Failure events are just the symptoms of the underlying shortcomings of the administrative controls that are supposed to keep failures from happening in the first place. This is a very different thought process than that employed by most within IT today. Failure to address the underlying management root cause simply sets the stage for the next physical failure event. Sound RCA is a deeper analysis of the underlying conditions to uncover and then correct those omissions that will contribute to future failures. Key features of RCA include: q q q Understanding how a failure occurred; not just what occurred Discovering the underlying management weakness of the key contributors to the failure Developing and implementing practical and effective solutions for preventing future failures While this may sound like what you are already doing, RCA is actually very different as experience shows that most technicians operate from intuition and assumption. One study shows that most technicians have jumped to an intuitive conclusion of what they plan to do, and formulate a strategy before they even finish reading the problem description! Key differences between RCA and traditional problem solving include: q q q q q q Reasoning through cause-effect instead of jumping to conclusions Focusing on facts versus supposition and intuition Considering a range of possibilities Thinking about procedural (management) failures vs. technical failures Discovering multiple root causes Trending data in order to discover systemic reasons for failure across more than one failure Formal ITIL Problem Management include Problem Control (discovers root cause), Error Control (evaluates root cause data), Major Problem Review (deeper analysis of specific Problems), and Proactive Problem Management (trending.) These activities provide a very workable workflow and set of tasks that can easily help discover those management failures that keep IT staff “doing the same thing over and over again and expecting different results.” RCA is not over until you have identified the management failure that allowed, enabled, or encouraged the hardware, software, or person failure. ITIL describes a Known Error as the root cause defined (e.g., what requires physically change to restore service), and a workaround established. I would like to add another requirement – documentation of what requires management change to prevent similar outages as well. So consider adding another level of analysis to your existing troubleshooting—make an effort to discover the missing management control that set the stage for the physical root cause. After evaluating and trending a series of “management root causes” you will discover those key controls that you can easily implement to reduce future failures. Your reward will be true firefighting, and much more time to proactively eliminate even more physical and managerial root causes. So, the next time someone tells you that fighting fires is a symptom of failing IT organization, tell them that’s insanity, and your goal is to be more like a fire fighter! http://www.itsmsolutions.com/newsletters/DITYvol2iss49.htm (4 of 5)12/20/2006 6:04:23 AM
  • 6. Fire Fighting is What You Want -Where to go from here: q q q Subscribe to our newsletter and get new skills delivered right to your Inbox, click here. Download this article in PDF format for use at your own convenience, click here. Browse back-issues of the DITY Newsletter, click here. Related articles: q “How To Measure IT Service Quality” by Hank Marquis for how to establish cost of downtime Entire Contents © 2006 itSM Solutions LLC. All Rights Reserved. http://www.itsmsolutions.com/newsletters/DITYvol2iss49.htm (5 of 5)12/20/2006 6:04:23 AM