SlideShare una empresa de Scribd logo
1 de 35
How not to release software
            Laura Thomson
    laura@{mozilla,laurathomson}.com
                  @lxt
Conference speakers and book authors are
people too. Despite our smug demeanor,
we also fail.

Frequently.
This talk sparked by the release-that-shall-
not-be named.

It went so badly that the version number
was retired.

After 24 hours, people started bringing in
casseroles.
Thus I present this catalogue of failure, a
compendium of things not to do when you
want to release any kind of software.
Methodology
Agile fail #1

Cynical basis of agile: estimation is intractable
Avoid estimation by working in short sprints
- surely we can estimate the next two
weeks, right?
Right?
Sprint fail #1: rollover
We will build these five thingies/bugs/stories
in the next two week sprint
Two are more complex than anticipated
One is blocked
Ship two, roll the other three over
Rinse, repeat
Sprint fail #2:
         Mara-sprint
We can’t get these five things done. Should
we ship with only two?
If we just push the deadline out a week,
everything will be fine.
And then another.
And then another.
Sprint fail #3:
        Death sprints
We HAVE to get these five things done
The deadline is an immovable object
We’ll just work really really late or all night
every night for the next two weeks
And then the same thing again....and
again...and again
Things I’ve learned
Plan the dimension of flex and choose:
     Less functionality in the same time
     Flexible timelines or slower velocity
     Continuous deployment
     Train model
Coding
Coding fail #1:
Build a new bikeshed
Sitting down to code when you don’t know
what you’re doing
                      vs
Analysis paralysis/bikeshedding: getting stuck
talking through a complex problem, over and
over
Coding fail #2:
      One True Ring
There’s one right way to solve this problem
Complex
Going to take a long time
Scope keeps expanding
Coding fail #3:
    Over-engineering
Closely related to the One True Ring
Beautiful class hierarchy and architecture
Lovely diagrams
It doesn’t do anything
Things I’ve learned
Different people work in different ways.
Some people deal better with uncertainty.
Those people should build a prototype.
Do The Simplest Thing That Could Possibly
Work
Be happy with 60-80% solutions. Iterate.
Even if you’re not a startup, Minimum Viable
Product is valid.
Testing
Test fail #1:
The Null Hypothesis


Have no tests.
Test fail #2:
     Epic confidence

Have some tests, but no idea about
coverage.
Ensures you have a great level of over-
confidence going into a release.
Test fail #3:
UR DOING IT RONG
Test the wrong things.
99% unit test coverage, no load tests.
Push something out and don’t realize that it’s
too slow, causes data or user loss, until too
late.
Things I’ve learned
Know your coverage and your limitations
Use CI
Use browser testing
Use load and smoke testing
Get QA’s opinion on whether you’re ready
to ship something
Release and
deployment
Deployment fail #1:
   Version chaos
Push master/trunk/head.
Have no other branches.
Have no tags.
Have things that aren’t under version
control.
Deployment fail #2:
Fail forward, fail back

Have no rollback plan.
Have no ability to fail forward, either.
If failing forward fails, have no backup plan.
Deployment fail #3:
   Manual pushes
Release by manually logging into each
machine and updating a checkout.
This is even better under high load.
Scaling is something best not talked about.
Deployment fail #4:
ALTER TABLE ADD FAIL
Require manual changes to databases. Have
no record of the changes in version control.
Don’t estimate how long these changes will
take, how much they will degrade
performance, or how much downtime will be
required before kicking them off.
Deployment fail #5:
  Config of Doom

Require manual changes to configuration,
especially if there are more than two
machines involved - 30 or more is especially
good.
Deployment fail #6:
Single Person Of Fail

Have only one person that knows how to
deploy your stuff.
(Don’t document any of it.)
Deployment fail #7:
 It Worked In Staging
Staging not the same as prod:
Different OS, packages, hardware
Only one of something where there are >1
in prod (classic examples: db replication,
memcache)
Things I’ve learned
Have a release management system
Build the capability to deploy continuously
Automate everything: build, test, deploy
Use configuration management
Document everything
Load test when needed
Eliminate SPOFs, including people
Staging should reflect production
Operations
Ops fail #1:
         Devops Fail
Ops and Dev don’t speak to, work with, or
trust each other


Most communication takes place through
aggressive ticket filing and email
Ops fail #2:
      Dilettante devs
Stuff breaks at 2am
Devs have not documented anything and
aren’t available to fix it
Ops gets the blame for extended downtime
Ops fail #3:
 Telepathic troubleshooting

Stuff breaks
Devs have to troubleshoot via remote:
Did you look at this log, what did it say?
What is this config value set to?
Things I’ve learned
Everybody who works on a system is
responsible for it
Cross train your teams
Open up systems to devs for read access to
config + logging
Troubleshoot together
Questions?
laura@mozilla.com
   PS We’re hiring

Más contenido relacionado

Último

1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPTiSEO AI
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...FIDO Alliance
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...FIDO Alliance
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfFIDO Alliance
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024Stephanie Beckett
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfFIDO Alliance
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfSrushith Repakula
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfFIDO Alliance
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...marcuskenyatta275
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераMark Opanasiuk
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfFIDO Alliance
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxJennifer Lim
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty SecureFemke de Vroome
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FIDO Alliance
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaCzechDreamin
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...CzechDreamin
 

Último (20)

Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
1111 ChatGPT Prompts PDF Free Download - Prompts for ChatGPT
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdfLinux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
Linux Foundation Edge _ Overview of FDO Software Components _ Randy at Intel.pdf
 
What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024What's New in Teams Calling, Meetings and Devices April 2024
What's New in Teams Calling, Meetings and Devices April 2024
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024Enterprise Knowledge Graphs - Data Summit 2024
Enterprise Knowledge Graphs - Data Summit 2024
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptxWSO2CONMay2024OpenSourceConferenceDebrief.pptx
WSO2CONMay2024OpenSourceConferenceDebrief.pptx
 
ECS 2024 Teams Premium - Pretty Secure
ECS 2024   Teams Premium - Pretty SecureECS 2024   Teams Premium - Pretty Secure
ECS 2024 Teams Premium - Pretty Secure
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 

Destacado

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Destacado (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

How not to_release_software_read_version

  • 1. How not to release software Laura Thomson laura@{mozilla,laurathomson}.com @lxt
  • 2. Conference speakers and book authors are people too. Despite our smug demeanor, we also fail. Frequently.
  • 3. This talk sparked by the release-that-shall- not-be named. It went so badly that the version number was retired. After 24 hours, people started bringing in casseroles.
  • 4. Thus I present this catalogue of failure, a compendium of things not to do when you want to release any kind of software.
  • 6. Agile fail #1 Cynical basis of agile: estimation is intractable Avoid estimation by working in short sprints - surely we can estimate the next two weeks, right? Right?
  • 7. Sprint fail #1: rollover We will build these five thingies/bugs/stories in the next two week sprint Two are more complex than anticipated One is blocked Ship two, roll the other three over Rinse, repeat
  • 8. Sprint fail #2: Mara-sprint We can’t get these five things done. Should we ship with only two? If we just push the deadline out a week, everything will be fine. And then another. And then another.
  • 9. Sprint fail #3: Death sprints We HAVE to get these five things done The deadline is an immovable object We’ll just work really really late or all night every night for the next two weeks And then the same thing again....and again...and again
  • 10. Things I’ve learned Plan the dimension of flex and choose: Less functionality in the same time Flexible timelines or slower velocity Continuous deployment Train model
  • 12. Coding fail #1: Build a new bikeshed Sitting down to code when you don’t know what you’re doing vs Analysis paralysis/bikeshedding: getting stuck talking through a complex problem, over and over
  • 13. Coding fail #2: One True Ring There’s one right way to solve this problem Complex Going to take a long time Scope keeps expanding
  • 14. Coding fail #3: Over-engineering Closely related to the One True Ring Beautiful class hierarchy and architecture Lovely diagrams It doesn’t do anything
  • 15. Things I’ve learned Different people work in different ways. Some people deal better with uncertainty. Those people should build a prototype. Do The Simplest Thing That Could Possibly Work Be happy with 60-80% solutions. Iterate. Even if you’re not a startup, Minimum Viable Product is valid.
  • 17. Test fail #1: The Null Hypothesis Have no tests.
  • 18. Test fail #2: Epic confidence Have some tests, but no idea about coverage. Ensures you have a great level of over- confidence going into a release.
  • 19. Test fail #3: UR DOING IT RONG Test the wrong things. 99% unit test coverage, no load tests. Push something out and don’t realize that it’s too slow, causes data or user loss, until too late.
  • 20. Things I’ve learned Know your coverage and your limitations Use CI Use browser testing Use load and smoke testing Get QA’s opinion on whether you’re ready to ship something
  • 22. Deployment fail #1: Version chaos Push master/trunk/head. Have no other branches. Have no tags. Have things that aren’t under version control.
  • 23. Deployment fail #2: Fail forward, fail back Have no rollback plan. Have no ability to fail forward, either. If failing forward fails, have no backup plan.
  • 24. Deployment fail #3: Manual pushes Release by manually logging into each machine and updating a checkout. This is even better under high load. Scaling is something best not talked about.
  • 25. Deployment fail #4: ALTER TABLE ADD FAIL Require manual changes to databases. Have no record of the changes in version control. Don’t estimate how long these changes will take, how much they will degrade performance, or how much downtime will be required before kicking them off.
  • 26. Deployment fail #5: Config of Doom Require manual changes to configuration, especially if there are more than two machines involved - 30 or more is especially good.
  • 27. Deployment fail #6: Single Person Of Fail Have only one person that knows how to deploy your stuff. (Don’t document any of it.)
  • 28. Deployment fail #7: It Worked In Staging Staging not the same as prod: Different OS, packages, hardware Only one of something where there are >1 in prod (classic examples: db replication, memcache)
  • 29. Things I’ve learned Have a release management system Build the capability to deploy continuously Automate everything: build, test, deploy Use configuration management Document everything Load test when needed Eliminate SPOFs, including people Staging should reflect production
  • 31. Ops fail #1: Devops Fail Ops and Dev don’t speak to, work with, or trust each other Most communication takes place through aggressive ticket filing and email
  • 32. Ops fail #2: Dilettante devs Stuff breaks at 2am Devs have not documented anything and aren’t available to fix it Ops gets the blame for extended downtime
  • 33. Ops fail #3: Telepathic troubleshooting Stuff breaks Devs have to troubleshoot via remote: Did you look at this log, what did it say? What is this config value set to?
  • 34. Things I’ve learned Everybody who works on a system is responsible for it Cross train your teams Open up systems to devs for read access to config + logging Troubleshoot together
  • 35. Questions? laura@mozilla.com PS We’re hiring

Notas del editor

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n
  25. \n
  26. \n
  27. \n
  28. \n
  29. \n
  30. \n
  31. \n
  32. \n
  33. \n
  34. \n
  35. \n