SlideShare una empresa de Scribd logo
1 de 48
www.productschool.com
The Scientific Method of
Experimentation by Google PM
Join 35,000+Product
Managers on
Free Resources
Discover great job
opportunities
Job Portal
prdct.school/PSJobPortalprdct.school/events-slack
C O U R S E S
Product
Management
Learn the skills you need to land
a Product Manager job
C O U R S E S
Coding
for Managers
Build a website and gain the
technical knowledge to lead
software engineers
C O U R S E S
Data Analytics
for Managers
Learn the skills to understand web
analytics, SQL and machine learning
concepts
C O U R S E S
Learn how to acquire more users
and convert them into clients
Digital Marketing
for Managers
C O U R S E S
UX Design
for Managers
Gain a deeper understanding of your
users and deliver an exceptional end-to-
end experience
C O U R S E S
For experienced Product Managers
looking to gain strategic skills needed
for top leadership roles
Product
Leadership
C O U R S E S
Corporate
Training
Level up your team’s Product
Management skills
Ruben Lozano
T O N I G H T ’ S S P E A K E R
The scientific method
of experimentation
Seattle, WA | June 25, 2019
A/B
User Research
101
For Product Managers
What is user research?
Systematic approach to discovering
users' aspirations, goals, tasks, needs,
pain points, and information and
interaction requirements.
User research grounds, verifies, and
validates what a team builds.
Where does user research fit to product?
Iterative research
Foundational research
Evaluative research
PRODUCT DEVELOPMENT
Dimensions of user research methods
Context
Natural or near-natural
Scripted
Not using the product
A hybrid of the above
Attitudinal
what people say
Behavioral
what people do
1
Qualitative
answers why
Quantitative
answers how
much/many
2
3
Source: https://www.nngroup.com/articles/which-ux-research-methods/
Experiments
Experiments 101
What is an experiment?
What is an experiment?
An experiment is a way to test a hypothesis about the
product.
An experiment may also refer to the gradual launch of a
new feature.
LIVE
EVAL
Note: Tests, while they are an important part of the software development
journey, are not experiments, since you know in advance the result you expect
Why run live
experiments?
I’m a PM. I know what will happen.
Humans are terrible at
making predictions
1. Hindsight bias
2. Observational selection bias
3. Projection bias
4. Anchoring bias
… and hundreds of cognitive biases...
Doing a pre/post analysis is enough
Brazil Search Traffic
June 2014
A/B isolates the impact of
just the product changes
Is something a good idea?
A B
ORGANIC HONEYCRISP APPLE
GRANNY SMITH APPLE
FUJI APPLE
ORGANIC HONEYCRISP APPLE
GRANNY SMITH APPLE
Live
experiments
are not
magic wands
“I suppose it is tempting, if the only
tool you have is a hammer, to treat
everything as if it were a nail.‘”
Abraham Maslow
bring the science
Fundamentals of experiment design
The scientific method is an empirical method of acquiring knowledge. It is the
systematic observation, measurement, and experimentation of a hypothesis.
Observation1 Hypothesis2 Design3
Experiment4 Analysis5 Prove/Reject6
PM flavor of scientific method
Observation1 Hypothesis2 Design3
Experiment4 Analysis5 Prove/Reject6
Ask a question0
Communicate results7
0. Ask a question
How can I increase usage of my product?
How can I increase revenue attributed to my product?
How can I increase user happiness?
How can I simplify code without changing metrics?
How can I affect click behavior?
1. Observation: do background research
What others have done before
Are you doing something different?
Did something change since the previous attempt?
Quantitative data
Behavioral metrics
Surveys
Trends
Qualitative data
Perceptions
Attitudes
Assumptions
Preferences
2. Develop a hypothesis
A (1) testable (2) explanation for a
phenomenon.
The goal of an experiment is to prove or
disprove the hypothesis.
AVOID running experiments to see what happens
or to gather data with no hypothesis. Use other
user research methods and have a POV.
2. Develop a hypothesis
Example
1. Ask a question
a. How can I increase sales for Prime users on the mobile
app?
2. Do background research
a. Users had troubles finding filters on mobile
b. Users get overwhelmed with too many results
c. Decreasing options simplifies decision-making
d. BUT, past experiments limiting results had negative
results
2. Develop a hypothesis
Hypothesis:
Prime users will spend more $ if they can easily narrow their
search results to prime products
Is it valid?
● Is it testable?
● Does it have an explanation?
● Do I have an educated guess?
3. Design experiment
Hypothesis:
Prime users will spend more $ if they can easily narrow their
search results to prime products
Design experiment:
1. Show a prime toggle on the navigation bar for all US prime
users on the iOS app
2. Toggle off by default
3. No changes to
a. Backend algorithms
b. Logic that decides when to enable the prime filter
c. Current prime filter behind the filter button
3. Design experiment
BTriggering criteria
● Who: US prime users using iOS app
● When: If results include a prime product
● How: Session-based
Duration
● 2 weeks
Launch criteria (success metric)
● Statistically significant increase in revenue
● No increase in latency
4. Run experiment
A B
5. Analyze the data
BResults
+2.5% Revenue [1.9%, 3.1%], p=0.05
5. Analyze the data
1. Statistical significance is the likelihood that the numeric difference
between a control and treatment outcome is not due to random chance
2. Null hypothesis states there is no significant difference between control and
treatment, any observed difference is due to sampling or experimental error
3. P-value evaluates how well the sample data supports the argument that the
null hypothesis is true. A low p value suggests you can reject the null hypothesis
4. Confidence interval is a range of values (lower and upper bound) that is
likely to contain an unknown population parameter
5. Analyze the data
significantly positive
significantly negative
inconclusive
flat* (still inconclusive)
(-) 0% (+)
-0.5% | practical significance
Results
+2.5% Revenue [1.9%, 3.1%], p=0.05
6. Draw conclusions
Hypothesis:
Prime users will spend more $ if they can easily narrow
their search results to prime products
1. Validate data
2. Craft a story
3. Evaluate results
a. Arguments in favor and against it
b. Key observations and durable learning
c. Next steps
B
7. Communicate results
“The most exciting phrase to hear
in science, the one that heralds
new discoveries, is not ‘Eureka‘ but
‘That’s funny…’”
Isaac Asimov
Best practicesfrom my time at Amazon and at Google
Choose the right metrics
1. Think both short-term and long-term
2. Use metrics that matter
3. Align on the success metrics beyond your
own team
Be a good wannabe scientist
1. The scientific method is not a suggestion
2. Be suspicious if you didn’t predict a specific
result in advance
3. The more you slice and dice your data, the
more false positives you’ll get
4. Lean against rolling out flat experiments,
unless there are valid reasons
Create and follow templates and processes
1. Setup an intake process to get ideas from
everyone
2. Establish a pre and post-experiment design
template
3. Document all learnings and make them widely
available
Thank you
“Somewhere, something
incredible is waiting to
be known”
Carl Sagan
www.productschool.com
Part-time Product Management, Coding, Data Analytics, Digital
Marketing, UX Design, Product Leadership courses and
Corporate Training

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Customer Centric & Hypothesis Driven Innovation by Cruise VP of Product Engin...
Customer Centric & Hypothesis Driven Innovation by Cruise VP of Product Engin...Customer Centric & Hypothesis Driven Innovation by Cruise VP of Product Engin...
Customer Centric & Hypothesis Driven Innovation by Cruise VP of Product Engin...
 
Managing an Experimentation Platform by LinkedIn Product Leader
Managing an Experimentation Platform by LinkedIn Product LeaderManaging an Experimentation Platform by LinkedIn Product Leader
Managing an Experimentation Platform by LinkedIn Product Leader
 
Psychological Safety to Build Great Products by King SVP of Product.pdf
Psychological Safety to Build Great Products by King SVP of Product.pdfPsychological Safety to Build Great Products by King SVP of Product.pdf
Psychological Safety to Build Great Products by King SVP of Product.pdf
 
Designers and Product Managers_ Leveling Up Product Development and Each Othe...
Designers and Product Managers_ Leveling Up Product Development and Each Othe...Designers and Product Managers_ Leveling Up Product Development and Each Othe...
Designers and Product Managers_ Leveling Up Product Development and Each Othe...
 
Controlled Experimentation aka A/B Testing for PMs by Tinder Sr PM
Controlled Experimentation aka A/B Testing for PMs by Tinder Sr PMControlled Experimentation aka A/B Testing for PMs by Tinder Sr PM
Controlled Experimentation aka A/B Testing for PMs by Tinder Sr PM
 
Building Better Tech: The Product Manager's Role in Infrastructure & Platform...
Building Better Tech: The Product Manager's Role in Infrastructure & Platform...Building Better Tech: The Product Manager's Role in Infrastructure & Platform...
Building Better Tech: The Product Manager's Role in Infrastructure & Platform...
 
Product Leadership - from FAANG to Traditional Media by The New York Times SV...
Product Leadership - from FAANG to Traditional Media by The New York Times SV...Product Leadership - from FAANG to Traditional Media by The New York Times SV...
Product Leadership - from FAANG to Traditional Media by The New York Times SV...
 
The Future of Product
The Future of ProductThe Future of Product
The Future of Product
 
How to Master Product-Led Growth Strategy in B2B by Gainsight CTO
How to Master Product-Led Growth Strategy in B2B by Gainsight CTOHow to Master Product-Led Growth Strategy in B2B by Gainsight CTO
How to Master Product-Led Growth Strategy in B2B by Gainsight CTO
 
How to Get Promoted and Stand Out from Your Peers by Match fmr VP of Product.pdf
How to Get Promoted and Stand Out from Your Peers by Match fmr VP of Product.pdfHow to Get Promoted and Stand Out from Your Peers by Match fmr VP of Product.pdf
How to Get Promoted and Stand Out from Your Peers by Match fmr VP of Product.pdf
 
Designing Great Products The Power of Design and Leadership
Designing Great Products The Power of Design and LeadershipDesigning Great Products The Power of Design and Leadership
Designing Great Products The Power of Design and Leadership
 
How to Build a Robust Product Roadmap by Salesforce VP of Product
How to Build a Robust Product Roadmap by Salesforce VP of ProductHow to Build a Robust Product Roadmap by Salesforce VP of Product
How to Build a Robust Product Roadmap by Salesforce VP of Product
 
Product Discovery At Google
Product Discovery At GoogleProduct Discovery At Google
Product Discovery At Google
 
Beyond the Cart: Unleashing AI Wonders with Instacart’s Shopping Revolution
Beyond the Cart: Unleashing AI Wonders with Instacart’s Shopping RevolutionBeyond the Cart: Unleashing AI Wonders with Instacart’s Shopping Revolution
Beyond the Cart: Unleashing AI Wonders with Instacart’s Shopping Revolution
 
The Future of Product Management by Product School Founder & CEO
The Future of Product Management by Product School Founder & CEOThe Future of Product Management by Product School Founder & CEO
The Future of Product Management by Product School Founder & CEO
 
Building AI products by Google Group Product Manager.pdf
Building AI products by Google Group Product Manager.pdfBuilding AI products by Google Group Product Manager.pdf
Building AI products by Google Group Product Manager.pdf
 
SEO, PPC and AI in 2023 and Beyond
SEO, PPC and AI in 2023 and BeyondSEO, PPC and AI in 2023 and Beyond
SEO, PPC and AI in 2023 and Beyond
 
Growing as a PM in the Course of Your Career by Google PM Director
Growing as a PM in the Course of Your Career by Google PM DirectorGrowing as a PM in the Course of Your Career by Google PM Director
Growing as a PM in the Course of Your Career by Google PM Director
 
Customer-Centric PM: Anticipating Needs Across the Product Life Cycle
Customer-Centric PM: Anticipating Needs Across the Product Life CycleCustomer-Centric PM: Anticipating Needs Across the Product Life Cycle
Customer-Centric PM: Anticipating Needs Across the Product Life Cycle
 
Taking Your Product From 0 to 100 by Facebook Product Manager
Taking Your Product From 0 to 100 by Facebook Product ManagerTaking Your Product From 0 to 100 by Facebook Product Manager
Taking Your Product From 0 to 100 by Facebook Product Manager
 

Similar a The Scientific Method of Experimentation by Google PM

How to Use Data to Inform Your Design and Drive Your Business
How to Use Data to Inform Your Design and Drive Your BusinessHow to Use Data to Inform Your Design and Drive Your Business
How to Use Data to Inform Your Design and Drive Your Business
Kissmetrics on SlideShare
 

Similar a The Scientific Method of Experimentation by Google PM (20)

How to Correctly Use Experimentation in PM by Google PM
How to Correctly Use Experimentation in PM by Google PMHow to Correctly Use Experimentation in PM by Google PM
How to Correctly Use Experimentation in PM by Google PM
 
Creating a culture that provokes failure and boosts improvement
Creating a culture that provokes failure and boosts improvementCreating a culture that provokes failure and boosts improvement
Creating a culture that provokes failure and boosts improvement
 
Testing the unknown: the art and science of working with hypothesis
Testing the unknown: the art and science of working with hypothesisTesting the unknown: the art and science of working with hypothesis
Testing the unknown: the art and science of working with hypothesis
 
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
 
Better Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data DecisionsBetter Living Through Analytics - Strategies for Data Decisions
Better Living Through Analytics - Strategies for Data Decisions
 
User Research to Validate Product Ideas Workshop
User Research to Validate Product Ideas WorkshopUser Research to Validate Product Ideas Workshop
User Research to Validate Product Ideas Workshop
 
Rapid Prototyping
Rapid PrototypingRapid Prototyping
Rapid Prototyping
 
Organizing Your First Website Usability Test - WordCamp Toronto 2016
Organizing Your First Website Usability Test - WordCamp Toronto 2016Organizing Your First Website Usability Test - WordCamp Toronto 2016
Organizing Your First Website Usability Test - WordCamp Toronto 2016
 
UX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT Online 2020: Dr. Martin Tingley, NetflixUX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT Online 2020: Dr. Martin Tingley, Netflix
 
Usability Testing - 10 Tips For Getting It Right
Usability Testing - 10 Tips For Getting It Right Usability Testing - 10 Tips For Getting It Right
Usability Testing - 10 Tips For Getting It Right
 
[CXL Live 16] Opening Keynote by Peep Laja
[CXL Live 16] Opening Keynote by Peep Laja[CXL Live 16] Opening Keynote by Peep Laja
[CXL Live 16] Opening Keynote by Peep Laja
 
Users are Losers! They’ll Like Whatever we Make! and Other Fallacies.
Users are Losers! They’ll Like Whatever we Make! and Other Fallacies.Users are Losers! They’ll Like Whatever we Make! and Other Fallacies.
Users are Losers! They’ll Like Whatever we Make! and Other Fallacies.
 
Weapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
Weapons of Math Instruction: Evolving from Data0-Driven to Science-DrivenWeapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
Weapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
 
AI x Design_Haverinen 2023.pdf
AI x Design_Haverinen 2023.pdfAI x Design_Haverinen 2023.pdf
AI x Design_Haverinen 2023.pdf
 
Fail Well, Pivot Fast: Product Experimentation for Continuous Discovery
Fail Well, Pivot Fast: Product Experimentation for Continuous DiscoveryFail Well, Pivot Fast: Product Experimentation for Continuous Discovery
Fail Well, Pivot Fast: Product Experimentation for Continuous Discovery
 
How to Use User Science to Your Product's Benefit by XO Group PM
How to Use User Science to Your Product's Benefit by XO Group PMHow to Use User Science to Your Product's Benefit by XO Group PM
How to Use User Science to Your Product's Benefit by XO Group PM
 
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdfResearch and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
 
Presentation: Philips
Presentation: PhilipsPresentation: Philips
Presentation: Philips
 
EIA2019HK - Prototyping and Design Hacks - Alar Kolk
EIA2019HK - Prototyping and Design Hacks - Alar KolkEIA2019HK - Prototyping and Design Hacks - Alar Kolk
EIA2019HK - Prototyping and Design Hacks - Alar Kolk
 
How to Use Data to Inform Your Design and Drive Your Business
How to Use Data to Inform Your Design and Drive Your BusinessHow to Use Data to Inform Your Design and Drive Your Business
How to Use Data to Inform Your Design and Drive Your Business
 

Más de Product School

Más de Product School (20)

Webinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - Tech
Webinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - TechWebinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - Tech
Webinar: The Art of Prioritizing Your Product Roadmap by AWS Sr PM - Tech
 
Harnessing the Power of GenAI for Exceptional Product Outcomes by Booking.com...
Harnessing the Power of GenAI for Exceptional Product Outcomes by Booking.com...Harnessing the Power of GenAI for Exceptional Product Outcomes by Booking.com...
Harnessing the Power of GenAI for Exceptional Product Outcomes by Booking.com...
 
Relationship Counselling: From Disjointed Features to Product-First Thinking ...
Relationship Counselling: From Disjointed Features to Product-First Thinking ...Relationship Counselling: From Disjointed Features to Product-First Thinking ...
Relationship Counselling: From Disjointed Features to Product-First Thinking ...
 
Launching New Products In Companies Where It Matters Most by Product Director...
Launching New Products In Companies Where It Matters Most by Product Director...Launching New Products In Companies Where It Matters Most by Product Director...
Launching New Products In Companies Where It Matters Most by Product Director...
 
Cultivating Entrepreneurial Mindset in Product Management: Strategies for Suc...
Cultivating Entrepreneurial Mindset in Product Management: Strategies for Suc...Cultivating Entrepreneurial Mindset in Product Management: Strategies for Suc...
Cultivating Entrepreneurial Mindset in Product Management: Strategies for Suc...
 
Revolutionizing The Banking Industry: The Monzo Way by CPO, Monzo
Revolutionizing The Banking Industry: The Monzo Way by CPO, MonzoRevolutionizing The Banking Industry: The Monzo Way by CPO, Monzo
Revolutionizing The Banking Industry: The Monzo Way by CPO, Monzo
 
Synergy in Leadership and Product Excellence: A Blueprint for Growth by CPO, ...
Synergy in Leadership and Product Excellence: A Blueprint for Growth by CPO, ...Synergy in Leadership and Product Excellence: A Blueprint for Growth by CPO, ...
Synergy in Leadership and Product Excellence: A Blueprint for Growth by CPO, ...
 
Act Like an Owner, Challenge Like a VC by former CPO, Tripadvisor
Act Like an Owner,  Challenge Like a VC by former CPO, TripadvisorAct Like an Owner,  Challenge Like a VC by former CPO, Tripadvisor
Act Like an Owner, Challenge Like a VC by former CPO, Tripadvisor
 
The Future of Product, by Founder & CEO, Product School
The Future of Product, by Founder & CEO, Product SchoolThe Future of Product, by Founder & CEO, Product School
The Future of Product, by Founder & CEO, Product School
 
Webinar How PMs Use AI to 10X Their Productivity by Product School EiR.pdf
Webinar How PMs Use AI to 10X Their Productivity by Product School EiR.pdfWebinar How PMs Use AI to 10X Their Productivity by Product School EiR.pdf
Webinar How PMs Use AI to 10X Their Productivity by Product School EiR.pdf
 
Webinar: Using GenAI for Increasing Productivity in PM by Amazon PM Leader
Webinar: Using GenAI for Increasing Productivity in PM by Amazon PM LeaderWebinar: Using GenAI for Increasing Productivity in PM by Amazon PM Leader
Webinar: Using GenAI for Increasing Productivity in PM by Amazon PM Leader
 
Unlocking High-Performance Product Teams by former Meta Global PMM
Unlocking High-Performance Product Teams by former Meta Global PMMUnlocking High-Performance Product Teams by former Meta Global PMM
Unlocking High-Performance Product Teams by former Meta Global PMM
 
The Types of TPM Content Roles by Facebook product Leader
The Types of TPM Content Roles by Facebook product LeaderThe Types of TPM Content Roles by Facebook product Leader
The Types of TPM Content Roles by Facebook product Leader
 
Match Is the New Sell in The Digital World by Amazon Product leader
Match Is the New Sell in The Digital World by Amazon Product leaderMatch Is the New Sell in The Digital World by Amazon Product leader
Match Is the New Sell in The Digital World by Amazon Product leader
 
Command the Room: Empower Your Team of Product Managers with Effective Commun...
Command the Room: Empower Your Team of Product Managers with Effective Commun...Command the Room: Empower Your Team of Product Managers with Effective Commun...
Command the Room: Empower Your Team of Product Managers with Effective Commun...
 
Metrics That Matter: Bridging User Needs and Board Priorities for Business Su...
Metrics That Matter: Bridging User Needs and Board Priorities for Business Su...Metrics That Matter: Bridging User Needs and Board Priorities for Business Su...
Metrics That Matter: Bridging User Needs and Board Priorities for Business Su...
 
AI in Action The New Age of Intelligent Products and Sales Automation
AI in Action The New Age of Intelligent Products and Sales AutomationAI in Action The New Age of Intelligent Products and Sales Automation
AI in Action The New Age of Intelligent Products and Sales Automation
 
Cracking the Product Sense Interview by TikTok Product Leader.pdf
Cracking the Product Sense Interview by TikTok Product Leader.pdfCracking the Product Sense Interview by TikTok Product Leader.pdf
Cracking the Product Sense Interview by TikTok Product Leader.pdf
 
The Future of Product
The Future of ProductThe Future of Product
The Future of Product
 
Polymathic Product Managers
Polymathic Product ManagersPolymathic Product Managers
Polymathic Product Managers
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

The Scientific Method of Experimentation by Google PM

  • 1. www.productschool.com The Scientific Method of Experimentation by Google PM
  • 2. Join 35,000+Product Managers on Free Resources Discover great job opportunities Job Portal prdct.school/PSJobPortalprdct.school/events-slack
  • 3. C O U R S E S Product Management Learn the skills you need to land a Product Manager job
  • 4. C O U R S E S Coding for Managers Build a website and gain the technical knowledge to lead software engineers
  • 5. C O U R S E S Data Analytics for Managers Learn the skills to understand web analytics, SQL and machine learning concepts
  • 6. C O U R S E S Learn how to acquire more users and convert them into clients Digital Marketing for Managers
  • 7. C O U R S E S UX Design for Managers Gain a deeper understanding of your users and deliver an exceptional end-to- end experience
  • 8. C O U R S E S For experienced Product Managers looking to gain strategic skills needed for top leadership roles Product Leadership
  • 9. C O U R S E S Corporate Training Level up your team’s Product Management skills
  • 10. Ruben Lozano T O N I G H T ’ S S P E A K E R
  • 11. The scientific method of experimentation Seattle, WA | June 25, 2019
  • 12. A/B
  • 14. What is user research? Systematic approach to discovering users' aspirations, goals, tasks, needs, pain points, and information and interaction requirements. User research grounds, verifies, and validates what a team builds.
  • 15. Where does user research fit to product? Iterative research Foundational research Evaluative research PRODUCT DEVELOPMENT
  • 16. Dimensions of user research methods Context Natural or near-natural Scripted Not using the product A hybrid of the above Attitudinal what people say Behavioral what people do 1 Qualitative answers why Quantitative answers how much/many 2 3 Source: https://www.nngroup.com/articles/which-ux-research-methods/ Experiments
  • 17. Experiments 101 What is an experiment?
  • 18. What is an experiment? An experiment is a way to test a hypothesis about the product. An experiment may also refer to the gradual launch of a new feature. LIVE EVAL Note: Tests, while they are an important part of the software development journey, are not experiments, since you know in advance the result you expect
  • 20. I’m a PM. I know what will happen. Humans are terrible at making predictions 1. Hindsight bias 2. Observational selection bias 3. Projection bias 4. Anchoring bias … and hundreds of cognitive biases...
  • 21. Doing a pre/post analysis is enough Brazil Search Traffic June 2014
  • 22. A/B isolates the impact of just the product changes
  • 23. Is something a good idea? A B ORGANIC HONEYCRISP APPLE GRANNY SMITH APPLE FUJI APPLE ORGANIC HONEYCRISP APPLE GRANNY SMITH APPLE
  • 25. “I suppose it is tempting, if the only tool you have is a hammer, to treat everything as if it were a nail.‘” Abraham Maslow
  • 27. Fundamentals of experiment design The scientific method is an empirical method of acquiring knowledge. It is the systematic observation, measurement, and experimentation of a hypothesis. Observation1 Hypothesis2 Design3 Experiment4 Analysis5 Prove/Reject6
  • 28. PM flavor of scientific method Observation1 Hypothesis2 Design3 Experiment4 Analysis5 Prove/Reject6 Ask a question0 Communicate results7
  • 29. 0. Ask a question How can I increase usage of my product? How can I increase revenue attributed to my product? How can I increase user happiness? How can I simplify code without changing metrics? How can I affect click behavior?
  • 30. 1. Observation: do background research What others have done before Are you doing something different? Did something change since the previous attempt? Quantitative data Behavioral metrics Surveys Trends Qualitative data Perceptions Attitudes Assumptions Preferences
  • 31. 2. Develop a hypothesis A (1) testable (2) explanation for a phenomenon. The goal of an experiment is to prove or disprove the hypothesis. AVOID running experiments to see what happens or to gather data with no hypothesis. Use other user research methods and have a POV.
  • 32. 2. Develop a hypothesis Example 1. Ask a question a. How can I increase sales for Prime users on the mobile app? 2. Do background research a. Users had troubles finding filters on mobile b. Users get overwhelmed with too many results c. Decreasing options simplifies decision-making d. BUT, past experiments limiting results had negative results
  • 33. 2. Develop a hypothesis Hypothesis: Prime users will spend more $ if they can easily narrow their search results to prime products Is it valid? ● Is it testable? ● Does it have an explanation? ● Do I have an educated guess?
  • 34. 3. Design experiment Hypothesis: Prime users will spend more $ if they can easily narrow their search results to prime products Design experiment: 1. Show a prime toggle on the navigation bar for all US prime users on the iOS app 2. Toggle off by default 3. No changes to a. Backend algorithms b. Logic that decides when to enable the prime filter c. Current prime filter behind the filter button
  • 35. 3. Design experiment BTriggering criteria ● Who: US prime users using iOS app ● When: If results include a prime product ● How: Session-based Duration ● 2 weeks Launch criteria (success metric) ● Statistically significant increase in revenue ● No increase in latency
  • 37. 5. Analyze the data BResults +2.5% Revenue [1.9%, 3.1%], p=0.05
  • 38. 5. Analyze the data 1. Statistical significance is the likelihood that the numeric difference between a control and treatment outcome is not due to random chance 2. Null hypothesis states there is no significant difference between control and treatment, any observed difference is due to sampling or experimental error 3. P-value evaluates how well the sample data supports the argument that the null hypothesis is true. A low p value suggests you can reject the null hypothesis 4. Confidence interval is a range of values (lower and upper bound) that is likely to contain an unknown population parameter
  • 39. 5. Analyze the data significantly positive significantly negative inconclusive flat* (still inconclusive) (-) 0% (+) -0.5% | practical significance Results +2.5% Revenue [1.9%, 3.1%], p=0.05
  • 40. 6. Draw conclusions Hypothesis: Prime users will spend more $ if they can easily narrow their search results to prime products 1. Validate data 2. Craft a story 3. Evaluate results a. Arguments in favor and against it b. Key observations and durable learning c. Next steps B
  • 42. “The most exciting phrase to hear in science, the one that heralds new discoveries, is not ‘Eureka‘ but ‘That’s funny…’” Isaac Asimov
  • 43. Best practicesfrom my time at Amazon and at Google
  • 44. Choose the right metrics 1. Think both short-term and long-term 2. Use metrics that matter 3. Align on the success metrics beyond your own team
  • 45. Be a good wannabe scientist 1. The scientific method is not a suggestion 2. Be suspicious if you didn’t predict a specific result in advance 3. The more you slice and dice your data, the more false positives you’ll get 4. Lean against rolling out flat experiments, unless there are valid reasons
  • 46. Create and follow templates and processes 1. Setup an intake process to get ideas from everyone 2. Establish a pre and post-experiment design template 3. Document all learnings and make them widely available
  • 47. Thank you “Somewhere, something incredible is waiting to be known” Carl Sagan
  • 48. www.productschool.com Part-time Product Management, Coding, Data Analytics, Digital Marketing, UX Design, Product Leadership courses and Corporate Training

Notas del editor

  1. "As you checked in we sent you an email to join our online communities, events, and to apply for product management jobs. As members of the Product School community we'd like to provide you with these resources at your disposal."
  2. Hello everyone, it’s a pleasure to be here. My name is Ruben Lozano and I’m a Product Manager at Google Maps. Before Maps, I was a PM at Google Cloud, Amazon, and Microsoft. And today, I want to talk to you about using the scientific method when conducting experiments as product managers
  3. From my experience, when people talk about conducting experiments in tech--they talk about A/B testing. For the few of you who may not be familiar with A/B testing, at its most basic, it is a way to compare two versions of something to figure out which of the two performs better. There are other more advanced methodologies of experimentation, like Multivariate Testing or Multi-armed Bandit, but I won’t be covering them during this presentation. But in general, experiments are one of many methods withini your product management toolkit to conduct user research when building products.
  4. That is why I want to briefly talk to you about user research So you understand when is good a idea to use experiments compared to other research methodologies.
  5. User research is a systematic approach to discovering users’ aspirations, goals, tasks, needs, paint points---you name it. To me, it is that magical component that helps you ground, verify, and validate what you and your team build.
  6. Research fits in every phase of product development For example, foundational research usually starts before design and development; but I encourage you to use it even after your product has been launched. Examples of foundational research are diary or ethnographic studies, these help you build empathy towards people, uncover opportunities, and inform your overall product strategy and direction. Iterative research is commonly used when you have already identified the problem you want to solve, and you may want to conduct an in-lab usability study to gather user input to direct which path your solution should focus on. Experimentation, fits into evaluative research. In other words, you use it when your product is done or almost done, and you want to improve it.
  7. Experiments will provide you rich data--but not in every dimension. Experiments will provide you “Behavioral” data, in other words--what people do. Experiments will not provide you attitudinal data--like how people feel, what they want, or their aspiration. Experiments will provide you “Quantitative” data, in other words--they will answer “how much” or “how many”---but not exactly “why” users do or what they like. And finally, Experiments will provide you data from a Natural or near-natural context. In other words, you need to have a product already in the wild to collect this data accurately
  8. With that in mind, let’s define an experiment.
  9. An experiment is a way to test a hypothesis about your product. At Google or Amazon, experiments may also refer to the gradual launch of a new feature. For this talk, I would only focus on the first ones, the live experiments. It is important to note that Tests are not experiments, as in tests, you know in advance the result you expect.
  10. So why run live experiments? Most of the time, you already built the feature. You did user research, you conducted usability studies.
  11. You are the PM--you are smart. You know what will happen, right? But let me tell you--humans are terrible at making predictions. Too soon? I know. The worst part is that our own mind tricks us with multiple cognitive biases. For example, hindsight bias. I am confident you, or many people you know, say that they deeply knew the results of the 2016 election. So they feel they are good at making predictions, but they are not. We are not. The same happens with product. We don’t always know.
  12. So what about a pre/post analysis? You already built the feature. Launch it and see what happens. But the world is complicated. Let me give you an example. This graph roughly shows Google Search traffic over time The Google Search team released a feature right when you see a big drop in Searches. Just by looking at pre/post, the team should have been concerned--but they were not. Why? Let me give you a hint. These data comes from Brazil in June 2014. Any ideas? Yes. The World Cup. People were not searching, they were watching soccer--it was not your feature. Thank you A/B experiments.
  13. This is the beauty of A/B testing. It isolates the impact of just the product changes you deploy.
  14. Experiments help you understand if something is a good idea. For example, you decide to add images to your search results. It seems like a better UX, people like images. Buf if you think deeply--will it be better? What if the site gets slower, what if you show less results on the same screen space, what if the most relevant result doesn’t have an image? Not that straightforward--but If you do an A/B test, you could measure its impact. A/B tests are very useful, they can help you Iterate on a good idea Remove features from your product Measure impact of changes. At some point, you may even feel they are magical.
  15. But it’s not true. They are not magical. The A/B test concept is very easy to understand and there are tools that make it easy to implement. Ergo, they are overused and used incorrectly.
  16. And as Maslow wisely said: “if the only tool you have is a hammer, you will treat everything as if it were a nail.”
  17. This is when we bring science. Conducting experiments means doing science--and science follow a very strict methodology. If you don’t, you are doing pseudoscience. Not sure about you, but I don’t trust pseudoscience--not even “directionally” or as a “better than nothing” outcome.
  18. To conduct a sound experiment, we should follow the scientific method. Yes, the one you learned long time ago. It follows 6 steps Observer the world Formulate a hypothesis Design an experiment Run an experiment Analyze that experiment And prove or reject your hypothesis
  19. For product management, it is basically the same. I would just add two steps. First, you may have a specific question you want to answer And last, you should invest on communicating your results.
  20. Let’s start by asking questions And these questions could be like--how can I increase revenue or usage of my product Or something more philosophical--like---how can I increase happiness or make people love my product?
  21. Then, you move to the observation step, do background research. First, look at what others have done before, when, and why--has something changed--should we try it again? Then, look at quantitative and qualitative data from all those user research methods you conducted. What can you learn about your product?
  22. And after that, you develop a hypothesis A hypothesis is a testable explanation for a phenomenon. It has two parts Testable: you should be able to measure it Explanation: you should have a story to explains it Before you run an experiment, you actually need to have an educated guess of what you think it will happen. This is required because the goal of an experiment is to prove or disprove a hypothesis.
  23. As an example, let’s use an experiment I conducted at Amazon as the PM of the mobile app. I asked the question: How can I increase sales of Prime users on the mobile app? I did background research: I found through different data sources that users get overwhelmed with too many results, users had troubles finding filters on mobile, but also that experiments that limited number of results had negative results, and overall psychological research on how decreasing options helps decision-making So… let’s try to develop a hypothesis
  24. My hypothesis is that Prime users will spend more $ if they can easily narrow their search results to prime products Let’s check they hypothesis Is it testable? Yes, I can measure changes in revenue Does it have an explanation? Yes, I am saying the change will happen because Prime users will be able to easily narrow their search results to prime products Do I have an educated guess? Yes, I am saying revenue will increase
  25. Based on that hypothesis, I designed the experiment in this way Show a prime toggle on the navigation bar for all US prime users on the iOS app No changes to algorithms No changes to when the prime filter is enabled No changes to the prime filter within the filter menu Toggle off by default
  26. Then, you define the triggering of your experiment Who is going to see it: US primer users using iOS app When: When results include a prime product How: Session-based--it means each session is a data point Duratios is two weeks. In most consumer products, you test weekly, as user behavior between a Tuesday and a Sunday are drastically different. Be careful which 2 weeks--avoid experimenting on holidays or on anything that could disrupt regular user behavior Think if the 2 first weeks are actually the best. Sometimes you could have features with a novelty effect--in other words, their impact can wear off over time; or with a learnability effect--users require time to adapt Launch criteria This is when you define your success metric I won’t go over details here because it could be its own session But whatever you decide on your experiment duration or launch criteria, don’t change it after the experiment starts. Why? To prevent data manipulation or the perception of data manipulation. Many experiment owners will be tempted to stop or keep running an experiment, or change the narrative of success, to fit their own agenda.
  27. So you run the experiment. Here you see the only difference.
  28. Two weeks pass. You get the results, and they look something like that Increase of +2.5% in revenue with a confidence interval from 1.9% to 3.2% and a p-value of 0.02 So--how should you read this?
  29. There are four concepts that are important to understand. Statistical significance. That is the likelihood that the numeric difference between a control and treatment outcome is not due to random chance. In other words, most of the times you want them to be statistically significant. Null hypothesis. The null hypothesis says that there is no significant difference between control and treatment, and that any observed difference is due to sampling or experimental error. In other words, most of the times, you want to reject the null hypothesis--as you expect a difference between control and treatment. P-value evaluates how well the sample data supports the argument that the null hypothesis is true. A low p value suggests you can reject the null hypothesis. In other words, most of the times, you want a low p-value Confidence interval is a range of values (lower and upper bound) that is likely to contain an unknown population parameter
  30. If we look at our data, we will see that our result is significantly positive as the confidence interval is on the positive side. So you can make the assumption that the metric will increase If the full confidence interval were in the negative side, you could make the assumption that the metric will decrease If the confidence interval crosses the zero, you don’t have enough data to know if your metric will increase or decrease. And finally, there is an inconclusive result known as “flat” where the lower bound of your confidence interval is above a threshold called “practical significance”. Let’s say, you put that threshold at -0.5%. This means that you are ok losing up to 0.5% of revenue when launching your feature product. Put it in another way. “Do no harm” experiments do not exist--you just need to define how much harm you are ok with. Also, “Leaning positive” or “Leaning negative” outcomes do not exist. If you hear someone using them, make sure they take some statistic courses.
  31. And finally, it’s time to draw conclusions First, validate the data. Do the numbers seem off, or are they too good to be true? Then, craft a story. Use the experiment data but also, all your previous data. Does it make sense? Does it approve or reject your hypothesis? And write it down. I recommend to Write arguments in favor or against launching the feature based on your pre-define launch criteria and other metrics you were tracking. Record any observations and learnings Write down next steps? Will you do another iteration, will you expand to other markets?
  32. And after you capture everything--share it. Share successful and failed experiments. Not only because sharing is caring--but because these insights are very helpful. Even to people who were not involved at all in the experiment.
  33. And as one of my heroes would say, Isaac Asimov, “The most exciting phrase to hear in science, the one that heralds new discoveries, is not ‘Eureka‘ but ‘That’s funny…’”