SlideShare una empresa de Scribd logo
1 de 29
Descargar para leer sin conexión
Backstage to Data Driven
Culture
Success with an
Agile Data Science
Stack
Big Data LA Day 2016
Pauline Chow
2
So, You are the First Data
Scientist…?
WORLDWIDE BUSINESS BUSINESS TO GO CREATIVE SOLUTIONS
WORLDWIDE BUSINESS BUSINESS TO GO CREATIVE SOLUTIONS
What my Friends Think I Do What my Mom Thinks I Do What Society Thinks I Do
What my Boss Think I Do What I Think I Do What I Actually Do
Misconceptions about Data Scientists
3
4
So, You are the First or Lead Data
Scientist…?
Open Source
& New Tools
Profits Steady ,
Adding Products
Report to VP
Marketing
Non Technical
Culture
First Data
Scientist
What does the organization do
best? How does it relate to
data and technology?
What is the business
core competencies?
What are existing tools,
processes, and code? Do you
have a budget for new tools and
resources?
What Tools are
Available ?
This is both a team members
and expectations related
question.
Where is your Team?
What is the mood of the
organization? How are they
solving problems? Why are they
adding DS/A into the
organization?
What is the State of
the Organization?
Who are the stakeholders?
How is data able to contribute
to their goals and
expectations?
Who has the
Influence On the
Roadmap?
Context for Presentation
Case Study: Startup in Digital Media
5
Effectively
Implement
Solutions
Maximize
Impact &
Commun-
ication
Set a Blueprint that
promotes flexibility,
iteration, and
scalability. It facilities
agile-oriented
mindsets for data
practices and it crucial
for implementation.
Build a Roadmap
from Blueprint to
shape data practices
and implement goals
from stakeholders,
company, as well as
strong DS/A
foundations.
Develop key
qualitative and
quantitative
milestones.
Communicate
consistently and
frequently to the
organization.
Influence
Expectations
Influence from both
angles, yours and
stakeholders
expectations. Find
explicit and implicit
goals and bridge the
gaps that you find.
6
Key Drivers Integrating Data Culture
Create an
Agile Data
Science
Stack
Non-technical focused
Actively
Listen
Implement
Explore Collaborate
Influence Grow
Guiding Verbs for “First” Data Scientist
7
In no particular order
ACTIVE LISTENING:
What Are you Trying to Hear?
Explicit Goals & Expectations
Structured, straight-forward, logical, and safe
inquiries
Document, share, and openly discuss with team
members and stakeholders.
Jungwoo Hong @ Unsplash
Implicit Goals & Expectations
Thom @ Unsplash
IMPLEMENT:
HOW TO APPROACH YOUR
BLUEPRINT FOR DATA
DRIVEN-INFORMED
CULTURE?
Architecture
First
Process
First
12
STACK AGILE APPROACHES
Anthony Delanoix @ Unsplash Jeff Sheldon @ Unsplash
Blueprint approach from infrastructure perspective
AGILE BY ARCHITECTURE
13
Customize as the team grows
SaaS & PaaS Integration
14
IDENTIFY
BUILD SYS &
MODELS
- Select Appropriate Models
- Build Models and Pipelines
for Scalability
- Evaluate and refine Models
ACQUIRE
DATA
- Identify the “right” source
- Import data and set up
remote / local storage
- Determine tools to work
with selected sources
CREATE PROBLEM
STATEMENT
- Identify business, data,
product objectives
- Brainstorm potential
solutions
- Create questions and
identify people/stakeholders
to help
PARSE & MINE DATA
- Determine distribution of
data and necessary
transformations
- Format, clean, splice, etc
- Create new derived data
PRESENT RESULTS
- Summarize Findings
- Add Storytelling aspects
- Identify next questions
and additional analysis
- For teams and
stakeholders
15
AGILE BY PROCESS
Blueprint approach from workflow perspective
ACQUIRE PARSE & MINE PRESENTBUILD DEPLOY
IDENTIFY
BUILD SYS &
MODELS + DEPLOY
Leverage platforms that document
models, pipelines, and feature
iterations. Collaboration is a plus.
-  Sklearn pipelines
-  DS/ML platforms: Yhat,
domino labs, anaconda
ACQUIRE DATA
Curate data from existing sources that
is cleaned, reliable, and automated,
where ETL can be skipped
-  Segement.io
-  Zapier
-  CrowdFlower
-  Open Data
CREATE PROBLEM
STATEMENT
Keep most attributes of
this section in-house and
within your team
PARSE & MINE DATA
For the data that cannot be
automated or acquired
cleanly, sklearn pipelines or
open source Luigi
(Spotify) or airflow
(AirBNB) can mitigate this
process.
PRESENT RESULTS
Adopt platforms that allow for
iterations and data mining/
parsing process to feed into
reports and presentations
-  Ipython Jupyter
Notebooks
-  Dashboards: Looker,
RJMetrics, Tableau
16
SaaS & PaaS Integration
Customize as the Process Increases in Complexity
ACQUIRE PARSE & MINE PRESENTBUILD DEPLOY
COLLABORATE:
What Metrics to Emphasize for
Teamwork?
Burn Rate
Most companies do not widely
broadcast but transparency can put
decisions into perspective for the
organization. Time and urgency can
also be of the essence.
Customer
Acquisition
Cost (CAC)
Illustrates market competitiveness
with your products, services, and
market saturation. Social media ad
platforms can make up a large portion
of these costs.
Gross
Profit &
Revenue
Actual revenue & profit after
expenses, investors, and
ongoing costs. If the business
model and product are viable
then the company will be able
to stand on its own without
external capital.
Active Users
Measure the ongoing stickiness
of a service or product. Clearly
define “active” to not
overcompensate first-time, new,
and experimental users. Can
the company move beyond
early adopters and fans?
Churn Rate &
Retention
How many people are leaving or
become inactive after a certain
period of time? When in the
customer’s lifetime is churn more
likely to occur? The higher the
expected churn rate, then the
more the company has to spend
on acquiring new customers.
Cumulative
Growth
Cumulative growth puts a long
term and sustainable
perspective to just month over
month growth. Short-term
growth can unabashedly take
over and cause decision
makers to lose sight of an
organization’s mission and
goals.
Response
Time
The amount of time teams take
to respond and complete tasks,
which includes bug fixes,
technological improvements,
product upgades, and customer
service. Responsiveness
demonstrates staff and team
dedication, effective allocation of
resources, operational
effectiveness, and no tech debt.
Customer
LIfetime
Value (CLV)
Total dollars from a customer
during the lifetime relationship
with that customer. Intersection
of frequency of customer
purchases, revenue per
customer, acquisition costs.
This measure can have
predictive qualities
INFLUENCE
How to align and connect
goals and expectations?
"Leadership is the art of giving people
a platform for spreading ideas that
work."
-Seth Godin
23
Evaluate milestones,
iterate and grow
Month 12
Blueprint for Agile
Data Science and
Analytics Stack
Day 30
Establish clear
measures for success
as widespread as
possible
Day 90
Good first
impressions. Listen
and Learn!
Day 1
Celebrate improvements
to workflow,
effectiveness, and
access
Day 60
Democratize data
access and streamline
measures to external
and internal teams
Month 6
Communicate, Strategize, Communicate...
Connect the Dots
24
Anything Else Reporting &
Urgent
Requests
Data
Acquisition,
Cleaning
Exploration &
Analysis,
Reports, &
Presentation
20% 80% 80% 20%
25
Allocate Time & Resources Effectively
Business as Usual Allocation New Data Science Allocation
GROW YOUR TEAM
When to increase the ability and
capabilities of your team?
Technical Project
Manager
Data Scientist
Data Engineer
Data Engineer
Analyst
Researcher
Team Members
6
1
2
5Central to the ability to
juggle and balance
responsibility of being the
first/lead data scientist.
Agile Data Science
& Analytics Stack
3
4
Active
Listeni
ng
Influen
ce
Collabora
te with
Metrics
Explore
Implement
Grow
Actionable Agile DS/A Stack is Key to
Success
28
@DataThinker
WhenThereIsData.com
pauline.chow@gmail.com

Más contenido relacionado

La actualidad más candente

Imarticus Roundtable Analytics Conference Summary
Imarticus Roundtable Analytics Conference SummaryImarticus Roundtable Analytics Conference Summary
Imarticus Roundtable Analytics Conference Summary
Narasimhalu Senthil
 
20130711 - Customer Journey - Oracle - Matthew Banks
20130711 - Customer Journey - Oracle - Matthew Banks20130711 - Customer Journey - Oracle - Matthew Banks
20130711 - Customer Journey - Oracle - Matthew Banks
Werbeplanung.at Summit
 
Collaboration Proposel
Collaboration ProposelCollaboration Proposel
Collaboration Proposel
CLse
 
7 Dimensions of Agile Analytics by Ken Collier
7 Dimensions of Agile Analytics by Ken Collier 7 Dimensions of Agile Analytics by Ken Collier
7 Dimensions of Agile Analytics by Ken Collier
Thoughtworks
 

La actualidad más candente (19)

Thought provoking content and other things
Thought provoking content and other thingsThought provoking content and other things
Thought provoking content and other things
 
Change Management: Expanding OMS Use Within Your Organization
Change Management: Expanding OMS Use Within Your OrganizationChange Management: Expanding OMS Use Within Your Organization
Change Management: Expanding OMS Use Within Your Organization
 
Sharktower: Will AI change the way you manage change?
Sharktower: Will AI change the way you manage change?Sharktower: Will AI change the way you manage change?
Sharktower: Will AI change the way you manage change?
 
Higher Education Computing - Best Practices for Cloud Migration
Higher Education Computing - Best Practices for Cloud MigrationHigher Education Computing - Best Practices for Cloud Migration
Higher Education Computing - Best Practices for Cloud Migration
 
Imarticus Roundtable Analytics Conference Summary
Imarticus Roundtable Analytics Conference SummaryImarticus Roundtable Analytics Conference Summary
Imarticus Roundtable Analytics Conference Summary
 
Digital Employee Experience Breakfast - 28th March Brisbane
Digital Employee Experience Breakfast - 28th March BrisbaneDigital Employee Experience Breakfast - 28th March Brisbane
Digital Employee Experience Breakfast - 28th March Brisbane
 
20130711 - Customer Journey - Oracle - Matthew Banks
20130711 - Customer Journey - Oracle - Matthew Banks20130711 - Customer Journey - Oracle - Matthew Banks
20130711 - Customer Journey - Oracle - Matthew Banks
 
2016 Data Science Salary Survey
2016 Data Science Salary Survey2016 Data Science Salary Survey
2016 Data Science Salary Survey
 
Workfront: 7 Experts on Flawless Campaign Execution
Workfront: 7 Experts on Flawless Campaign ExecutionWorkfront: 7 Experts on Flawless Campaign Execution
Workfront: 7 Experts on Flawless Campaign Execution
 
Mighty Guides- Data Disruption
Mighty Guides- Data Disruption Mighty Guides- Data Disruption
Mighty Guides- Data Disruption
 
Case Study: Analytics at CMC Markets: from measuring clicks to driving business
Case Study: Analytics at CMC Markets: from measuring clicks to driving businessCase Study: Analytics at CMC Markets: from measuring clicks to driving business
Case Study: Analytics at CMC Markets: from measuring clicks to driving business
 
[AIIM16] What Did AIIM16 Mean?
[AIIM16]  What Did AIIM16 Mean?[AIIM16]  What Did AIIM16 Mean?
[AIIM16] What Did AIIM16 Mean?
 
Making sense of BI
Making sense of BIMaking sense of BI
Making sense of BI
 
Mgi the-age-of-analytics-full-report
Mgi the-age-of-analytics-full-reportMgi the-age-of-analytics-full-report
Mgi the-age-of-analytics-full-report
 
Attivio Big Data Survey
Attivio Big Data SurveyAttivio Big Data Survey
Attivio Big Data Survey
 
Organizational self-direction
Organizational self-directionOrganizational self-direction
Organizational self-direction
 
Pro bono OR webinar - Making sense of data
Pro bono OR webinar - Making sense of data Pro bono OR webinar - Making sense of data
Pro bono OR webinar - Making sense of data
 
Collaboration Proposel
Collaboration ProposelCollaboration Proposel
Collaboration Proposel
 
7 Dimensions of Agile Analytics by Ken Collier
7 Dimensions of Agile Analytics by Ken Collier 7 Dimensions of Agile Analytics by Ken Collier
7 Dimensions of Agile Analytics by Ken Collier
 

Similar a Big Data LA 2016: Backstage to a Data Driven Culture

Cost & benefits of business analytics marshall sponder
Cost & benefits of business analytics marshall sponderCost & benefits of business analytics marshall sponder
Cost & benefits of business analytics marshall sponder
Marshall Sponder
 
Bob Selfridge - Identify, Collect, and Act Upon Customer Interactions; Rinse,...
Bob Selfridge - Identify, Collect, and Act Upon Customer Interactions; Rinse,...Bob Selfridge - Identify, Collect, and Act Upon Customer Interactions; Rinse,...
Bob Selfridge - Identify, Collect, and Act Upon Customer Interactions; Rinse,...
Julia Grosman
 
Os Nolen Gebhart
Os Nolen GebhartOs Nolen Gebhart
Os Nolen Gebhart
oscon2007
 

Similar a Big Data LA 2016: Backstage to a Data Driven Culture (20)

Marcus Baker: People Analytics at Scale
Marcus Baker: People Analytics at ScaleMarcus Baker: People Analytics at Scale
Marcus Baker: People Analytics at Scale
 
Cost & benefits of business analytics marshall sponder
Cost & benefits of business analytics marshall sponderCost & benefits of business analytics marshall sponder
Cost & benefits of business analytics marshall sponder
 
Bob Selfridge - Identify, Collect, and Act Upon Customer Interactions; Rinse,...
Bob Selfridge - Identify, Collect, and Act Upon Customer Interactions; Rinse,...Bob Selfridge - Identify, Collect, and Act Upon Customer Interactions; Rinse,...
Bob Selfridge - Identify, Collect, and Act Upon Customer Interactions; Rinse,...
 
[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi
[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi
[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi
 
2020 05-data-skills-framework
2020 05-data-skills-framework2020 05-data-skills-framework
2020 05-data-skills-framework
 
How to Modernize Your Data Strategy to Fuel Digital Transformation
How to Modernize Your Data Strategy to Fuel Digital TransformationHow to Modernize Your Data Strategy to Fuel Digital Transformation
How to Modernize Your Data Strategy to Fuel Digital Transformation
 
Creating a Data-Driven Organization, Data Day Texas, January 2016
Creating a Data-Driven Organization, Data Day Texas, January 2016Creating a Data-Driven Organization, Data Day Texas, January 2016
Creating a Data-Driven Organization, Data Day Texas, January 2016
 
Delivering an effective customer experience dashboard
Delivering an effective customer experience dashboardDelivering an effective customer experience dashboard
Delivering an effective customer experience dashboard
 
Creating a Data-Driven Organization, Crunchconf, October 2015
Creating a Data-Driven Organization, Crunchconf, October 2015Creating a Data-Driven Organization, Crunchconf, October 2015
Creating a Data-Driven Organization, Crunchconf, October 2015
 
Secrets Of Successful Portal Implementations Dec2008
Secrets Of Successful Portal Implementations   Dec2008Secrets Of Successful Portal Implementations   Dec2008
Secrets Of Successful Portal Implementations Dec2008
 
6 steps to start your artificial intelligence project
6 steps to start your artificial intelligence project6 steps to start your artificial intelligence project
6 steps to start your artificial intelligence project
 
Dsa presentation 5
Dsa presentation 5Dsa presentation 5
Dsa presentation 5
 
Optimizing Organizational Knowledge With Project Cortex & The Microsoft Digit...
Optimizing Organizational Knowledge With Project Cortex & The Microsoft Digit...Optimizing Organizational Knowledge With Project Cortex & The Microsoft Digit...
Optimizing Organizational Knowledge With Project Cortex & The Microsoft Digit...
 
Os Nolen Gebhart
Os Nolen GebhartOs Nolen Gebhart
Os Nolen Gebhart
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetup
 
Data & Analytics: A Point of View
Data & Analytics: A Point of ViewData & Analytics: A Point of View
Data & Analytics: A Point of View
 
Analytics Isn’t Enough To Create A Data–Driven Culture
Analytics Isn’t Enough To Create A Data–Driven CultureAnalytics Isn’t Enough To Create A Data–Driven Culture
Analytics Isn’t Enough To Create A Data–Driven Culture
 
How organizations can become data-driven: three main rules
How organizations can become data-driven: three main rulesHow organizations can become data-driven: three main rules
How organizations can become data-driven: three main rules
 
Planning your analytics journey - webinar slides
Planning your analytics journey  - webinar slidesPlanning your analytics journey  - webinar slides
Planning your analytics journey - webinar slides
 
The Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impactThe Softer Skills Analysts need to make an impact
The Softer Skills Analysts need to make an impact
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Último (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 

Big Data LA 2016: Backstage to a Data Driven Culture

  • 1. Backstage to Data Driven Culture Success with an Agile Data Science Stack Big Data LA Day 2016 Pauline Chow
  • 2. 2 So, You are the First Data Scientist…?
  • 3. WORLDWIDE BUSINESS BUSINESS TO GO CREATIVE SOLUTIONS WORLDWIDE BUSINESS BUSINESS TO GO CREATIVE SOLUTIONS What my Friends Think I Do What my Mom Thinks I Do What Society Thinks I Do What my Boss Think I Do What I Think I Do What I Actually Do Misconceptions about Data Scientists 3
  • 4. 4 So, You are the First or Lead Data Scientist…?
  • 5. Open Source & New Tools Profits Steady , Adding Products Report to VP Marketing Non Technical Culture First Data Scientist What does the organization do best? How does it relate to data and technology? What is the business core competencies? What are existing tools, processes, and code? Do you have a budget for new tools and resources? What Tools are Available ? This is both a team members and expectations related question. Where is your Team? What is the mood of the organization? How are they solving problems? Why are they adding DS/A into the organization? What is the State of the Organization? Who are the stakeholders? How is data able to contribute to their goals and expectations? Who has the Influence On the Roadmap? Context for Presentation Case Study: Startup in Digital Media 5
  • 6. Effectively Implement Solutions Maximize Impact & Commun- ication Set a Blueprint that promotes flexibility, iteration, and scalability. It facilities agile-oriented mindsets for data practices and it crucial for implementation. Build a Roadmap from Blueprint to shape data practices and implement goals from stakeholders, company, as well as strong DS/A foundations. Develop key qualitative and quantitative milestones. Communicate consistently and frequently to the organization. Influence Expectations Influence from both angles, yours and stakeholders expectations. Find explicit and implicit goals and bridge the gaps that you find. 6 Key Drivers Integrating Data Culture Create an Agile Data Science Stack Non-technical focused
  • 7. Actively Listen Implement Explore Collaborate Influence Grow Guiding Verbs for “First” Data Scientist 7 In no particular order
  • 8. ACTIVE LISTENING: What Are you Trying to Hear?
  • 9. Explicit Goals & Expectations Structured, straight-forward, logical, and safe inquiries Document, share, and openly discuss with team members and stakeholders. Jungwoo Hong @ Unsplash
  • 10. Implicit Goals & Expectations Thom @ Unsplash
  • 11. IMPLEMENT: HOW TO APPROACH YOUR BLUEPRINT FOR DATA DRIVEN-INFORMED CULTURE?
  • 12. Architecture First Process First 12 STACK AGILE APPROACHES Anthony Delanoix @ Unsplash Jeff Sheldon @ Unsplash
  • 13. Blueprint approach from infrastructure perspective AGILE BY ARCHITECTURE 13
  • 14. Customize as the team grows SaaS & PaaS Integration 14
  • 15. IDENTIFY BUILD SYS & MODELS - Select Appropriate Models - Build Models and Pipelines for Scalability - Evaluate and refine Models ACQUIRE DATA - Identify the “right” source - Import data and set up remote / local storage - Determine tools to work with selected sources CREATE PROBLEM STATEMENT - Identify business, data, product objectives - Brainstorm potential solutions - Create questions and identify people/stakeholders to help PARSE & MINE DATA - Determine distribution of data and necessary transformations - Format, clean, splice, etc - Create new derived data PRESENT RESULTS - Summarize Findings - Add Storytelling aspects - Identify next questions and additional analysis - For teams and stakeholders 15 AGILE BY PROCESS Blueprint approach from workflow perspective ACQUIRE PARSE & MINE PRESENTBUILD DEPLOY
  • 16. IDENTIFY BUILD SYS & MODELS + DEPLOY Leverage platforms that document models, pipelines, and feature iterations. Collaboration is a plus. -  Sklearn pipelines -  DS/ML platforms: Yhat, domino labs, anaconda ACQUIRE DATA Curate data from existing sources that is cleaned, reliable, and automated, where ETL can be skipped -  Segement.io -  Zapier -  CrowdFlower -  Open Data CREATE PROBLEM STATEMENT Keep most attributes of this section in-house and within your team PARSE & MINE DATA For the data that cannot be automated or acquired cleanly, sklearn pipelines or open source Luigi (Spotify) or airflow (AirBNB) can mitigate this process. PRESENT RESULTS Adopt platforms that allow for iterations and data mining/ parsing process to feed into reports and presentations -  Ipython Jupyter Notebooks -  Dashboards: Looker, RJMetrics, Tableau 16 SaaS & PaaS Integration Customize as the Process Increases in Complexity ACQUIRE PARSE & MINE PRESENTBUILD DEPLOY
  • 17. COLLABORATE: What Metrics to Emphasize for Teamwork?
  • 18. Burn Rate Most companies do not widely broadcast but transparency can put decisions into perspective for the organization. Time and urgency can also be of the essence. Customer Acquisition Cost (CAC) Illustrates market competitiveness with your products, services, and market saturation. Social media ad platforms can make up a large portion of these costs.
  • 19. Gross Profit & Revenue Actual revenue & profit after expenses, investors, and ongoing costs. If the business model and product are viable then the company will be able to stand on its own without external capital. Active Users Measure the ongoing stickiness of a service or product. Clearly define “active” to not overcompensate first-time, new, and experimental users. Can the company move beyond early adopters and fans?
  • 20. Churn Rate & Retention How many people are leaving or become inactive after a certain period of time? When in the customer’s lifetime is churn more likely to occur? The higher the expected churn rate, then the more the company has to spend on acquiring new customers. Cumulative Growth Cumulative growth puts a long term and sustainable perspective to just month over month growth. Short-term growth can unabashedly take over and cause decision makers to lose sight of an organization’s mission and goals.
  • 21. Response Time The amount of time teams take to respond and complete tasks, which includes bug fixes, technological improvements, product upgades, and customer service. Responsiveness demonstrates staff and team dedication, effective allocation of resources, operational effectiveness, and no tech debt. Customer LIfetime Value (CLV) Total dollars from a customer during the lifetime relationship with that customer. Intersection of frequency of customer purchases, revenue per customer, acquisition costs. This measure can have predictive qualities
  • 22. INFLUENCE How to align and connect goals and expectations?
  • 23. "Leadership is the art of giving people a platform for spreading ideas that work." -Seth Godin 23
  • 24. Evaluate milestones, iterate and grow Month 12 Blueprint for Agile Data Science and Analytics Stack Day 30 Establish clear measures for success as widespread as possible Day 90 Good first impressions. Listen and Learn! Day 1 Celebrate improvements to workflow, effectiveness, and access Day 60 Democratize data access and streamline measures to external and internal teams Month 6 Communicate, Strategize, Communicate... Connect the Dots 24
  • 25. Anything Else Reporting & Urgent Requests Data Acquisition, Cleaning Exploration & Analysis, Reports, & Presentation 20% 80% 80% 20% 25 Allocate Time & Resources Effectively Business as Usual Allocation New Data Science Allocation
  • 26. GROW YOUR TEAM When to increase the ability and capabilities of your team?
  • 27. Technical Project Manager Data Scientist Data Engineer Data Engineer Analyst Researcher Team Members
  • 28. 6 1 2 5Central to the ability to juggle and balance responsibility of being the first/lead data scientist. Agile Data Science & Analytics Stack 3 4 Active Listeni ng Influen ce Collabora te with Metrics Explore Implement Grow Actionable Agile DS/A Stack is Key to Success 28