SlideShare una empresa de Scribd logo
1 de 36
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
Confidential Prepared by Ver.
Data-driven approaches in a technology startup
1.0Michal Szczecinski
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
2
Hong Kong
Taiwan
Singapore
South Korea
China
China (300+ cities)
Since 02.2015
08. 2017 merged with 58
Suyun
Hong Kong
Since 07.2013
Singapore
Since 06.2014
South Korea (2 cities)
Since 10.2015
Taiwan
Since 11.2014
India
India
Since 03.2016
Established in 2013, GOGOVAN is the first app-based platform for
delivering goods in Asia.
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
3
What I will talk about?
Startup context
Goals of Analytics
Why data matters
Work cases
Lessons learnt
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
4
Hong Kong
Oxford
London
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
“Data guy”
• Business Intelligence
• Data engineering
• Data Science (data products)
• Data quality
• Digital Marketing/Growth
• Product analytics
• Financial modelling/forecasting
• Strategy analysis
• Big Data Research
• Data compliance
…...
Established multi-team contribution:
Corporate vs Startup
Multidisciplinary, tech wizz, “all-knowing”…. :
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 6
Goals of Analytics
Underlying vision is to make GOGOVAN data-driven.
6
1. Decision support
2. Knowledge discovery
3. Optimization
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 7
- Supporting all teams (product, operations,
marketing, customer service, engineering, finance,
management, legal and more…)
- Supporting all countries
- Everything related to data
- Multiple outputs (in house build dashboards, etl
jobs, interactive tools, notebooks, ML models,
scientific papers, ad hoc queries, alerts,
infrastructure and tools)
- Multiple input
- Data users across whole organisation
Data team
“Everything data” in GOGOVAN
7
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
8
Why data matters?
Just 3 examples (there is more…)
read more: https://towardsdatascience.com/what-does-a-data-team-really-do-12484482e683
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 9
1. Price - what user pays/what driver
earns.
2. Time - response, arrival, completion.
3. Quality - customer experience, effort,
reliability...
Service level
improving key components of our
service
9
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 10
1. Frontier - After certain point x as the
volume of orders grows, the completion
rate starts to fall exponentially.
2. Wall - Also there is a wall of soft limit of
numbers of orders that can be completed
no matter what is the volume of orders.
3. Improvement - In the whole history of
GOGOVAN that wall has been overcome
just once, very recently. Also this wall
has been steadily raising.
Completion rate
growing business activity
10
*axes and details removed for data confidentiality purposes
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 11
1. Transactions - is there any unusual
activity?
2. Partners - do all partners play fair?
3. Systems - are systems working fine?
4. Community - what are people saying?
5. Safety - are people and goods safe?
6. Competition - what’s going on in other
camps?
Anomaly detection
avoiding unexpected
11
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
12
What are we working on?
Applications - examples of projects and solutions..
Real use cases and tools (with transformed, hidden or masked details)
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
13
Decision Support
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 14
1. All-in-one place for data
2. Multi-use - reports, interactive tools,.
Self service, dashboards, algorithms,
docs, training videos etc.
3. Search
4. Tagging
5. Collaboration
Data Platform
Operating data services
14
Main dashboard charts
Goal: Provide decision support on all important areas of the company for the respective team members.
Action: Get important metrics by different breakdowns and time periods. Monitor progress and
Outcome: Lower Costs/More GMV/More Users
Next Generation self service analytics
Goal: Enable end users to to effectively analyse and retrieve the data.
Action: Build custom reports, share comments and insights, optimised UX.
Outcome: Lower costs/Better Service
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
17
Knowledge discovery
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 18
1. Focused on particular problem/question
2. Thousands of searchable and reproducible reports
3. Publishing tools
4. Auto Generated reports and alerts
5. Metadata and templates
6. Analytics Meetings
Notebooks
Scaling deep knowledge
18
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 19
Real Time Heatmap
19
1. Interactive monitoring
2. Adopted - used by ops
3. Goal: Visualize drivers and orders
4. Action: identify idle drivers and pending
orders, understand and affect distribution
of supply/demand
5. Outcome: Higher GMV/Better service
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 20
Marketplace analysis
Monitoring and stimulating GOGOVAN ecosystem
20
1. Arrival time
2. Distribution of orders
3. Supply/demand proportion
4. Completion time
5. Utilization rate
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
21
Optimization
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
22
Predicting demand
Algorithms
Selected examples
Predicting unmet demand
Predicting order status
Driver Matching
Route Optimization
Churn prediction
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 23
1. Responding to questions, what was an
impact of x ?
2. ARMA Exogenous Variable Model
(ARMAX)
3. DOW, Weather, Holiday
Demand prediction
Causal inference
23
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 24
1. Goal: Predict unmet demand and balance
supply/demand.
2. Action: Know how many more drivers we
need at particular regions at the
particular time in order to fulfill expected
demand.
3. Outcome: Better Service/Higher GMV
Unmet demand prediction
Balancing supply/demand
24
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 25
1. Goal: Optimise supply and demand.
2. Action: Match drivers to orders better so
that we optimise key operational KPIs.
3. Outcome: Lower Costs/Better
Service/Higher GMV
Dynamic Supply and Demand dispatching in
spatially structured region based on big data
analytics
Matching best driver.
25
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 26
1. Goal: Plan route in a way that utilizes
drivers time and provide cost benefits for
the customer.
2. Action: choose quickest route; avoid
obstacles, traffic and hot spots, predict
ETA, bundle orders so that is more cost
efficient for the driver
3. Outcome: Higher GMV/Lower
costs/Better Service
4. Scalable
5. Cost efficient
6. High performance
7. Customizable
8. In-house competitive advantage
Route Optimization
Increasing operations efficiency: route optimization.
Bundling and scheduling.
26
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 27
1. Interactive real time tool predicting
if/how fast order will be picked up
2. Response time (percentiles, absolute)
3. Zero rated probability
4. Feature Importance
5. Action: identify risky orders, assign
orders before they are cancelled by user,
add bonus/subsidy, redirect bad
performing orders to specified pool of
drivers/incentivize drivers/notify user to
add bonus in the app
6. Outcome: More revenue/More
users/Improved experience for user
Order status prediction
Estimating attractiveness of the order
27
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 28
1. Predicting churn
2. Identifying things that lead to churn
3. Prevent churn
Predicting churn
Engaging clients
28
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
29
How to become a data-driven organisation?
Lessons learnt
read more : article coming soon “Principles for becoming data-driven”.
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
30
30
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 31
1. Goals: 1) cross system data integration
2) analytics abstraction 3) data analytics
4) real time data services
2. Minimal management cost
3. Scalable
4. Well integrated with data analytics tools
5. Universal, being able to support different
type of systems and events
6. Facilitating productivity of data science
team , with minimized maintenance
effort and cognitive load
7. Ideally unified data science workflow
across batch and real time
Data Infrastructure
(GOGOTRACK)
Real Time analytics source
31
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 32
1. Scaling
2. Traceability
3. Multiple models
4. Flexibility
5. Multiple consumers
6. Reproducibility
7. Performance and availability
ML Logistics
(GOGOMI)
Operating data services
32
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
33
Data-driven Framework
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
34
ML/AI Initiatives
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
Ops Data Brain
35
Real-time Heatmap on steroids with ML recommendations for ops
Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission.
Confidential Prepared by Ver.
Michal Szczecinski
https://www.linkedin.com/in/michalszczecinski/
michal@gogotech.hk
Thank you
Michal Szczecinski

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

AI in the Enterprise
AI in the EnterpriseAI in the Enterprise
AI in the Enterprise
 
H2O World - Transamerica's Product Recommender Platform - Vishal Bamba & Niti...
H2O World - Transamerica's Product Recommender Platform - Vishal Bamba & Niti...H2O World - Transamerica's Product Recommender Platform - Vishal Bamba & Niti...
H2O World - Transamerica's Product Recommender Platform - Vishal Bamba & Niti...
 
Fraud detection
Fraud detectionFraud detection
Fraud detection
 
Doing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOpsDoing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOps
 
How to add Artificial Intelligence Capabilities to Existing Software Platforms
How to add Artificial Intelligence Capabilities to Existing Software PlatformsHow to add Artificial Intelligence Capabilities to Existing Software Platforms
How to add Artificial Intelligence Capabilities to Existing Software Platforms
 
Msst 2019 v4
Msst 2019 v4Msst 2019 v4
Msst 2019 v4
 
Kubernetes Jakarta Meetup 010 - Service Mesh Observability with Kiali
Kubernetes Jakarta Meetup 010 - Service Mesh Observability with KialiKubernetes Jakarta Meetup 010 - Service Mesh Observability with Kiali
Kubernetes Jakarta Meetup 010 - Service Mesh Observability with Kiali
 
WSO2Con EU 2016: An Effective Device Strategy to Accelerate your Business
WSO2Con EU 2016: An Effective Device Strategy to  Accelerate your BusinessWSO2Con EU 2016: An Effective Device Strategy to  Accelerate your Business
WSO2Con EU 2016: An Effective Device Strategy to Accelerate your Business
 
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix WebinarFuture-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
 
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
Delivering Large Scale Real-time Graph Analytics with Dell Infrastructure and...
 
H2O World - Self Guiding Applications with Venkatesh Yadav
H2O World - Self Guiding Applications with Venkatesh YadavH2O World - Self Guiding Applications with Venkatesh Yadav
H2O World - Self Guiding Applications with Venkatesh Yadav
 
Digital Shift in Insurance: How is the Industry Responding with the Influx of...
Digital Shift in Insurance: How is the Industry Responding with the Influx of...Digital Shift in Insurance: How is the Industry Responding with the Influx of...
Digital Shift in Insurance: How is the Industry Responding with the Influx of...
 
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
 
Flight Delay Compensation: How SwissRe is exploring new territories in Busine...
Flight Delay Compensation: How SwissRe is exploring new territories in Busine...Flight Delay Compensation: How SwissRe is exploring new territories in Busine...
Flight Delay Compensation: How SwissRe is exploring new territories in Busine...
 
“The Data-Driven Engineering Revolution,” a Presentation from Edge Impulse
“The Data-Driven Engineering Revolution,” a Presentation from Edge Impulse“The Data-Driven Engineering Revolution,” a Presentation from Edge Impulse
“The Data-Driven Engineering Revolution,” a Presentation from Edge Impulse
 
NUS-ISS Learning Day 2018- Solving difficult problems in the age of digitalis...
NUS-ISS Learning Day 2018- Solving difficult problems in the age of digitalis...NUS-ISS Learning Day 2018- Solving difficult problems in the age of digitalis...
NUS-ISS Learning Day 2018- Solving difficult problems in the age of digitalis...
 
Fraud Analytics with Machine Learning and Big Data Engineering for Telecom
Fraud Analytics with Machine Learning and Big Data Engineering for TelecomFraud Analytics with Machine Learning and Big Data Engineering for Telecom
Fraud Analytics with Machine Learning and Big Data Engineering for Telecom
 
Pivotal Digital Transformation Forum: Data Science
Pivotal Digital Transformation Forum: Data Science Pivotal Digital Transformation Forum: Data Science
Pivotal Digital Transformation Forum: Data Science
 
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunen
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunenMeetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunen
Meetup 27/6/2018: AIOPS om de uitdagingen van een slimme stad te ondersteunen
 

Similar a Data driven approaches in a technology startup

Webinar effective mobile performance testing using real devices
Webinar effective mobile performance testing using real devicesWebinar effective mobile performance testing using real devices
Webinar effective mobile performance testing using real devices
Perfecto Mobile
 
Adapting to Meet Today’s Trends and Technologies– Compliance vs. Enforcement
Adapting to Meet Today’s Trends and Technologies– Compliance vs. EnforcementAdapting to Meet Today’s Trends and Technologies– Compliance vs. Enforcement
Adapting to Meet Today’s Trends and Technologies– Compliance vs. Enforcement
Flexera
 
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Capgemini
 
SAP Leonardo Blockchain Services and Use-Cases
SAP Leonardo Blockchain Services and Use-CasesSAP Leonardo Blockchain Services and Use-Cases
SAP Leonardo Blockchain Services and Use-Cases
Nagesh Caparthy
 
Cloud cpr uncc cloud computing conference 2013
Cloud cpr   uncc cloud computing conference 2013Cloud cpr   uncc cloud computing conference 2013
Cloud cpr uncc cloud computing conference 2013
C5_LUCK
 

Similar a Data driven approaches in a technology startup (20)

Webinar effective mobile performance testing using real devices
Webinar effective mobile performance testing using real devicesWebinar effective mobile performance testing using real devices
Webinar effective mobile performance testing using real devices
 
How to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPOHow to Build an AI/ML Product and Sell it by SalesChoice CPO
How to Build an AI/ML Product and Sell it by SalesChoice CPO
 
How INOVVO Delivers Analysis that Leads to Greater User Retention and Loyalty...
How INOVVO Delivers Analysis that Leads to Greater User Retention and Loyalty...How INOVVO Delivers Analysis that Leads to Greater User Retention and Loyalty...
How INOVVO Delivers Analysis that Leads to Greater User Retention and Loyalty...
 
Test Everything: TrustRadius Delivers Customer Value with Experimentation
Test Everything: TrustRadius Delivers Customer Value with ExperimentationTest Everything: TrustRadius Delivers Customer Value with Experimentation
Test Everything: TrustRadius Delivers Customer Value with Experimentation
 
Empowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningEmpowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine Learning
 
Developing Custom iOs Applications for Enterprise
Developing Custom iOs Applications for EnterpriseDeveloping Custom iOs Applications for Enterprise
Developing Custom iOs Applications for Enterprise
 
Emvigo Data Visualization - E Commerce Deck
Emvigo Data Visualization - E Commerce DeckEmvigo Data Visualization - E Commerce Deck
Emvigo Data Visualization - E Commerce Deck
 
Adapting to Meet Today’s Trends and Technologies– Compliance vs. Enforcement
Adapting to Meet Today’s Trends and Technologies– Compliance vs. EnforcementAdapting to Meet Today’s Trends and Technologies– Compliance vs. Enforcement
Adapting to Meet Today’s Trends and Technologies– Compliance vs. Enforcement
 
Get your data analytics strategy right!
Get your data analytics strategy right!Get your data analytics strategy right!
Get your data analytics strategy right!
 
Monitoring in the DevOps Era
Monitoring in the DevOps EraMonitoring in the DevOps Era
Monitoring in the DevOps Era
 
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
 
SAP Leonardo Blockchain Services and Use-Cases
SAP Leonardo Blockchain Services and Use-CasesSAP Leonardo Blockchain Services and Use-Cases
SAP Leonardo Blockchain Services and Use-Cases
 
Zycus Online E- Auction
Zycus Online E- AuctionZycus Online E- Auction
Zycus Online E- Auction
 
Contingency Planning and Risk Mitigation Strategies for Cloud-based Technolog...
Contingency Planning and Risk Mitigation Strategies for Cloud-based Technolog...Contingency Planning and Risk Mitigation Strategies for Cloud-based Technolog...
Contingency Planning and Risk Mitigation Strategies for Cloud-based Technolog...
 
5 Lessons Learned in Product Management by Twitch Senior PM
5 Lessons Learned in Product Management by Twitch Senior PM5 Lessons Learned in Product Management by Twitch Senior PM
5 Lessons Learned in Product Management by Twitch Senior PM
 
Hardcore SEO & Social Media Tools - SMX Advanced 2012
Hardcore SEO & Social Media Tools - SMX Advanced 2012Hardcore SEO & Social Media Tools - SMX Advanced 2012
Hardcore SEO & Social Media Tools - SMX Advanced 2012
 
Oracle big data and rtd v5
Oracle big data and rtd v5Oracle big data and rtd v5
Oracle big data and rtd v5
 
Differentiating Digital Banking with API Monitoring
Differentiating Digital Banking with API MonitoringDifferentiating Digital Banking with API Monitoring
Differentiating Digital Banking with API Monitoring
 
Cloud cpr uncc cloud computing conference 2013
Cloud cpr   uncc cloud computing conference 2013Cloud cpr   uncc cloud computing conference 2013
Cloud cpr uncc cloud computing conference 2013
 
Being a digital communication superstar
Being a digital communication superstarBeing a digital communication superstar
Being a digital communication superstar
 

Más de Rakuten Group, Inc.

Más de Rakuten Group, Inc. (20)

コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
 
楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のり楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のり
 
What Makes Software Green?
What Makes Software Green?What Makes Software Green?
What Makes Software Green?
 
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
 
DataSkillCultureを浸透させる楽天の取り組み
DataSkillCultureを浸透させる楽天の取り組みDataSkillCultureを浸透させる楽天の取り組み
DataSkillCultureを浸透させる楽天の取り組み
 
大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開
 
楽天における大規模データベースの運用
楽天における大規模データベースの運用楽天における大規模データベースの運用
楽天における大規模データベースの運用
 
楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャー楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャー
 
楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割
 
Rakuten Services and Infrastructure Team.pdf
Rakuten Services and Infrastructure Team.pdfRakuten Services and Infrastructure Team.pdf
Rakuten Services and Infrastructure Team.pdf
 
The Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdfThe Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdf
 
Supporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdfSupporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdf
 
Making Cloud Native CI_CD Services.pdf
Making Cloud Native CI_CD Services.pdfMaking Cloud Native CI_CD Services.pdf
Making Cloud Native CI_CD Services.pdf
 
How We Defined Our Own Cloud.pdf
How We Defined Our Own Cloud.pdfHow We Defined Our Own Cloud.pdf
How We Defined Our Own Cloud.pdf
 
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
 
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
 
OWASPTop10_Introduction
OWASPTop10_IntroductionOWASPTop10_Introduction
OWASPTop10_Introduction
 
Introduction of GORA API Group technology
Introduction of GORA API Group technologyIntroduction of GORA API Group technology
Introduction of GORA API Group technology
 
100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情
 
社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャー社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャー
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Data driven approaches in a technology startup

  • 1. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. Confidential Prepared by Ver. Data-driven approaches in a technology startup 1.0Michal Szczecinski
  • 2. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 2 Hong Kong Taiwan Singapore South Korea China China (300+ cities) Since 02.2015 08. 2017 merged with 58 Suyun Hong Kong Since 07.2013 Singapore Since 06.2014 South Korea (2 cities) Since 10.2015 Taiwan Since 11.2014 India India Since 03.2016 Established in 2013, GOGOVAN is the first app-based platform for delivering goods in Asia.
  • 3. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 3 What I will talk about? Startup context Goals of Analytics Why data matters Work cases Lessons learnt
  • 4. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 4 Hong Kong Oxford London
  • 5. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. “Data guy” • Business Intelligence • Data engineering • Data Science (data products) • Data quality • Digital Marketing/Growth • Product analytics • Financial modelling/forecasting • Strategy analysis • Big Data Research • Data compliance …... Established multi-team contribution: Corporate vs Startup Multidisciplinary, tech wizz, “all-knowing”…. :
  • 6. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 6 Goals of Analytics Underlying vision is to make GOGOVAN data-driven. 6 1. Decision support 2. Knowledge discovery 3. Optimization
  • 7. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 7 - Supporting all teams (product, operations, marketing, customer service, engineering, finance, management, legal and more…) - Supporting all countries - Everything related to data - Multiple outputs (in house build dashboards, etl jobs, interactive tools, notebooks, ML models, scientific papers, ad hoc queries, alerts, infrastructure and tools) - Multiple input - Data users across whole organisation Data team “Everything data” in GOGOVAN 7
  • 8. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 8 Why data matters? Just 3 examples (there is more…) read more: https://towardsdatascience.com/what-does-a-data-team-really-do-12484482e683
  • 9. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 9 1. Price - what user pays/what driver earns. 2. Time - response, arrival, completion. 3. Quality - customer experience, effort, reliability... Service level improving key components of our service 9
  • 10. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 10 1. Frontier - After certain point x as the volume of orders grows, the completion rate starts to fall exponentially. 2. Wall - Also there is a wall of soft limit of numbers of orders that can be completed no matter what is the volume of orders. 3. Improvement - In the whole history of GOGOVAN that wall has been overcome just once, very recently. Also this wall has been steadily raising. Completion rate growing business activity 10 *axes and details removed for data confidentiality purposes
  • 11. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 11 1. Transactions - is there any unusual activity? 2. Partners - do all partners play fair? 3. Systems - are systems working fine? 4. Community - what are people saying? 5. Safety - are people and goods safe? 6. Competition - what’s going on in other camps? Anomaly detection avoiding unexpected 11
  • 12. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 12 What are we working on? Applications - examples of projects and solutions.. Real use cases and tools (with transformed, hidden or masked details)
  • 13. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 13 Decision Support
  • 14. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 14 1. All-in-one place for data 2. Multi-use - reports, interactive tools,. Self service, dashboards, algorithms, docs, training videos etc. 3. Search 4. Tagging 5. Collaboration Data Platform Operating data services 14
  • 15. Main dashboard charts Goal: Provide decision support on all important areas of the company for the respective team members. Action: Get important metrics by different breakdowns and time periods. Monitor progress and Outcome: Lower Costs/More GMV/More Users
  • 16. Next Generation self service analytics Goal: Enable end users to to effectively analyse and retrieve the data. Action: Build custom reports, share comments and insights, optimised UX. Outcome: Lower costs/Better Service
  • 17. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 17 Knowledge discovery
  • 18. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 18 1. Focused on particular problem/question 2. Thousands of searchable and reproducible reports 3. Publishing tools 4. Auto Generated reports and alerts 5. Metadata and templates 6. Analytics Meetings Notebooks Scaling deep knowledge 18
  • 19. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 19 Real Time Heatmap 19 1. Interactive monitoring 2. Adopted - used by ops 3. Goal: Visualize drivers and orders 4. Action: identify idle drivers and pending orders, understand and affect distribution of supply/demand 5. Outcome: Higher GMV/Better service
  • 20. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 20 Marketplace analysis Monitoring and stimulating GOGOVAN ecosystem 20 1. Arrival time 2. Distribution of orders 3. Supply/demand proportion 4. Completion time 5. Utilization rate
  • 21. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 21 Optimization
  • 22. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 22 Predicting demand Algorithms Selected examples Predicting unmet demand Predicting order status Driver Matching Route Optimization Churn prediction
  • 23. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 23 1. Responding to questions, what was an impact of x ? 2. ARMA Exogenous Variable Model (ARMAX) 3. DOW, Weather, Holiday Demand prediction Causal inference 23
  • 24. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 24 1. Goal: Predict unmet demand and balance supply/demand. 2. Action: Know how many more drivers we need at particular regions at the particular time in order to fulfill expected demand. 3. Outcome: Better Service/Higher GMV Unmet demand prediction Balancing supply/demand 24
  • 25. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 25 1. Goal: Optimise supply and demand. 2. Action: Match drivers to orders better so that we optimise key operational KPIs. 3. Outcome: Lower Costs/Better Service/Higher GMV Dynamic Supply and Demand dispatching in spatially structured region based on big data analytics Matching best driver. 25
  • 26. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 26 1. Goal: Plan route in a way that utilizes drivers time and provide cost benefits for the customer. 2. Action: choose quickest route; avoid obstacles, traffic and hot spots, predict ETA, bundle orders so that is more cost efficient for the driver 3. Outcome: Higher GMV/Lower costs/Better Service 4. Scalable 5. Cost efficient 6. High performance 7. Customizable 8. In-house competitive advantage Route Optimization Increasing operations efficiency: route optimization. Bundling and scheduling. 26
  • 27. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 27 1. Interactive real time tool predicting if/how fast order will be picked up 2. Response time (percentiles, absolute) 3. Zero rated probability 4. Feature Importance 5. Action: identify risky orders, assign orders before they are cancelled by user, add bonus/subsidy, redirect bad performing orders to specified pool of drivers/incentivize drivers/notify user to add bonus in the app 6. Outcome: More revenue/More users/Improved experience for user Order status prediction Estimating attractiveness of the order 27
  • 28. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 28 1. Predicting churn 2. Identifying things that lead to churn 3. Prevent churn Predicting churn Engaging clients 28
  • 29. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 29 How to become a data-driven organisation? Lessons learnt read more : article coming soon “Principles for becoming data-driven”.
  • 30. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 30 30
  • 31. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 31 1. Goals: 1) cross system data integration 2) analytics abstraction 3) data analytics 4) real time data services 2. Minimal management cost 3. Scalable 4. Well integrated with data analytics tools 5. Universal, being able to support different type of systems and events 6. Facilitating productivity of data science team , with minimized maintenance effort and cognitive load 7. Ideally unified data science workflow across batch and real time Data Infrastructure (GOGOTRACK) Real Time analytics source 31
  • 32. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 32 1. Scaling 2. Traceability 3. Multiple models 4. Flexibility 5. Multiple consumers 6. Reproducibility 7. Performance and availability ML Logistics (GOGOMI) Operating data services 32
  • 33. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 33 Data-driven Framework
  • 34. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. 34 ML/AI Initiatives
  • 35. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. Ops Data Brain 35 Real-time Heatmap on steroids with ML recommendations for ops
  • 36. Copyrights: Proprietary and confidential. Not to be distributed or reproduced without permission. Confidential Prepared by Ver. Michal Szczecinski https://www.linkedin.com/in/michalszczecinski/ michal@gogotech.hk Thank you Michal Szczecinski