SlideShare una empresa de Scribd logo
1 de 13
Descargar para leer sin conexión
Market Propensity Modelling Using
XStreams
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
About XSTREAMS
Why XSTREAMS?
XSTREAMS Architecture
Technology Stack
Adv. Market Propensity
Legacy Design & Issues
Feature Engineering
Modelling on XStreams
???
Agenda
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
XSTREAMS: Drag and Drop Self Serve Platform
50+ Data Processing
Operators
45+ Transformer and
Estimators for
Feature Engineering
Train, Score and
Evaluate Model in
one Pipeline
Marketplace for
readily available
blueprints.
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
Why XSTREAMS?
Sink for Error
Handling
Timeseries and
Aggregated Metrics
Actionable Alerts
Scheduling
Capabilities
Auditing Support
Versioning Support
Granular Role
Based Authorization
Checkpointing
Support.
Common features
automatically
applied to all pipelines
created
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
5
Kafka
Pubsub
Flume
RabbitMQ
Amazon S3
HDFS
Kinesis
MQTT
Data Sources
HDFS Files
Hive
Kafka
ElasticSearch
Kinesis
WebSocket
Cassandra
BigQuery
PubSub
Data Sinks
Real Time Self Service ETL Platform
Hadoop Distributions Native / VM / Cloud
Cloudera MapR HDP
High Level Architecture
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
6
Source SinkUI/UX
Service Layer
Xstreams Core
Real Time Olap
Technology Stack
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
7
.
Marketing Propensity Business Use Case
Predict user purchase trends across different signals like Brand, Price , Size
based on custom and dynamic feature set that is composed on time based
event product category , sub category , age and gender.
Business Applications
Search and Browse for Recommendation
Enhance Browsing Experience
Discount Optimization
Campaign Management
Ad Monetization
Internal R &D (Eg. Acti Mirrors)
Futuristic Shopping Experience
Tagging Using App by Finding Customers
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
8
.
Overview of Sources/Features/EndPoints
ClickStream Dataset
CustomerId
ProductId
EventId
Time
Product Dataset
Product Category
Product Sub Category
Brand
Occasion
Age , Gender , Colour , Size
Demographic Dataset
Gender
Household Size
Income
#Children , Marital Status
Education
Source Tables and Attributes
Features
Product Category
Product Sub Category
Event Type
Age
Gender
Time
EndPoints/Singals
Brand
Price
Size
Occasion
Colour
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
9
.
Legacy Modelling Steps
Join ClickStream and Product Dataset
Pivot and Vectorize
Join ClickStream and Demographic Dataset
Pivot and Vectorize
Merge Above two Vectors
Apply Binary Classification Logistic Regression Model
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
10
.
Legacy Modelling Issues
Pivots creation was taking more than 20 hours
on whole dataset. So sampled 5% dataset was
used.
Default Vector Operator was taking longer time.
Modelling was done on subset of the features
combinations.
Skewed data for Purchase and Non Purchase label.
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
11
.
Feature Engineering Optimization on Xstreams
Complete Dataset for 30 million customers was used since
vectorization time was reduced from 18hours to 3 hours due
to custom sparse vectorize operator.
Removed skewed data label by redcing the non
purchase by 1:10 ratio.
Added unknown values for missing demographic
values.
All feature combination were not used instead of top
20.P1/P2 combination varied 60-2400
Market Propensity Pipeline on XStreams (Live Demo)
Copyright © 2015-2017
Exadatum Software Services Pvt. Ltd.
Thank You!
13

Más contenido relacionado

La actualidad más candente

Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not YearsReplatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not YearsVMware Tanzu
 
Gautham Pai K - Resume
Gautham Pai K - ResumeGautham Pai K - Resume
Gautham Pai K - ResumeGautham Pai
 
Build Spark-based ETL Workflows on Cloud in Minutes
Build Spark-based ETL Workflows on Cloud in MinutesBuild Spark-based ETL Workflows on Cloud in Minutes
Build Spark-based ETL Workflows on Cloud in MinutesImpetus Technologies
 
Kcom graph connect europe, 11 may 2017
Kcom   graph connect europe, 11 may 2017Kcom   graph connect europe, 11 may 2017
Kcom graph connect europe, 11 may 2017Andrew Smale
 
My CIO Says that We're Going All-in and Migrating to AWS, Now What
My CIO Says that We're Going All-in and Migrating to AWS, Now WhatMy CIO Says that We're Going All-in and Migrating to AWS, Now What
My CIO Says that We're Going All-in and Migrating to AWS, Now WhatAmazon Web Services
 
Techorama: Power BI Automation with MS Flow
Techorama: Power BI Automation with MS FlowTechorama: Power BI Automation with MS Flow
Techorama: Power BI Automation with MS FlowIda Bergum
 
Cloud Assessment and Readiness Tool (CART)
Cloud Assessment and Readiness Tool (CART)Cloud Assessment and Readiness Tool (CART)
Cloud Assessment and Readiness Tool (CART)HCL Technologies
 
Olivier Blais: Want to adopt AI in your business: good luck!
Olivier Blais: Want to adopt AI in your business: good luck!Olivier Blais: Want to adopt AI in your business: good luck!
Olivier Blais: Want to adopt AI in your business: good luck!Lviv Startup Club
 
AI Builder Deepdive DynamicsPower! Brussels 2019
AI Builder Deepdive DynamicsPower! Brussels 2019AI Builder Deepdive DynamicsPower! Brussels 2019
AI Builder Deepdive DynamicsPower! Brussels 2019Rebekka Aalbers-de Jong
 
ROI Example
ROI ExampleROI Example
ROI ExamplePhilbo58
 
Roadshow Chicago - Introduction
Roadshow   Chicago - IntroductionRoadshow   Chicago - Introduction
Roadshow Chicago - IntroductionInfluxData
 
Market Move - SAP acquires Qualtrics - The quest for the XM category begins
Market Move - SAP acquires Qualtrics - The quest for the XM category beginsMarket Move - SAP acquires Qualtrics - The quest for the XM category begins
Market Move - SAP acquires Qualtrics - The quest for the XM category beginsHolger Mueller
 
AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics Ruben Pertusa Lopez
 
AWS & Manufacturing: SKF Connects Smart Products with Smart Factories (MFG316...
AWS & Manufacturing: SKF Connects Smart Products with Smart Factories (MFG316...AWS & Manufacturing: SKF Connects Smart Products with Smart Factories (MFG316...
AWS & Manufacturing: SKF Connects Smart Products with Smart Factories (MFG316...Amazon Web Services
 
HUGIreland_VincentDeStocklin_DataScienceWorkflows
HUGIreland_VincentDeStocklin_DataScienceWorkflowsHUGIreland_VincentDeStocklin_DataScienceWorkflows
HUGIreland_VincentDeStocklin_DataScienceWorkflowsJohn Mulhall
 
Integration Services
Integration ServicesIntegration Services
Integration ServicesMai Hoang
 

La actualidad más candente (20)

Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not YearsReplatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
Replatform your Teradata to a Next-Gen Cloud Data Platform in Weeks, Not Years
 
Gautham Pai K - Resume
Gautham Pai K - ResumeGautham Pai K - Resume
Gautham Pai K - Resume
 
Build Spark-based ETL Workflows on Cloud in Minutes
Build Spark-based ETL Workflows on Cloud in MinutesBuild Spark-based ETL Workflows on Cloud in Minutes
Build Spark-based ETL Workflows on Cloud in Minutes
 
Kcom graph connect europe, 11 may 2017
Kcom   graph connect europe, 11 may 2017Kcom   graph connect europe, 11 may 2017
Kcom graph connect europe, 11 may 2017
 
My CIO Says that We're Going All-in and Migrating to AWS, Now What
My CIO Says that We're Going All-in and Migrating to AWS, Now WhatMy CIO Says that We're Going All-in and Migrating to AWS, Now What
My CIO Says that We're Going All-in and Migrating to AWS, Now What
 
Techorama: Power BI Automation with MS Flow
Techorama: Power BI Automation with MS FlowTechorama: Power BI Automation with MS Flow
Techorama: Power BI Automation with MS Flow
 
Power BI
Power BIPower BI
Power BI
 
Enterprise search solutions
Enterprise search solutionsEnterprise search solutions
Enterprise search solutions
 
Cloud Assessment and Readiness Tool (CART)
Cloud Assessment and Readiness Tool (CART)Cloud Assessment and Readiness Tool (CART)
Cloud Assessment and Readiness Tool (CART)
 
Olivier Blais: Want to adopt AI in your business: good luck!
Olivier Blais: Want to adopt AI in your business: good luck!Olivier Blais: Want to adopt AI in your business: good luck!
Olivier Blais: Want to adopt AI in your business: good luck!
 
AI Builder Deepdive DynamicsPower! Brussels 2019
AI Builder Deepdive DynamicsPower! Brussels 2019AI Builder Deepdive DynamicsPower! Brussels 2019
AI Builder Deepdive DynamicsPower! Brussels 2019
 
ROI Example
ROI ExampleROI Example
ROI Example
 
Roadshow Chicago - Introduction
Roadshow   Chicago - IntroductionRoadshow   Chicago - Introduction
Roadshow Chicago - Introduction
 
Market Move - SAP acquires Qualtrics - The quest for the XM category begins
Market Move - SAP acquires Qualtrics - The quest for the XM category beginsMarket Move - SAP acquires Qualtrics - The quest for the XM category begins
Market Move - SAP acquires Qualtrics - The quest for the XM category begins
 
AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics
 
AWS & Manufacturing: SKF Connects Smart Products with Smart Factories (MFG316...
AWS & Manufacturing: SKF Connects Smart Products with Smart Factories (MFG316...AWS & Manufacturing: SKF Connects Smart Products with Smart Factories (MFG316...
AWS & Manufacturing: SKF Connects Smart Products with Smart Factories (MFG316...
 
HUGIreland_VincentDeStocklin_DataScienceWorkflows
HUGIreland_VincentDeStocklin_DataScienceWorkflowsHUGIreland_VincentDeStocklin_DataScienceWorkflows
HUGIreland_VincentDeStocklin_DataScienceWorkflows
 
Integration Services
Integration ServicesIntegration Services
Integration Services
 
Cloud Economics
Cloud EconomicsCloud Economics
Cloud Economics
 
Democratize ai with google cloud
Democratize ai with google cloudDemocratize ai with google cloud
Democratize ai with google cloud
 

Similar a Market Propensity Modeling Using XSTREAMS

雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)
雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)
雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)Amazon Web Services
 
Welcome and AWS Big Data Solution Overview
Welcome and AWS Big Data Solution OverviewWelcome and AWS Big Data Solution Overview
Welcome and AWS Big Data Solution OverviewAmazon Web Services
 
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...Khalid Salama
 
Come costruire una soluzione Digital Twin con AWS IoT e AI-ML
Come costruire una soluzione Digital Twin con AWS IoT e AI-MLCome costruire una soluzione Digital Twin con AWS IoT e AI-ML
Come costruire una soluzione Digital Twin con AWS IoT e AI-MLAmazon Web Services
 
How TrueCar Gains Actionable Insights with Splunk Cloud PPT
How TrueCar Gains Actionable Insights with Splunk Cloud PPTHow TrueCar Gains Actionable Insights with Splunk Cloud PPT
How TrueCar Gains Actionable Insights with Splunk Cloud PPTAmazon Web Services
 
Microservices oracle-meetup
Microservices oracle-meetupMicroservices oracle-meetup
Microservices oracle-meetupNitu Parimi
 
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...Amazon Web Services
 
How can your business benefit from going Serverless
How can your business benefit from going ServerlessHow can your business benefit from going Serverless
How can your business benefit from going ServerlessAmazon Web Services
 
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...Amazon Web Services
 
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of ManufacturingGPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of ManufacturingAmazon Web Services
 
Ripping off the Bandage: Re-Architecting Traditional Three-Tier Monoliths to ...
Ripping off the Bandage: Re-Architecting Traditional Three-Tier Monoliths to ...Ripping off the Bandage: Re-Architecting Traditional Three-Tier Monoliths to ...
Ripping off the Bandage: Re-Architecting Traditional Three-Tier Monoliths to ...Amazon Web Services
 
Machine learning in the physical world by Kip Larson from AWS IoT
Machine learning in the physical world by  Kip Larson from AWS IoTMachine learning in the physical world by  Kip Larson from AWS IoT
Machine learning in the physical world by Kip Larson from AWS IoTBill Liu
 
Envisioning the Future Enterprise
Envisioning the Future EnterpriseEnvisioning the Future Enterprise
Envisioning the Future Enterprise WSO2
 
How to Revamp your Legacy Applications For More Agility and Better Service - ...
How to Revamp your Legacy Applications For More Agility and Better Service - ...How to Revamp your Legacy Applications For More Agility and Better Service - ...
How to Revamp your Legacy Applications For More Agility and Better Service - ...NRB
 
How can your business benefit from going serverless?
How can your business benefit from going serverless?How can your business benefit from going serverless?
How can your business benefit from going serverless?Adrian Hornsby
 
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...Amazon Web Services
 
Enterprise Cloud Adoption
Enterprise Cloud Adoption Enterprise Cloud Adoption
Enterprise Cloud Adoption Tom Laszewski
 
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018Amazon Web Services
 
Digital Transformation using Predictive Analytics and Big Data
Digital Transformation using Predictive Analytics and Big DataDigital Transformation using Predictive Analytics and Big Data
Digital Transformation using Predictive Analytics and Big DataAmazon Web Services
 
Get to Know Your Customers - Build and Innovate with a Modern Data Architecture
Get to Know Your Customers - Build and Innovate with a Modern Data ArchitectureGet to Know Your Customers - Build and Innovate with a Modern Data Architecture
Get to Know Your Customers - Build and Innovate with a Modern Data ArchitectureAmazon Web Services
 

Similar a Market Propensity Modeling Using XSTREAMS (20)

雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)
雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)
雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)
 
Welcome and AWS Big Data Solution Overview
Welcome and AWS Big Data Solution OverviewWelcome and AWS Big Data Solution Overview
Welcome and AWS Big Data Solution Overview
 
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
Operational Machine Learning: Using Microsoft Technologies for Applied Data S...
 
Come costruire una soluzione Digital Twin con AWS IoT e AI-ML
Come costruire una soluzione Digital Twin con AWS IoT e AI-MLCome costruire una soluzione Digital Twin con AWS IoT e AI-ML
Come costruire una soluzione Digital Twin con AWS IoT e AI-ML
 
How TrueCar Gains Actionable Insights with Splunk Cloud PPT
How TrueCar Gains Actionable Insights with Splunk Cloud PPTHow TrueCar Gains Actionable Insights with Splunk Cloud PPT
How TrueCar Gains Actionable Insights with Splunk Cloud PPT
 
Microservices oracle-meetup
Microservices oracle-meetupMicroservices oracle-meetup
Microservices oracle-meetup
 
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
Architecting for Real-Time Insights with Amazon Kinesis (ANT310) - AWS re:Inv...
 
How can your business benefit from going Serverless
How can your business benefit from going ServerlessHow can your business benefit from going Serverless
How can your business benefit from going Serverless
 
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
 
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of ManufacturingGPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
 
Ripping off the Bandage: Re-Architecting Traditional Three-Tier Monoliths to ...
Ripping off the Bandage: Re-Architecting Traditional Three-Tier Monoliths to ...Ripping off the Bandage: Re-Architecting Traditional Three-Tier Monoliths to ...
Ripping off the Bandage: Re-Architecting Traditional Three-Tier Monoliths to ...
 
Machine learning in the physical world by Kip Larson from AWS IoT
Machine learning in the physical world by  Kip Larson from AWS IoTMachine learning in the physical world by  Kip Larson from AWS IoT
Machine learning in the physical world by Kip Larson from AWS IoT
 
Envisioning the Future Enterprise
Envisioning the Future EnterpriseEnvisioning the Future Enterprise
Envisioning the Future Enterprise
 
How to Revamp your Legacy Applications For More Agility and Better Service - ...
How to Revamp your Legacy Applications For More Agility and Better Service - ...How to Revamp your Legacy Applications For More Agility and Better Service - ...
How to Revamp your Legacy Applications For More Agility and Better Service - ...
 
How can your business benefit from going serverless?
How can your business benefit from going serverless?How can your business benefit from going serverless?
How can your business benefit from going serverless?
 
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
 
Enterprise Cloud Adoption
Enterprise Cloud Adoption Enterprise Cloud Adoption
Enterprise Cloud Adoption
 
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
Amazon Kinesis - Building Serverless real-time solution - Tel Aviv Summit 2018
 
Digital Transformation using Predictive Analytics and Big Data
Digital Transformation using Predictive Analytics and Big DataDigital Transformation using Predictive Analytics and Big Data
Digital Transformation using Predictive Analytics and Big Data
 
Get to Know Your Customers - Build and Innovate with a Modern Data Architecture
Get to Know Your Customers - Build and Innovate with a Modern Data ArchitectureGet to Know Your Customers - Build and Innovate with a Modern Data Architecture
Get to Know Your Customers - Build and Innovate with a Modern Data Architecture
 

Último

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Último (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Market Propensity Modeling Using XSTREAMS

  • 2. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. About XSTREAMS Why XSTREAMS? XSTREAMS Architecture Technology Stack Adv. Market Propensity Legacy Design & Issues Feature Engineering Modelling on XStreams ??? Agenda
  • 3. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. XSTREAMS: Drag and Drop Self Serve Platform 50+ Data Processing Operators 45+ Transformer and Estimators for Feature Engineering Train, Score and Evaluate Model in one Pipeline Marketplace for readily available blueprints.
  • 4. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. Why XSTREAMS? Sink for Error Handling Timeseries and Aggregated Metrics Actionable Alerts Scheduling Capabilities Auditing Support Versioning Support Granular Role Based Authorization Checkpointing Support. Common features automatically applied to all pipelines created
  • 5. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. 5 Kafka Pubsub Flume RabbitMQ Amazon S3 HDFS Kinesis MQTT Data Sources HDFS Files Hive Kafka ElasticSearch Kinesis WebSocket Cassandra BigQuery PubSub Data Sinks Real Time Self Service ETL Platform Hadoop Distributions Native / VM / Cloud Cloudera MapR HDP High Level Architecture
  • 6. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. 6 Source SinkUI/UX Service Layer Xstreams Core Real Time Olap Technology Stack
  • 7. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. 7 . Marketing Propensity Business Use Case Predict user purchase trends across different signals like Brand, Price , Size based on custom and dynamic feature set that is composed on time based event product category , sub category , age and gender. Business Applications Search and Browse for Recommendation Enhance Browsing Experience Discount Optimization Campaign Management Ad Monetization Internal R &D (Eg. Acti Mirrors) Futuristic Shopping Experience Tagging Using App by Finding Customers
  • 8. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. 8 . Overview of Sources/Features/EndPoints ClickStream Dataset CustomerId ProductId EventId Time Product Dataset Product Category Product Sub Category Brand Occasion Age , Gender , Colour , Size Demographic Dataset Gender Household Size Income #Children , Marital Status Education Source Tables and Attributes Features Product Category Product Sub Category Event Type Age Gender Time EndPoints/Singals Brand Price Size Occasion Colour
  • 9. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. 9 . Legacy Modelling Steps Join ClickStream and Product Dataset Pivot and Vectorize Join ClickStream and Demographic Dataset Pivot and Vectorize Merge Above two Vectors Apply Binary Classification Logistic Regression Model
  • 10. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. 10 . Legacy Modelling Issues Pivots creation was taking more than 20 hours on whole dataset. So sampled 5% dataset was used. Default Vector Operator was taking longer time. Modelling was done on subset of the features combinations. Skewed data for Purchase and Non Purchase label.
  • 11. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. 11 . Feature Engineering Optimization on Xstreams Complete Dataset for 30 million customers was used since vectorization time was reduced from 18hours to 3 hours due to custom sparse vectorize operator. Removed skewed data label by redcing the non purchase by 1:10 ratio. Added unknown values for missing demographic values. All feature combination were not used instead of top 20.P1/P2 combination varied 60-2400
  • 12. Market Propensity Pipeline on XStreams (Live Demo)
  • 13. Copyright © 2015-2017 Exadatum Software Services Pvt. Ltd. Thank You! 13