SlideShare una empresa de Scribd logo
1 de 20
Titolo presentazione
sottotitolo
Milano, XX mese 20XX
A Big Data Analysis Framework for
Model-Based Web User Behavior
Analytics
Carlo Bernaschina, Marco Brambila, Andrea Mauri, Eric Umuhoza
6th June 2017
DEIB. Data Science LabICWE2017 – June 6th 2017
Context
Web analytics has become the tool of choice to inform both
business users and designers.
Several tools exist that support analysis of Web server logs and
extract information on application usage.
DEIB. Data Science LabICWE2017 – June 6th 2017
Context
DEIB. Data Science LabICWE2017 – June 6th 2017
Context
DEIB. Data Science LabICWE2017 – June 6th 2017
Context
Web analytics has become the tool of choice towards taking
informed business and interaction design decisions.
Several tools exist that support analysis of Web server logs and
extract information on application usage.
• Usually unaware of the design structure and the actual
content managed by the application.
DEIB. Data Science LabICWE2017 – June 6th 2017
Objective
Provide valuable insights to designers and decision makers
Integration of two approaches:
• Web Log Analytics
• Model Driven Development
DEIB. Data Science LabICWE2017 – June 6th 2017
Model Driven Development
Software development paradigm where the models are the main
artefacts of the development process.
Main benefits:
• Model once, generate for any platform of choice
• Validation of the requirements
In this specific case:
• The model includes the structure of the web application.
DEIB. Data Science LabICWE2017 – June 6th 2017
Interaction Flow Modeling Language
Interaction Flow Modeling Language (IFML) is designed for
expressing the content, user interaction and control behavior of
the front-end of software applications
DEIB. Data Science LabICWE2017 – June 6th 2017
Overview
Application Server
Data Analyser
Storage
Analyzer
(Spark)
Code
Generator
Webratio
IFML Editor
(Enriched Analytics Model)
Web Server LogRTXLog
ModelCode
Data Visualization
Analysis
(b)
(a,f)
(d)
(e)
Application
DB
Web Server
(Tomcat)
Database
(c)
Modeling
Application
Deployment
Analysis Visualization
Execution
DEIB. Data Science LabICWE2017 – June 6th 2017
Logs Integration
(d)
(c)
(b)
Model
RTXLog
Web Server Log
Database
RTXLog
WebServerLog
Model
Database
EnrichedLog
GlobalLog
FinalLog
RTX.sessionId = WebServer.sessionId AND
RTX.timestamp = WebServer.timestamp
EnrichedLog.elementId = Model.elementId
GlobalLog.tabelName = Database.tableName
AND Database.attributeName=“OID”
AND GlobalLog.instanceID = Database,value
(a)
(a)
(a)
(a)
A denormalized view of the logs
DEIB. Data Science LabICWE2017 – June 6th 2017
Navigation Based Analyses
Includes information regarding how the users navigate the Web
site
Examples:
• Entrance Rate
• Bounce Rate
• Page Visit
• Residence Time
• Link Navigation
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Content Based Analyses
Comprehends information regarding the domain entities involved
in the user interaction, their types and their semantics.
Example (e-commerce website)
• Top K Visualized Books
• Top K Visualized Authors
• Top K Clicked Books
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Structure Based Analyses
Comprehends information regarding the kind of widget,
visualization, or even navigation pattern used in the user
interaction.
Example:
• Top K elements clicked by users when shown in a map
throughout the site
• Top K elements clicked when shown in the first three positions
of a list
• Top K elements clicked when an attribute of type image is
shown in the page versus an attribute of type currency.
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Data Visualization Tool
Traditional charts like pie charts, bar charts, navigation flow
charts and so on..
DEIB. Data Science LabICWE2017 – June 6th 2017
Visual Feedback on Model Editor
Three types of visualization:
• Color: the analytics is shown through the change of color of
the corresponding model element
• Label: the analytics is shown with a label on the
corresponding model element
• Properties: the analytics is shown in a separate property panel
DEIB. Data Science LabICWE2017 – June 6th 2017
Visual Feedback on Model Editor - Color
• Residence Time
• Page visit
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Visual Feedback on Model Editor - Label
• Link Out Ratio
• Link In Ratio
• Top 1 Clicked Entity
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Visual Feedback on Model Editor - Properties
• Top k clicked
entities
• Top k visualized
entities
• …
DEIB. Data Science LabICWE2017 – June 6th 2017
Future Work
Do not stop at the visualization!
• Effects of Model Restructuring on User Behavior
• Conversion Rate
• Number of user who reach payment Module
• Conversion Path: the path (with high contribution)
leading to conversion rate
• Model Optimization
DEIB. Data Science LabICWE2017 – June 6th 2017
Thanks for your attention
Questions?
Contact: andrea.mauri@polimi.it
Tool: www.ifmledit.org
Further Info: http://datascience.deib.polimi.it/bigdata-modeling-
weblogs/

Más contenido relacionado

La actualidad más candente

Data Visualizations with D3.js
Data Visualizations with D3.jsData Visualizations with D3.js
Data Visualizations with D3.jsBrian Greig
 
Resume xiaodan(vinci)
Resume xiaodan(vinci)Resume xiaodan(vinci)
Resume xiaodan(vinci)vinci105
 
Introduction BI Semantic Model with Sql Server Data Tools copy
Introduction BI Semantic Model with Sql Server Data Tools   copyIntroduction BI Semantic Model with Sql Server Data Tools   copy
Introduction BI Semantic Model with Sql Server Data Tools copySlava Kokaev
 
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...Big Data Value Association
 
This week in Neo4j - 21st October 2017
This week in Neo4j - 21st October 2017This week in Neo4j - 21st October 2017
This week in Neo4j - 21st October 2017Neo4j
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Access database
Access databaseAccess database
Access databasephaocake
 
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & AnalyticsDataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & AnalyticsDr. Arif Wider
 
Building business processes with content in OneDrive and SharePoint
Building business processes with content in OneDrive and SharePointBuilding business processes with content in OneDrive and SharePoint
Building business processes with content in OneDrive and SharePointMicrosoft Tech Community
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLDATAVERSITY
 
Tracking research data footprints - slides
Tracking research data footprints - slidesTracking research data footprints - slides
Tracking research data footprints - slidesARDC
 
Analytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
Analytical Systems Evolution: From Excel to Big Data Platforms and Data LakesAnalytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
Analytical Systems Evolution: From Excel to Big Data Platforms and Data LakesProvectus
 
Edmc use cases 2018 nyc
Edmc use cases 2018   nycEdmc use cases 2018   nyc
Edmc use cases 2018 nycMarty Loughlin
 
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020Cambridge Semantics
 
20180901 - 9th Meeting - Power BI Module 0-1
20180901 - 9th Meeting - Power BI Module 0-120180901 - 9th Meeting - Power BI Module 0-1
20180901 - 9th Meeting - Power BI Module 0-1Duc Lai Trung Minh
 
Prague data management meetup 2017-09-26
Prague data management meetup 2017-09-26Prague data management meetup 2017-09-26
Prague data management meetup 2017-09-26Martin Bém
 
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMSGRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMSGraph-TA
 
Connected data meetup group - introduction & scope
Connected data meetup group - introduction & scopeConnected data meetup group - introduction & scope
Connected data meetup group - introduction & scopeConnected Data World
 

La actualidad más candente (20)

Data Visualizations with D3.js
Data Visualizations with D3.jsData Visualizations with D3.js
Data Visualizations with D3.js
 
Resume xiaodan(vinci)
Resume xiaodan(vinci)Resume xiaodan(vinci)
Resume xiaodan(vinci)
 
Introduction BI Semantic Model with Sql Server Data Tools copy
Introduction BI Semantic Model with Sql Server Data Tools   copyIntroduction BI Semantic Model with Sql Server Data Tools   copy
Introduction BI Semantic Model with Sql Server Data Tools copy
 
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
 
This week in Neo4j - 21st October 2017
This week in Neo4j - 21st October 2017This week in Neo4j - 21st October 2017
This week in Neo4j - 21st October 2017
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Hobbit in a Nutshell - EDF2016
Hobbit in a Nutshell - EDF2016Hobbit in a Nutshell - EDF2016
Hobbit in a Nutshell - EDF2016
 
Access database
Access databaseAccess database
Access database
 
Lju Lazarevic
Lju LazarevicLju Lazarevic
Lju Lazarevic
 
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & AnalyticsDataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
DataDevOps: A Manifesto for a DevOps-like Culture Shift in Data & Analytics
 
Building business processes with content in OneDrive and SharePoint
Building business processes with content in OneDrive and SharePointBuilding business processes with content in OneDrive and SharePoint
Building business processes with content in OneDrive and SharePoint
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
 
Tracking research data footprints - slides
Tracking research data footprints - slidesTracking research data footprints - slides
Tracking research data footprints - slides
 
Analytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
Analytical Systems Evolution: From Excel to Big Data Platforms and Data LakesAnalytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
Analytical Systems Evolution: From Excel to Big Data Platforms and Data Lakes
 
Edmc use cases 2018 nyc
Edmc use cases 2018   nycEdmc use cases 2018   nyc
Edmc use cases 2018 nyc
 
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020Fireside Chat with Bloor Research: State of the Graph Database Market 2020
Fireside Chat with Bloor Research: State of the Graph Database Market 2020
 
20180901 - 9th Meeting - Power BI Module 0-1
20180901 - 9th Meeting - Power BI Module 0-120180901 - 9th Meeting - Power BI Module 0-1
20180901 - 9th Meeting - Power BI Module 0-1
 
Prague data management meetup 2017-09-26
Prague data management meetup 2017-09-26Prague data management meetup 2017-09-26
Prague data management meetup 2017-09-26
 
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMSGRAPHITE — An Extensible Graph Traversal Framework for RDBMS
GRAPHITE — An Extensible Graph Traversal Framework for RDBMS
 
Connected data meetup group - introduction & scope
Connected data meetup group - introduction & scopeConnected data meetup group - introduction & scope
Connected data meetup group - introduction & scope
 

Similar a A Big Data Analysis Framework for Model-Based Web User Behavior Analytics

App dev and partner ecosystem for pink social connections 2017
App dev and partner ecosystem for pink   social connections 2017App dev and partner ecosystem for pink   social connections 2017
App dev and partner ecosystem for pink social connections 2017Heath McCarthy
 
Turning the IBM Collaboration Ecosystem Pink
Turning the IBM Collaboration Ecosystem PinkTurning the IBM Collaboration Ecosystem Pink
Turning the IBM Collaboration Ecosystem PinkLetsConnect
 
Calling all Developers: Building Connections Apps and Integrating with Pink
Calling all Developers: Building Connections Apps and Integrating with PinkCalling all Developers: Building Connections Apps and Integrating with Pink
Calling all Developers: Building Connections Apps and Integrating with PinkLetsConnect
 
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningData Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningKai Wähner
 
Koneksys Presentation March 2021
Koneksys Presentation March 2021Koneksys Presentation March 2021
Koneksys Presentation March 2021Axel Reichwein
 
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...LetsConnect
 
Digital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoTDigital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoTDimitri Volkmann
 
SPFx- A modern development model for SharePoint
SPFx- A modern development model  for SharePointSPFx- A modern development model  for SharePoint
SPFx- A modern development model for SharePointKirti Prajapati
 
Choosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_FinalChoosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_FinalHeather Choi
 
Data & Analytics at Scale
Data & Analytics at ScaleData & Analytics at Scale
Data & Analytics at ScaleWalid Mehanna
 
Build Business Web Applications with PHPOpenbiz Framework and Cubi Platform
Build Business Web Applications with PHPOpenbiz Framework and Cubi PlatformBuild Business Web Applications with PHPOpenbiz Framework and Cubi Platform
Build Business Web Applications with PHPOpenbiz Framework and Cubi PlatformAgus Suhartono
 
Charting libraries vs BI & Analytics Platforms
Charting libraries vs BI & Analytics Platforms Charting libraries vs BI & Analytics Platforms
Charting libraries vs BI & Analytics Platforms TIBCO Jaspersoft
 
Neo4j GraphDay Seattle- Sept19- in the enterprise
Neo4j GraphDay Seattle- Sept19-  in the enterpriseNeo4j GraphDay Seattle- Sept19-  in the enterprise
Neo4j GraphDay Seattle- Sept19- in the enterpriseNeo4j
 
Proof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityProof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityOpen Cyber University of Korea
 
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"CSols, Inc.
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformVMware Tanzu
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise ArchitectsNeo4j
 

Similar a A Big Data Analysis Framework for Model-Based Web User Behavior Analytics (20)

App dev and partner ecosystem for pink social connections 2017
App dev and partner ecosystem for pink   social connections 2017App dev and partner ecosystem for pink   social connections 2017
App dev and partner ecosystem for pink social connections 2017
 
Turning the IBM Collaboration Ecosystem Pink
Turning the IBM Collaboration Ecosystem PinkTurning the IBM Collaboration Ecosystem Pink
Turning the IBM Collaboration Ecosystem Pink
 
Research Paper
Research PaperResearch Paper
Research Paper
 
Calling all Developers: Building Connections Apps and Integrating with Pink
Calling all Developers: Building Connections Apps and Integrating with PinkCalling all Developers: Building Connections Apps and Integrating with Pink
Calling all Developers: Building Connections Apps and Integrating with Pink
 
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine LearningData Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
 
Koneksys Presentation March 2021
Koneksys Presentation March 2021Koneksys Presentation March 2021
Koneksys Presentation March 2021
 
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
Announcing the Connections Cloud Catalog: How to Get new Apps fresh out of th...
 
Digital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoTDigital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoT
 
SPFx- A modern development model for SharePoint
SPFx- A modern development model  for SharePointSPFx- A modern development model  for SharePoint
SPFx- A modern development model for SharePoint
 
Choosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_FinalChoosing a Data Visualization Tool for Data Scientists_Final
Choosing a Data Visualization Tool for Data Scientists_Final
 
Data & Analytics at Scale
Data & Analytics at ScaleData & Analytics at Scale
Data & Analytics at Scale
 
Analyst Toolbox August 2017
Analyst Toolbox August 2017Analyst Toolbox August 2017
Analyst Toolbox August 2017
 
Build Business Web Applications with PHPOpenbiz Framework and Cubi Platform
Build Business Web Applications with PHPOpenbiz Framework and Cubi PlatformBuild Business Web Applications with PHPOpenbiz Framework and Cubi Platform
Build Business Web Applications with PHPOpenbiz Framework and Cubi Platform
 
Charting libraries vs BI & Analytics Platforms
Charting libraries vs BI & Analytics Platforms Charting libraries vs BI & Analytics Platforms
Charting libraries vs BI & Analytics Platforms
 
Bridging the Gap
Bridging the GapBridging the Gap
Bridging the Gap
 
Neo4j GraphDay Seattle- Sept19- in the enterprise
Neo4j GraphDay Seattle- Sept19-  in the enterpriseNeo4j GraphDay Seattle- Sept19-  in the enterprise
Neo4j GraphDay Seattle- Sept19- in the enterprise
 
Proof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics InteroperabilityProof of Concept for Learning Analytics Interoperability
Proof of Concept for Learning Analytics Interoperability
 
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise Architects
 

Más de Andrea Mauri

TripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the WebTripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the WebAndrea Mauri
 
Model Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring ApplicationsModel Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring ApplicationsAndrea Mauri
 
Methodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applicationsMethodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applicationsAndrea Mauri
 
An explorative approach for Crowdsourcing tasks design
An explorative approach for Crowdsourcing tasks design �An explorative approach for Crowdsourcing tasks design �
An explorative approach for Crowdsourcing tasks design Andrea Mauri
 
Community Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing QueriesCommunity Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing QueriesAndrea Mauri
 
Community-based Crowdsourcing
Community-based CrowdsourcingCommunity-based Crowdsourcing
Community-based CrowdsourcingAndrea Mauri
 

Más de Andrea Mauri (6)

TripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the WebTripleWave: Spreading RDF Streams on the Web
TripleWave: Spreading RDF Streams on the Web
 
Model Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring ApplicationsModel Driven Development of Social Media Environmental Monitoring Applications
Model Driven Development of Social Media Environmental Monitoring Applications
 
Methodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applicationsMethodologies for the Development of Crowd and Social-based applications
Methodologies for the Development of Crowd and Social-based applications
 
An explorative approach for Crowdsourcing tasks design
An explorative approach for Crowdsourcing tasks design �An explorative approach for Crowdsourcing tasks design �
An explorative approach for Crowdsourcing tasks design
 
Community Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing QueriesCommunity Profiling for Crowdsourcing Queries
Community Profiling for Crowdsourcing Queries
 
Community-based Crowdsourcing
Community-based CrowdsourcingCommunity-based Crowdsourcing
Community-based Crowdsourcing
 

Último

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 

Último (20)

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 

A Big Data Analysis Framework for Model-Based Web User Behavior Analytics

  • 1. Titolo presentazione sottotitolo Milano, XX mese 20XX A Big Data Analysis Framework for Model-Based Web User Behavior Analytics Carlo Bernaschina, Marco Brambila, Andrea Mauri, Eric Umuhoza 6th June 2017
  • 2. DEIB. Data Science LabICWE2017 – June 6th 2017 Context Web analytics has become the tool of choice to inform both business users and designers. Several tools exist that support analysis of Web server logs and extract information on application usage.
  • 3. DEIB. Data Science LabICWE2017 – June 6th 2017 Context
  • 4. DEIB. Data Science LabICWE2017 – June 6th 2017 Context
  • 5. DEIB. Data Science LabICWE2017 – June 6th 2017 Context Web analytics has become the tool of choice towards taking informed business and interaction design decisions. Several tools exist that support analysis of Web server logs and extract information on application usage. • Usually unaware of the design structure and the actual content managed by the application.
  • 6. DEIB. Data Science LabICWE2017 – June 6th 2017 Objective Provide valuable insights to designers and decision makers Integration of two approaches: • Web Log Analytics • Model Driven Development
  • 7. DEIB. Data Science LabICWE2017 – June 6th 2017 Model Driven Development Software development paradigm where the models are the main artefacts of the development process. Main benefits: • Model once, generate for any platform of choice • Validation of the requirements In this specific case: • The model includes the structure of the web application.
  • 8. DEIB. Data Science LabICWE2017 – June 6th 2017 Interaction Flow Modeling Language Interaction Flow Modeling Language (IFML) is designed for expressing the content, user interaction and control behavior of the front-end of software applications
  • 9. DEIB. Data Science LabICWE2017 – June 6th 2017 Overview Application Server Data Analyser Storage Analyzer (Spark) Code Generator Webratio IFML Editor (Enriched Analytics Model) Web Server LogRTXLog ModelCode Data Visualization Analysis (b) (a,f) (d) (e) Application DB Web Server (Tomcat) Database (c) Modeling Application Deployment Analysis Visualization Execution
  • 10. DEIB. Data Science LabICWE2017 – June 6th 2017 Logs Integration (d) (c) (b) Model RTXLog Web Server Log Database RTXLog WebServerLog Model Database EnrichedLog GlobalLog FinalLog RTX.sessionId = WebServer.sessionId AND RTX.timestamp = WebServer.timestamp EnrichedLog.elementId = Model.elementId GlobalLog.tabelName = Database.tableName AND Database.attributeName=“OID” AND GlobalLog.instanceID = Database,value (a) (a) (a) (a) A denormalized view of the logs
  • 11. DEIB. Data Science LabICWE2017 – June 6th 2017 Navigation Based Analyses Includes information regarding how the users navigate the Web site Examples: • Entrance Rate • Bounce Rate • Page Visit • Residence Time • Link Navigation • …
  • 12. DEIB. Data Science LabICWE2017 – June 6th 2017 Content Based Analyses Comprehends information regarding the domain entities involved in the user interaction, their types and their semantics. Example (e-commerce website) • Top K Visualized Books • Top K Visualized Authors • Top K Clicked Books • …
  • 13. DEIB. Data Science LabICWE2017 – June 6th 2017 Structure Based Analyses Comprehends information regarding the kind of widget, visualization, or even navigation pattern used in the user interaction. Example: • Top K elements clicked by users when shown in a map throughout the site • Top K elements clicked when shown in the first three positions of a list • Top K elements clicked when an attribute of type image is shown in the page versus an attribute of type currency. • …
  • 14. DEIB. Data Science LabICWE2017 – June 6th 2017 Data Visualization Tool Traditional charts like pie charts, bar charts, navigation flow charts and so on..
  • 15. DEIB. Data Science LabICWE2017 – June 6th 2017 Visual Feedback on Model Editor Three types of visualization: • Color: the analytics is shown through the change of color of the corresponding model element • Label: the analytics is shown with a label on the corresponding model element • Properties: the analytics is shown in a separate property panel
  • 16. DEIB. Data Science LabICWE2017 – June 6th 2017 Visual Feedback on Model Editor - Color • Residence Time • Page visit • …
  • 17. DEIB. Data Science LabICWE2017 – June 6th 2017 Visual Feedback on Model Editor - Label • Link Out Ratio • Link In Ratio • Top 1 Clicked Entity • …
  • 18. DEIB. Data Science LabICWE2017 – June 6th 2017 Visual Feedback on Model Editor - Properties • Top k clicked entities • Top k visualized entities • …
  • 19. DEIB. Data Science LabICWE2017 – June 6th 2017 Future Work Do not stop at the visualization! • Effects of Model Restructuring on User Behavior • Conversion Rate • Number of user who reach payment Module • Conversion Path: the path (with high contribution) leading to conversion rate • Model Optimization
  • 20. DEIB. Data Science LabICWE2017 – June 6th 2017 Thanks for your attention Questions? Contact: andrea.mauri@polimi.it Tool: www.ifmledit.org Further Info: http://datascience.deib.polimi.it/bigdata-modeling- weblogs/

Notas del editor

  1. Business user - > customer needs Designers -> good web applications
  2. Here there is an example of this kind of tools. It shows statistics such as: residence time, page views, bounce rate and so on..
  3. It also offers a more complex view is the ”behavior flow”, that represents how the user move inside the web application.
  4. Design structure -> what is actually present in the page -> hidden in the source code Content -> entities involved -> hidden in the database
  5. Include and expose the structure of the web application.
  6. The runtime component log (RTX) that stores events and data produced and consumed by the application runtime for serving page requests. It traces the history of the ViewContainers, ViewComponents, and operations that are executed, along with the executed queries. Enriched log: At this stage of the process we know all the elements involved in the users navigation, but only limited to their id. Global Log: information regarding to the model elements involved in the user interaction. From this global, denormalized view one can generate any desired behaviour analysis. In particular every statistics is computed with one or more SQL-like queries on the FinalLog .
  7. Color allows to have a heatmap-like view of the statistic
  8. Label allows to attach a single piece of information to a model element
  9. Properties allows to visualize complex statistics that would be too cumbersome to visualize directly on the model editor.