SlideShare una empresa de Scribd logo
1 de 25
BIG DATA
PLATFORM, TECHNOLOGY & TOOLS
Summary








Intro – what is Big Data?
Objectives
Technology approach
ETL, infrastructure, applications & tools
Existing platforms and tools
Evolution
What is Big Data?


Big Data = 3V
 High

Volume
 High Velocity
 High Variety


Includes: Capture, Curation,
Storage, Search, Sharing,
Transfer, Analysis, Visualization
Objectives


Actionable analytics
 A/B

testing
 Channel content automation and optimization


Accountable marketing
 Measure

marketing initiatives impact
 Using predictive technology


Creative discovery
 Using

BI tools
 Explore what questions could be asked
Brand Ecosystem

VOLUME / VELOCITY / VARIETY
Web & E-commerce
Social Media
Mobile Applications
Ad Serving
Data & CRM
Platforms & Services
Connecting the dots – Big Data Platform

BIG DATA PLATFORM
Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting
Automation & Optimization
Big Data - High Level System Architecture

Brand Ecosystem

Web
Platforms

Social
Media

Mobile
Applications

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing – Tracking, Logging, ETL
Distributed Infrastructure

Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Big Data - Data Flow & Tools
DATA SOURCES
Unstructured Data

Log Files

Exhaust Data

Social Media

Sensors, Devices

DB Data

LOG SERVICES

DATA WAREHOUSE

ANALYTICS

REAL TIME DATA STORAGE

REPORTING
d3.js

AUTOMATION,
OPTIMIZATION
Real Time APIs
A/B Testing
Big Data Roles


Program manager




Infrastructure




Project scope definition and planning, delivery, documentation and circulation of an end to
end plan, driving a unified message to all stakeholders, provide actionable detail on future
requirements, present program status and issues

IT Administrators – cluster configuration, management and maintenance

Software


Software Engineers – programming and technical analysis for Big Data main
solution and related products



Software Architects – solution and application architecture for all related products
(ETL, data warehouse, real time databases, platforms and tools)



Data Architects – distributed data storage architecture, related platform and tools
database architecture



BI Developers – programming for distributed queries, predictive analysis tools,
automation tools



Analysis




Data Analysts – data analysis, reporting tools, cross platform data analysis
BI Analysts – predictive multichannel analysis, BI tools
Data Scientists – Big Data algorithms for BI and predictive models
Big Data Components







Events and Data Capturing
Distributed Infrastructure
Platforms & Tools
Reporting & Analytics
Automation & Optimization

Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Events and Data Capturing
Every user action or state change on each client
platform will be logged using a common structure
(Json format):
 USER uid, reg_uid




EVENT tstamp, client_id, app_id, obj_id, event_id





Unique identifier for each en user
When a user is known (logged, across multiple platforms)
merge previous activity (events) on a single thread
When the event occurred
What event is logged (platform, object, event)

CONTEXT ip, uagent, referrer, qstring, geo_coords



User context
(application used)
IP address
and geo-location

Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Events and Data Capturing
Additional data to be captured to complement
user related events and states, such as:
Sales information
 Context information – weather, events, etc.
 Other relevant data
Data stored using a common structure (Json) –
somewhat similar to user events
but related to
the context or
the business client,
not the user


Brand Ecosystem

Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Events and Data Capturing


Shared libraries and protocols to be used
across all platforms

LIBRARIES
Browser client
library

Web &
Ecommerce
✔

Social
Media

Mobile
Application
s

Ad
Serving

✔

Data
& CRM

Platforms
& Services

✔

Mobile client
libraries

✔

✔

✔

✔

Log files import

✔

Data import

✔

✔
✔

✔

✔

Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Distributed Infrastructure

Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Platforms & Tools




CRM Marketing View
Media Publishing Platform
Other Platforms & Tools – related to social
media, loyalty platforms, ecommerce, CRM,
etc.

Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Platforms & Tools – CRM Marketing View










Segment CRM users based on a
group/segment definition schema
Generic admin interface for managing
segments and quality control
Generic solution for any CRM platform
Simplify CRM operations
Simplify custom CRM dashboards and reports
Integrates smoothly
with other Big Data
components
Brand Ecosystem

Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Platforms & Tools – Publishing Platform














Generic scalable
platform
Easily add any type
of input
Manage real-time
aggregation rules
Automatically publish
live banners, ads, etc.
A/B testing for
output media
Integration with CRM
and live feeds
Integration with other
Big Data components

Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Platforms & Tools – Top Voice




Social media brand influence platform
Real time data synchronization
Scalable infrastructure & services

Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Analytics & Reporting




Big Data Ultimate Dashboards
Trends & Semantic Analysis
BI Applications & Tools

Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Analytics & Reporting – Dashboards







Tableau Software platform
Leader on data visualization
Connects with relational databases
Connects with data stores such as Hadoop,
Google Big Query, HP Vertica
Rich and interactive dashboards and reports
Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Analytics & Reporting – Sentiment Analysis






Nexalogy
Process unstructured text data
Easily connects with social, CRM or any other
brand proprietary data
Finds relevant streams of conversations and
data

Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Analytics & Reporting – BI Applications





BI Tools
Sophisticated reports and correlations
Predictive technology
Software solutions such as Mahout, HP
Vertica, R, Platfora, Datameer, SAS, SPSS,
PSPP, Pivotal

Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Automation & Optimization







Automation services and processes
Dynamic and personalized offers and content
on websites, social media, mobile, ad banners,
etc.
Feed Big Data analytics into live input channel
applications
A/B testing
Brand Ecosystem
Web
Ecommerce

Social
Media

Mobile
Application
s

Ad
Serving

Data
CRM

Platforms
Services

Events and Data Capturing
Distributed Infrastructure
Platforms & Tools

Analytics & Reporting

Automation
&
Optimization
Big Data – Client Facing Tools


Platforms and tools






Media Publishing Platform for real-time content automation
CRM Marketing View for cross platform state marketing
Other tools integrated with Big Data

Analytics & Reporting




Big Data Ultimate Dashboards using Tableau Software
Predictive models for content and campaign optimization
Possibility to expose query tools directly to end-users
Big Data

BEFORE

AFTER

Más contenido relacionado

Destacado

Destacado (14)

7 Characteristics of a Bad (Big) Data Platform
7 Characteristics of a Bad (Big) Data Platform7 Characteristics of a Bad (Big) Data Platform
7 Characteristics of a Bad (Big) Data Platform
 
Intel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data SuccessIntel and Cloudera: Accelerating Enterprise Big Data Success
Intel and Cloudera: Accelerating Enterprise Big Data Success
 
Big Data Solutions Executive Overview
Big Data Solutions Executive OverviewBig Data Solutions Executive Overview
Big Data Solutions Executive Overview
 
Big Data : Risks and Opportunities
Big Data : Risks and OpportunitiesBig Data : Risks and Opportunities
Big Data : Risks and Opportunities
 
Big data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceBig data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security Alliance
 
WSO2 Big Data Analytics Platform
WSO2 Big Data Analytics PlatformWSO2 Big Data Analytics Platform
WSO2 Big Data Analytics Platform
 
How to design a linear control system
How to design a linear control systemHow to design a linear control system
How to design a linear control system
 
CS Toronto US Attendees
CS Toronto US AttendeesCS Toronto US Attendees
CS Toronto US Attendees
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture Capabilities
 
Big data characteristics, value chain and challenges
Big data characteristics, value chain and challengesBig data characteristics, value chain and challenges
Big data characteristics, value chain and challenges
 
Big data concepts
Big data conceptsBig data concepts
Big data concepts
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must Know
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with Data
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017
 

Similar a Big Data

Digital marketing pharma - google event
Digital marketing   pharma - google eventDigital marketing   pharma - google event
Digital marketing pharma - google event
Daniel Viveiros
 
The Next Digital Marketing- Digital Pharma presentation by Ci&T and Google
The Next Digital Marketing- Digital Pharma presentation by Ci&T and GoogleThe Next Digital Marketing- Digital Pharma presentation by Ci&T and Google
The Next Digital Marketing- Digital Pharma presentation by Ci&T and Google
CI&T
 
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
Lucas Jellema
 
Npcredentials
NpcredentialsNpcredentials
Npcredentials
manish17
 
Microsoft SQL Server 2008 R2 and BizTalk Server Presentation
Microsoft SQL Server 2008 R2 and BizTalk Server PresentationMicrosoft SQL Server 2008 R2 and BizTalk Server Presentation
Microsoft SQL Server 2008 R2 and BizTalk Server Presentation
Microsoft Private Cloud
 
Thought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserveThought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserve
Ron Krzoska
 
Intellact + Brand Communities/Centrifuge 2015
Intellact + Brand Communities/Centrifuge 2015Intellact + Brand Communities/Centrifuge 2015
Intellact + Brand Communities/Centrifuge 2015
Jeff Dickey
 

Similar a Big Data (20)

Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...
Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...
Big Data and Analytics on Amazon Web Services: Building A Business-Friendly P...
 
BI_Ch03.ppt
BI_Ch03.pptBI_Ch03.ppt
BI_Ch03.ppt
 
Digital marketing pharma - google event
Digital marketing   pharma - google eventDigital marketing   pharma - google event
Digital marketing pharma - google event
 
The Next Digital Marketing- Digital Pharma presentation by Ci&T and Google
The Next Digital Marketing- Digital Pharma presentation by Ci&T and GoogleThe Next Digital Marketing- Digital Pharma presentation by Ci&T and Google
The Next Digital Marketing- Digital Pharma presentation by Ci&T and Google
 
Imatia General Presentation
Imatia General PresentationImatia General Presentation
Imatia General Presentation
 
business analytics.ppt
business analytics.pptbusiness analytics.ppt
business analytics.ppt
 
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
5733   a deep dive into IBM Watson Foundation for CSP (WFC)5733   a deep dive into IBM Watson Foundation for CSP (WFC)
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
 
Big Data Application Architectures - Fraud Detection
Big Data Application Architectures - Fraud DetectionBig Data Application Architectures - Fraud Detection
Big Data Application Architectures - Fraud Detection
 
WSO2Con EU 2015: Reference Architecture for EDA
WSO2Con EU 2015: Reference Architecture for EDAWSO2Con EU 2015: Reference Architecture for EDA
WSO2Con EU 2015: Reference Architecture for EDA
 
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
50 Shades of Data - Dutch Oracle Architects Platform (February 2018)
 
Analytics tool comparison
Analytics tool comparisonAnalytics tool comparison
Analytics tool comparison
 
Big Data Day LA 2015 - Event Driven Architecture for Web Analytics by Peyman ...
Big Data Day LA 2015 - Event Driven Architecture for Web Analytics by Peyman ...Big Data Day LA 2015 - Event Driven Architecture for Web Analytics by Peyman ...
Big Data Day LA 2015 - Event Driven Architecture for Web Analytics by Peyman ...
 
Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...
Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...
Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...
 
Npcredentials
NpcredentialsNpcredentials
Npcredentials
 
Microsoft SQL Server 2008 R2 and BizTalk Server Presentation
Microsoft SQL Server 2008 R2 and BizTalk Server PresentationMicrosoft SQL Server 2008 R2 and BizTalk Server Presentation
Microsoft SQL Server 2008 R2 and BizTalk Server Presentation
 
Thought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserveThought leadership Oct2015 selfserve
Thought leadership Oct2015 selfserve
 
Data-Driven and User-Centric: Improving enterprise productivity and engagemen...
Data-Driven and User-Centric: Improving enterprise productivity and engagemen...Data-Driven and User-Centric: Improving enterprise productivity and engagemen...
Data-Driven and User-Centric: Improving enterprise productivity and engagemen...
 
Intellact + Brand Communities/Centrifuge 2015
Intellact + Brand Communities/Centrifuge 2015Intellact + Brand Communities/Centrifuge 2015
Intellact + Brand Communities/Centrifuge 2015
 
Pipedrive pricing .pdf
Pipedrive pricing .pdfPipedrive pricing .pdf
Pipedrive pricing .pdf
 
Harnessing transportation big data with analytics in the age of digital busin...
Harnessing transportation big data with analytics in the age of digital busin...Harnessing transportation big data with analytics in the age of digital busin...
Harnessing transportation big data with analytics in the age of digital busin...
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Último (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Big Data

  • 2. Summary       Intro – what is Big Data? Objectives Technology approach ETL, infrastructure, applications & tools Existing platforms and tools Evolution
  • 3. What is Big Data?  Big Data = 3V  High Volume  High Velocity  High Variety  Includes: Capture, Curation, Storage, Search, Sharing, Transfer, Analysis, Visualization
  • 4. Objectives  Actionable analytics  A/B testing  Channel content automation and optimization  Accountable marketing  Measure marketing initiatives impact  Using predictive technology  Creative discovery  Using BI tools  Explore what questions could be asked
  • 5. Brand Ecosystem VOLUME / VELOCITY / VARIETY Web & E-commerce Social Media Mobile Applications Ad Serving Data & CRM Platforms & Services
  • 6. Connecting the dots – Big Data Platform BIG DATA PLATFORM Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 7. Big Data - High Level System Architecture Brand Ecosystem Web Platforms Social Media Mobile Applications Ad Serving Data CRM Platforms Services Events and Data Capturing – Tracking, Logging, ETL Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 8. Big Data - Data Flow & Tools DATA SOURCES Unstructured Data Log Files Exhaust Data Social Media Sensors, Devices DB Data LOG SERVICES DATA WAREHOUSE ANALYTICS REAL TIME DATA STORAGE REPORTING d3.js AUTOMATION, OPTIMIZATION Real Time APIs A/B Testing
  • 9. Big Data Roles  Program manager   Infrastructure   Project scope definition and planning, delivery, documentation and circulation of an end to end plan, driving a unified message to all stakeholders, provide actionable detail on future requirements, present program status and issues IT Administrators – cluster configuration, management and maintenance Software  Software Engineers – programming and technical analysis for Big Data main solution and related products  Software Architects – solution and application architecture for all related products (ETL, data warehouse, real time databases, platforms and tools)  Data Architects – distributed data storage architecture, related platform and tools database architecture  BI Developers – programming for distributed queries, predictive analysis tools, automation tools  Analysis    Data Analysts – data analysis, reporting tools, cross platform data analysis BI Analysts – predictive multichannel analysis, BI tools Data Scientists – Big Data algorithms for BI and predictive models
  • 10. Big Data Components      Events and Data Capturing Distributed Infrastructure Platforms & Tools Reporting & Analytics Automation & Optimization Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 11. Events and Data Capturing Every user action or state change on each client platform will be logged using a common structure (Json format):  USER uid, reg_uid    EVENT tstamp, client_id, app_id, obj_id, event_id    Unique identifier for each en user When a user is known (logged, across multiple platforms) merge previous activity (events) on a single thread When the event occurred What event is logged (platform, object, event) CONTEXT ip, uagent, referrer, qstring, geo_coords   User context (application used) IP address and geo-location Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 12. Events and Data Capturing Additional data to be captured to complement user related events and states, such as: Sales information  Context information – weather, events, etc.  Other relevant data Data stored using a common structure (Json) – somewhat similar to user events but related to the context or the business client, not the user  Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 13. Events and Data Capturing  Shared libraries and protocols to be used across all platforms LIBRARIES Browser client library Web & Ecommerce ✔ Social Media Mobile Application s Ad Serving ✔ Data & CRM Platforms & Services ✔ Mobile client libraries ✔ ✔ ✔ ✔ Log files import ✔ Data import ✔ ✔ ✔ ✔ ✔ Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 14. Distributed Infrastructure Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 15. Platforms & Tools    CRM Marketing View Media Publishing Platform Other Platforms & Tools – related to social media, loyalty platforms, ecommerce, CRM, etc. Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 16. Platforms & Tools – CRM Marketing View       Segment CRM users based on a group/segment definition schema Generic admin interface for managing segments and quality control Generic solution for any CRM platform Simplify CRM operations Simplify custom CRM dashboards and reports Integrates smoothly with other Big Data components Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 17. Platforms & Tools – Publishing Platform        Generic scalable platform Easily add any type of input Manage real-time aggregation rules Automatically publish live banners, ads, etc. A/B testing for output media Integration with CRM and live feeds Integration with other Big Data components Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 18. Platforms & Tools – Top Voice    Social media brand influence platform Real time data synchronization Scalable infrastructure & services Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 19. Analytics & Reporting    Big Data Ultimate Dashboards Trends & Semantic Analysis BI Applications & Tools Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 20. Analytics & Reporting – Dashboards      Tableau Software platform Leader on data visualization Connects with relational databases Connects with data stores such as Hadoop, Google Big Query, HP Vertica Rich and interactive dashboards and reports Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 21. Analytics & Reporting – Sentiment Analysis     Nexalogy Process unstructured text data Easily connects with social, CRM or any other brand proprietary data Finds relevant streams of conversations and data Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 22. Analytics & Reporting – BI Applications     BI Tools Sophisticated reports and correlations Predictive technology Software solutions such as Mahout, HP Vertica, R, Platfora, Datameer, SAS, SPSS, PSPP, Pivotal Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 23. Automation & Optimization     Automation services and processes Dynamic and personalized offers and content on websites, social media, mobile, ad banners, etc. Feed Big Data analytics into live input channel applications A/B testing Brand Ecosystem Web Ecommerce Social Media Mobile Application s Ad Serving Data CRM Platforms Services Events and Data Capturing Distributed Infrastructure Platforms & Tools Analytics & Reporting Automation & Optimization
  • 24. Big Data – Client Facing Tools  Platforms and tools     Media Publishing Platform for real-time content automation CRM Marketing View for cross platform state marketing Other tools integrated with Big Data Analytics & Reporting    Big Data Ultimate Dashboards using Tableau Software Predictive models for content and campaign optimization Possibility to expose query tools directly to end-users