SlideShare una empresa de Scribd logo
1 de 66
Seminario Big Data
Ing. Roberto Messora
Lecco, 23 Novembre 2017
Big Data, Analytics, AI, Machine Learning, Deep Learning
Executive Overview
Big Data & Analytics Reference Architectures Conceptual View
Big Data & Analytics Reference Architectures Logical View
Big Data & Analytics Reference Architectures Technological View
Analytics Overview and Case Studies
Event Store
Domain Model
Cloudera
Agenda
Big Data, Analytics, AI, Machine Learning, Deep Learning
Executive Overview
Big Data & Analytics Reference Architectures Conceptual View
Big Data & Analytics Reference Architectures Logical View
Big Data & Analytics Reference Architectures Technological View
Analytics Overview and Case Studies
Event Store
Domain Model
Cloudera
Agenda
Big Data, Analytics, AI, Machine Learning, Deep Learning
4
Big Data, Analytics, AI, Machine Learning, Deep Learning
5
Big Data, Analytics, AI, Machine Learning, Deep Learning
6
Big Data, Analytics, AI, Machine Learning, Deep Learning
Executive Overview
Big Data & Analytics Reference Architectures Conceptual View
Big Data & Analytics Reference Architectures Logical View
Big Data & Analytics Reference Architectures Technological View
Analytics Overview and Case Studies
Event Store
Domain Model
Cloudera
Agenda
Data is often considered to be the crown jewels of an organization.
1) Most companies already use analytics in the form of reports and dashboards to help run
their business. This is largely based on well structured data from operational systems that
conform to pre-determined relationships (“a single version of the truth”).
2) Big Data, however, doesn’t follow this structured model. The streams are all different and it
is difficult to establish common relationships. But with its diversity and abundance come
opportunities to learn and to develop new ideas – ideas that can help change the business
(“a single version of the facts”)
The architectural challenge is to bring the two paradigms together. So, rather than approach Big
Data as a new technology silo, an organization should strive to create a unified information
architecture – one that enables it to leverage all types of data, as situations demand, to
promptly satisfy business needs.
The objective of this workshop is to describe a reference architecture (and its implementation)
that promotes a unified vision for information management and analytics.
Executive Overview
8
Executive Overview
9
Big Data, Analytics, AI, Machine Learning, Deep Learning
Executive Overview
Big Data & Analytics Reference Architectures Conceptual View
Big Data & Analytics Reference Architectures Logical View
Big Data & Analytics Reference Architectures Technological View
Analytics Overview and Case Studies
Event Store
Domain Model
Cloudera
Agenda
The architecture is organized into views that highlight three focus areas:
1. universal information management
2. real-time analytics
3. intelligent processes
They represent architecturally significant capabilities that are important to most organizations
today.
Big Data & Analytics Reference Architectures Conceptual View
11
Unified Information Management addresses the need to manage information holistically as
opposed to maintaining independently governed silos. At a high level this includes:
o High Volume Data Acquisition – The system must be able to acquire data despite high
volumes, velocity, and variety. It may not be necessary to persist all data that is received.
o Multi-Structured Data Organization and Discovery – The ability to navigate and search across
different forms of data can be enhanced by the capability to organize data of different
structures into a common schema.
o Low Latency Data Processing – Data processing can occur at many stages of the architecture.
In order to support the processing requirements of Big Data, the system must be fast and
efficient.
o Single Version of the Truth – When two people perform the same form of analysis they
should get the same result. As obvious as this seems, it isn’t necessarily a small feat,
especially if the two people belong to different departments or divisions of a company. Single
version of truth requires architecture consistency and governance.
Unified Information Management
12
Real-Time Analytics enables the business to leverage information and analysis as events are
unfolding. At a high level this includes:
o Speed of Thought Analysis – Analysis is often a journey of discovery, where the results of one
query determine the content of the next. The system must support this journey in an
expeditious manner. System performance must keep pace with the users’ thought process.
o Interactive Dashboards – Interactive dashboards allow the user to immediately react to
information being displayed, providing the ability to drill down and perform root cause
analysis of situations at hand.
o Advanced Analytics – Advanced forms of analytics, including data mining, machine learning,
and statistical analysis enable businesses to better understand past activities and spot trends
that can carry forward into the future. Applied in real-time, advanced analytics can enhance
customer interactions and buying decisions, detect fraud and waste, and enable the business
to make adjustments according to current conditions.
o Event Processing – Real-time processing of events enables immediate responses to existing
problems and opportunities. It filters through large quantities of streaming data, triggering
predefined responses to known data patterns.
Real-Time Analytics
13
A key objective for any Big Data and Analytics program is to execute business processes more
effectively and efficiently. This means channeling the intelligence one gains from analysis
directly into the processes that the business is performing. At a high level this includes:
o Application-Embedded Analysis – Many workers today can be classified as knowledge
workers; they routinely make decisions that affect business performance. Embedding analysis
into the applications they use helps them to make more informed decisions.
o Optimized Rules and Recommendations –With optimized rules and recommendations,
insight from analysis is used to influence the decision logic as the process is being executed.
o Guided User Navigation – Whenever possible the system should leverage the information
available in order to guide the user along the most appropriate path of investigation.
o Performance and Strategy Management – Analytics can also provide insight to guide and
support the performance and strategy management processes of a business. It can help to
ensure that strategy is based on sound analysis. Likewise, it can track business performance
versus objectives in order to provide insight on strategy achievement.
Intelligent Processes
14
Big Data, Analytics, AI, Machine Learning, Deep Learning
Executive Overview
Big Data & Analytics Reference Architectures Conceptual View
Big Data & Analytics Reference Architectures Logical View
Big Data & Analytics Reference Architectures Technological View
Analytics Overview and Case Studies
Event Store
Domain Model
Cloudera
Agenda
Big Data & Analytics Reference Architectures Logical View
16
The high-level logical view defines a multi-tier architecture template that can be used to describe
many types of technology solutions.
Big Data & Analytics Reference Architectures Logical View
17
This layer includes the hardware
and platforms on which the Big
Data and Analytics components
run. As shared infrastructure, it
can be used to support multiple
concurrent implementations, in
support of, or analogous to, Cloud
Computing.
This layer includes infrastructure to
support traditional databases,
specialized Big Data management
systems, and infrastructure that
has been optimized for analytics.
Big Data & Analytics Reference Architectures Logical View
18
At the bottom are data stores that
have been commissioned for
specific purposes (g.e. individual
operational data stores, CMS, etc.)
These data stores represent
sources of data that are ingested
(upward) into the Logical Data
Warehouse (LDW). The LDW
represents a collection of data that
has been provisioned for historical
and analytical purposes.
Above the LDW are components
that provide processing and event
detection for all forms of data.
At the top of the layer are
components that virtualize all
forms of data for universal
consumption.
Big Data & Analytics Reference Architectures Logical View
19
The Services Layer includes
components that provide or
perform commonly used services.
Presentation Services and
Information Services are types of
Services in a Service Oriented
Architecture (SOA). They can be
defined, cataloged, used, and
shared across solutions. Business
Activity Monitoring, Business
Rules, and Event Handling provide
common services for the
processing layer(s) above.
Big Data & Analytics Reference Architectures Logical View
20
The Process Layer represents
components that perform higher
level processing activities. For the
purpose of Big Data and Analytics,
this layer calls out several types of
applications that support
analytical, intelligence gathering,
and performance management
processes.
The Interaction Layer is comprised
of components used to support
interaction with end users.
Common artifacts for this layer
include dashboards, reports,
charts, graphs, and spreadsheets.
In addition, this layer includes the
tools used by analysts to perform
analysis and discovery activities.
Big Data & Analytics Reference Architectures Logical View
21
The results of analysis can be
delivered via many different
channels. The architecture calls
out common IP network based
channels such as desktops and
laptops, common mobile network
channels such as mobile phones
and tablets, and other channels
such as email, SMS, and hardcopy.
The architecture is supported by a
number of components that affect
all layers of the architecture. These
include information and analysis
modeling, monitoring,
management, security, and
governance.
Big Data, Analytics, AI, Machine Learning, Deep Learning
Executive Overview
Big Data & Analytics Reference Architectures Conceptual View
Big Data & Analytics Reference Architectures Logical View
Big Data & Analytics Reference Architectures Technological View
Analytics Overview and Case Studies
Event Store
Domain Model
Cloudera
Agenda
Big Data & Analytics Reference Architectures Technological View
23
It lets you publish and subscribe to streams of records. In this respect it is similar to a message
queue or enterprise messaging system.
It lets you store streams of records ia a fault-tolerant way.
It lets you process streams of records as they occur.
Apache Kafka
24
Apache Kafka™ is a distributed streaming platform.
Website: https://kafka.apache.org/
Speed –up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk
Ease of use – API in Java, Scala, Python and R
Generality – powerful stack of libraries including SQL and DataFrames, Mllib for machine
learning, GraphX and Spark Streaming
Runs Everywhere - Spark runs on Hadoop, Mesos, standalone, or in the cloud
Apache Spark
25
Apache Spark™ is a fast and general engine for large-scale data processing.
Website: http://spark.apache.org/
Reference Architectures - Hadoop Classic Batch Architecture
26
Characteristics:
 Batch oriented
 Massive Storage
 Multiuser jobs
 Data Warehouse Replacement
Reference Architectures – Lambda Architecture
27
Batch Layer manages the master data set, an immutable, append-only set of raw data
Speed Layer
ingest streaming data or micro-batches and provide an «active partition» with a
limited window of mutability
Serving Layer
output from the batch and speed layers are stored in the serving layer (BASE
compliant)
Reference Architectures – Lambda Architecture
28
Complexity
 Many moving parts
 Restatement is difficult
 Two code base must be kept in sync
 Proper failure handling is complex
Reference Architectures – Kappa Architecture
29
Jay Kreps, the creating of Kafka and one of the first proponents of stream-based
architectures, joking called his alternative the “Kappa Architecture”.
Reference Architectures – «Fast Data» Architecture
31
There are more options today for where to deploy a solution than ever before. At a high level
the four options for deployment of architecture components are:
1) Public Cloud – In the public cloud model, a company rents resources from a third party. The
most advanced usage of public cloud is where the business functionality is provided by the
cloud provider (i.e., software-as-a-service). Public cloud might also be used as the platform
upon which the business functionality is built (i.e., platform-as-a-service), or the public
cloud may simply provide the infrastructure for the system (i.e.,infrastructure-as-a-service).
2) Private Cloud - Private cloud is the same as public cloud, but the cloud is owned by a
company instead of being provided by a third party. Private clouds are ideal for hosting and
integrating very large data volumes while keeping data secure behind corporate firewalls.
3) Managed Services – In this model a company owns the components of the system, but
outsources some or all aspects of runtime operations.
4) Traditional IT – In this model a company owns and operates the system.
These various options for deployment are not mutually exclusive.
Deployment
32
Security
33
1) Authentication (Kerberos, LDAP, …)
2) Authorization (ACE, ACL, Sentry,…)
3) Encryption & Data Masking (Over-the-Wire Encryption, Encryption at Rest, Field-
Level Encryption, Format-preserving Encryption)
4) Auditing & Data Lineage
5) Disaster Recovery & Backup
The Keys to secure the enterprise Big Data platform are:
Big Data, Analytics, AI, Machine Learning, Deep Learning
Executive Overview
Big Data & Analytics Reference Architectures Conceptual View
Big Data & Analytics Reference Architectures Logical View
Big Data & Analytics Reference Architectures Technological View
Analytics Overview and Case Studies
Event Store
Domain Model
Cloudera
Agenda
Analytics - Data Science on Hadoop
35
Common Limitations
Analytics - Data Science
36
Notebooks combine code, output and narrative into a single document.
Notebooks
You can condunct analysis writing down code, results, ideas and thoughts.
You have multiple languages and versions in a single multi-tenant environment.
Easy to share
Easy version control
37
Data Science is the science of building data products.
OVERT DATA PRODUCTS COVERT DATA PRODUCTS
• Products where the data is clearly visible
as part of the deliverable.
• Descriptive Analysis
• Dashboarding
• Reporting
• Deliver results rather than data; data is
hidden.
• Recommendation Engine
• …
Website:https://www.oreilly.com/ideas/evolution-of-data-products
Analytics - Data Science Data Products
BENEFITS
Analytics allows to better manage Customer Base and extract customer
value
Analyze customer profiles, behaviors and purchases and obtain a complete and strategic view
of the most recurrent customer behaviors
Develop a tailored proposition by customer segment to increase customer value along the
whole client lifecycle
Address marketing efforts based on customer insights and value
Drive consumer segments to exploit product portfolio at the right time of their customer
journey
DIGITAL DIGITAL
Analytics will be carried out in order to offer actionable insights on
customer and will follow a multi-step approach
Business
Objective
&Question
Business
Actions
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
%Accounts
Deciles
Responders
Non Responders
Model Interpretation
Modeling
Data Preparation
Data Exploration/Understanding
Simple exploratory Analysis in order to understand the whole set of information
available, identify problems in the data, and start observing relationships
among variables.
Use of data visualization techniques for exploring the set of information
Data is prepared for data mining and machine learning models
Imputation of missing values, computation of new variables potentially useful for
the business question, transformation of variables to make them meaningful for
the problem to be solved
Models are implemented
Available data is used and synthesized to answer the business question, by
identifying relationships among target variable and input variables
It may be a recursive process based also on sampling data and assessing models
and results
Model results are interpreted in order to be useful for business strategy and
actions.
OBJECTIVES
ANALYTICAL
MODEL
… that can be answered through specific statistical models and
approaches
CustomerValue
Customer Life Time
New Customer
identification
and engagement
Clienteling
& Caring
Program
Actions to retain
leaving customers
Churn Model
ENGAGE NEW
CUSTOMERS
NURTURE & DEVELOP
LOYALTY CUSTOMERS
RETAIN LEAVING
CUSTOMERS
+


Clickstream & Content Analysis
Next Best Offer Analysis
Segmentation (deterministical vs behavioural)
Propensity Model
Why Algorithms Analysis
Propensity Models
The model assigns a propensity score to each customer and allows to priorite initiatives
Propensity model allows to estimate
Re-purchasing probability of customers
Retargeting Optimization: predict the likehood of booking a flight for potential customers
Up-selling propensity: Reservation upgrade or ancillary services proposal
Etc.
Address marketing
investments on customer
with highest propensity
to:
– Increase up-selling
– Increase cross-selling
– Increase active
customers
– Increase redemption
of marketing
campaigns
Regressions
Decision Trees
Random Forests
Neural Networks
Support Vector Machines
Ensemble Models
…
What
+

Why Algorithms Analysis
Behavioural Segmentation
Behavioral segmentation follows a statistical clustering algorithm which:
Identify most significant variables for the analysis
Aggregate customers into mutually exclusive groups with similar behavioral patterns, by creating
clusters are as similar as possible
Customer affiliation to a specific cluster varies overtime, based on his behavior
Get strategic insight on
customer base to increase
loyalty and value
Tailor contact strategy
(“the right action for the
right customer”)
Enhance the website
experience
Increase the redemption
rate for targeting
marketing campaigns
Data transformation
Factor analysis
Unsupervised Clustering
models
What
+


Why Algorithms Analysis
Churn Models
Churn analysis is a multivariate data mining technique that assigns a score to customer attrition
It estimates the probability that a customer will not buy from a company anymore or for a given period of
time
Historical data on customers leaving the company will be investigated in order to identify anticipatory
signals. Information on flying behavior, enriched data (lifestyle, interests, motivation, SOW, price sensitivity)
and customer hyper-profile will be used to compare churn vs loyal behavior
Optimization of costs and
marketing activities in
customer retention
Identification of high risk
customers sorted by
profitability
Increase active customers
Regressions
Decision Trees
Random Forests
Neural Networks
Support Vector Machines
Ensemble Models
…
What

Big Data, Analytics, AI, Machine Learning, Deep Learning
Executive Overview
Big Data & Analytics Reference Architectures Conceptual View
Big Data & Analytics Reference Architectures Logical View
Big Data & Analytics Reference Architectures Technological View
Analytics Overview and Case Studies
Event Store
Domain Model
Cloudera
Agenda
The Database Log: the real database
45
The Event Store: an entity history
46
Event Driven Architecture: segregating Command & Queries
47
Event Driven Architecture: Eventual Consistency
48
Event Driven Architecture: the Domain Model
49
Big Data, Analytics, AI, Machine Learning, Deep Learning
Executive Overview
Big Data & Analytics Reference Architectures Conceptual View
Big Data & Analytics Reference Architectures Logical View
Big Data & Analytics Reference Architectures Technological View
Analytics Overview and Case Studies
Event Store
Domain Model
Cloudera
Agenda
Place the project's primary focus on the core domain and domain logic
Base complex designs on a model of the domain
Initiate a creative collaboration between technical and domain experts to iteratively refine a
conceptual model that addresses particular domain problems.
Concepts
– Context: the setting in which a word or statement appears that determines its meaning
– Domain: a sphere of knowledge (ontology), influence, or activity. The subject area to which
the user applies a program is the domain of the software
– Model: a system of abstractions that describes selected aspects of a domain and can be used
to solve problems related to that domain
– Ubiquitous Language: a language structured around the domain model and used by all team
members to connect all the activities of the team with the software
– Bounded context: explicitly define the context within which a model applies. Explicitly set
boundaries in terms of team organization, usage within specific parts of the application
– Context map: Identify each model in play on the project and define its bounded context. This
includes the implicit models of non-object-oriented subsystems. Name each bounded
context, and make the names part of the ubiquitous language. Describe the points of contact
between the models, outlining explicit translation for any communication
Domain Driven Design: Concepts
51
Entity: An object that is not defined by its attributes, but rather by a thread of continuity and its
identity
Value Object: an object that contains attributes but has no conceptual identity. They should be
treated as immutable
Aggregate: a collection of objects that are bound together by a root entity, otherwise known as
an aggregate root. The aggregate root guarantees the consistency of changes being made within
the aggregate by forbidding external objects from holding references to its members
Domain Event: a domain object that defines an event (something that happens). A domain
event is an event that domain experts care about
Service: when an operation does not conceptually belong to any object. Following the natural
contours of the problem, you can implement these operations in services
Domain Driven Design: Building Blocks
52
Domain Model: Service
53
Bounded Contexts: coordinate Domain Models
54
Big Data, Analytics, AI, Machine Learning, Deep Learning
Executive Overview
Big Data & Analytics Reference Architectures Conceptual View
Big Data & Analytics Reference Architectures Logical View
Big Data & Analytics Reference Architectures Technological View
Analytics Overview and Case Studies
Event Store
Domain Model
Cloudera
Agenda
Cloudera Product Mapping View
56
Cloudera Manager
57
Cloudera Manager is an end-to-end application for managing CDH clusters. Cloudera Manager sets the
standard for enterprise deployment by delivering granular visibility into and control over every part of
the CDH cluster—empowering operators to improve performance, enhance quality of service,
increase compliance and reduce administrative costs.
Cloudera Navigator
58
Cloudera Navigator Optimizer
59
How can you assess the risk and true cost of offloading ETL and analytic workloads and understand
what it takes to get there?
o Cloudera Navigator Optimizer gives you the insights and risk-assessments you need to build out
a comprehensive strategy for Hadoop success. Simply upload your existing SQL workloads to
get started, and Navigator Optimizer will identify relative risks and development costs for
offloading these to Hadoop based on compatibility and complexity.
o To efficiently optimize performance for the latest technologies, like Hive and Impala, you need
visibility into what users are doing with the data and when the queries themselves are to
blame. Cloudera Navigator Optimizer gives you that visibility and lets you focus optimization
efforts on critical areas and best practices.
Cloudera Security
60
Cloudera Data Science Workbench
61
What is Cloudera Data Science Workbench?
Cloudera Data Science Workbench
62
Data Science on Hadoop
Cloudera Data Science Workbench Architecture
63
Cloudera Data Science Workbench Architecture
64
Cloudera Product Mapping View
65
Cloudera Enterprise is available on a subscription basis in five editions, each designed for your
specific needs.
– Essentials provides superior support and advanced management for core Apache Hadoop
– Data Science and Engineering for programmatic preparation and predictive modeling
– Operational DB for online applications and real-time serving
– Analytic DB for BI and SQL analytics
– The Enterprise Data Hub gives you everything you need to become information-driven, with
complete use of the platform.
Data Engineering in the Cloud
66
BI/Analytics in the Cloud
67

Más contenido relacionado

La actualidad más candente

Rebooting BI and EPM [english]
Rebooting BI and EPM [english]Rebooting BI and EPM [english]
Rebooting BI and EPM [english]Jean-Michel Franco
 
intro_to_business_analytics_and_data_science_ver 1.0
intro_to_business_analytics_and_data_science_ver 1.0intro_to_business_analytics_and_data_science_ver 1.0
intro_to_business_analytics_and_data_science_ver 1.0Anthony Paulus
 
Data-Related Presentations
Data-Related PresentationsData-Related Presentations
Data-Related PresentationsAlan McSweeney
 
Basic analtyics & advanced analtyics
Basic analtyics & advanced analtyicsBasic analtyics & advanced analtyics
Basic analtyics & advanced analtyicsDEEPIKA T
 
Business intelligence concepts & application
Business intelligence concepts & applicationBusiness intelligence concepts & application
Business intelligence concepts & applicationnandini patil
 
Business Intelligence Key Factors - How to successfully help decision making ...
Business Intelligence Key Factors - How to successfully help decision making ...Business Intelligence Key Factors - How to successfully help decision making ...
Business Intelligence Key Factors - How to successfully help decision making ...Cristian Golban
 
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...Optimus BT
 
Business Intelligence Module 1
Business Intelligence Module 1Business Intelligence Module 1
Business Intelligence Module 1Home
 
The Economic Value of Data: A New Revenue Stream for Global Custodians
The Economic Value of Data: A New Revenue Stream for Global CustodiansThe Economic Value of Data: A New Revenue Stream for Global Custodians
The Economic Value of Data: A New Revenue Stream for Global CustodiansCognizant
 
Data mining Introduction
Data mining IntroductionData mining Introduction
Data mining IntroductionVijayasankariS
 
How to Create and Manage a Successful Analytics Organization
How to Create and Manage a Successful Analytics OrganizationHow to Create and Manage a Successful Analytics Organization
How to Create and Manage a Successful Analytics OrganizationDATAVERSITY
 
Business Intelligence 3.0 Revolution
Business Intelligence 3.0 RevolutionBusiness Intelligence 3.0 Revolution
Business Intelligence 3.0 Revolutionwww.panorama.com
 
MIS: Business Intelligence
MIS: Business IntelligenceMIS: Business Intelligence
MIS: Business IntelligenceJonathan Coleman
 
Business intelligence, Data Analytics & Data Visualization
Business intelligence, Data Analytics & Data VisualizationBusiness intelligence, Data Analytics & Data Visualization
Business intelligence, Data Analytics & Data VisualizationMuthu Natarajan
 

La actualidad más candente (19)

Rebooting BI and EPM [english]
Rebooting BI and EPM [english]Rebooting BI and EPM [english]
Rebooting BI and EPM [english]
 
intro_to_business_analytics_and_data_science_ver 1.0
intro_to_business_analytics_and_data_science_ver 1.0intro_to_business_analytics_and_data_science_ver 1.0
intro_to_business_analytics_and_data_science_ver 1.0
 
Data-Related Presentations
Data-Related PresentationsData-Related Presentations
Data-Related Presentations
 
Basic analtyics & advanced analtyics
Basic analtyics & advanced analtyicsBasic analtyics & advanced analtyics
Basic analtyics & advanced analtyics
 
Business process based analytics
Business process based analyticsBusiness process based analytics
Business process based analytics
 
Business intelligence concepts & application
Business intelligence concepts & applicationBusiness intelligence concepts & application
Business intelligence concepts & application
 
BI Presentation
BI PresentationBI Presentation
BI Presentation
 
Business Intelligence Key Factors - How to successfully help decision making ...
Business Intelligence Key Factors - How to successfully help decision making ...Business Intelligence Key Factors - How to successfully help decision making ...
Business Intelligence Key Factors - How to successfully help decision making ...
 
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
Business Intelligence, Portals, Dashboards and Operational Matrix with ShareP...
 
Business Intelligence Module 1
Business Intelligence Module 1Business Intelligence Module 1
Business Intelligence Module 1
 
The Economic Value of Data: A New Revenue Stream for Global Custodians
The Economic Value of Data: A New Revenue Stream for Global CustodiansThe Economic Value of Data: A New Revenue Stream for Global Custodians
The Economic Value of Data: A New Revenue Stream for Global Custodians
 
Data mining Introduction
Data mining IntroductionData mining Introduction
Data mining Introduction
 
Gartner Predicts 2018
Gartner Predicts 2018Gartner Predicts 2018
Gartner Predicts 2018
 
Unit 4 Advanced Data Analytics
Unit 4 Advanced Data AnalyticsUnit 4 Advanced Data Analytics
Unit 4 Advanced Data Analytics
 
How to Create and Manage a Successful Analytics Organization
How to Create and Manage a Successful Analytics OrganizationHow to Create and Manage a Successful Analytics Organization
How to Create and Manage a Successful Analytics Organization
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Business Intelligence 3.0 Revolution
Business Intelligence 3.0 RevolutionBusiness Intelligence 3.0 Revolution
Business Intelligence 3.0 Revolution
 
MIS: Business Intelligence
MIS: Business IntelligenceMIS: Business Intelligence
MIS: Business Intelligence
 
Business intelligence, Data Analytics & Data Visualization
Business intelligence, Data Analytics & Data VisualizationBusiness intelligence, Data Analytics & Data Visualization
Business intelligence, Data Analytics & Data Visualization
 

Similar a Seminario Big Data

Business Intelligence and decision support system
Business Intelligence and decision support system Business Intelligence and decision support system
Business Intelligence and decision support system Shrihari Shrihari
 
Big Data Analytics Architecture Powerpoint Presentation Slides
Big Data Analytics Architecture Powerpoint Presentation SlidesBig Data Analytics Architecture Powerpoint Presentation Slides
Big Data Analytics Architecture Powerpoint Presentation SlidesSlideTeam
 
Big Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation SlidesBig Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation SlidesSlideTeam
 
Business Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptxBusiness Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptxRupaRani28
 
Notes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfNotes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfKarishma Chaudhary
 
Business intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and ApplicationsBusiness intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and Applicationsraj
 
Enterprise Architecture - An Introduction
Enterprise Architecture - An Introduction Enterprise Architecture - An Introduction
Enterprise Architecture - An Introduction Daljit Banger
 
Why Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfWhy Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfData Science Council of America
 
Why Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfWhy Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfData Science Council of America
 
Enterprise architecture
Enterprise architecture Enterprise architecture
Enterprise architecture Hamzazafeer
 
Prescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptxPrescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptxKarthik132344
 
White Paper-2-Mapping Manager-Bringing Agility To Business Intelligence
White Paper-2-Mapping Manager-Bringing Agility To Business IntelligenceWhite Paper-2-Mapping Manager-Bringing Agility To Business Intelligence
White Paper-2-Mapping Manager-Bringing Agility To Business IntelligenceAnalytixDataServices
 
BIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICSBIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICSVikram Joshi
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsAbhishek Sood
 
Data Science And Analytics Outsourcing – Vendors, Models, Steps by Ravi Kalak...
Data Science And Analytics Outsourcing – Vendors, Models, Steps by Ravi Kalak...Data Science And Analytics Outsourcing – Vendors, Models, Steps by Ravi Kalak...
Data Science And Analytics Outsourcing – Vendors, Models, Steps by Ravi Kalak...Tommy Toy
 
business analytics.ppt
business analytics.pptbusiness analytics.ppt
business analytics.pptRenu Lamba
 
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docx
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docxHow Analytics Has Changed in the Last 10 Years (and How It’s Staye.docx
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docxpooleavelina
 

Similar a Seminario Big Data (20)

Business Intelligence and decision support system
Business Intelligence and decision support system Business Intelligence and decision support system
Business Intelligence and decision support system
 
Big Data Analytics Architecture Powerpoint Presentation Slides
Big Data Analytics Architecture Powerpoint Presentation SlidesBig Data Analytics Architecture Powerpoint Presentation Slides
Big Data Analytics Architecture Powerpoint Presentation Slides
 
Big Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation SlidesBig Data Analytics Architecture PowerPoint Presentation Slides
Big Data Analytics Architecture PowerPoint Presentation Slides
 
Business Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptxBusiness Intelligence and Analytics .pptx
Business Intelligence and Analytics .pptx
 
Notes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfNotes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdf
 
BI-Full Document
BI-Full DocumentBI-Full Document
BI-Full Document
 
Business intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and ApplicationsBusiness intelligence- Components, Tools, Need and Applications
Business intelligence- Components, Tools, Need and Applications
 
Enterprise Architecture - An Introduction
Enterprise Architecture - An Introduction Enterprise Architecture - An Introduction
Enterprise Architecture - An Introduction
 
Why Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfWhy Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdf
 
Why Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfWhy Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdf
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Evaluation guide to Streaming Analytics
Evaluation guide to Streaming AnalyticsEvaluation guide to Streaming Analytics
Evaluation guide to Streaming Analytics
 
Enterprise architecture
Enterprise architecture Enterprise architecture
Enterprise architecture
 
Prescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptxPrescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptx
 
White Paper-2-Mapping Manager-Bringing Agility To Business Intelligence
White Paper-2-Mapping Manager-Bringing Agility To Business IntelligenceWhite Paper-2-Mapping Manager-Bringing Agility To Business Intelligence
White Paper-2-Mapping Manager-Bringing Agility To Business Intelligence
 
BIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICSBIG DATA & BUSINESS ANALYTICS
BIG DATA & BUSINESS ANALYTICS
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data Analytics
 
Data Science And Analytics Outsourcing – Vendors, Models, Steps by Ravi Kalak...
Data Science And Analytics Outsourcing – Vendors, Models, Steps by Ravi Kalak...Data Science And Analytics Outsourcing – Vendors, Models, Steps by Ravi Kalak...
Data Science And Analytics Outsourcing – Vendors, Models, Steps by Ravi Kalak...
 
business analytics.ppt
business analytics.pptbusiness analytics.ppt
business analytics.ppt
 
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docx
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docxHow Analytics Has Changed in the Last 10 Years (and How It’s Staye.docx
How Analytics Has Changed in the Last 10 Years (and How It’s Staye.docx
 

Más de Roberto Messora

Azure Synapse: data lake & modern data warehouse dalla A alla Z
Azure Synapse: data lake &  modern data warehouse dalla A alla ZAzure Synapse: data lake &  modern data warehouse dalla A alla Z
Azure Synapse: data lake & modern data warehouse dalla A alla ZRoberto Messora
 
Azure Data Factory: l'evoluzione della specie della data integration
Azure Data Factory: l'evoluzione della specie della data integrationAzure Data Factory: l'evoluzione della specie della data integration
Azure Data Factory: l'evoluzione della specie della data integrationRoberto Messora
 
Real world Visual Studio Code
Real world Visual Studio CodeReal world Visual Studio Code
Real world Visual Studio CodeRoberto Messora
 
Architetture a Microservizi con Docker Container
Architetture a Microservizi con Docker ContainerArchitetture a Microservizi con Docker Container
Architetture a Microservizi con Docker ContainerRoberto Messora
 
Da JavaScript a TypeScript
Da JavaScript a TypeScriptDa JavaScript a TypeScript
Da JavaScript a TypeScriptRoberto Messora
 
Docker as a hosting target
Docker as a hosting targetDocker as a hosting target
Docker as a hosting targetRoberto Messora
 
Da imperativo a reattivo: Bacon.JS
Da imperativo a reattivo: Bacon.JSDa imperativo a reattivo: Bacon.JS
Da imperativo a reattivo: Bacon.JSRoberto Messora
 
Event streaming pipeline with Windows Azure and ArcGIS Geoevent extension
Event streaming pipeline with Windows Azure and ArcGIS Geoevent extensionEvent streaming pipeline with Windows Azure and ArcGIS Geoevent extension
Event streaming pipeline with Windows Azure and ArcGIS Geoevent extensionRoberto Messora
 
Code quality e test automatizzati con JavaScript
Code quality e test automatizzati con JavaScriptCode quality e test automatizzati con JavaScript
Code quality e test automatizzati con JavaScriptRoberto Messora
 
Single Page Applications
Single Page ApplicationsSingle Page Applications
Single Page ApplicationsRoberto Messora
 
Single Page web Application
Single Page web ApplicationSingle Page web Application
Single Page web ApplicationRoberto Messora
 
Javascript avanzato: sfruttare al massimo il web
Javascript avanzato: sfruttare al massimo il webJavascript avanzato: sfruttare al massimo il web
Javascript avanzato: sfruttare al massimo il webRoberto Messora
 
Self-adaptive geospatial web applications
Self-adaptive geospatial web applicationsSelf-adaptive geospatial web applications
Self-adaptive geospatial web applicationsRoberto Messora
 
Web technologies and patterns in HTML5 mapping
Web technologies and patterns in HTML5 mappingWeb technologies and patterns in HTML5 mapping
Web technologies and patterns in HTML5 mappingRoberto Messora
 
MV* presentation frameworks in Javascript: en garde, pret, allez!
MV* presentation frameworks in Javascript: en garde, pret, allez!MV* presentation frameworks in Javascript: en garde, pret, allez!
MV* presentation frameworks in Javascript: en garde, pret, allez!Roberto Messora
 

Más de Roberto Messora (18)

Azure Synapse: data lake & modern data warehouse dalla A alla Z
Azure Synapse: data lake &  modern data warehouse dalla A alla ZAzure Synapse: data lake &  modern data warehouse dalla A alla Z
Azure Synapse: data lake & modern data warehouse dalla A alla Z
 
Azure Data Factory: l'evoluzione della specie della data integration
Azure Data Factory: l'evoluzione della specie della data integrationAzure Data Factory: l'evoluzione della specie della data integration
Azure Data Factory: l'evoluzione della specie della data integration
 
Introduzione a Docker
Introduzione a DockerIntroduzione a Docker
Introduzione a Docker
 
Real world Visual Studio Code
Real world Visual Studio CodeReal world Visual Studio Code
Real world Visual Studio Code
 
Architetture a Microservizi con Docker Container
Architetture a Microservizi con Docker ContainerArchitetture a Microservizi con Docker Container
Architetture a Microservizi con Docker Container
 
Da JavaScript a TypeScript
Da JavaScript a TypeScriptDa JavaScript a TypeScript
Da JavaScript a TypeScript
 
Docker as a hosting target
Docker as a hosting targetDocker as a hosting target
Docker as a hosting target
 
Da imperativo a reattivo: Bacon.JS
Da imperativo a reattivo: Bacon.JSDa imperativo a reattivo: Bacon.JS
Da imperativo a reattivo: Bacon.JS
 
Event streaming pipeline with Windows Azure and ArcGIS Geoevent extension
Event streaming pipeline with Windows Azure and ArcGIS Geoevent extensionEvent streaming pipeline with Windows Azure and ArcGIS Geoevent extension
Event streaming pipeline with Windows Azure and ArcGIS Geoevent extension
 
Code quality e test automatizzati con JavaScript
Code quality e test automatizzati con JavaScriptCode quality e test automatizzati con JavaScript
Code quality e test automatizzati con JavaScript
 
Single Page Applications
Single Page ApplicationsSingle Page Applications
Single Page Applications
 
Javascript Unit Testing
Javascript Unit TestingJavascript Unit Testing
Javascript Unit Testing
 
Single Page web Application
Single Page web ApplicationSingle Page web Application
Single Page web Application
 
Javascript avanzato: sfruttare al massimo il web
Javascript avanzato: sfruttare al massimo il webJavascript avanzato: sfruttare al massimo il web
Javascript avanzato: sfruttare al massimo il web
 
Self-adaptive geospatial web applications
Self-adaptive geospatial web applicationsSelf-adaptive geospatial web applications
Self-adaptive geospatial web applications
 
Web technologies and patterns in HTML5 mapping
Web technologies and patterns in HTML5 mappingWeb technologies and patterns in HTML5 mapping
Web technologies and patterns in HTML5 mapping
 
Usare Knockout JS
Usare Knockout JSUsare Knockout JS
Usare Knockout JS
 
MV* presentation frameworks in Javascript: en garde, pret, allez!
MV* presentation frameworks in Javascript: en garde, pret, allez!MV* presentation frameworks in Javascript: en garde, pret, allez!
MV* presentation frameworks in Javascript: en garde, pret, allez!
 

Último

Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 

Último (20)

Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 

Seminario Big Data

  • 1. Seminario Big Data Ing. Roberto Messora Lecco, 23 Novembre 2017
  • 2. Big Data, Analytics, AI, Machine Learning, Deep Learning Executive Overview Big Data & Analytics Reference Architectures Conceptual View Big Data & Analytics Reference Architectures Logical View Big Data & Analytics Reference Architectures Technological View Analytics Overview and Case Studies Event Store Domain Model Cloudera Agenda
  • 3. Big Data, Analytics, AI, Machine Learning, Deep Learning Executive Overview Big Data & Analytics Reference Architectures Conceptual View Big Data & Analytics Reference Architectures Logical View Big Data & Analytics Reference Architectures Technological View Analytics Overview and Case Studies Event Store Domain Model Cloudera Agenda
  • 4. Big Data, Analytics, AI, Machine Learning, Deep Learning 4
  • 5. Big Data, Analytics, AI, Machine Learning, Deep Learning 5
  • 6. Big Data, Analytics, AI, Machine Learning, Deep Learning 6
  • 7. Big Data, Analytics, AI, Machine Learning, Deep Learning Executive Overview Big Data & Analytics Reference Architectures Conceptual View Big Data & Analytics Reference Architectures Logical View Big Data & Analytics Reference Architectures Technological View Analytics Overview and Case Studies Event Store Domain Model Cloudera Agenda
  • 8. Data is often considered to be the crown jewels of an organization. 1) Most companies already use analytics in the form of reports and dashboards to help run their business. This is largely based on well structured data from operational systems that conform to pre-determined relationships (“a single version of the truth”). 2) Big Data, however, doesn’t follow this structured model. The streams are all different and it is difficult to establish common relationships. But with its diversity and abundance come opportunities to learn and to develop new ideas – ideas that can help change the business (“a single version of the facts”) The architectural challenge is to bring the two paradigms together. So, rather than approach Big Data as a new technology silo, an organization should strive to create a unified information architecture – one that enables it to leverage all types of data, as situations demand, to promptly satisfy business needs. The objective of this workshop is to describe a reference architecture (and its implementation) that promotes a unified vision for information management and analytics. Executive Overview 8
  • 10. Big Data, Analytics, AI, Machine Learning, Deep Learning Executive Overview Big Data & Analytics Reference Architectures Conceptual View Big Data & Analytics Reference Architectures Logical View Big Data & Analytics Reference Architectures Technological View Analytics Overview and Case Studies Event Store Domain Model Cloudera Agenda
  • 11. The architecture is organized into views that highlight three focus areas: 1. universal information management 2. real-time analytics 3. intelligent processes They represent architecturally significant capabilities that are important to most organizations today. Big Data & Analytics Reference Architectures Conceptual View 11
  • 12. Unified Information Management addresses the need to manage information holistically as opposed to maintaining independently governed silos. At a high level this includes: o High Volume Data Acquisition – The system must be able to acquire data despite high volumes, velocity, and variety. It may not be necessary to persist all data that is received. o Multi-Structured Data Organization and Discovery – The ability to navigate and search across different forms of data can be enhanced by the capability to organize data of different structures into a common schema. o Low Latency Data Processing – Data processing can occur at many stages of the architecture. In order to support the processing requirements of Big Data, the system must be fast and efficient. o Single Version of the Truth – When two people perform the same form of analysis they should get the same result. As obvious as this seems, it isn’t necessarily a small feat, especially if the two people belong to different departments or divisions of a company. Single version of truth requires architecture consistency and governance. Unified Information Management 12
  • 13. Real-Time Analytics enables the business to leverage information and analysis as events are unfolding. At a high level this includes: o Speed of Thought Analysis – Analysis is often a journey of discovery, where the results of one query determine the content of the next. The system must support this journey in an expeditious manner. System performance must keep pace with the users’ thought process. o Interactive Dashboards – Interactive dashboards allow the user to immediately react to information being displayed, providing the ability to drill down and perform root cause analysis of situations at hand. o Advanced Analytics – Advanced forms of analytics, including data mining, machine learning, and statistical analysis enable businesses to better understand past activities and spot trends that can carry forward into the future. Applied in real-time, advanced analytics can enhance customer interactions and buying decisions, detect fraud and waste, and enable the business to make adjustments according to current conditions. o Event Processing – Real-time processing of events enables immediate responses to existing problems and opportunities. It filters through large quantities of streaming data, triggering predefined responses to known data patterns. Real-Time Analytics 13
  • 14. A key objective for any Big Data and Analytics program is to execute business processes more effectively and efficiently. This means channeling the intelligence one gains from analysis directly into the processes that the business is performing. At a high level this includes: o Application-Embedded Analysis – Many workers today can be classified as knowledge workers; they routinely make decisions that affect business performance. Embedding analysis into the applications they use helps them to make more informed decisions. o Optimized Rules and Recommendations –With optimized rules and recommendations, insight from analysis is used to influence the decision logic as the process is being executed. o Guided User Navigation – Whenever possible the system should leverage the information available in order to guide the user along the most appropriate path of investigation. o Performance and Strategy Management – Analytics can also provide insight to guide and support the performance and strategy management processes of a business. It can help to ensure that strategy is based on sound analysis. Likewise, it can track business performance versus objectives in order to provide insight on strategy achievement. Intelligent Processes 14
  • 15. Big Data, Analytics, AI, Machine Learning, Deep Learning Executive Overview Big Data & Analytics Reference Architectures Conceptual View Big Data & Analytics Reference Architectures Logical View Big Data & Analytics Reference Architectures Technological View Analytics Overview and Case Studies Event Store Domain Model Cloudera Agenda
  • 16. Big Data & Analytics Reference Architectures Logical View 16 The high-level logical view defines a multi-tier architecture template that can be used to describe many types of technology solutions.
  • 17. Big Data & Analytics Reference Architectures Logical View 17 This layer includes the hardware and platforms on which the Big Data and Analytics components run. As shared infrastructure, it can be used to support multiple concurrent implementations, in support of, or analogous to, Cloud Computing. This layer includes infrastructure to support traditional databases, specialized Big Data management systems, and infrastructure that has been optimized for analytics.
  • 18. Big Data & Analytics Reference Architectures Logical View 18 At the bottom are data stores that have been commissioned for specific purposes (g.e. individual operational data stores, CMS, etc.) These data stores represent sources of data that are ingested (upward) into the Logical Data Warehouse (LDW). The LDW represents a collection of data that has been provisioned for historical and analytical purposes. Above the LDW are components that provide processing and event detection for all forms of data. At the top of the layer are components that virtualize all forms of data for universal consumption.
  • 19. Big Data & Analytics Reference Architectures Logical View 19 The Services Layer includes components that provide or perform commonly used services. Presentation Services and Information Services are types of Services in a Service Oriented Architecture (SOA). They can be defined, cataloged, used, and shared across solutions. Business Activity Monitoring, Business Rules, and Event Handling provide common services for the processing layer(s) above.
  • 20. Big Data & Analytics Reference Architectures Logical View 20 The Process Layer represents components that perform higher level processing activities. For the purpose of Big Data and Analytics, this layer calls out several types of applications that support analytical, intelligence gathering, and performance management processes. The Interaction Layer is comprised of components used to support interaction with end users. Common artifacts for this layer include dashboards, reports, charts, graphs, and spreadsheets. In addition, this layer includes the tools used by analysts to perform analysis and discovery activities.
  • 21. Big Data & Analytics Reference Architectures Logical View 21 The results of analysis can be delivered via many different channels. The architecture calls out common IP network based channels such as desktops and laptops, common mobile network channels such as mobile phones and tablets, and other channels such as email, SMS, and hardcopy. The architecture is supported by a number of components that affect all layers of the architecture. These include information and analysis modeling, monitoring, management, security, and governance.
  • 22. Big Data, Analytics, AI, Machine Learning, Deep Learning Executive Overview Big Data & Analytics Reference Architectures Conceptual View Big Data & Analytics Reference Architectures Logical View Big Data & Analytics Reference Architectures Technological View Analytics Overview and Case Studies Event Store Domain Model Cloudera Agenda
  • 23. Big Data & Analytics Reference Architectures Technological View 23
  • 24. It lets you publish and subscribe to streams of records. In this respect it is similar to a message queue or enterprise messaging system. It lets you store streams of records ia a fault-tolerant way. It lets you process streams of records as they occur. Apache Kafka 24 Apache Kafka™ is a distributed streaming platform. Website: https://kafka.apache.org/
  • 25. Speed –up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk Ease of use – API in Java, Scala, Python and R Generality – powerful stack of libraries including SQL and DataFrames, Mllib for machine learning, GraphX and Spark Streaming Runs Everywhere - Spark runs on Hadoop, Mesos, standalone, or in the cloud Apache Spark 25 Apache Spark™ is a fast and general engine for large-scale data processing. Website: http://spark.apache.org/
  • 26. Reference Architectures - Hadoop Classic Batch Architecture 26 Characteristics:  Batch oriented  Massive Storage  Multiuser jobs  Data Warehouse Replacement
  • 27. Reference Architectures – Lambda Architecture 27 Batch Layer manages the master data set, an immutable, append-only set of raw data Speed Layer ingest streaming data or micro-batches and provide an «active partition» with a limited window of mutability Serving Layer output from the batch and speed layers are stored in the serving layer (BASE compliant)
  • 28. Reference Architectures – Lambda Architecture 28 Complexity  Many moving parts  Restatement is difficult  Two code base must be kept in sync  Proper failure handling is complex
  • 29. Reference Architectures – Kappa Architecture 29 Jay Kreps, the creating of Kafka and one of the first proponents of stream-based architectures, joking called his alternative the “Kappa Architecture”.
  • 30. Reference Architectures – «Fast Data» Architecture 31
  • 31. There are more options today for where to deploy a solution than ever before. At a high level the four options for deployment of architecture components are: 1) Public Cloud – In the public cloud model, a company rents resources from a third party. The most advanced usage of public cloud is where the business functionality is provided by the cloud provider (i.e., software-as-a-service). Public cloud might also be used as the platform upon which the business functionality is built (i.e., platform-as-a-service), or the public cloud may simply provide the infrastructure for the system (i.e.,infrastructure-as-a-service). 2) Private Cloud - Private cloud is the same as public cloud, but the cloud is owned by a company instead of being provided by a third party. Private clouds are ideal for hosting and integrating very large data volumes while keeping data secure behind corporate firewalls. 3) Managed Services – In this model a company owns the components of the system, but outsources some or all aspects of runtime operations. 4) Traditional IT – In this model a company owns and operates the system. These various options for deployment are not mutually exclusive. Deployment 32
  • 32. Security 33 1) Authentication (Kerberos, LDAP, …) 2) Authorization (ACE, ACL, Sentry,…) 3) Encryption & Data Masking (Over-the-Wire Encryption, Encryption at Rest, Field- Level Encryption, Format-preserving Encryption) 4) Auditing & Data Lineage 5) Disaster Recovery & Backup The Keys to secure the enterprise Big Data platform are:
  • 33. Big Data, Analytics, AI, Machine Learning, Deep Learning Executive Overview Big Data & Analytics Reference Architectures Conceptual View Big Data & Analytics Reference Architectures Logical View Big Data & Analytics Reference Architectures Technological View Analytics Overview and Case Studies Event Store Domain Model Cloudera Agenda
  • 34. Analytics - Data Science on Hadoop 35 Common Limitations
  • 35. Analytics - Data Science 36 Notebooks combine code, output and narrative into a single document. Notebooks You can condunct analysis writing down code, results, ideas and thoughts. You have multiple languages and versions in a single multi-tenant environment. Easy to share Easy version control
  • 36. 37 Data Science is the science of building data products. OVERT DATA PRODUCTS COVERT DATA PRODUCTS • Products where the data is clearly visible as part of the deliverable. • Descriptive Analysis • Dashboarding • Reporting • Deliver results rather than data; data is hidden. • Recommendation Engine • … Website:https://www.oreilly.com/ideas/evolution-of-data-products Analytics - Data Science Data Products
  • 37. BENEFITS Analytics allows to better manage Customer Base and extract customer value Analyze customer profiles, behaviors and purchases and obtain a complete and strategic view of the most recurrent customer behaviors Develop a tailored proposition by customer segment to increase customer value along the whole client lifecycle Address marketing efforts based on customer insights and value Drive consumer segments to exploit product portfolio at the right time of their customer journey DIGITAL DIGITAL
  • 38. Analytics will be carried out in order to offer actionable insights on customer and will follow a multi-step approach Business Objective &Question Business Actions 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% %Accounts Deciles Responders Non Responders Model Interpretation Modeling Data Preparation Data Exploration/Understanding Simple exploratory Analysis in order to understand the whole set of information available, identify problems in the data, and start observing relationships among variables. Use of data visualization techniques for exploring the set of information Data is prepared for data mining and machine learning models Imputation of missing values, computation of new variables potentially useful for the business question, transformation of variables to make them meaningful for the problem to be solved Models are implemented Available data is used and synthesized to answer the business question, by identifying relationships among target variable and input variables It may be a recursive process based also on sampling data and assessing models and results Model results are interpreted in order to be useful for business strategy and actions.
  • 39. OBJECTIVES ANALYTICAL MODEL … that can be answered through specific statistical models and approaches CustomerValue Customer Life Time New Customer identification and engagement Clienteling & Caring Program Actions to retain leaving customers Churn Model ENGAGE NEW CUSTOMERS NURTURE & DEVELOP LOYALTY CUSTOMERS RETAIN LEAVING CUSTOMERS +   Clickstream & Content Analysis Next Best Offer Analysis Segmentation (deterministical vs behavioural) Propensity Model
  • 40. Why Algorithms Analysis Propensity Models The model assigns a propensity score to each customer and allows to priorite initiatives Propensity model allows to estimate Re-purchasing probability of customers Retargeting Optimization: predict the likehood of booking a flight for potential customers Up-selling propensity: Reservation upgrade or ancillary services proposal Etc. Address marketing investments on customer with highest propensity to: – Increase up-selling – Increase cross-selling – Increase active customers – Increase redemption of marketing campaigns Regressions Decision Trees Random Forests Neural Networks Support Vector Machines Ensemble Models … What + 
  • 41. Why Algorithms Analysis Behavioural Segmentation Behavioral segmentation follows a statistical clustering algorithm which: Identify most significant variables for the analysis Aggregate customers into mutually exclusive groups with similar behavioral patterns, by creating clusters are as similar as possible Customer affiliation to a specific cluster varies overtime, based on his behavior Get strategic insight on customer base to increase loyalty and value Tailor contact strategy (“the right action for the right customer”) Enhance the website experience Increase the redemption rate for targeting marketing campaigns Data transformation Factor analysis Unsupervised Clustering models What +  
  • 42. Why Algorithms Analysis Churn Models Churn analysis is a multivariate data mining technique that assigns a score to customer attrition It estimates the probability that a customer will not buy from a company anymore or for a given period of time Historical data on customers leaving the company will be investigated in order to identify anticipatory signals. Information on flying behavior, enriched data (lifestyle, interests, motivation, SOW, price sensitivity) and customer hyper-profile will be used to compare churn vs loyal behavior Optimization of costs and marketing activities in customer retention Identification of high risk customers sorted by profitability Increase active customers Regressions Decision Trees Random Forests Neural Networks Support Vector Machines Ensemble Models … What 
  • 43. Big Data, Analytics, AI, Machine Learning, Deep Learning Executive Overview Big Data & Analytics Reference Architectures Conceptual View Big Data & Analytics Reference Architectures Logical View Big Data & Analytics Reference Architectures Technological View Analytics Overview and Case Studies Event Store Domain Model Cloudera Agenda
  • 44. The Database Log: the real database 45
  • 45. The Event Store: an entity history 46
  • 46. Event Driven Architecture: segregating Command & Queries 47
  • 47. Event Driven Architecture: Eventual Consistency 48
  • 48. Event Driven Architecture: the Domain Model 49
  • 49. Big Data, Analytics, AI, Machine Learning, Deep Learning Executive Overview Big Data & Analytics Reference Architectures Conceptual View Big Data & Analytics Reference Architectures Logical View Big Data & Analytics Reference Architectures Technological View Analytics Overview and Case Studies Event Store Domain Model Cloudera Agenda
  • 50. Place the project's primary focus on the core domain and domain logic Base complex designs on a model of the domain Initiate a creative collaboration between technical and domain experts to iteratively refine a conceptual model that addresses particular domain problems. Concepts – Context: the setting in which a word or statement appears that determines its meaning – Domain: a sphere of knowledge (ontology), influence, or activity. The subject area to which the user applies a program is the domain of the software – Model: a system of abstractions that describes selected aspects of a domain and can be used to solve problems related to that domain – Ubiquitous Language: a language structured around the domain model and used by all team members to connect all the activities of the team with the software – Bounded context: explicitly define the context within which a model applies. Explicitly set boundaries in terms of team organization, usage within specific parts of the application – Context map: Identify each model in play on the project and define its bounded context. This includes the implicit models of non-object-oriented subsystems. Name each bounded context, and make the names part of the ubiquitous language. Describe the points of contact between the models, outlining explicit translation for any communication Domain Driven Design: Concepts 51
  • 51. Entity: An object that is not defined by its attributes, but rather by a thread of continuity and its identity Value Object: an object that contains attributes but has no conceptual identity. They should be treated as immutable Aggregate: a collection of objects that are bound together by a root entity, otherwise known as an aggregate root. The aggregate root guarantees the consistency of changes being made within the aggregate by forbidding external objects from holding references to its members Domain Event: a domain object that defines an event (something that happens). A domain event is an event that domain experts care about Service: when an operation does not conceptually belong to any object. Following the natural contours of the problem, you can implement these operations in services Domain Driven Design: Building Blocks 52
  • 53. Bounded Contexts: coordinate Domain Models 54
  • 54. Big Data, Analytics, AI, Machine Learning, Deep Learning Executive Overview Big Data & Analytics Reference Architectures Conceptual View Big Data & Analytics Reference Architectures Logical View Big Data & Analytics Reference Architectures Technological View Analytics Overview and Case Studies Event Store Domain Model Cloudera Agenda
  • 56. Cloudera Manager 57 Cloudera Manager is an end-to-end application for managing CDH clusters. Cloudera Manager sets the standard for enterprise deployment by delivering granular visibility into and control over every part of the CDH cluster—empowering operators to improve performance, enhance quality of service, increase compliance and reduce administrative costs.
  • 58. Cloudera Navigator Optimizer 59 How can you assess the risk and true cost of offloading ETL and analytic workloads and understand what it takes to get there? o Cloudera Navigator Optimizer gives you the insights and risk-assessments you need to build out a comprehensive strategy for Hadoop success. Simply upload your existing SQL workloads to get started, and Navigator Optimizer will identify relative risks and development costs for offloading these to Hadoop based on compatibility and complexity. o To efficiently optimize performance for the latest technologies, like Hive and Impala, you need visibility into what users are doing with the data and when the queries themselves are to blame. Cloudera Navigator Optimizer gives you that visibility and lets you focus optimization efforts on critical areas and best practices.
  • 60. Cloudera Data Science Workbench 61 What is Cloudera Data Science Workbench?
  • 61. Cloudera Data Science Workbench 62 Data Science on Hadoop
  • 62. Cloudera Data Science Workbench Architecture 63
  • 63. Cloudera Data Science Workbench Architecture 64
  • 64. Cloudera Product Mapping View 65 Cloudera Enterprise is available on a subscription basis in five editions, each designed for your specific needs. – Essentials provides superior support and advanced management for core Apache Hadoop – Data Science and Engineering for programmatic preparation and predictive modeling – Operational DB for online applications and real-time serving – Analytic DB for BI and SQL analytics – The Enterprise Data Hub gives you everything you need to become information-driven, with complete use of the platform.
  • 65. Data Engineering in the Cloud 66