SlideShare una empresa de Scribd logo
1 de 49
Descargar para leer sin conexión
Take it to the Limit:
An Information Architecture
for Beyond Hadoop
Jeff Pollock
VP, Product Management
February, 2015
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Safe Harbor Statement
The following is intended to outline our general product direction. It is intended for
information purposes only, and may not be incorporated into any contract. It is not a
commitment to deliver any material, code, or functionality, and should not be relied upon
in making purchasing decisions. The development, release, and timing of any features or
functionality described for Oracle’s products remains at the sole discretion of Oracle.
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Jeff Pollock speaker bio:
• Product Lead for Big Data
Integration & Governance
• Oracle Middleware
• IBM InfoSphere
• Chief Technical Officer
• Cerebra – Semantic technology startup
• Modulant – Semantic technology startup
• Software Engineer
• Ernst & Young LLP – Java and E-Commerce
• Amgen – Information services
• Author
• Semantic Web for Dummies, 2006
• Adaptive Information, 2004
In this session you’ll learn about how to apply Data Discovery and Deep Data
Storage for new breakthroughs in data warehousing. We’ll discuss the benefits of
using Hadoop technologies like Spark, Kafka, and Hive together with enterprise
information architecture and data governance best practices.
Come hear the latest about Oracle’s Big Data architecture with special emphasis
on new product updates in Oracle Big Data Discovery and Oracle Data Integration
offerings – including the ability to replicate data in real-time with Hadoop as well
as new support for Flume, Kafka, Pig, Oozie, and Spark. You’ll also get the latest
information on Oracle Big Data SQL which can access all sources in NoSQL, HBase,
HDFS, and relational databases in a single query.
The gravity of big data on the information management industry is evolving the
way we collect, store, move, and analyze information. It is both pulling and
pushing the world of business analytics to new breakthroughs in real-time
streaming, predictive capabilities, virtualized data access, and self-service modes
of operation. Join us in this session as we help bridge the gap between old and
new by explaining how to apply data discovery, data governance, and SQL
interoperability in this brave new world of Hadoop technology.
Take it to the Limit: An Information
Architecture for Beyond Hadoop
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Customer
Experience
Operational
Improvement
New Business
Models
44% 30% 26%
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Traditional
Currency
Photo
Film
Postal
Services
Printing
Press
Record
Industry
Digital
Currency
Web
Publishing
Email Digital
Camera
#StrataHadoop - Oracle Big Data Architecture
Digital
Download
Entire Industries are Being Disrupted
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Bigger
Smarter
Easier
CheaperBetter
Faster VALUE
FROM DATA
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Connected by Data
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Business Data as a Platform
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
New Concepts for a Modern Big Data Platform Architecture
Polyglot
Fit for Purpose Data
Lambda
Speed Layer
Batch Layer
Data
Sources
Data
Services
Kappa
Data
Services
Data PipelineData
Sources
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Revenge of the Data Nerds!
Lambda
Fit for Purpose Data
Lambda
Speed Layer
Batch Layer
Data
Sources
Data
Services
Lambda
Data
Services
Data PipelineData
Sources
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Execution
Innovation
#StrataHadoop - Oracle Big Data Architecture
4th Generation Data Architecture for Big Data
WarehouseData FactoryReservoir
Data Streaming
Data Platform
Discovery Lab
Analytics
APIs
Enterprise
Data
Other Data
Sources
Data
Streams
Business
Data
Social/Log
Data
Model First
Analytics
• Reporting-oriented
• Often enterprise wide
in scope, cross LoB
• “you know the
questions to ask”
Reports &
Dashboards
Data First
Analytics
• Data Exploration
• Highly visual and/or
interactive
• “you don’t know the
questions to ask”
Discovery
• Telematics
• Industry Services
• Internet of Things
• Sentiment
Data
Services
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Execution
Innovation
#StrataHadoop - Oracle Big Data Architecture
Comprehensive Oracle Solution for Big Data
WarehouseFactoryReservoir
Data Streaming
Data Platform
Discovery Lab
Analytics
APIs
Enterprise
Data
Other Data
Sources
Data
Streams
Business
Data
Social/Log
Data
Model First
Analytics
• Reporting-oriented
• Often enterprise wide
in scope, cross LoB
• “you know the
questions to ask”
Reports &
Dashboards
Data First
Analytics
• Data Exploration
• Highly visual and/or
interactive
• “you don’t know the
questions to ask”
Discovery
• Telematics
• Industry Services
• Internet of Things
• Sentiment
Data
Services
Apache
Oracle
NoSQL
Oracle
CAF & OEP
Oracle Data Integration & Governance
Oracle Database
& Big Data SQL
Oracle
R
Oracle
Big Data
Discovery
Oracle
Business
Intelligence
Oracle
Big Data
Discovery
Apache
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Execution
Innovation
#StrataHadoop - Oracle Big Data Architecture
Integrated Oracle Engineered Systems for Big Data
Data Streaming
Data Platform
Discovery Lab
Analytics
APIs
Enterprise
Data
Other Data
Sources
Data
Streams
Business
Data
Social/Log
Data
Model First
Analytics
• Reporting-oriented
• Often enterprise wide
in scope, cross LoB
• “you know the
questions to ask”
Reports &
Dashboards
Data First
Analytics
• Data Exploration
• Highly visual and/or
interactive
• “you don’t know the
questions to ask”
Discovery
• Telematics
• Industry Services
• Internet of Things
• Sentiment
Data
Services
APIs
Analytics
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Execution
Innovation
#StrataHadoop - Oracle Big Data Architecture
Visionary Oracle Cloud for Big Data
Data Platform
Discovery Lab
Analytics
APIs
Enterprise
Data
Other Data
Sources
Data
Streams
Business
Data
Social/Log
Data
Model First
Analytics
• Reporting-oriented
• Often enterprise wide
in scope, cross LoB
• “you know the
questions to ask”
Reports &
Dashboards
Data First
Analytics
• Data Exploration
• Highly visual and/or
interactive
• “you don’t know the
questions to ask”
Discovery
• Telematics
• Industry Services
• Internet of Things
• Sentiment
Data
Services
Data Streaming
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Oracle Brings Business Value to Big Data
Enterprise-Grade Capabilities
Discover and
Predict – Fast
Govern and Secure
All Data
Simplify Access to
All Data
Performance Integration Availability Scalability Manageability
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Enterprise
Productivity
Disruptive
Technology
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
BIG DATA DISCOVERY
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Data Discovery Accelerates Time-to-Value From Data
80% effort typically
spent on evaluating
and preparing data
Data Uncertainty
• Not familiar and overwhelming
• Potential value not obvious
• Requires significant manipulation
Overly dependent on
scarce and highly
skilled resources
Tool Complexity
• Early Hadoop tools only for experts
• Existing BI tools not designed for Hadoop
• Emerging solutions lack broad capabilities
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Announcing: Oracle Big Data Discovery
The Visual Face of Hadoop
find explore transform discover share
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Find
20
• Access a rich,
interactive catalog of
all data in Hadoop
• Familiar search and
guided navigation for
ease of use
• See data set
summaries, user
annotation and
recommendations
• Provision personal
and enterprise data
to Hadoop via self-
service
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Explore
21
• Visualize all
attributes by type
• Sort attributes by
information
potential
• Assess attribute
statistics, data
quality and outliers
• Use scratch pad to
uncover
correlations
between attributes
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 2222
• Intuitive, user
driven data
wrangling
• Extensive library of
powerful data
transformations and
enrichments
• Preview results,
undo, commit and
replay transforms
• Test on sample data
then apply to full
data set in Hadoop
Transform
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
• Join and blend data
for deeper
perspectives
• Compose project
pages via drag and
drop
• Use powerful search
and guided
navigation to ask
questions
• See new patterns in
rich, interactive data
visualizations
Discovery
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 24
• Share projects,
bookmarks and
snapshots with
others
• Build galleries and
tell big data stories
• Collaborate and
iterate as a team
• Publish blended
data to HDFS for
leverage in other
tools
Share
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Open Platform: Core Technical Innovations on Hadoop
Oracle Big Data Discovery Workloads
Hadoop Cluster
(BDA or Commodity Hardware)
BDD node
data node
data node
data node
data node
name node
Data Processing, Workflow & Monitoring
• Profiling: catalog entry creation, data type &
language detection, schema configuration
• Sampling: dgraph (index) file creation
• Transforms: >100 functions
• Enrichments: location (geo), text (cleanup,
sentiment, entity, key-phrase, whitelist tagging)
Self-Service Provisioning & Data Transfer
• Personal Data: Upload CSV and XLS to HDFS
In-Memory Discovery Indexes
• DGraph: Search, Guided Navigation, Analytics
Studio
• Web UI: Find, Explore, Transform, Discover, Share
Hadoop 2.x
Filesystem
Workload Mgmt
(YARN)
Metadata
(HCatalog)
Other Hadoop Workloads
MR
Spark
Hive
Pig
Oracle Big Data SQL
(Oracle BDA)
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Radical Simplicity: Discovery Lab in a Box
Big Data Discovery with the Big Data Appliance X5
• Ready to Go – just add Data
• Pre-configured, Tuned
• Integrated Management
• Run any 3rd party software
• Install Exalytics into a BDA Starter Rack
– In-memory Database & Analytics
– OBIEE Certified with 12c & Big Data SQL
– Big Data Discovery Certified
– Oracle Data Integration Certified
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
BIG DATA APPLIANCE
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
18 Sun Oracle X5-2L Servers with per server:
• 2 * 18-core Intel® Xeon® Haswell-EP - fastest Intel processor ever shipped
• 128 GB Memory – DDR4 upgradeable from 128GB (8x16GB) to 768 GB
• 48TB Disk space
Integrated Software (4.1):
• Oracle Linux6.5, Oracle JDK 7u72
• Cloudera Distribution of Apache Hadoop 5.3 – EDH Edition, Cloudera Manager 5.3
• Oracle Big Data SQL 1.1, Oracle R Distribution 3.1.1-2, Oracle NoSQL Database CE 3.2.4
Announcing: BDA X5-2 Update
Faster Processors, More Cores, More Memory, Same Price
BDA X5-2 vs. X4-2 Full Rack
2.25x More Cores 648 cores – Intel® Xeon® E5-2699 v3
2x More Memory Default 3.2TB DDR4-2133MHz
50% More DIMM Slots Up to 13.2TB DDR4-2133MHz
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Easier Adoption of New Big Data Technologies
INTEGRATION SKILLS SECURITY
Engineered
Systems
SQL on
All Data
Database
Security on
All Data
SQL
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Standard, Modular and Highly Extensible Appliance
 Starter Rack is a fully cabled
and configured for growth with 6
servers
 In-Rack Expansion delivers 6
server modular expansion block
 Full Rack delivers optimal blend
of capacity and expansion
options
 Grow by adding rack – up to 18
racks without additional switches
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Driving Business Value from Technology Innovation
Use the Right Tool for the Job and benefit from the Powerof “AND”
RelationalHadoop NoSQL Graph
Run the Business
 Integrate existing
systems
 Mission-critical
tasks
 Use existing
investments
 Ensure skills
relevance
Change the Business
 Disrupt competitors
 Disintermediate
supply chains
 Leverage new
paradigms
 Exploit new analyses
Scale the Business
 Serve data
faster
 Persist data streams
 Meet mobile and
device challenges
 Scale-out
economically
Link the Business
 Associate complex
business entities
 Link to Open Data
 Share data sets via
Ontology
 Evolve data and
schema together
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Proven, Cost Effective Solution
“Oracle Big Data Appliance is
an excellent choice for
customers looking to work
with the full suite of
Cloudera’s leading Hadoop-
based technology. It’s more
cost-effective and quicker to
deploy than a DIY cluster.”
⁻ Mike Olson, Cloudera founder, Chief
Strategy Officer, and Chairman of the
Board
Source:
ESG White Paper
21%
Cost Savings
33%
Faster Time
to Value
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
BIG DATA INTEGRATION
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Comprehensive Integration and Governance
Fast
Load
Speed Layer
Batch Layer
Oracle Data Integrator
(Transform)
Oracle GoldenGate
(Move & Ingest)
Data
Governance
Foundation
Enterprise Data Quality
(Profile & Cleanse)
Enterprise Metadata Management & Business Glossary
(Business Glossary, Data Lineage, Impact Analysis and Data Provenance)
Veridata
(Verify)
Data Enrichment
(Prepare)
Real-Time Data Movement
– Low impact capture, stage in Hadoop
– Continuous data availability
Data Transformation
– Bulk data movement
– Pushdown data processing
Data Governance
– Prepare unstructured data
– Profile data with sampling
– Clean data in real time or batch
– Verify data for consistency
– Trace lineage of all data
– Define glossary of business terms
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Staging
#StrataHadoop - Oracle Big Data Architecture
Open Lambda Architecture with Oracle Big Data Integration
Sqoop
HDFS
Hive
Flume
Capture
Trail
Route
Deliver
Pump
Transformation
Data StreamingKafka (MPP Pub/Sub)
Storm and Trident
Spark Streaming
HBase
Discovery Sandbox/s
ROracle GoldenGate
Oracle Data Integrator
Oracle Data Governance
Oracle Data Enrichment
Model First
Analytics
• Reporting-oriented
• Often enterprise wide
in scope, cross LoB
• “you know the
questions to ask”
Data First
Analytics
• Data Exploration
• Highly visual and/or
interactive
• “you don’t know the
questions to ask”
• Telematics
• Industry Services
• Internet of Things
• Sentiment
Reports &
Dashboards
Discovery
Data
Services
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Capture
Trail
Route
Deliver
Pump
#StrataHadoop - Oracle Big Data Architecture
Announcing: Oracle GoldenGate for Big Data
New DB/
HW/OS/APP
Zero Downtime Upgrades
& Data Migration
Fully Active
Distributed DB
High Availability
& Disaster Recovery
Application
Offloading
Query & Report Offloading
Big Data, DW
& Marts
Real-time BI, Hadoop Data
Staging, Data Ingestion
Event Driven Architecture,
SOA/JMS, Coherence
Message Bus
& Data Grid
Data Synchronization
Across the Enterprise
Global Data
Centers
Real-time Analytics
& Massive Parallelization
Data
Streaming
GoldenGate
Real-time
Data Delivery
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Oracle Data Integrator for Transformation on Big Data
Flume
Hive on MR, Tez, Spark
Logs
OLTP DB
SQOOP
OGG
Pig on MR, Tez, Spark
Oracle Data Integrator
SQOOP
Any DW
OGG
Spark
OEMM
Metadata Mgmt
& Lineage
API/File
Hive/HCat,
HDFS,HBase
Hive/HCat,
HDFS,HBase
NoSQL
Kafka
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Business Value of ODI: Low Cost and High Dev Efficiency
Oracle Confidential, under Non-Disclosure 38
No ETL engine is
required
Separation of
Logical and
Physical design
Physical exec on
SQL, Hive, Pig, or
Spark
Runtime exec in
Oozie or via ODI
Java Agent
Rich set of pre-
built operators
User defined
functions
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Load to Oracle
OLH/OSCH
#StrataHadoop - Oracle Big Data Architecture
Oracle Data Integration on Engineered Systems
Transform
ODI
Hive/HDFS
Federate Hive/HDFS to Oracle
Big Data SQL
Oracle DB
OLTP
Load from Oracle
CopyToBDA
Hive/HDFS
OGGOGG
Hive/HDFS
SQOOP
Flume
Kafka
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
BIG DATA GOVERNANCE
#StrataHadoop - Oracle Big Data Architecture
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Data Governance is Not Easy, Hadoop is No Silver Bullet!
Data
Governance
Metadata
Management
Business
Glossary
Data
Profiling
Data
Cleansing
Data
Archiving
Data Privacy
PEOPLE
PROCESS TECHNOLOGY
…people and process first, …tools and capabilities next, …and, there is no magic!
“…the overall impact of poor-
quality data on the whole
dataset remains the same. In
addition, much of the data that
organizations use in a big data
context comes from outside, or
is of unknown structure and
origin. This means that the
likelihood of data quality issues
is even higher than before. So
data quality is actually more
important in the world of big
data."
- Ted Friedman, Gartner
http://www.gartner.com/newsroom/id/2854917
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Operational Data Preparation for Data Without Schema
Data Discovery
& Visualization
Enterprise
Reporting
Internet
Logs
Unstructured &
Structured Data
90% of time is
spent WRANGLING
DATA
MONTHS of effort
spent on each new
dataset
PROGRAMERS writing scripts
or complex ETL
Enterprise
ETL & Data
Integration
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Internet
Logs
Unstructured &
Structured Data
Data Discovery
& Visualization
Enterprise
Reporting
Enterprise
ETL & Data
Integration
Operational Data Preparation for Data Without Schema
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Industry Proven Data Privacy for All Data Types
extend Oracle
Database Security to
Hadoop, NoSQL and
Graph data
authentication and
authorization, audit
and encryption on
Hadoop
end to end data
transparency and
lifecycle management
Oracle Database Security
Industry Standard Access Controls and Lifecycle Controls
Relational
Hadoop
NoSQL
Graph
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Comprehensive Metadata Management for Big Data
ETL
BI
Dashboards
App
ETL
ETL
Sys Admin
Executive
BI Developer
Application User
CDC
Data Steward
ETL
Developer
Data Scientist
GG
Oracle Enterprise Metadata Management
Glossary & Catalog Harvest & Stitch Lineage
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Most Open & Heterogeneous Big Data Solution
 Hadoop HBase
 Hadoop Hive/Flume
 HP Enscribe
 HP NonStop
 HP Neoview
 Hypersonic SQL
 IBM DB2 i Series
 IBM DB2 UDB
 IBM DB2 z Series
 IBM Informix
 IBM Netezza
 JMS / MQ
 Microsoft Access
 Microsoft SQLServer
 MySQL
 Pivotal Greenplum
 PostgreSQL
 Salesforce.com
 SAP BW / BI
 SAP ERP / ECC
 SAS
 SQL/MP
 SQL/MX
 Sybase ASE
 Sybase IQ
 Teradata
 Adaptive
 Altova
 Apache Hcatalog
 Apache Hive/HQL
 Borland
 CA ERwin
 Cloudera Impala
 COBOL Copybook
 DataStax
 Embarcadero
 EMC ProActivity
 GentleWare
 Google BigQuery
 Grandite
 Hadapt Hive
 Hortonworks Hive
 IBM Cognos
 IBM DB2
 IBM DataStage
 IBM Discovery
 IBM Federation Server
 IBM Lotus Notes
 IBM Netezza
 IBM Rational Rose
 IBM Rational Architect
 Informatica Metadata Mgr.
 Informatica PowerCenter
 CoSORT
 ISO SQL Standard (DDL)
 MapR Hadoop Hive
 MicroFocus
 Microsoft Access
 Microsoft Office Excel
 Microsoft Visio
 Microsoft SQL Server
 Microsoft SSIS
 Microsoft Visual Studio
 Microstrategy
 Magic Draw
 OMG CWM Standard
 OMG UML Standard
 Oracle BI Answers
 Oracle BI Enterprise Edition
 Oracle BI Server
 Oracle DAC
 Oracle Data Integrator
 Oracle Data Modeler
 Oracle Database
 Oracle Designer
 Oracle Hyperion Applications
 Oracle Hyperion Essbase
 Oracle Warehouse Builder
 Pivotal Greenplum
 PostgreSQL
 QlikView
 SAP BO Crystal Reports
 SAP BO Designer
 SAP BO Desktop Intelligence
 SAP BO Repository
 SAP BO Data Integrator
 SAP BO Data Steward
 SAP Master Data Management
 SAP Sybase PowerDesigner
 SAP Sybase ASE Database
 SAS Data Integration Studio
 SAS BI Server
 SAS Information Map
 SAS Metadata Management
 SAS OLAP Server
 Select
 Sparx Architect
 Syncsort
 Tableau
 Talend
 Teradata
 Tigris
 Visible
 W3C DTD & XSD Schema
Operational Integration (Movement / Transformation) Metadata Harvesting (Glossary, Lineage & Impact Analysis)
 Oracle Database
 Oracle Exadata
 Oracle Big Data Appliance
 Oracle TimesTen
 Oracle OLAP
 Oracle Business Intelligence
 Oracle BI Applications
 Oracle E-Business Suite
 Oracle JD Edwards Enterprise One
 Oracle JD Edwards World
 Oracle Fusion Applications
 Oracle Governance Risk and Compliance
 Oracle Fusion AIA
 Oracle Retail Applications
 Oracle Agile BI / DW
 Oracle Agile PLM for Process
 Oracle iFlex FlexCUBE
 Oracle iFlex Mantas
 Oracle Hyperion Applications
 Oracle PeopleSoft
 Oracle Siebel CRM / OnDemand
 Oracle Communications
 Oracle WebLogic Server
 Oracle Coherence Data Grid
 Oracle SOA Suite
 Oracle Enterprise Service Bus
+ open APIs and standards
based meta-model
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Land Grab of the Future is
Data Capital
47Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
It’s yours for the taking
#StrataHadoop - Oracle Big Data Architecture
Big Data at Oracle - Strata 2015 San Jose

Más contenido relacionado

La actualidad más candente

2009.10.22 S308460 Cloud Data Services
2009.10.22 S308460  Cloud Data Services2009.10.22 S308460  Cloud Data Services
2009.10.22 S308460 Cloud Data ServicesJeffrey T. Pollock
 
Big Data Discovery
Big Data DiscoveryBig Data Discovery
Big Data DiscoveryHarald Erb
 
Oracle Data Integration - Overview
Oracle Data Integration - OverviewOracle Data Integration - Overview
Oracle Data Integration - OverviewJeffrey T. Pollock
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata Hortonworks
 
Hortonworks Oracle Big Data Integration
Hortonworks Oracle Big Data Integration Hortonworks Oracle Big Data Integration
Hortonworks Oracle Big Data Integration Hortonworks
 
Dataguise hortonworks insurance_feb25
Dataguise hortonworks insurance_feb25Dataguise hortonworks insurance_feb25
Dataguise hortonworks insurance_feb25Hortonworks
 
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data WarehouseHybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data WarehouseDataWorks Summit
 
Flash session -goldengate--lht1053-lon
Flash session -goldengate--lht1053-lonFlash session -goldengate--lht1053-lon
Flash session -goldengate--lht1053-lonJeffrey T. Pollock
 
Hortonworks and Clarity Solution Group
Hortonworks and Clarity Solution Group Hortonworks and Clarity Solution Group
Hortonworks and Clarity Solution Group Hortonworks
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesDataWorks Summit
 
10 Amazing Things To Do With a Hadoop-Based Data Lake
10 Amazing Things To Do With a Hadoop-Based Data Lake10 Amazing Things To Do With a Hadoop-Based Data Lake
10 Amazing Things To Do With a Hadoop-Based Data LakeVMware Tanzu
 
Teradata Aster Discovery Platform
Teradata Aster Discovery PlatformTeradata Aster Discovery Platform
Teradata Aster Discovery PlatformScott Antony
 
Big Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsightBig Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsightHortonworks
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaCaserta
 
The Next Generation of Big Data Analytics
The Next Generation of Big Data AnalyticsThe Next Generation of Big Data Analytics
The Next Generation of Big Data AnalyticsHortonworks
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data editionMark Kerzner
 
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...DataWorks Summit/Hadoop Summit
 
Hadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data ProcessingHadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data ProcessingHortonworks
 

La actualidad más candente (20)

2009.10.22 S308460 Cloud Data Services
2009.10.22 S308460  Cloud Data Services2009.10.22 S308460  Cloud Data Services
2009.10.22 S308460 Cloud Data Services
 
Big Data Discovery
Big Data DiscoveryBig Data Discovery
Big Data Discovery
 
Oracle Data Integration - Overview
Oracle Data Integration - OverviewOracle Data Integration - Overview
Oracle Data Integration - Overview
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
Hortonworks Oracle Big Data Integration
Hortonworks Oracle Big Data Integration Hortonworks Oracle Big Data Integration
Hortonworks Oracle Big Data Integration
 
Dataguise hortonworks insurance_feb25
Dataguise hortonworks insurance_feb25Dataguise hortonworks insurance_feb25
Dataguise hortonworks insurance_feb25
 
Accelerate Return on Data
Accelerate Return on DataAccelerate Return on Data
Accelerate Return on Data
 
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data WarehouseHybrid Data Architecture: Integrating Hadoop with a Data Warehouse
Hybrid Data Architecture: Integrating Hadoop with a Data Warehouse
 
Flash session -goldengate--lht1053-lon
Flash session -goldengate--lht1053-lonFlash session -goldengate--lht1053-lon
Flash session -goldengate--lht1053-lon
 
Extending Hortonworks with Oracle's Big Data Platform
Extending Hortonworks with Oracle's Big Data PlatformExtending Hortonworks with Oracle's Big Data Platform
Extending Hortonworks with Oracle's Big Data Platform
 
Hortonworks and Clarity Solution Group
Hortonworks and Clarity Solution Group Hortonworks and Clarity Solution Group
Hortonworks and Clarity Solution Group
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data Lakes
 
10 Amazing Things To Do With a Hadoop-Based Data Lake
10 Amazing Things To Do With a Hadoop-Based Data Lake10 Amazing Things To Do With a Hadoop-Based Data Lake
10 Amazing Things To Do With a Hadoop-Based Data Lake
 
Teradata Aster Discovery Platform
Teradata Aster Discovery PlatformTeradata Aster Discovery Platform
Teradata Aster Discovery Platform
 
Big Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsightBig Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsight
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with Cloudera
 
The Next Generation of Big Data Analytics
The Next Generation of Big Data AnalyticsThe Next Generation of Big Data Analytics
The Next Generation of Big Data Analytics
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data edition
 
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
 
Hadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data ProcessingHadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data Processing
 

Destacado

Tapping into the Big Data Reservoir (CON7934)
Tapping into the Big Data Reservoir (CON7934)Tapping into the Big Data Reservoir (CON7934)
Tapping into the Big Data Reservoir (CON7934)Jeffrey T. Pollock
 
Oracle Big Data Governance Webcast Charts
Oracle Big Data Governance Webcast ChartsOracle Big Data Governance Webcast Charts
Oracle Big Data Governance Webcast ChartsJeffrey T. Pollock
 
CDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and TrendsCDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and TrendsJeffrey T. Pollock
 
Intelligent Integration OOW2017 - Jeff Pollock
Intelligent Integration OOW2017 - Jeff PollockIntelligent Integration OOW2017 - Jeff Pollock
Intelligent Integration OOW2017 - Jeff PollockJeffrey T. Pollock
 
Brief lessons from the greatest product managers
Brief lessons from the greatest product managersBrief lessons from the greatest product managers
Brief lessons from the greatest product managersJeffrey T. Pollock
 
Oracle Data Integration CON9737 at OpenWorld
Oracle Data Integration CON9737 at OpenWorldOracle Data Integration CON9737 at OpenWorld
Oracle Data Integration CON9737 at OpenWorldJeffrey T. Pollock
 

Destacado (6)

Tapping into the Big Data Reservoir (CON7934)
Tapping into the Big Data Reservoir (CON7934)Tapping into the Big Data Reservoir (CON7934)
Tapping into the Big Data Reservoir (CON7934)
 
Oracle Big Data Governance Webcast Charts
Oracle Big Data Governance Webcast ChartsOracle Big Data Governance Webcast Charts
Oracle Big Data Governance Webcast Charts
 
CDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and TrendsCDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and Trends
 
Intelligent Integration OOW2017 - Jeff Pollock
Intelligent Integration OOW2017 - Jeff PollockIntelligent Integration OOW2017 - Jeff Pollock
Intelligent Integration OOW2017 - Jeff Pollock
 
Brief lessons from the greatest product managers
Brief lessons from the greatest product managersBrief lessons from the greatest product managers
Brief lessons from the greatest product managers
 
Oracle Data Integration CON9737 at OpenWorld
Oracle Data Integration CON9737 at OpenWorldOracle Data Integration CON9737 at OpenWorld
Oracle Data Integration CON9737 at OpenWorld
 

Similar a Big Data at Oracle - Strata 2015 San Jose

Tame Big Data with Oracle Data Integration
Tame Big Data with Oracle Data IntegrationTame Big Data with Oracle Data Integration
Tame Big Data with Oracle Data IntegrationMichael Rainey
 
Turning Relational Database Tables into Hadoop Datasources by Kuassi Mensah
Turning Relational Database Tables into Hadoop Datasources by Kuassi MensahTurning Relational Database Tables into Hadoop Datasources by Kuassi Mensah
Turning Relational Database Tables into Hadoop Datasources by Kuassi MensahData Con LA
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Rittman Analytics
 
Unlocking Big Data Insights with MySQL
Unlocking Big Data Insights with MySQLUnlocking Big Data Insights with MySQL
Unlocking Big Data Insights with MySQLMatt Lord
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...Jürgen Ambrosi
 
Oracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleOracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleHarald Erb
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopInside Analysis
 
Level Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationLevel Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationInside Analysis
 
Big Data & SQL: The On-Ramp to Hadoop
Big Data & SQL: The On-Ramp to Hadoop Big Data & SQL: The On-Ramp to Hadoop
Big Data & SQL: The On-Ramp to Hadoop Inside Analysis
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoopDr. Wilfred Lin (Ph.D.)
 
The New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data ExplorationThe New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data ExplorationInside Analysis
 
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...DataWorks Summit
 
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderEmbedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderDataconomy Media
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...jdijcks
 
A7 storytelling with_oracle_analytics_cloud
A7 storytelling with_oracle_analytics_cloudA7 storytelling with_oracle_analytics_cloud
A7 storytelling with_oracle_analytics_cloudDr. Wilfred Lin (Ph.D.)
 
Oracle BI Big Data and Bics
Oracle BI Big Data and BicsOracle BI Big Data and Bics
Oracle BI Big Data and BicsDarren Grogan
 
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Dave Segleau
 
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...semanticsconference
 
Back to school: Big Data IDEA 101
Back to school: Big Data IDEA 101Back to school: Big Data IDEA 101
Back to school: Big Data IDEA 101Adam Doyle
 

Similar a Big Data at Oracle - Strata 2015 San Jose (20)

Oracle big data discovery 994294
Oracle big data discovery   994294Oracle big data discovery   994294
Oracle big data discovery 994294
 
Tame Big Data with Oracle Data Integration
Tame Big Data with Oracle Data IntegrationTame Big Data with Oracle Data Integration
Tame Big Data with Oracle Data Integration
 
Turning Relational Database Tables into Hadoop Datasources by Kuassi Mensah
Turning Relational Database Tables into Hadoop Datasources by Kuassi MensahTurning Relational Database Tables into Hadoop Datasources by Kuassi Mensah
Turning Relational Database Tables into Hadoop Datasources by Kuassi Mensah
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
 
Unlocking Big Data Insights with MySQL
Unlocking Big Data Insights with MySQLUnlocking Big Data Insights with MySQL
Unlocking Big Data Insights with MySQL
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
 
Oracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleOracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by Example
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Level Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationLevel Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop Acceleration
 
Big Data & SQL: The On-Ramp to Hadoop
Big Data & SQL: The On-Ramp to Hadoop Big Data & SQL: The On-Ramp to Hadoop
Big Data & SQL: The On-Ramp to Hadoop
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
 
The New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data ExplorationThe New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data Exploration
 
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
 
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderEmbedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern Staender
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
 
A7 storytelling with_oracle_analytics_cloud
A7 storytelling with_oracle_analytics_cloudA7 storytelling with_oracle_analytics_cloud
A7 storytelling with_oracle_analytics_cloud
 
Oracle BI Big Data and Bics
Oracle BI Big Data and BicsOracle BI Big Data and Bics
Oracle BI Big Data and Bics
 
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
 
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
 
Back to school: Big Data IDEA 101
Back to school: Big Data IDEA 101Back to school: Big Data IDEA 101
Back to school: Big Data IDEA 101
 

Más de Jeffrey T. Pollock

2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data IntegrationJeffrey T. Pollock
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Microservices Patterns with GoldenGate
Microservices Patterns with GoldenGateMicroservices Patterns with GoldenGate
Microservices Patterns with GoldenGateJeffrey T. Pollock
 
Webinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaWebinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaJeffrey T. Pollock
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonJeffrey T. Pollock
 
Version Control Training - First Lego League
Version Control Training - First Lego LeagueVersion Control Training - First Lego League
Version Control Training - First Lego LeagueJeffrey T. Pollock
 
Oracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer IntroductionOracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer IntroductionJeffrey T. Pollock
 
GoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest RakutenGoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest RakutenJeffrey T. Pollock
 

Más de Jeffrey T. Pollock (10)

2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
Microservices Patterns with GoldenGate
Microservices Patterns with GoldenGateMicroservices Patterns with GoldenGate
Microservices Patterns with GoldenGate
 
Webinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafkaWebinar future dataintegration-datamesh-and-goldengatekafka
Webinar future dataintegration-datamesh-and-goldengatekafka
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
 
Version Control Training - First Lego League
Version Control Training - First Lego LeagueVersion Control Training - First Lego League
Version Control Training - First Lego League
 
Oracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer IntroductionOracle Stream Analytics - Developer Introduction
Oracle Stream Analytics - Developer Introduction
 
GoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest RakutenGoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest Rakuten
 
Semantic Web For Dummies
Semantic Web For DummiesSemantic Web For Dummies
Semantic Web For Dummies
 

Último

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 

Último (20)

Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 

Big Data at Oracle - Strata 2015 San Jose

  • 1. Take it to the Limit: An Information Architecture for Beyond Hadoop Jeff Pollock VP, Product Management February, 2015
  • 2. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle. #StrataHadoop - Oracle Big Data Architecture
  • 3. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Jeff Pollock speaker bio: • Product Lead for Big Data Integration & Governance • Oracle Middleware • IBM InfoSphere • Chief Technical Officer • Cerebra – Semantic technology startup • Modulant – Semantic technology startup • Software Engineer • Ernst & Young LLP – Java and E-Commerce • Amgen – Information services • Author • Semantic Web for Dummies, 2006 • Adaptive Information, 2004 In this session you’ll learn about how to apply Data Discovery and Deep Data Storage for new breakthroughs in data warehousing. We’ll discuss the benefits of using Hadoop technologies like Spark, Kafka, and Hive together with enterprise information architecture and data governance best practices. Come hear the latest about Oracle’s Big Data architecture with special emphasis on new product updates in Oracle Big Data Discovery and Oracle Data Integration offerings – including the ability to replicate data in real-time with Hadoop as well as new support for Flume, Kafka, Pig, Oozie, and Spark. You’ll also get the latest information on Oracle Big Data SQL which can access all sources in NoSQL, HBase, HDFS, and relational databases in a single query. The gravity of big data on the information management industry is evolving the way we collect, store, move, and analyze information. It is both pulling and pushing the world of business analytics to new breakthroughs in real-time streaming, predictive capabilities, virtualized data access, and self-service modes of operation. Join us in this session as we help bridge the gap between old and new by explaining how to apply data discovery, data governance, and SQL interoperability in this brave new world of Hadoop technology. Take it to the Limit: An Information Architecture for Beyond Hadoop #StrataHadoop - Oracle Big Data Architecture
  • 4. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Customer Experience Operational Improvement New Business Models 44% 30% 26% #StrataHadoop - Oracle Big Data Architecture
  • 5. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Traditional Currency Photo Film Postal Services Printing Press Record Industry Digital Currency Web Publishing Email Digital Camera #StrataHadoop - Oracle Big Data Architecture Digital Download Entire Industries are Being Disrupted
  • 6. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | Bigger Smarter Easier CheaperBetter Faster VALUE FROM DATA #StrataHadoop - Oracle Big Data Architecture
  • 7. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Connected by Data #StrataHadoop - Oracle Big Data Architecture
  • 8. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Business Data as a Platform #StrataHadoop - Oracle Big Data Architecture
  • 9. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture New Concepts for a Modern Big Data Platform Architecture Polyglot Fit for Purpose Data Lambda Speed Layer Batch Layer Data Sources Data Services Kappa Data Services Data PipelineData Sources
  • 10. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Revenge of the Data Nerds! Lambda Fit for Purpose Data Lambda Speed Layer Batch Layer Data Sources Data Services Lambda Data Services Data PipelineData Sources
  • 11. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Execution Innovation #StrataHadoop - Oracle Big Data Architecture 4th Generation Data Architecture for Big Data WarehouseData FactoryReservoir Data Streaming Data Platform Discovery Lab Analytics APIs Enterprise Data Other Data Sources Data Streams Business Data Social/Log Data Model First Analytics • Reporting-oriented • Often enterprise wide in scope, cross LoB • “you know the questions to ask” Reports & Dashboards Data First Analytics • Data Exploration • Highly visual and/or interactive • “you don’t know the questions to ask” Discovery • Telematics • Industry Services • Internet of Things • Sentiment Data Services
  • 12. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Execution Innovation #StrataHadoop - Oracle Big Data Architecture Comprehensive Oracle Solution for Big Data WarehouseFactoryReservoir Data Streaming Data Platform Discovery Lab Analytics APIs Enterprise Data Other Data Sources Data Streams Business Data Social/Log Data Model First Analytics • Reporting-oriented • Often enterprise wide in scope, cross LoB • “you know the questions to ask” Reports & Dashboards Data First Analytics • Data Exploration • Highly visual and/or interactive • “you don’t know the questions to ask” Discovery • Telematics • Industry Services • Internet of Things • Sentiment Data Services Apache Oracle NoSQL Oracle CAF & OEP Oracle Data Integration & Governance Oracle Database & Big Data SQL Oracle R Oracle Big Data Discovery Oracle Business Intelligence Oracle Big Data Discovery Apache
  • 13. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Execution Innovation #StrataHadoop - Oracle Big Data Architecture Integrated Oracle Engineered Systems for Big Data Data Streaming Data Platform Discovery Lab Analytics APIs Enterprise Data Other Data Sources Data Streams Business Data Social/Log Data Model First Analytics • Reporting-oriented • Often enterprise wide in scope, cross LoB • “you know the questions to ask” Reports & Dashboards Data First Analytics • Data Exploration • Highly visual and/or interactive • “you don’t know the questions to ask” Discovery • Telematics • Industry Services • Internet of Things • Sentiment Data Services APIs Analytics
  • 14. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Execution Innovation #StrataHadoop - Oracle Big Data Architecture Visionary Oracle Cloud for Big Data Data Platform Discovery Lab Analytics APIs Enterprise Data Other Data Sources Data Streams Business Data Social/Log Data Model First Analytics • Reporting-oriented • Often enterprise wide in scope, cross LoB • “you know the questions to ask” Reports & Dashboards Data First Analytics • Data Exploration • Highly visual and/or interactive • “you don’t know the questions to ask” Discovery • Telematics • Industry Services • Internet of Things • Sentiment Data Services Data Streaming
  • 15. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Oracle Brings Business Value to Big Data Enterprise-Grade Capabilities Discover and Predict – Fast Govern and Secure All Data Simplify Access to All Data Performance Integration Availability Scalability Manageability
  • 16. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Enterprise Productivity Disruptive Technology
  • 17. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | BIG DATA DISCOVERY #StrataHadoop - Oracle Big Data Architecture
  • 18. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Data Discovery Accelerates Time-to-Value From Data 80% effort typically spent on evaluating and preparing data Data Uncertainty • Not familiar and overwhelming • Potential value not obvious • Requires significant manipulation Overly dependent on scarce and highly skilled resources Tool Complexity • Early Hadoop tools only for experts • Existing BI tools not designed for Hadoop • Emerging solutions lack broad capabilities
  • 19. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Announcing: Oracle Big Data Discovery The Visual Face of Hadoop find explore transform discover share #StrataHadoop - Oracle Big Data Architecture
  • 20. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Find 20 • Access a rich, interactive catalog of all data in Hadoop • Familiar search and guided navigation for ease of use • See data set summaries, user annotation and recommendations • Provision personal and enterprise data to Hadoop via self- service
  • 21. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Explore 21 • Visualize all attributes by type • Sort attributes by information potential • Assess attribute statistics, data quality and outliers • Use scratch pad to uncover correlations between attributes
  • 22. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 2222 • Intuitive, user driven data wrangling • Extensive library of powerful data transformations and enrichments • Preview results, undo, commit and replay transforms • Test on sample data then apply to full data set in Hadoop Transform
  • 23. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | • Join and blend data for deeper perspectives • Compose project pages via drag and drop • Use powerful search and guided navigation to ask questions • See new patterns in rich, interactive data visualizations Discovery #StrataHadoop - Oracle Big Data Architecture
  • 24. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 24 • Share projects, bookmarks and snapshots with others • Build galleries and tell big data stories • Collaborate and iterate as a team • Publish blended data to HDFS for leverage in other tools Share
  • 25. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Open Platform: Core Technical Innovations on Hadoop Oracle Big Data Discovery Workloads Hadoop Cluster (BDA or Commodity Hardware) BDD node data node data node data node data node name node Data Processing, Workflow & Monitoring • Profiling: catalog entry creation, data type & language detection, schema configuration • Sampling: dgraph (index) file creation • Transforms: >100 functions • Enrichments: location (geo), text (cleanup, sentiment, entity, key-phrase, whitelist tagging) Self-Service Provisioning & Data Transfer • Personal Data: Upload CSV and XLS to HDFS In-Memory Discovery Indexes • DGraph: Search, Guided Navigation, Analytics Studio • Web UI: Find, Explore, Transform, Discover, Share Hadoop 2.x Filesystem Workload Mgmt (YARN) Metadata (HCatalog) Other Hadoop Workloads MR Spark Hive Pig Oracle Big Data SQL (Oracle BDA)
  • 26. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Radical Simplicity: Discovery Lab in a Box Big Data Discovery with the Big Data Appliance X5 • Ready to Go – just add Data • Pre-configured, Tuned • Integrated Management • Run any 3rd party software • Install Exalytics into a BDA Starter Rack – In-memory Database & Analytics – OBIEE Certified with 12c & Big Data SQL – Big Data Discovery Certified – Oracle Data Integration Certified
  • 27. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | BIG DATA APPLIANCE #StrataHadoop - Oracle Big Data Architecture
  • 28. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 18 Sun Oracle X5-2L Servers with per server: • 2 * 18-core Intel® Xeon® Haswell-EP - fastest Intel processor ever shipped • 128 GB Memory – DDR4 upgradeable from 128GB (8x16GB) to 768 GB • 48TB Disk space Integrated Software (4.1): • Oracle Linux6.5, Oracle JDK 7u72 • Cloudera Distribution of Apache Hadoop 5.3 – EDH Edition, Cloudera Manager 5.3 • Oracle Big Data SQL 1.1, Oracle R Distribution 3.1.1-2, Oracle NoSQL Database CE 3.2.4 Announcing: BDA X5-2 Update Faster Processors, More Cores, More Memory, Same Price BDA X5-2 vs. X4-2 Full Rack 2.25x More Cores 648 cores – Intel® Xeon® E5-2699 v3 2x More Memory Default 3.2TB DDR4-2133MHz 50% More DIMM Slots Up to 13.2TB DDR4-2133MHz #StrataHadoop - Oracle Big Data Architecture
  • 29. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Easier Adoption of New Big Data Technologies INTEGRATION SKILLS SECURITY Engineered Systems SQL on All Data Database Security on All Data SQL
  • 30. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Standard, Modular and Highly Extensible Appliance  Starter Rack is a fully cabled and configured for growth with 6 servers  In-Rack Expansion delivers 6 server modular expansion block  Full Rack delivers optimal blend of capacity and expansion options  Grow by adding rack – up to 18 racks without additional switches #StrataHadoop - Oracle Big Data Architecture
  • 31. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Driving Business Value from Technology Innovation Use the Right Tool for the Job and benefit from the Powerof “AND” RelationalHadoop NoSQL Graph Run the Business  Integrate existing systems  Mission-critical tasks  Use existing investments  Ensure skills relevance Change the Business  Disrupt competitors  Disintermediate supply chains  Leverage new paradigms  Exploit new analyses Scale the Business  Serve data faster  Persist data streams  Meet mobile and device challenges  Scale-out economically Link the Business  Associate complex business entities  Link to Open Data  Share data sets via Ontology  Evolve data and schema together
  • 32. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Proven, Cost Effective Solution “Oracle Big Data Appliance is an excellent choice for customers looking to work with the full suite of Cloudera’s leading Hadoop- based technology. It’s more cost-effective and quicker to deploy than a DIY cluster.” ⁻ Mike Olson, Cloudera founder, Chief Strategy Officer, and Chairman of the Board Source: ESG White Paper 21% Cost Savings 33% Faster Time to Value
  • 33. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | BIG DATA INTEGRATION #StrataHadoop - Oracle Big Data Architecture
  • 34. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Comprehensive Integration and Governance Fast Load Speed Layer Batch Layer Oracle Data Integrator (Transform) Oracle GoldenGate (Move & Ingest) Data Governance Foundation Enterprise Data Quality (Profile & Cleanse) Enterprise Metadata Management & Business Glossary (Business Glossary, Data Lineage, Impact Analysis and Data Provenance) Veridata (Verify) Data Enrichment (Prepare) Real-Time Data Movement – Low impact capture, stage in Hadoop – Continuous data availability Data Transformation – Bulk data movement – Pushdown data processing Data Governance – Prepare unstructured data – Profile data with sampling – Clean data in real time or batch – Verify data for consistency – Trace lineage of all data – Define glossary of business terms
  • 35. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Staging #StrataHadoop - Oracle Big Data Architecture Open Lambda Architecture with Oracle Big Data Integration Sqoop HDFS Hive Flume Capture Trail Route Deliver Pump Transformation Data StreamingKafka (MPP Pub/Sub) Storm and Trident Spark Streaming HBase Discovery Sandbox/s ROracle GoldenGate Oracle Data Integrator Oracle Data Governance Oracle Data Enrichment Model First Analytics • Reporting-oriented • Often enterprise wide in scope, cross LoB • “you know the questions to ask” Data First Analytics • Data Exploration • Highly visual and/or interactive • “you don’t know the questions to ask” • Telematics • Industry Services • Internet of Things • Sentiment Reports & Dashboards Discovery Data Services
  • 36. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Capture Trail Route Deliver Pump #StrataHadoop - Oracle Big Data Architecture Announcing: Oracle GoldenGate for Big Data New DB/ HW/OS/APP Zero Downtime Upgrades & Data Migration Fully Active Distributed DB High Availability & Disaster Recovery Application Offloading Query & Report Offloading Big Data, DW & Marts Real-time BI, Hadoop Data Staging, Data Ingestion Event Driven Architecture, SOA/JMS, Coherence Message Bus & Data Grid Data Synchronization Across the Enterprise Global Data Centers Real-time Analytics & Massive Parallelization Data Streaming GoldenGate Real-time Data Delivery
  • 37. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Oracle Data Integrator for Transformation on Big Data Flume Hive on MR, Tez, Spark Logs OLTP DB SQOOP OGG Pig on MR, Tez, Spark Oracle Data Integrator SQOOP Any DW OGG Spark OEMM Metadata Mgmt & Lineage API/File Hive/HCat, HDFS,HBase Hive/HCat, HDFS,HBase NoSQL Kafka
  • 38. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Business Value of ODI: Low Cost and High Dev Efficiency Oracle Confidential, under Non-Disclosure 38 No ETL engine is required Separation of Logical and Physical design Physical exec on SQL, Hive, Pig, or Spark Runtime exec in Oozie or via ODI Java Agent Rich set of pre- built operators User defined functions
  • 39. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Load to Oracle OLH/OSCH #StrataHadoop - Oracle Big Data Architecture Oracle Data Integration on Engineered Systems Transform ODI Hive/HDFS Federate Hive/HDFS to Oracle Big Data SQL Oracle DB OLTP Load from Oracle CopyToBDA Hive/HDFS OGGOGG Hive/HDFS SQOOP Flume Kafka
  • 40. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | BIG DATA GOVERNANCE #StrataHadoop - Oracle Big Data Architecture
  • 41. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Data Governance is Not Easy, Hadoop is No Silver Bullet! Data Governance Metadata Management Business Glossary Data Profiling Data Cleansing Data Archiving Data Privacy PEOPLE PROCESS TECHNOLOGY …people and process first, …tools and capabilities next, …and, there is no magic! “…the overall impact of poor- quality data on the whole dataset remains the same. In addition, much of the data that organizations use in a big data context comes from outside, or is of unknown structure and origin. This means that the likelihood of data quality issues is even higher than before. So data quality is actually more important in the world of big data." - Ted Friedman, Gartner http://www.gartner.com/newsroom/id/2854917
  • 42. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Operational Data Preparation for Data Without Schema Data Discovery & Visualization Enterprise Reporting Internet Logs Unstructured & Structured Data 90% of time is spent WRANGLING DATA MONTHS of effort spent on each new dataset PROGRAMERS writing scripts or complex ETL Enterprise ETL & Data Integration
  • 43. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Internet Logs Unstructured & Structured Data Data Discovery & Visualization Enterprise Reporting Enterprise ETL & Data Integration Operational Data Preparation for Data Without Schema
  • 44. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Industry Proven Data Privacy for All Data Types extend Oracle Database Security to Hadoop, NoSQL and Graph data authentication and authorization, audit and encryption on Hadoop end to end data transparency and lifecycle management Oracle Database Security Industry Standard Access Controls and Lifecycle Controls Relational Hadoop NoSQL Graph
  • 45. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Comprehensive Metadata Management for Big Data ETL BI Dashboards App ETL ETL Sys Admin Executive BI Developer Application User CDC Data Steward ETL Developer Data Scientist GG Oracle Enterprise Metadata Management Glossary & Catalog Harvest & Stitch Lineage
  • 46. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture Most Open & Heterogeneous Big Data Solution  Hadoop HBase  Hadoop Hive/Flume  HP Enscribe  HP NonStop  HP Neoview  Hypersonic SQL  IBM DB2 i Series  IBM DB2 UDB  IBM DB2 z Series  IBM Informix  IBM Netezza  JMS / MQ  Microsoft Access  Microsoft SQLServer  MySQL  Pivotal Greenplum  PostgreSQL  Salesforce.com  SAP BW / BI  SAP ERP / ECC  SAS  SQL/MP  SQL/MX  Sybase ASE  Sybase IQ  Teradata  Adaptive  Altova  Apache Hcatalog  Apache Hive/HQL  Borland  CA ERwin  Cloudera Impala  COBOL Copybook  DataStax  Embarcadero  EMC ProActivity  GentleWare  Google BigQuery  Grandite  Hadapt Hive  Hortonworks Hive  IBM Cognos  IBM DB2  IBM DataStage  IBM Discovery  IBM Federation Server  IBM Lotus Notes  IBM Netezza  IBM Rational Rose  IBM Rational Architect  Informatica Metadata Mgr.  Informatica PowerCenter  CoSORT  ISO SQL Standard (DDL)  MapR Hadoop Hive  MicroFocus  Microsoft Access  Microsoft Office Excel  Microsoft Visio  Microsoft SQL Server  Microsoft SSIS  Microsoft Visual Studio  Microstrategy  Magic Draw  OMG CWM Standard  OMG UML Standard  Oracle BI Answers  Oracle BI Enterprise Edition  Oracle BI Server  Oracle DAC  Oracle Data Integrator  Oracle Data Modeler  Oracle Database  Oracle Designer  Oracle Hyperion Applications  Oracle Hyperion Essbase  Oracle Warehouse Builder  Pivotal Greenplum  PostgreSQL  QlikView  SAP BO Crystal Reports  SAP BO Designer  SAP BO Desktop Intelligence  SAP BO Repository  SAP BO Data Integrator  SAP BO Data Steward  SAP Master Data Management  SAP Sybase PowerDesigner  SAP Sybase ASE Database  SAS Data Integration Studio  SAS BI Server  SAS Information Map  SAS Metadata Management  SAS OLAP Server  Select  Sparx Architect  Syncsort  Tableau  Talend  Teradata  Tigris  Visible  W3C DTD & XSD Schema Operational Integration (Movement / Transformation) Metadata Harvesting (Glossary, Lineage & Impact Analysis)  Oracle Database  Oracle Exadata  Oracle Big Data Appliance  Oracle TimesTen  Oracle OLAP  Oracle Business Intelligence  Oracle BI Applications  Oracle E-Business Suite  Oracle JD Edwards Enterprise One  Oracle JD Edwards World  Oracle Fusion Applications  Oracle Governance Risk and Compliance  Oracle Fusion AIA  Oracle Retail Applications  Oracle Agile BI / DW  Oracle Agile PLM for Process  Oracle iFlex FlexCUBE  Oracle iFlex Mantas  Oracle Hyperion Applications  Oracle PeopleSoft  Oracle Siebel CRM / OnDemand  Oracle Communications  Oracle WebLogic Server  Oracle Coherence Data Grid  Oracle SOA Suite  Oracle Enterprise Service Bus + open APIs and standards based meta-model
  • 47. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Land Grab of the Future is Data Capital 47Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | #StrataHadoop - Oracle Big Data Architecture
  • 48. Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | It’s yours for the taking #StrataHadoop - Oracle Big Data Architecture