SlideShare una empresa de Scribd logo
1 de 26
BIG DATA & ORACLE
TECHNOLOGIES
KIEV
OCT 2013
PRACTIC CONSULTING
Alliance of Professional IT & Management Consultants
HTTP://PRACTIC-CONSULTING.COM
Agenda
• ABOUT BIG DATAWHAT
• INDUSTRY EXAMPLES OF BIG DATAWHEN
• ORACLE NO SQL
• ORACLE R
• ORACLE ENDECA
HOW
WHAT IS BIG DATA?
PART I
What Is Big Data?
Big Data – is data that becomes large
enough that it cannot be processed
using conventional methods
Big Data – is the new generation
of data warehousing and
business analysis systems
010101101010100101010101010101010010101010100101010101001010101010010101010101010100101
010100101010101010101001010101010010101010101010010101001010101001010101001010101001010
101010101001010101010100101010100101010100101010100101010101001010101001010101001010101
010010101010100101010010101001010101001010101001010100101010010101010010101010010101010
010101010010101001010101010101010101010101010010101010010101010010101010010101010010101
010010101010010101010010101010010101010010101010010101010010100101010100101010100101010
A Wider Variety of Data
Internet
Data
 Clickstream
 Social media
 Social media stream
 Web site logs
Research
Data
 Experiments
 Observations
 Surveys
 Marketplace data
Healthcare
Data
 Treatment data
 Telehealth
 National Electronic Health Records
 Procedures
Image
Data
 Image
 Video
 Satellite image
 Surveillance
Device
Data
 RF Devices
 Sensors
 EDI
 Telemetry
Why Is Big Data Important?
Big Data - Just another buzzword
or powerful business & science enabler?
SQL
Analytics
• Count
• Mean
• OLAP
Descriptive
Analytics
• Univariate
distribution
• Central
tendency
• Dispersion
Data Mining
• Association
rules
• Clustering
• Feature
extraction
Predictive
Analytics
• Classification
• Regression
• Forecasting
• Spatial
• Machine
Learning
• Text
Analytics
Simulation
• Monte Carlo
• Agent-based
modeling
• Discrete
event
modeling
Optimizatio
n
• Linear
Optimization
• Non-Linear
Optimization
Business Intelligence Advanced Analytics
INDUSTRY EXAMPLES
OF BIG DATA
PART II
Marketing & Sales + Big Data
TO DELIVER AN ANSWER
100 milliseconds
COUNT OF ADS
100,000 per SECOND
http://www.dataxu.com/
ADVERTISING
PLATFORM
Clickstream, Behavior
Retail + Big Data
CAPTURE
1,000 tweets per SECONDS
INCREASE OF DATA
+10 TB per DAY
http://www.walmart.com/
WAL-MART ONLINE
MARKETING
Social Media
Health Care + Big Data
INCREASE OF DATA EACH MONTH
+10 TB
PATIENTS INVOLVED
10,000
https://cghub.ucsc.edu/index.html/
CANCER GENOMICS
HUB
DNA and RNA data
Science + Big Data
SEVEN TELESCOPES CAPTURE
2 MB per SECOND
IN NEXT 10-15 YEARS ALL
TELESCOPES WILL RECEIVE
30 TB per SECOND
http://www.skatelescope.org/
THE CATALOG OF
UNIVERSE
Data from Telescope
ORACLE TECHNOLOGIES
PART III
Oracle NoSQL
Hadoop Distributed File
System (HDFS)
Oracle NoSQL Database
File System Database
Parallel scanning Indexed storage
No inherent structure Simple data structure
High volume writes
High volume random reads
and writes
Batch Oriented Real-Time
Big Data Storage Choices
Oracle NoSQL
• RDBMS
– High value, high density,
complex data
– Complex data relationships
– Schema-centric
– Designed to scale up & out
– Lots of general purpose
features/functionality
 High overhead ($ per
operation)
• NoSQL architectures
– Low value, low density, simple
data
– Very simple relationships
– Schema-free, unstructured or
semi-structured data
– Distributed storage and
processing
– Stripped down, special
purpose data store
 Lower overhead ($ per
operation)
Oracle NoSQL
Simple Data Model
Small, distributed footprint
Highly scalable, available
Transparent load
balancing
Integrates with Oracle
Stack
Application
Storage Nodes
Datacenter B
Storage Nodes
Datacenter A
NoSQL Database
Driver
Application
NoSQL Database
Driver
A Distributed, Scalable Key-Value Database
Oracle NoSQL
Key-value pairs
• Simple data model – key-value pair (major+minor-key paradigm)
• Simple operations – read/insert/update/delete, RMW support
• Scope of transaction – records within a major key, single API call
• Unordered scan of all data (non-transactional)
userid
addresssubscriptions
email idphone #expiration date
Major key:
Sub key:
Value:
Strings
Byte Array 
Oracle NoSQL
On Line Display Advertising
Oracle NoSQL
Getting Started with Oracle NoSQL DB
1. Download from OTN:
www.oracle.com/technetwork/products/nosqldb/
downloads/index.html
2. Review Quick Start & Getting Started
Guide
3. Review Programmatic API Guide
4. Start writing Java code
What is R?
• R is an Open Source language and
environment for statistical computing
and graphics
http://www.R-project.org/
• Started in 1994 as an Alternative to
SAS, SPSS & Other proprietary
Statistical Environments
• The R environment
– R is an integrated suite of software facilities for data
manipulation, calculation and graphical display
• Around 2 million R users worldwide
– Widely taught in Universities
– Many Corporate Analysts know and use R
• Thousands of open sources R
packages to enhance productivity such
as:
– Bioinformatics
– Spatial Statistics
– Financial Market Analysis
Why statisticians/data analysts use
R?
R environment is ..
• Powerful
• Extensible
• Graphical
• Extensive statistics
• OOTB functionality with
many ‘knobs’ but
smart defaults
• Ease of installation and use
• Free
Limitations of R
• R is a client and server bundled together as 1 executable
– Single user tool, like Excel
– Single-threaded
– Cannot leverage multi-CPU capacity without use of special
packages and coding
• R requires data to be loaded into memory first
– Loading data may not be a limitation given RAM available on
laptops/desktops
– R’s call by value semantics means that as data flows into functions,
for each function invocation, a complete copy of the data is made
– As a result you can quickly run into memory limits
Oracle R Connector for Hadoop
• Provides transparent access to Hadoop Cluster, which
consists of MapReduce and HDFS-resident data
• R users not required to learn new language or interface to
work with Hadoop
• R users can execute jobs on a Hadoop cluster without
requiring knowledge of Hadoop internals, Hadoop CLI, or
IT infrastructure
• Ability to leverage open source contributed R packages to
work on HDFS-resident data
Oracle R Enterprise
• Provides familiar R environment to operate on database-
resident data
• Overloads base R functions for scalable execution in
Oracle Database
– Automatically generates SQL from R and submits query to
database
– Leverages table parallelism where applicable
• Enables embedded execution of R scripts at Oracle
Database server
– Provides database-controlled data-parallel execution framework
– Enables leveraging CRAN open source R packages
• Enables integration of structured results and graphics with
OBIEE dashboards and BI Publisher documents
Oracle R Links
• Blog: https://blogs.oracle.com/R/
• Forum: https://forums.oracle.com/forums/forum.jspa?forumID=1397
• Oracle R Distribution:
http://www.oracle.com/technetwork/indexes/downloads/r-distribution-1532464.html
• ROracle:
http://cran.r-project.org/web/packages/ROracle
• Oracle R Enterprise:
http://www.oracle.com/technetwork/database/options/advanced-analytics/r-enterprise
• Oracle R Connector for Hadoop:
http://www.oracle.com/us/products/database/big-data-connectors/overview
Other Oracle Big Data Products
Oracle Endeca Information Discovery
http://www.oracle.com/us/solutions/business-analytics/business-
intelligence/endeca/overview/index.html
Oracle Data Integrator Application Adapter for Hadoop
http://www.oracle.com/us/products/middleware/data-
integration/hadoop/overview/index.html
Oracle Loader for Hadoop
http://www.oracle.com/technetwork/bdc/hadoop-loader/learnmore/index.html
The End
The best way to predict the future is to
create it!
- Peter F. Drucker

Más contenido relacionado

La actualidad más candente

Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabatinabati
 
Big Data - An Overview
Big Data -  An OverviewBig Data -  An Overview
Big Data - An OverviewArvind Kalyan
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopGhassan Al-Yafie
 
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyGuest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyNishant Gandhi
 
Introduction to Big Data Technologies & Applications
Introduction to Big Data Technologies & ApplicationsIntroduction to Big Data Technologies & Applications
Introduction to Big Data Technologies & ApplicationsNguyen Cao
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataHaluan Irsad
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop IntroductionJayant Mukherjee
 
Big Data vs Data Warehousing
Big Data vs Data WarehousingBig Data vs Data Warehousing
Big Data vs Data WarehousingThomas Kejser
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataJoey Li
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataMohammed Guller
 
Big Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsBig Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsKamalika Dutta
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingm_hepburn
 
Big Data Analytics with Hadoop, MongoDB and SQL Server
Big Data Analytics with Hadoop, MongoDB and SQL ServerBig Data Analytics with Hadoop, MongoDB and SQL Server
Big Data Analytics with Hadoop, MongoDB and SQL ServerMark Kromer
 
Hadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseHadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseCloudera, Inc.
 
Building intelligent applications, experimental ML with Uber’s Data Science W...
Building intelligent applications, experimental ML with Uber’s Data Science W...Building intelligent applications, experimental ML with Uber’s Data Science W...
Building intelligent applications, experimental ML with Uber’s Data Science W...DataWorks Summit
 
Structuring Big Data
Structuring Big DataStructuring Big Data
Structuring Big DataFujitsu UK
 
Better Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraBetter Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraCloudera, Inc.
 

La actualidad más candente (20)

Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabati
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Big Data - An Overview
Big Data -  An OverviewBig Data -  An Overview
Big Data - An Overview
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoop
 
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyGuest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
 
Introduction to Big Data Technologies & Applications
Introduction to Big Data Technologies & ApplicationsIntroduction to Big Data Technologies & Applications
Introduction to Big Data Technologies & Applications
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Practical advice to build a data driven company
Practical advice to build a data driven companyPractical advice to build a data driven company
Practical advice to build a data driven company
 
BDaas- BigData as a service
BDaas- BigData as a service  BDaas- BigData as a service
BDaas- BigData as a service
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop Introduction
 
Big Data vs Data Warehousing
Big Data vs Data WarehousingBig Data vs Data Warehousing
Big Data vs Data Warehousing
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsBig Data Analytics for Real Time Systems
Big Data Analytics for Real Time Systems
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-banking
 
Big Data Analytics with Hadoop, MongoDB and SQL Server
Big Data Analytics with Hadoop, MongoDB and SQL ServerBig Data Analytics with Hadoop, MongoDB and SQL Server
Big Data Analytics with Hadoop, MongoDB and SQL Server
 
Hadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseHadoop: Extending your Data Warehouse
Hadoop: Extending your Data Warehouse
 
Building intelligent applications, experimental ML with Uber’s Data Science W...
Building intelligent applications, experimental ML with Uber’s Data Science W...Building intelligent applications, experimental ML with Uber’s Data Science W...
Building intelligent applications, experimental ML with Uber’s Data Science W...
 
Structuring Big Data
Structuring Big DataStructuring Big Data
Structuring Big Data
 
Better Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraBetter Together: The New Data Management Orchestra
Better Together: The New Data Management Orchestra
 

Destacado

Big data oracle_introduccion
Big data oracle_introduccionBig data oracle_introduccion
Big data oracle_introduccionFran Navarro
 
OOW15 - Oracle Enterprise Manager Cloud Control for Managing Oracle E-Busines...
OOW15 - Oracle Enterprise Manager Cloud Control for Managing Oracle E-Busines...OOW15 - Oracle Enterprise Manager Cloud Control for Managing Oracle E-Busines...
OOW15 - Oracle Enterprise Manager Cloud Control for Managing Oracle E-Busines...vasuballa
 
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle ClusterwareManaging Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle ClusterwareLeighton Nelson
 
Oracle enterprise manager cloud control 12c
Oracle enterprise manager cloud control 12cOracle enterprise manager cloud control 12c
Oracle enterprise manager cloud control 12csolarisyougood
 
Oracle enterprise manager cloud control 12c
Oracle enterprise manager cloud control 12cOracle enterprise manager cloud control 12c
Oracle enterprise manager cloud control 12csolarisyougood
 
Zero to Manageability in 60 Minutes: Building a Solid Foundation for Oracle E...
Zero to Manageability in 60 Minutes: Building a Solid Foundation for Oracle E...Zero to Manageability in 60 Minutes: Building a Solid Foundation for Oracle E...
Zero to Manageability in 60 Minutes: Building a Solid Foundation for Oracle E...Courtney Llamas
 
Data center Trends with Oracle
Data center Trends with OracleData center Trends with Oracle
Data center Trends with OracleFran Navarro
 
Total cloud control with oracle enterprise manager 12c
Total cloud control with oracle enterprise manager 12cTotal cloud control with oracle enterprise manager 12c
Total cloud control with oracle enterprise manager 12csolarisyougood
 
Oracle - Enterprise Manager 12c Overview
Oracle - Enterprise Manager 12c OverviewOracle - Enterprise Manager 12c Overview
Oracle - Enterprise Manager 12c OverviewFred Sim
 
Oracle Enterprise Manager Cloud Control 13c for DBAs
Oracle Enterprise Manager Cloud Control 13c for DBAsOracle Enterprise Manager Cloud Control 13c for DBAs
Oracle Enterprise Manager Cloud Control 13c for DBAsGokhan Atil
 
Oracle Enterprise Manager 12c: The Oracle Monitoring tool of choice – Why yo...
Oracle Enterprise Manager 12c:  The Oracle Monitoring tool of choice – Why yo...Oracle Enterprise Manager 12c:  The Oracle Monitoring tool of choice – Why yo...
Oracle Enterprise Manager 12c: The Oracle Monitoring tool of choice – Why yo...Jeff Kayser
 
Oracle Enterprise Manager
Oracle Enterprise ManagerOracle Enterprise Manager
Oracle Enterprise ManagerBob Rhubart
 
Oracle Enterprise Manager Cloud Control 13c for DBAs
Oracle Enterprise Manager Cloud Control 13c for DBAsOracle Enterprise Manager Cloud Control 13c for DBAs
Oracle Enterprise Manager Cloud Control 13c for DBAsGokhan Atil
 

Destacado (17)

Big data oracle_introduccion
Big data oracle_introduccionBig data oracle_introduccion
Big data oracle_introduccion
 
OOW15 - Oracle Enterprise Manager Cloud Control for Managing Oracle E-Busines...
OOW15 - Oracle Enterprise Manager Cloud Control for Managing Oracle E-Busines...OOW15 - Oracle Enterprise Manager Cloud Control for Managing Oracle E-Busines...
OOW15 - Oracle Enterprise Manager Cloud Control for Managing Oracle E-Busines...
 
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle ClusterwareManaging Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
Managing Oracle Enterprise Manager Cloud Control 12c with Oracle Clusterware
 
Oracle enterprise manager cloud control 12c
Oracle enterprise manager cloud control 12cOracle enterprise manager cloud control 12c
Oracle enterprise manager cloud control 12c
 
Oracle enterprise manager cloud control 12c
Oracle enterprise manager cloud control 12cOracle enterprise manager cloud control 12c
Oracle enterprise manager cloud control 12c
 
Zero to Manageability in 60 Minutes: Building a Solid Foundation for Oracle E...
Zero to Manageability in 60 Minutes: Building a Solid Foundation for Oracle E...Zero to Manageability in 60 Minutes: Building a Solid Foundation for Oracle E...
Zero to Manageability in 60 Minutes: Building a Solid Foundation for Oracle E...
 
Data center Trends with Oracle
Data center Trends with OracleData center Trends with Oracle
Data center Trends with Oracle
 
Total cloud control with oracle enterprise manager 12c
Total cloud control with oracle enterprise manager 12cTotal cloud control with oracle enterprise manager 12c
Total cloud control with oracle enterprise manager 12c
 
Upgrading Em13c Collaborate 2016
Upgrading Em13c Collaborate 2016Upgrading Em13c Collaborate 2016
Upgrading Em13c Collaborate 2016
 
Em13c New Features- One of Two
Em13c New Features- One of TwoEm13c New Features- One of Two
Em13c New Features- One of Two
 
Em13c New Features- Two of Two
Em13c New Features- Two of TwoEm13c New Features- Two of Two
Em13c New Features- Two of Two
 
Oracle - Enterprise Manager 12c Overview
Oracle - Enterprise Manager 12c OverviewOracle - Enterprise Manager 12c Overview
Oracle - Enterprise Manager 12c Overview
 
Oracle Enterprise Manager Cloud Control 13c for DBAs
Oracle Enterprise Manager Cloud Control 13c for DBAsOracle Enterprise Manager Cloud Control 13c for DBAs
Oracle Enterprise Manager Cloud Control 13c for DBAs
 
Oracle Enterprise Manager 12c: The Oracle Monitoring tool of choice – Why yo...
Oracle Enterprise Manager 12c:  The Oracle Monitoring tool of choice – Why yo...Oracle Enterprise Manager 12c:  The Oracle Monitoring tool of choice – Why yo...
Oracle Enterprise Manager 12c: The Oracle Monitoring tool of choice – Why yo...
 
JOSA TechTalk: Metadata Management
in Big Data
JOSA TechTalk: Metadata Management
in Big DataJOSA TechTalk: Metadata Management
in Big Data
JOSA TechTalk: Metadata Management
in Big Data
 
Oracle Enterprise Manager
Oracle Enterprise ManagerOracle Enterprise Manager
Oracle Enterprise Manager
 
Oracle Enterprise Manager Cloud Control 13c for DBAs
Oracle Enterprise Manager Cloud Control 13c for DBAsOracle Enterprise Manager Cloud Control 13c for DBAs
Oracle Enterprise Manager Cloud Control 13c for DBAs
 

Similar a ORACLE BIG DATA & ANALYTICS TECHNOLOGIES FOR KIEV EVENT

Presentation big dataappliance-overview_oow_v3
Presentation   big dataappliance-overview_oow_v3Presentation   big dataappliance-overview_oow_v3
Presentation big dataappliance-overview_oow_v3xKinAnx
 
Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2Connor McDonald
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutionssolarisyougood
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data avanttic Consultoría Tecnológica
 
Oracle Data Integration - Overview
Oracle Data Integration - OverviewOracle Data Integration - Overview
Oracle Data Integration - OverviewJeffrey T. Pollock
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Pentaho
 
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Dave Segleau
 
Stateful Interaction In Serverless Architecture With Redis: Pyounguk Cho
Stateful Interaction In Serverless Architecture With Redis: Pyounguk ChoStateful Interaction In Serverless Architecture With Redis: Pyounguk Cho
Stateful Interaction In Serverless Architecture With Redis: Pyounguk ChoRedis Labs
 
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...DataStax
 
Part 1 - Introduction to Hadoop and Big Data Technologies for Oracle BI & DW ...
Part 1 - Introduction to Hadoop and Big Data Technologies for Oracle BI & DW ...Part 1 - Introduction to Hadoop and Big Data Technologies for Oracle BI & DW ...
Part 1 - Introduction to Hadoop and Big Data Technologies for Oracle BI & DW ...Mark Rittman
 
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksUsing Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksMapR Technologies
 
Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Martin Bém
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...jdijcks
 
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...Mark Rittman
 
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksUsing Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksDataWorks Summit
 
ODI11g, Hadoop and "Big Data" Sources
ODI11g, Hadoop and "Big Data" SourcesODI11g, Hadoop and "Big Data" Sources
ODI11g, Hadoop and "Big Data" SourcesMark Rittman
 
Ougn2013 high speed, in-memory big data analysis with oracle exalytics
Ougn2013   high speed, in-memory big data analysis with oracle exalyticsOugn2013   high speed, in-memory big data analysis with oracle exalytics
Ougn2013 high speed, in-memory big data analysis with oracle exalyticsMark Rittman
 
TDC2016SP - Trilha NoSQL
TDC2016SP - Trilha NoSQLTDC2016SP - Trilha NoSQL
TDC2016SP - Trilha NoSQLtdc-globalcode
 
IoT databases - review and challenges - IoT, Hardware & Robotics meetup - onl...
IoT databases - review and challenges - IoT, Hardware & Robotics meetup - onl...IoT databases - review and challenges - IoT, Hardware & Robotics meetup - onl...
IoT databases - review and challenges - IoT, Hardware & Robotics meetup - onl...Marcin Bielak
 

Similar a ORACLE BIG DATA & ANALYTICS TECHNOLOGIES FOR KIEV EVENT (20)

Presentation big dataappliance-overview_oow_v3
Presentation   big dataappliance-overview_oow_v3Presentation   big dataappliance-overview_oow_v3
Presentation big dataappliance-overview_oow_v3
 
Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2Whats new in Oracle Database 12c release 12.1.0.2
Whats new in Oracle Database 12c release 12.1.0.2
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutions
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
 
Oracle Data Integration - Overview
Oracle Data Integration - OverviewOracle Data Integration - Overview
Oracle Data Integration - Overview
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
 
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
 
Stateful Interaction In Serverless Architecture With Redis: Pyounguk Cho
Stateful Interaction In Serverless Architecture With Redis: Pyounguk ChoStateful Interaction In Serverless Architecture With Redis: Pyounguk Cho
Stateful Interaction In Serverless Architecture With Redis: Pyounguk Cho
 
Meetup Oracle Database BCN: 2.1 Data Management Trends
Meetup Oracle Database BCN: 2.1 Data Management TrendsMeetup Oracle Database BCN: 2.1 Data Management Trends
Meetup Oracle Database BCN: 2.1 Data Management Trends
 
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
Webinar: Buckle Up: The Future of the Distributed Database is Here - DataStax...
 
Part 1 - Introduction to Hadoop and Big Data Technologies for Oracle BI & DW ...
Part 1 - Introduction to Hadoop and Big Data Technologies for Oracle BI & DW ...Part 1 - Introduction to Hadoop and Big Data Technologies for Oracle BI & DW ...
Part 1 - Introduction to Hadoop and Big Data Technologies for Oracle BI & DW ...
 
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksUsing Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
 
Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27Prague data management meetup 2018-03-27
Prague data management meetup 2018-03-27
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
 
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
From lots of reports (with some data Analysis) 
to Massive Data Analysis (Wit...
 
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise NetworksUsing Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
 
ODI11g, Hadoop and "Big Data" Sources
ODI11g, Hadoop and "Big Data" SourcesODI11g, Hadoop and "Big Data" Sources
ODI11g, Hadoop and "Big Data" Sources
 
Ougn2013 high speed, in-memory big data analysis with oracle exalytics
Ougn2013   high speed, in-memory big data analysis with oracle exalyticsOugn2013   high speed, in-memory big data analysis with oracle exalytics
Ougn2013 high speed, in-memory big data analysis with oracle exalytics
 
TDC2016SP - Trilha NoSQL
TDC2016SP - Trilha NoSQLTDC2016SP - Trilha NoSQL
TDC2016SP - Trilha NoSQL
 
IoT databases - review and challenges - IoT, Hardware & Robotics meetup - onl...
IoT databases - review and challenges - IoT, Hardware & Robotics meetup - onl...IoT databases - review and challenges - IoT, Hardware & Robotics meetup - onl...
IoT databases - review and challenges - IoT, Hardware & Robotics meetup - onl...
 

Último

Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 

Último (20)

Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 

ORACLE BIG DATA & ANALYTICS TECHNOLOGIES FOR KIEV EVENT

  • 1. BIG DATA & ORACLE TECHNOLOGIES KIEV OCT 2013 PRACTIC CONSULTING Alliance of Professional IT & Management Consultants HTTP://PRACTIC-CONSULTING.COM
  • 2. Agenda • ABOUT BIG DATAWHAT • INDUSTRY EXAMPLES OF BIG DATAWHEN • ORACLE NO SQL • ORACLE R • ORACLE ENDECA HOW
  • 3. WHAT IS BIG DATA? PART I
  • 4. What Is Big Data? Big Data – is data that becomes large enough that it cannot be processed using conventional methods Big Data – is the new generation of data warehousing and business analysis systems 010101101010100101010101010101010010101010100101010101001010101010010101010101010100101 010100101010101010101001010101010010101010101010010101001010101001010101001010101001010 101010101001010101010100101010100101010100101010100101010101001010101001010101001010101 010010101010100101010010101001010101001010101001010100101010010101010010101010010101010 010101010010101001010101010101010101010101010010101010010101010010101010010101010010101 010010101010010101010010101010010101010010101010010101010010100101010100101010100101010
  • 5. A Wider Variety of Data Internet Data  Clickstream  Social media  Social media stream  Web site logs Research Data  Experiments  Observations  Surveys  Marketplace data Healthcare Data  Treatment data  Telehealth  National Electronic Health Records  Procedures Image Data  Image  Video  Satellite image  Surveillance Device Data  RF Devices  Sensors  EDI  Telemetry
  • 6. Why Is Big Data Important? Big Data - Just another buzzword or powerful business & science enabler? SQL Analytics • Count • Mean • OLAP Descriptive Analytics • Univariate distribution • Central tendency • Dispersion Data Mining • Association rules • Clustering • Feature extraction Predictive Analytics • Classification • Regression • Forecasting • Spatial • Machine Learning • Text Analytics Simulation • Monte Carlo • Agent-based modeling • Discrete event modeling Optimizatio n • Linear Optimization • Non-Linear Optimization Business Intelligence Advanced Analytics
  • 8. Marketing & Sales + Big Data TO DELIVER AN ANSWER 100 milliseconds COUNT OF ADS 100,000 per SECOND http://www.dataxu.com/ ADVERTISING PLATFORM Clickstream, Behavior
  • 9. Retail + Big Data CAPTURE 1,000 tweets per SECONDS INCREASE OF DATA +10 TB per DAY http://www.walmart.com/ WAL-MART ONLINE MARKETING Social Media
  • 10. Health Care + Big Data INCREASE OF DATA EACH MONTH +10 TB PATIENTS INVOLVED 10,000 https://cghub.ucsc.edu/index.html/ CANCER GENOMICS HUB DNA and RNA data
  • 11. Science + Big Data SEVEN TELESCOPES CAPTURE 2 MB per SECOND IN NEXT 10-15 YEARS ALL TELESCOPES WILL RECEIVE 30 TB per SECOND http://www.skatelescope.org/ THE CATALOG OF UNIVERSE Data from Telescope
  • 13. Oracle NoSQL Hadoop Distributed File System (HDFS) Oracle NoSQL Database File System Database Parallel scanning Indexed storage No inherent structure Simple data structure High volume writes High volume random reads and writes Batch Oriented Real-Time Big Data Storage Choices
  • 14. Oracle NoSQL • RDBMS – High value, high density, complex data – Complex data relationships – Schema-centric – Designed to scale up & out – Lots of general purpose features/functionality  High overhead ($ per operation) • NoSQL architectures – Low value, low density, simple data – Very simple relationships – Schema-free, unstructured or semi-structured data – Distributed storage and processing – Stripped down, special purpose data store  Lower overhead ($ per operation)
  • 15. Oracle NoSQL Simple Data Model Small, distributed footprint Highly scalable, available Transparent load balancing Integrates with Oracle Stack Application Storage Nodes Datacenter B Storage Nodes Datacenter A NoSQL Database Driver Application NoSQL Database Driver A Distributed, Scalable Key-Value Database
  • 16. Oracle NoSQL Key-value pairs • Simple data model – key-value pair (major+minor-key paradigm) • Simple operations – read/insert/update/delete, RMW support • Scope of transaction – records within a major key, single API call • Unordered scan of all data (non-transactional) userid addresssubscriptions email idphone #expiration date Major key: Sub key: Value: Strings Byte Array 
  • 17. Oracle NoSQL On Line Display Advertising
  • 18. Oracle NoSQL Getting Started with Oracle NoSQL DB 1. Download from OTN: www.oracle.com/technetwork/products/nosqldb/ downloads/index.html 2. Review Quick Start & Getting Started Guide 3. Review Programmatic API Guide 4. Start writing Java code
  • 19. What is R? • R is an Open Source language and environment for statistical computing and graphics http://www.R-project.org/ • Started in 1994 as an Alternative to SAS, SPSS & Other proprietary Statistical Environments • The R environment – R is an integrated suite of software facilities for data manipulation, calculation and graphical display • Around 2 million R users worldwide – Widely taught in Universities – Many Corporate Analysts know and use R • Thousands of open sources R packages to enhance productivity such as: – Bioinformatics – Spatial Statistics – Financial Market Analysis
  • 20. Why statisticians/data analysts use R? R environment is .. • Powerful • Extensible • Graphical • Extensive statistics • OOTB functionality with many ‘knobs’ but smart defaults • Ease of installation and use • Free
  • 21. Limitations of R • R is a client and server bundled together as 1 executable – Single user tool, like Excel – Single-threaded – Cannot leverage multi-CPU capacity without use of special packages and coding • R requires data to be loaded into memory first – Loading data may not be a limitation given RAM available on laptops/desktops – R’s call by value semantics means that as data flows into functions, for each function invocation, a complete copy of the data is made – As a result you can quickly run into memory limits
  • 22. Oracle R Connector for Hadoop • Provides transparent access to Hadoop Cluster, which consists of MapReduce and HDFS-resident data • R users not required to learn new language or interface to work with Hadoop • R users can execute jobs on a Hadoop cluster without requiring knowledge of Hadoop internals, Hadoop CLI, or IT infrastructure • Ability to leverage open source contributed R packages to work on HDFS-resident data
  • 23. Oracle R Enterprise • Provides familiar R environment to operate on database- resident data • Overloads base R functions for scalable execution in Oracle Database – Automatically generates SQL from R and submits query to database – Leverages table parallelism where applicable • Enables embedded execution of R scripts at Oracle Database server – Provides database-controlled data-parallel execution framework – Enables leveraging CRAN open source R packages • Enables integration of structured results and graphics with OBIEE dashboards and BI Publisher documents
  • 24. Oracle R Links • Blog: https://blogs.oracle.com/R/ • Forum: https://forums.oracle.com/forums/forum.jspa?forumID=1397 • Oracle R Distribution: http://www.oracle.com/technetwork/indexes/downloads/r-distribution-1532464.html • ROracle: http://cran.r-project.org/web/packages/ROracle • Oracle R Enterprise: http://www.oracle.com/technetwork/database/options/advanced-analytics/r-enterprise • Oracle R Connector for Hadoop: http://www.oracle.com/us/products/database/big-data-connectors/overview
  • 25. Other Oracle Big Data Products Oracle Endeca Information Discovery http://www.oracle.com/us/solutions/business-analytics/business- intelligence/endeca/overview/index.html Oracle Data Integrator Application Adapter for Hadoop http://www.oracle.com/us/products/middleware/data- integration/hadoop/overview/index.html Oracle Loader for Hadoop http://www.oracle.com/technetwork/bdc/hadoop-loader/learnmore/index.html
  • 26. The End The best way to predict the future is to create it! - Peter F. Drucker

Notas del editor

  1. http://www.slideshare.net/cloudera/20100806-cloudera-10-hadoopable-problems-webinar-4931616
  2. http://www.youtube.com/watch?v=sANatTx87r4