SlideShare una empresa de Scribd logo
1 de 20
© 2013 IBM Corporation
IBM DB2 Analytics Accelerator (IDAA)
Near Real-Time Analytics with IDAA
March 2013
Daniel Martin (danmartin@de.ibm.com) – IBM Software Group, Information Management
© 2013 IBM Corporation
Disclaimer
© Copyright IBM Corporation 2012. All rights reserved.
U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with
IBM Corp.
IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without
notice at IBM’s sole discretion. Information regarding potential future products is intended to outline
our general product direction and it should not be relied on in making a purchasing decision. The
information mentioned regarding potential future products is not a commitment, promise, or legal
obligation to deliver any material, code or functionality. Information about potential future products may
not be incorporated into any contract. The development, release, and timing of any future features or
functionality described for our products remains at our sole discretion.
IBM, the IBM logo, ibm.com, DB2, and DB2 for z/OS are trademarks or registered trademarks of International Business Machines
Corporation in the United States, other countries, or both. If these and other IBM trademarked terms are marked on their first
occurrence in this information with a trademark symbol (® or ™), these symbols indicate U.S. registered or common law
trademarks owned by IBM at the time this information was published. Such trademarks may also be registered or common law
trademarks in other countries. A current list of IBM trademarks is available on the Web at “Copyright and trademark information” at
www.ibm.com/legal/copytrade.shtml
Other company, product, or service names may be trademarks or service marks of others.
© 2013 IBM Corporation3 03/20/13
Introduction & Overview
© 2013 IBM Corporation
Concept: Transparently accelerate analytical
queries by dynamically offloading (DB2 optimizer
decides) to a data warehouse appliance: no
application change!
• Transparency: applications connected to
DB2 are entirely unaware of the
Accelerator
• Integration: Deep integration with DB2
(security, monitoring, backup, ...)
• Self-managed workloads: queries are
executed in the most efficient location
• Simplified administration: appliance
hands-free operations, eliminating most
database tuning tasks
• Performance: Unprecedented response
times for both, OLTP and OLAP queries
IBM DB2 Analytics Accelerator (IDAA)
© 2013 IBM Corporation5
“Host” Computers
Snippet BladesTM
(S-Blades, SPUs)
Disk Enclosures
IDAA Server
SQL Compiler, Query Plan, Optimizer,
Administration
2 front/end hosts, IBM 3650M3 or 3850X5
clustered active-passive
2 Nehalem-EP Quad-core 2.4GHz per host
Processor &
streaming DB logic
High-performance database
engine streaming joins,
aggregations, sorts, etc.
e.g. TF12: 12 back/end SPUs
(more details on following charts)
Slice of User Data
Swap and Mirror partitions
High speed data streaming
High compression rate
EXP3000 JBOD Enclosures
12 x 3.5” 1TB, 7200RPM, SAS (3Gb/s)
max 116MB/s (200-500MB/s
compressed data)
e.g. TF12:
8 enclosures → 96 HDDs
32TB uncompressed user data (→ 128TB)
9 GB/s scan rate (~36GB/s w. compression)
Powered by IBM Netezza
© 2013 IBM Corporation6
DB2 for z/OS
Optimizer
ISAOptDRDARequestor
Smart Analytics Optimizer
Application
Application
Interface
Queries executed with Smart Analytics Optimizer
Queries executed without Smart Analytics Optimizer
Heartbeat (Smart Analytics Optimizer availability and performance indicators)
Query execution run-time for queries
that cannot be or should not be off-
loaded to ISAOpt
SPU
CPU FPGA
Memory
SPU
CPU FPGA
Memory
SPU
CPU FPGA
Memory
SPU
CPU FPGA
Memory
SMPHost
Heartbeat
IDAA Query Execution
© 2013 IBM Corporation7 03/20/13
Integrating Replication - Requirements
 The Incremental Update capability is part of the base offering for all customers, and not a separately
orderable feature
 Fully integrated into IDAA
– Managed via IDAA Studio
– Integrated into IDAA software update
– Integrated into IDAA HA concepts
– Automated scheduling of maintenance operations (RUNSTATS / REORG) on IDAA
– Automation possible via Stored Procedure
© 2013 IBM Corporation8 03/20/13
Complementing Existing Synchronization Options
 There are different options to synchronize tables between DB2 and IDAA
– Choice depends on IDAA usage scenarios, update frequency, affinity to partitions, etc.
Synchronization options Use cases, characteristics and requirements
Full table refresh
The entire content of a database table is refreshed for
accelerator processing
 Existing ETL process replaces entire table
 Multiple sources or complex transformations
 Smaller, un-partitioned tables
 Reporting based on consistent snapshot (“check point”)
Table partition refresh
For a partitioned database table, selected partitions can be
refreshed for accelerator processing
 Optimization for (time-) partitioned warehouse tables, appending changes “at the end”
 More efficient than full table refresh for larger tables
 Reporting based on consistent snapshot (“check point”)
Incremental update
Log-based capturing of changes and propagation to IDAA
with low latency (typically few minutes)
 Scattered updates after “bulk” load
 Reporting on continuously updated data (e.g., an ODS), considering most recent
changes
 More efficient for smaller updates than full table refresh
© 2013 IBM Corporation9 03/20/13
Reporting and Analytics on Continuously Changing Data
 With continuously changing data, users may experience different results for subsequent query
execution
– Users need to understand this behavior
 Can use “waitForReplication” Accelerator SP subcommand
– Wait until all committed data at the time of SP invocation has been applied to the target
Time
Users submitting queries
Updates to database
waitForReplication() waitForReplication()
© 2013 IBM Corporation10 03/20/13
Architecture
© 2013 IBM Corporation11 03/20/13
IBM Puredata System for AnalyticsIBM Puredata System for Analytics
Architecture
DB2 for z/OSDB2 for z/OS
insert
delete
update
Engine for
DB2 z/OS
(Log reading)
Engine for
DB2 z/OS
(Log reading)
IDAA
Database
IDAA
Database
Engine for IBM Netezza
(stage + apply changes)
Engine for IBM Netezza
(stage + apply changes)
APIAPI
IDAA ServerIDAA Server
Access Server
(manage engines and
subscriptions)
Access Server
(manage engines and
subscriptions)
(private network
10G fiber)
Catalog
information
Catalog
information
<xml>
IDAA Stored Procedures
ACCEL_CONTROL_ACCELERATOR
ACCEL_ENABLE_REPLICATION
...
IDAA Stored Procedures
ACCEL_CONTROL_ACCELERATOR
ACCEL_ENABLE_REPLICATION
...
JCLJCL
Automation Code
(creates data sources,
subscriptions)
Automation Code
(creates data sources,
subscriptions)
IDAA StudioIDAA Studio
© 2013 IBM Corporation12 03/20/13
Properties of this Architecture
 Optimized for throughput
– During normal operation, no disk I/O involved
• DB2 → log buffer → capture staging space → network → apply staging space → IDAA
– Changes within the apply staging space are consolidated on the target
• More than one change to the same row results in a single change
– Mini-batches to leverage Netezza bulk load interface
• The source sends a UR to the target once the commit log record was read
• The target applies all URs that arrived during a 60s window (or if size limit reached)
– UPDATEs are decomposed into <DELETE, INSERT> pairs (and merged with “regular”
DELETE and INSERT batches)
 Use of parallel UNLOAD with DB2 INTERNAL format to establish the initial snapshot of a table
– Replication continues from this snapshot (capture point automatically managed)
 IDAA schedules REORG automatically as a low prio task in the background as a threshold of
“disorganization” is reached on Netezza
 Simple identity mapping of tables
– No user-exits
– No transformations
 Based on “production” components
© 2013 IBM Corporation13 03/20/13
Incremental Update - Table Refresh Integration
Using IDAA table-refresh for taking the initial snapshot or re-syncing after bulk changes
Use case Details Operations
Enable incremental update on a
newly added table (state:
INITIAL_LOAD_PENDING)
Lock mode TABLE or TABLESET used
for the load to prevent in-flight changes
while the UNLOADs are running
● Enable replication for table
● Load table (sets capture point when
load completed)
● Start replication
Re-load a loaded, replicated
table, e.g. because of non-
logged operation on source
table
Assumption: table is synchronized after
re-load, replication will continue from
this new “snapshot”
● Full reload or partition-reload the table
(sets new capture point when the load
completed)
© 2013 IBM Corporation14 03/20/13
User Interface
Incremental update UI elements only visible if function was enabled on the DB2 subsystem
 Start / stop replication process (per subsystem-accelerator pair)
 Enable / disable replication (per table)
 Trace collection
 Information on replication latency and events
© 2013 IBM Corporation15 03/20/13
High-Availability Setup
 Capture side
– One active capture engine per DS-Group
• Multiple stand-by instances, coordinated via ENQ
• Shared metadata
– z/OS Communication Server migrates D-VIPA in case of fail-over
 Apply side (appliance internal)
– Integration into cluster management (active-standby)
– Mirrored disk between active and standby host (shared metadata)
– All components are migrated to the standby host and restarted
– replication will continue automatically where it left off
Member 1
Capture
(active)
Member 2
LPAR 2
LPAR 1
DS Group
Capture
(hot-standby)
catalog
D-VIPA
D-VIPA
© 2013 IBM Corporation16 03/20/13
Replication Tuning
 Replication on the target system produces DELETE statements with predicates on the unique columns
(index or constraint) of the source table
– Can use “clustered base tables” for more efficient location of rows to be deleted
– Caveat: may conflict with tuning objectives (e.g. table already clustered on time columns)
 If multiple unique constraints are available, we automatically select the “best” set of columns
– The set with the minimal number of columns (partially) matching existing clustering columns
 If tables are not clustered yet, the system suggests to cluster on source table columns with unique
index or unique constraint
© 2013 IBM Corporation17 03/20/13
Evaluation
© 2013 IBM Corporation18 03/20/13
Impact on Concurrently Running Queries
 Validated that incremental update has only minor impact on query response time
– “No” workload:
• 10x parallel queries: 5 streaming, 5 aggregation / group by
– “Medium” workload:
• 10x parallel queries: 5 streaming, 5 aggregation / group by
• Replication from 1 subsystem: 300.000 rows/minute / 5.000 rows/s
– “Full” workload
• 10x parallel queries: 5 streaming, 5 aggregation / group by
• Replication from 2 subsystems: 2.0 mio rows/minute, 33.333 rows/s
© 2013 IBM Corporation19 03/20/13
Table Refresh “Best Practices”
© 2013 IBM Corporation20 03/20/13

Más contenido relacionado

La actualidad más candente

Nové vlastnosti Oracle Database Appliance
Nové vlastnosti Oracle Database ApplianceNové vlastnosti Oracle Database Appliance
Nové vlastnosti Oracle Database ApplianceMarketingArrowECS_CZ
 
Open Innovation with Power Systems
Open Innovation with Power Systems Open Innovation with Power Systems
Open Innovation with Power Systems IBM Power Systems
 
Understanding the IBM Power Systems Advantage
Understanding the IBM Power Systems AdvantageUnderstanding the IBM Power Systems Advantage
Understanding the IBM Power Systems AdvantageIBM Power Systems
 
IBM Power9 Features and Specifications
IBM Power9 Features and SpecificationsIBM Power9 Features and Specifications
IBM Power9 Features and Specificationsinside-BigData.com
 
2017-02-21 AFCEA West Building Continuous Integration & Deployment (CI/CD) Pi...
2017-02-21 AFCEA West Building Continuous Integration & Deployment (CI/CD) Pi...2017-02-21 AFCEA West Building Continuous Integration & Deployment (CI/CD) Pi...
2017-02-21 AFCEA West Building Continuous Integration & Deployment (CI/CD) Pi...Shawn Wells
 
Overcoming write availability challenges of PostgreSQL
Overcoming write availability challenges of PostgreSQLOvercoming write availability challenges of PostgreSQL
Overcoming write availability challenges of PostgreSQLEDB
 
Expert Guide to Migrating Legacy Databases to Postgres
Expert Guide to Migrating Legacy Databases to PostgresExpert Guide to Migrating Legacy Databases to Postgres
Expert Guide to Migrating Legacy Databases to PostgresEDB
 
Public Sector Virtual Town Hall: High Availability for PostgreSQL
Public Sector Virtual Town Hall: High Availability for PostgreSQLPublic Sector Virtual Town Hall: High Availability for PostgreSQL
Public Sector Virtual Town Hall: High Availability for PostgreSQLEDB
 
Temporal Tables, Transparent Archiving in DB2 for z/OS and IDAA
Temporal Tables, Transparent Archiving in DB2 for z/OS and IDAATemporal Tables, Transparent Archiving in DB2 for z/OS and IDAA
Temporal Tables, Transparent Archiving in DB2 for z/OS and IDAACuneyt Goksu
 
An Expert Guide to Migrating Legacy Databases to PostgreSQL
An Expert Guide to Migrating Legacy Databases to PostgreSQLAn Expert Guide to Migrating Legacy Databases to PostgreSQL
An Expert Guide to Migrating Legacy Databases to PostgreSQLEDB
 
How to Design for Database High Availability
How to Design for Database High AvailabilityHow to Design for Database High Availability
How to Design for Database High AvailabilityEDB
 
MOUG17 Keynote: Oracle OpenWorld Major Announcements
MOUG17 Keynote: Oracle OpenWorld Major AnnouncementsMOUG17 Keynote: Oracle OpenWorld Major Announcements
MOUG17 Keynote: Oracle OpenWorld Major AnnouncementsMonica Li
 
Db2 family and v11.1.4.4
Db2 family and v11.1.4.4Db2 family and v11.1.4.4
Db2 family and v11.1.4.4ModusOptimum
 
HDT for Mainframe Considerations: Simplified Tiered Storage
HDT for Mainframe Considerations: Simplified Tiered StorageHDT for Mainframe Considerations: Simplified Tiered Storage
HDT for Mainframe Considerations: Simplified Tiered StorageHitachi Vantara
 
Superior Cloud Economics with Power Systems
Superior Cloud Economics with Power Systems Superior Cloud Economics with Power Systems
Superior Cloud Economics with Power Systems IBM Power Systems
 
Expert summit SQL Server 2016
Expert summit   SQL Server 2016Expert summit   SQL Server 2016
Expert summit SQL Server 2016Łukasz Grala
 
Co-Design Architecture for Exascale
Co-Design Architecture for ExascaleCo-Design Architecture for Exascale
Co-Design Architecture for Exascaleinside-BigData.com
 
Migrating from Oracle to Postgres
Migrating from Oracle to PostgresMigrating from Oracle to Postgres
Migrating from Oracle to PostgresEDB
 
Beginner's Guide to High Availability for Postgres
Beginner's Guide to High Availability for PostgresBeginner's Guide to High Availability for Postgres
Beginner's Guide to High Availability for PostgresEDB
 

La actualidad más candente (20)

Nové vlastnosti Oracle Database Appliance
Nové vlastnosti Oracle Database ApplianceNové vlastnosti Oracle Database Appliance
Nové vlastnosti Oracle Database Appliance
 
Open Innovation with Power Systems
Open Innovation with Power Systems Open Innovation with Power Systems
Open Innovation with Power Systems
 
Understanding the IBM Power Systems Advantage
Understanding the IBM Power Systems AdvantageUnderstanding the IBM Power Systems Advantage
Understanding the IBM Power Systems Advantage
 
IBM POWER8 as an HPC platform
IBM POWER8 as an HPC platformIBM POWER8 as an HPC platform
IBM POWER8 as an HPC platform
 
IBM Power9 Features and Specifications
IBM Power9 Features and SpecificationsIBM Power9 Features and Specifications
IBM Power9 Features and Specifications
 
2017-02-21 AFCEA West Building Continuous Integration & Deployment (CI/CD) Pi...
2017-02-21 AFCEA West Building Continuous Integration & Deployment (CI/CD) Pi...2017-02-21 AFCEA West Building Continuous Integration & Deployment (CI/CD) Pi...
2017-02-21 AFCEA West Building Continuous Integration & Deployment (CI/CD) Pi...
 
Overcoming write availability challenges of PostgreSQL
Overcoming write availability challenges of PostgreSQLOvercoming write availability challenges of PostgreSQL
Overcoming write availability challenges of PostgreSQL
 
Expert Guide to Migrating Legacy Databases to Postgres
Expert Guide to Migrating Legacy Databases to PostgresExpert Guide to Migrating Legacy Databases to Postgres
Expert Guide to Migrating Legacy Databases to Postgres
 
Public Sector Virtual Town Hall: High Availability for PostgreSQL
Public Sector Virtual Town Hall: High Availability for PostgreSQLPublic Sector Virtual Town Hall: High Availability for PostgreSQL
Public Sector Virtual Town Hall: High Availability for PostgreSQL
 
Temporal Tables, Transparent Archiving in DB2 for z/OS and IDAA
Temporal Tables, Transparent Archiving in DB2 for z/OS and IDAATemporal Tables, Transparent Archiving in DB2 for z/OS and IDAA
Temporal Tables, Transparent Archiving in DB2 for z/OS and IDAA
 
An Expert Guide to Migrating Legacy Databases to PostgreSQL
An Expert Guide to Migrating Legacy Databases to PostgreSQLAn Expert Guide to Migrating Legacy Databases to PostgreSQL
An Expert Guide to Migrating Legacy Databases to PostgreSQL
 
How to Design for Database High Availability
How to Design for Database High AvailabilityHow to Design for Database High Availability
How to Design for Database High Availability
 
MOUG17 Keynote: Oracle OpenWorld Major Announcements
MOUG17 Keynote: Oracle OpenWorld Major AnnouncementsMOUG17 Keynote: Oracle OpenWorld Major Announcements
MOUG17 Keynote: Oracle OpenWorld Major Announcements
 
Db2 family and v11.1.4.4
Db2 family and v11.1.4.4Db2 family and v11.1.4.4
Db2 family and v11.1.4.4
 
HDT for Mainframe Considerations: Simplified Tiered Storage
HDT for Mainframe Considerations: Simplified Tiered StorageHDT for Mainframe Considerations: Simplified Tiered Storage
HDT for Mainframe Considerations: Simplified Tiered Storage
 
Superior Cloud Economics with Power Systems
Superior Cloud Economics with Power Systems Superior Cloud Economics with Power Systems
Superior Cloud Economics with Power Systems
 
Expert summit SQL Server 2016
Expert summit   SQL Server 2016Expert summit   SQL Server 2016
Expert summit SQL Server 2016
 
Co-Design Architecture for Exascale
Co-Design Architecture for ExascaleCo-Design Architecture for Exascale
Co-Design Architecture for Exascale
 
Migrating from Oracle to Postgres
Migrating from Oracle to PostgresMigrating from Oracle to Postgres
Migrating from Oracle to Postgres
 
Beginner's Guide to High Availability for Postgres
Beginner's Guide to High Availability for PostgresBeginner's Guide to High Availability for Postgres
Beginner's Guide to High Availability for Postgres
 

Destacado (9)

Job center
Job centerJob center
Job center
 
Presentation of nouns
Presentation of nounsPresentation of nouns
Presentation of nouns
 
Poetic devices
Poetic devicesPoetic devices
Poetic devices
 
Remembrance Day
Remembrance DayRemembrance Day
Remembrance Day
 
Persuasive writing g7
Persuasive writing g7  Persuasive writing g7
Persuasive writing g7
 
Job centre presentation
Job centre presentationJob centre presentation
Job centre presentation
 
Nouns (1)
Nouns (1)Nouns (1)
Nouns (1)
 
Singular and plural nouns ppt
Singular and plural nouns pptSingular and plural nouns ppt
Singular and plural nouns ppt
 
10 facts about jobs in the future
10 facts about jobs in the future10 facts about jobs in the future
10 facts about jobs in the future
 

Similar a EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator

IBM Analytics Accelerator Trends & Directions Namk Hrle
IBM Analytics Accelerator  Trends & Directions Namk Hrle IBM Analytics Accelerator  Trends & Directions Namk Hrle
IBM Analytics Accelerator Trends & Directions Namk Hrle Surekha Parekh
 
IBM DB2 Analytics Accelerator Trends & Directions by Namik Hrle
IBM DB2 Analytics Accelerator  Trends & Directions by Namik Hrle IBM DB2 Analytics Accelerator  Trends & Directions by Namik Hrle
IBM DB2 Analytics Accelerator Trends & Directions by Namik Hrle Surekha Parekh
 
Ibm db2 analytics accelerator high availability and disaster recovery
Ibm db2 analytics accelerator  high availability and disaster recoveryIbm db2 analytics accelerator  high availability and disaster recovery
Ibm db2 analytics accelerator high availability and disaster recoverybupbechanhgmail
 
8392-exadatamaa-1887964.pptx
8392-exadatamaa-1887964.pptx8392-exadatamaa-1887964.pptx
8392-exadatamaa-1887964.pptxRaniVuppal
 
Oracle Database 12c Multitenant for Consolidation
Oracle Database 12c Multitenant for ConsolidationOracle Database 12c Multitenant for Consolidation
Oracle Database 12c Multitenant for ConsolidationYudi Herdiana
 
Informix warehouse and accelerator overview
Informix warehouse and accelerator overviewInformix warehouse and accelerator overview
Informix warehouse and accelerator overviewKeshav Murthy
 
Oracle MAA Best Practices - Applications Considerations
Oracle MAA Best Practices - Applications ConsiderationsOracle MAA Best Practices - Applications Considerations
Oracle MAA Best Practices - Applications ConsiderationsMarkus Michalewicz
 
Consolidate your SAP System landscape Teched && d-code 2014
Consolidate your SAP System landscape Teched && d-code 2014Consolidate your SAP System landscape Teched && d-code 2014
Consolidate your SAP System landscape Teched && d-code 2014Goetz Lessmann
 
Stephan Hummel – IT-Tage 2015 – DB2 In-Memory - Eine Technologie nicht nur fü...
Stephan Hummel – IT-Tage 2015 – DB2 In-Memory - Eine Technologie nicht nur fü...Stephan Hummel – IT-Tage 2015 – DB2 In-Memory - Eine Technologie nicht nur fü...
Stephan Hummel – IT-Tage 2015 – DB2 In-Memory - Eine Technologie nicht nur fü...Informatik Aktuell
 
System z Technology Summit Streamlining Utilities
System z Technology Summit Streamlining UtilitiesSystem z Technology Summit Streamlining Utilities
System z Technology Summit Streamlining UtilitiesSurekha Parekh
 
DB2 pureScale Overview Sept 2010
DB2 pureScale Overview Sept 2010DB2 pureScale Overview Sept 2010
DB2 pureScale Overview Sept 2010Laura Hood
 
Intro to goldilocks inmemory db - low latency
Intro to goldilocks inmemory db - low latencyIntro to goldilocks inmemory db - low latency
Intro to goldilocks inmemory db - low latencyDongpyo Lee
 
오라클 DR 및 복제 솔루션(Dbvisit 소개)
오라클 DR 및 복제 솔루션(Dbvisit 소개)오라클 DR 및 복제 솔루션(Dbvisit 소개)
오라클 DR 및 복제 솔루션(Dbvisit 소개)Linux Foundation Korea
 
The Central View of your Data with Postgres
The Central View of your Data with PostgresThe Central View of your Data with Postgres
The Central View of your Data with PostgresEDB
 
Présentation IBM DB2 Blu - Fabrizio DANUSSO
Présentation IBM DB2 Blu - Fabrizio DANUSSOPrésentation IBM DB2 Blu - Fabrizio DANUSSO
Présentation IBM DB2 Blu - Fabrizio DANUSSOIBMInfoSphereUGFR
 
Db2 analytics accelerator technical update
Db2 analytics accelerator  technical updateDb2 analytics accelerator  technical update
Db2 analytics accelerator technical updateCuneyt Goksu
 

Similar a EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator (20)

IBM Analytics Accelerator Trends & Directions Namk Hrle
IBM Analytics Accelerator  Trends & Directions Namk Hrle IBM Analytics Accelerator  Trends & Directions Namk Hrle
IBM Analytics Accelerator Trends & Directions Namk Hrle
 
IBM DB2 Analytics Accelerator Trends & Directions by Namik Hrle
IBM DB2 Analytics Accelerator  Trends & Directions by Namik Hrle IBM DB2 Analytics Accelerator  Trends & Directions by Namik Hrle
IBM DB2 Analytics Accelerator Trends & Directions by Namik Hrle
 
Greenplum Architecture
Greenplum ArchitectureGreenplum Architecture
Greenplum Architecture
 
Ibm db2 analytics accelerator high availability and disaster recovery
Ibm db2 analytics accelerator  high availability and disaster recoveryIbm db2 analytics accelerator  high availability and disaster recovery
Ibm db2 analytics accelerator high availability and disaster recovery
 
13721876
1372187613721876
13721876
 
8392-exadatamaa-1887964.pptx
8392-exadatamaa-1887964.pptx8392-exadatamaa-1887964.pptx
8392-exadatamaa-1887964.pptx
 
Oracle Database 12c Multitenant for Consolidation
Oracle Database 12c Multitenant for ConsolidationOracle Database 12c Multitenant for Consolidation
Oracle Database 12c Multitenant for Consolidation
 
Informix warehouse and accelerator overview
Informix warehouse and accelerator overviewInformix warehouse and accelerator overview
Informix warehouse and accelerator overview
 
Oracle MAA Best Practices - Applications Considerations
Oracle MAA Best Practices - Applications ConsiderationsOracle MAA Best Practices - Applications Considerations
Oracle MAA Best Practices - Applications Considerations
 
Maximize Availability With Oracle Database 12c
Maximize Availability With Oracle Database 12cMaximize Availability With Oracle Database 12c
Maximize Availability With Oracle Database 12c
 
Consolidate your SAP System landscape Teched && d-code 2014
Consolidate your SAP System landscape Teched && d-code 2014Consolidate your SAP System landscape Teched && d-code 2014
Consolidate your SAP System landscape Teched && d-code 2014
 
Stephan Hummel – IT-Tage 2015 – DB2 In-Memory - Eine Technologie nicht nur fü...
Stephan Hummel – IT-Tage 2015 – DB2 In-Memory - Eine Technologie nicht nur fü...Stephan Hummel – IT-Tage 2015 – DB2 In-Memory - Eine Technologie nicht nur fü...
Stephan Hummel – IT-Tage 2015 – DB2 In-Memory - Eine Technologie nicht nur fü...
 
System z Technology Summit Streamlining Utilities
System z Technology Summit Streamlining UtilitiesSystem z Technology Summit Streamlining Utilities
System z Technology Summit Streamlining Utilities
 
DB2 pureScale Overview Sept 2010
DB2 pureScale Overview Sept 2010DB2 pureScale Overview Sept 2010
DB2 pureScale Overview Sept 2010
 
Intro to goldilocks inmemory db - low latency
Intro to goldilocks inmemory db - low latencyIntro to goldilocks inmemory db - low latency
Intro to goldilocks inmemory db - low latency
 
Greenplum feature
Greenplum featureGreenplum feature
Greenplum feature
 
오라클 DR 및 복제 솔루션(Dbvisit 소개)
오라클 DR 및 복제 솔루션(Dbvisit 소개)오라클 DR 및 복제 솔루션(Dbvisit 소개)
오라클 DR 및 복제 솔루션(Dbvisit 소개)
 
The Central View of your Data with Postgres
The Central View of your Data with PostgresThe Central View of your Data with Postgres
The Central View of your Data with Postgres
 
Présentation IBM DB2 Blu - Fabrizio DANUSSO
Présentation IBM DB2 Blu - Fabrizio DANUSSOPrésentation IBM DB2 Blu - Fabrizio DANUSSO
Présentation IBM DB2 Blu - Fabrizio DANUSSO
 
Db2 analytics accelerator technical update
Db2 analytics accelerator  technical updateDb2 analytics accelerator  technical update
Db2 analytics accelerator technical update
 

Último

AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdfPearlKirahMaeRagusta1
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...kalichargn70th171
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfproinshot.com
 

Último (20)

AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 

EDBT 2013 - Near Realtime Analytics with IBM DB2 Analytics Accelerator

  • 1. © 2013 IBM Corporation IBM DB2 Analytics Accelerator (IDAA) Near Real-Time Analytics with IDAA March 2013 Daniel Martin (danmartin@de.ibm.com) – IBM Software Group, Information Management
  • 2. © 2013 IBM Corporation Disclaimer © Copyright IBM Corporation 2012. All rights reserved. U.S. Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. IBM’s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice at IBM’s sole discretion. Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision. The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract. The development, release, and timing of any future features or functionality described for our products remains at our sole discretion. IBM, the IBM logo, ibm.com, DB2, and DB2 for z/OS are trademarks or registered trademarks of International Business Machines Corporation in the United States, other countries, or both. If these and other IBM trademarked terms are marked on their first occurrence in this information with a trademark symbol (® or ™), these symbols indicate U.S. registered or common law trademarks owned by IBM at the time this information was published. Such trademarks may also be registered or common law trademarks in other countries. A current list of IBM trademarks is available on the Web at “Copyright and trademark information” at www.ibm.com/legal/copytrade.shtml Other company, product, or service names may be trademarks or service marks of others.
  • 3. © 2013 IBM Corporation3 03/20/13 Introduction & Overview
  • 4. © 2013 IBM Corporation Concept: Transparently accelerate analytical queries by dynamically offloading (DB2 optimizer decides) to a data warehouse appliance: no application change! • Transparency: applications connected to DB2 are entirely unaware of the Accelerator • Integration: Deep integration with DB2 (security, monitoring, backup, ...) • Self-managed workloads: queries are executed in the most efficient location • Simplified administration: appliance hands-free operations, eliminating most database tuning tasks • Performance: Unprecedented response times for both, OLTP and OLAP queries IBM DB2 Analytics Accelerator (IDAA)
  • 5. © 2013 IBM Corporation5 “Host” Computers Snippet BladesTM (S-Blades, SPUs) Disk Enclosures IDAA Server SQL Compiler, Query Plan, Optimizer, Administration 2 front/end hosts, IBM 3650M3 or 3850X5 clustered active-passive 2 Nehalem-EP Quad-core 2.4GHz per host Processor & streaming DB logic High-performance database engine streaming joins, aggregations, sorts, etc. e.g. TF12: 12 back/end SPUs (more details on following charts) Slice of User Data Swap and Mirror partitions High speed data streaming High compression rate EXP3000 JBOD Enclosures 12 x 3.5” 1TB, 7200RPM, SAS (3Gb/s) max 116MB/s (200-500MB/s compressed data) e.g. TF12: 8 enclosures → 96 HDDs 32TB uncompressed user data (→ 128TB) 9 GB/s scan rate (~36GB/s w. compression) Powered by IBM Netezza
  • 6. © 2013 IBM Corporation6 DB2 for z/OS Optimizer ISAOptDRDARequestor Smart Analytics Optimizer Application Application Interface Queries executed with Smart Analytics Optimizer Queries executed without Smart Analytics Optimizer Heartbeat (Smart Analytics Optimizer availability and performance indicators) Query execution run-time for queries that cannot be or should not be off- loaded to ISAOpt SPU CPU FPGA Memory SPU CPU FPGA Memory SPU CPU FPGA Memory SPU CPU FPGA Memory SMPHost Heartbeat IDAA Query Execution
  • 7. © 2013 IBM Corporation7 03/20/13 Integrating Replication - Requirements  The Incremental Update capability is part of the base offering for all customers, and not a separately orderable feature  Fully integrated into IDAA – Managed via IDAA Studio – Integrated into IDAA software update – Integrated into IDAA HA concepts – Automated scheduling of maintenance operations (RUNSTATS / REORG) on IDAA – Automation possible via Stored Procedure
  • 8. © 2013 IBM Corporation8 03/20/13 Complementing Existing Synchronization Options  There are different options to synchronize tables between DB2 and IDAA – Choice depends on IDAA usage scenarios, update frequency, affinity to partitions, etc. Synchronization options Use cases, characteristics and requirements Full table refresh The entire content of a database table is refreshed for accelerator processing  Existing ETL process replaces entire table  Multiple sources or complex transformations  Smaller, un-partitioned tables  Reporting based on consistent snapshot (“check point”) Table partition refresh For a partitioned database table, selected partitions can be refreshed for accelerator processing  Optimization for (time-) partitioned warehouse tables, appending changes “at the end”  More efficient than full table refresh for larger tables  Reporting based on consistent snapshot (“check point”) Incremental update Log-based capturing of changes and propagation to IDAA with low latency (typically few minutes)  Scattered updates after “bulk” load  Reporting on continuously updated data (e.g., an ODS), considering most recent changes  More efficient for smaller updates than full table refresh
  • 9. © 2013 IBM Corporation9 03/20/13 Reporting and Analytics on Continuously Changing Data  With continuously changing data, users may experience different results for subsequent query execution – Users need to understand this behavior  Can use “waitForReplication” Accelerator SP subcommand – Wait until all committed data at the time of SP invocation has been applied to the target Time Users submitting queries Updates to database waitForReplication() waitForReplication()
  • 10. © 2013 IBM Corporation10 03/20/13 Architecture
  • 11. © 2013 IBM Corporation11 03/20/13 IBM Puredata System for AnalyticsIBM Puredata System for Analytics Architecture DB2 for z/OSDB2 for z/OS insert delete update Engine for DB2 z/OS (Log reading) Engine for DB2 z/OS (Log reading) IDAA Database IDAA Database Engine for IBM Netezza (stage + apply changes) Engine for IBM Netezza (stage + apply changes) APIAPI IDAA ServerIDAA Server Access Server (manage engines and subscriptions) Access Server (manage engines and subscriptions) (private network 10G fiber) Catalog information Catalog information <xml> IDAA Stored Procedures ACCEL_CONTROL_ACCELERATOR ACCEL_ENABLE_REPLICATION ... IDAA Stored Procedures ACCEL_CONTROL_ACCELERATOR ACCEL_ENABLE_REPLICATION ... JCLJCL Automation Code (creates data sources, subscriptions) Automation Code (creates data sources, subscriptions) IDAA StudioIDAA Studio
  • 12. © 2013 IBM Corporation12 03/20/13 Properties of this Architecture  Optimized for throughput – During normal operation, no disk I/O involved • DB2 → log buffer → capture staging space → network → apply staging space → IDAA – Changes within the apply staging space are consolidated on the target • More than one change to the same row results in a single change – Mini-batches to leverage Netezza bulk load interface • The source sends a UR to the target once the commit log record was read • The target applies all URs that arrived during a 60s window (or if size limit reached) – UPDATEs are decomposed into <DELETE, INSERT> pairs (and merged with “regular” DELETE and INSERT batches)  Use of parallel UNLOAD with DB2 INTERNAL format to establish the initial snapshot of a table – Replication continues from this snapshot (capture point automatically managed)  IDAA schedules REORG automatically as a low prio task in the background as a threshold of “disorganization” is reached on Netezza  Simple identity mapping of tables – No user-exits – No transformations  Based on “production” components
  • 13. © 2013 IBM Corporation13 03/20/13 Incremental Update - Table Refresh Integration Using IDAA table-refresh for taking the initial snapshot or re-syncing after bulk changes Use case Details Operations Enable incremental update on a newly added table (state: INITIAL_LOAD_PENDING) Lock mode TABLE or TABLESET used for the load to prevent in-flight changes while the UNLOADs are running ● Enable replication for table ● Load table (sets capture point when load completed) ● Start replication Re-load a loaded, replicated table, e.g. because of non- logged operation on source table Assumption: table is synchronized after re-load, replication will continue from this new “snapshot” ● Full reload or partition-reload the table (sets new capture point when the load completed)
  • 14. © 2013 IBM Corporation14 03/20/13 User Interface Incremental update UI elements only visible if function was enabled on the DB2 subsystem  Start / stop replication process (per subsystem-accelerator pair)  Enable / disable replication (per table)  Trace collection  Information on replication latency and events
  • 15. © 2013 IBM Corporation15 03/20/13 High-Availability Setup  Capture side – One active capture engine per DS-Group • Multiple stand-by instances, coordinated via ENQ • Shared metadata – z/OS Communication Server migrates D-VIPA in case of fail-over  Apply side (appliance internal) – Integration into cluster management (active-standby) – Mirrored disk between active and standby host (shared metadata) – All components are migrated to the standby host and restarted – replication will continue automatically where it left off Member 1 Capture (active) Member 2 LPAR 2 LPAR 1 DS Group Capture (hot-standby) catalog D-VIPA D-VIPA
  • 16. © 2013 IBM Corporation16 03/20/13 Replication Tuning  Replication on the target system produces DELETE statements with predicates on the unique columns (index or constraint) of the source table – Can use “clustered base tables” for more efficient location of rows to be deleted – Caveat: may conflict with tuning objectives (e.g. table already clustered on time columns)  If multiple unique constraints are available, we automatically select the “best” set of columns – The set with the minimal number of columns (partially) matching existing clustering columns  If tables are not clustered yet, the system suggests to cluster on source table columns with unique index or unique constraint
  • 17. © 2013 IBM Corporation17 03/20/13 Evaluation
  • 18. © 2013 IBM Corporation18 03/20/13 Impact on Concurrently Running Queries  Validated that incremental update has only minor impact on query response time – “No” workload: • 10x parallel queries: 5 streaming, 5 aggregation / group by – “Medium” workload: • 10x parallel queries: 5 streaming, 5 aggregation / group by • Replication from 1 subsystem: 300.000 rows/minute / 5.000 rows/s – “Full” workload • 10x parallel queries: 5 streaming, 5 aggregation / group by • Replication from 2 subsystems: 2.0 mio rows/minute, 33.333 rows/s
  • 19. © 2013 IBM Corporation19 03/20/13 Table Refresh “Best Practices”
  • 20. © 2013 IBM Corporation20 03/20/13