SlideShare una empresa de Scribd logo
1 de 48
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/1
Outline
• Introduction
• Background
• Distributed Database Design
• Database Integration
• Semantic Data Control
• Distributed Query Processing
• Multidatabase Query Processing
• Distributed Transaction Management
• Data Replication
• Parallel Database Systems
➡ Data placement and query processing
➡ Load balancing
➡ Database clusters
• Distributed Object DBMS
• Peer-to-Peer Data Management
• Web Data Management
• Current Issues
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/2
The Database Problem
• Large volume of data use disk and large main memory
• I/O bottleneck (or memory access bottleneck)
➡ Speed(disk) << speed(RAM) << speed(microprocessor)
• Predictions
➡ Moore’s law: processor speed growth (with multicore): 50 % per year
➡ DRAM capacity growth : 4 × every three years
➡ Disk throughput : 2 × in the last ten years
• Conclusion : the I/O bottleneck worsens
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/3
The Solution
• Increase the I/O bandwidth
➡ Data partitioning
➡ Parallel data access
• Origins (1980's): database machines
➡ Hardware-oriented bad cost-performance failure
➡ Notable exception : ICL's CAFS Intelligent Search Processor
• 1990's: same solution but using standard hardware components
integrated in a multiprocessor
➡ Software-oriented
➡ Standard essential to exploit continuing technology improvements
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/4
Multiprocessor Objectives
• High-performance with better cost-performance than mainframe or vector
supercomputer
• Use many nodes, each with good cost-performance, communicating
through network
➡ Good cost via high-volume components
➡ Good performance via bandwidth
• Trends
➡ Microprocessor and memory (DRAM): off-the-shelf
➡ Network (multiprocessor edge): custom
• The real chalenge is to parallelize applications to run with good load
balancing
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/5
Data Server Architecture
client interface
query parsing
data server interface
communication channel
Application
server
Data
server
database
application server interface
database functions
Client
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/6
Objectives of Data Servers
• Avoid the shortcomings of the traditional DBMS approach
➡ Centralization of data and application management
➡ General-purpose OS (not DB-oriented)
• By separating the functions between
➡ Application server (or host computer)
➡ Data server (or database computer or back-end computer)
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/7
Data Server Approach:
Assessment
• Advantages
➡ Integrated data control by the server (black box)
➡ Increased performance by dedicated system
➡ Can better exploit parallelism
➡ Fits well in distributed environments
• Potential problems
➡ Communication overhead between application and data server
✦ High-level interface
➡ High cost with mainframe servers
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/8
Parallel Data Processing
• Three ways of exploiting high-performance multiprocessor systems:
 Automatically detect parallelism in sequential programs (e.g., Fortran, OPS5)
 Augment an existing language with parallel constructs (e.g., C*, Fortran90)
 Offer a new language in which parallelism can be expressed or automatically
inferred
• Critique
 Hard to develop parallelizing compilers, limited resulting speed-up
 Enables the programmer to express parallel computations but too low-level
 Can combine the advantages of both (1) and (2)
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/9
Data-based Parallelism
• Inter-operation
➡ p operations of the same query in parallel
op.3
op.1 op.2
op.
R
op.
R1
op.
R2
op.
R3
op.
R4
• Intra-operation
➡ The same op in parallel
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/10
Parallel DBMS
• Loose definition: a DBMS implemented on a tighly coupled
multiprocessor
• Alternative extremes
➡ Straighforward porting of relational DBMS (the software vendor edge)
➡ New hardware/software combination (the computer manufacturer edge)
• Naturally extends to distributed databases with one server per site
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/11
Parallel DBMS - Objectives
• Much better cost / performance than mainframe solution
• High-performance through parallelism
➡ High throughput with inter-query parallelism
➡ Low response time with intra-operation parallelism
• High availability and reliability by exploiting data replication
• Extensibility with the ideal goals
➡ Linear speed-up
➡ Linear scale-up
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/12
Linear Speed-up
Linear increase in performance for a constant DB size and proportional
increase of the system components (processor, memory, disk)
new perf.
old perf.
ideal
components
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/13
Linear Scale-up
Sustained performance for a linear increase of database size and
proportional increase of the system components.
components + database size
new perf.
old perf.
ideal
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/14
Barriers to Parallelism
• Startup
➡ The time needed to start a parallel operation may dominate the actual
computation time
• Interference
➡ When accessing shared resources, each new process slows down the others
(hot spot problem)
• Skew
➡ The response time of a set of parallel processes is the time of the slowest one
• Parallel data management techniques intend to overcome these barriers
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/15
Parallel DBMS – Functional
Architecture
RM
task n
DM
task 12
DM
task n2
DM
task n1
Data Mgr
DM
task 11
Request MgrRM
task 1
Session Mgr
User
task 1
User
task n
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/16
Parallel DBMS Functions
• Session manager
➡ Host interface
➡ Transaction monitoring for OLTP
• Request manager
➡ Compilation and optimization
➡ Data directory management
➡ Semantic data control
➡ Execution control
• Data manager
➡ Execution of DB operations
➡ Transaction management support
➡ Data management
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/17
Parallel System Architectures
• Multiprocessor architecture alternatives
➡ Shared memory (SM)
➡ Shared disk (SD)
➡ Shared nothing (SN)
• Hybrid architectures
➡ Non-Uniform Memory Architecture (NUMA)
➡ Cluster
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/18
Shared-Memory
DBMS on symmetric multiprocessors (SMP)
Prototypes: XPRS, Volcano, DBS3
+ Simplicity, load balancing, fast communication
- Network cost, low extensibility
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/19
Shared-Disk
Origins : DEC's VAXcluster, IBM's IMS/VS Data Sharing
Used first by Oracle with its Distributed Lock Manager
Now used by most DBMS vendors
+ network cost, extensibility, migration from uniprocessor
- complexity, potential performance problem for cache coherency
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/20
Shared-Nothing
Used by Teradata, IBM, Sybase, Microsoft for OLAP
Prototypes: Gamma, Bubba, Grace, Prisma, EDS
+ Extensibility, availability
- Complexity, difficult load balancing
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/21
Hybrid Architectures
• Various possible combinations of the three basic architectures are possible
to obtain different trade-offs between cost, performance, extensibility,
availability, etc.
• Hybrid architectures try to obtain the advantages of different architectures:
➡ efficiency and simplicity of shared-memory
➡ extensibility and cost of either shared disk or shared nothing
• 2 main kinds: NUMA and cluster
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/22
NUMA
• Shared-Memory vs. Distributed Memory
➡ Mixes two different aspects : addressing and memory
✦ Addressing: single address space vs multiple address spaces
✦ Physical memory: central vs distributed
• NUMA = single address space on distributed physical memory
➡ Eases application portability
➡ Extensibility
• The most successful NUMA is Cache Coherent NUMA (CC-NUMA)
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/23
CC-NUMA
• Principle
➡ Main memory distributed as with shared-nothing
➡ However, any processor has access to all other processors’ memories
• Similar to shared-disk, different processors can access the same data in a
conflicting update mode, so global cache consistency protocols are needed.
➡ Cache consistency done in hardware through a special consistent cache
interconnect
✦ Remote memory access very efficient, only a few times (typically between 2 and
3 times) the cost of local access
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/24
Cluster
• Combines good load balancing of SM with extensibility of SN
• Server nodes: off-the-shelf components
➡ From simple PC components to more powerful SMP
➡ Yields the best cost/performance ratio
➡ In its cheapest form,
• Fast standard interconnect (e.g., Myrinet and Infiniband) with high
bandwidth (Gigabits/sec) and low latency
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/25
SN cluster vs SD cluster
• SN cluster can yield best cost/performance and extensibility
➡ But adding or replacing cluster nodes requires disk and data reorganization
• SD cluster avoids such reorganization but requires disks to be globally
accessible by the cluster nodes
➡ Network-attached storage (NAS)
✦ distributed file system protocol such as NFS, relatively slow and not appropriate
for database management
➡ Storage-area network (SAN)
✦ Block-based protocol thus making it easier to manage cache consistency, efficient,
but costlier
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/26
Discussion
• For a small configuration (e.g., 8 processors), SM can provide the highest
performance because of better load balancing
• Some years ago, SN was the only choice for high-end systems. But SAN
makes SN a viable alternative with the main advantage of simplicity (for
transaction management)
➡ SD is now the preferred architecture for OLTP
➡ But for OLAP databases that are typically very large and mostly read-only,
SN is used
• Hybrid architectures, such as NUMA and cluster, can combine the
efficiency and simplicity of SM and the extensibility and cost of either SD
or SN
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/27
Parallel DBMS Techniques
• Data placement
➡ Physical placement of the DB onto multiple nodes
➡ Static vs. Dynamic
• Parallel data processing
➡ Select is easy
➡ Join (and all other non-select operations) is more difficult
• Parallel query optimization
➡ Choice of the best parallel execution plans
➡ Automatic parallelization of the queries and load balancing
• Transaction management
➡ Similar to distributed transaction management
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/28
Data Partitioning
• Each relation is divided in n partitions (subrelations), where n is a
function of relation size and access frequency
• Implementation
➡ Round-robin
✦ Maps i-th element to node i mod n
✦ Simple but only exact-match queries
➡ B-tree index
✦ Supports range queries but large index
➡ Hash function
✦ Only exact-match queries but small index
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/29
Partitioning Schemes
Round-Robin Hashing
Interval
••• •••
•••
•••
•••
•••
•••a-g h-m u-z
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/30
Replicated Data Partitioning
• High-availability requires data replication
➡ simple solution is mirrored disks
✦ hurts load balancing when one node fails
➡ more elaborate solutions achieve load balancing
✦ interleaved partitioning (Teradata)
✦ chained partitioning (Gamma)
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/31
Interleaved Partitioning
Node
Primary copy R1 R2 R3 R4
Backup copy r1.1 r1.2 r1.3
r2.3 r2.1 r2.2
r3.2 r3.2 r3.1
1 2 3 4
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/32
Chained Partitioning
Node
Primary copy R1 R2 R3 R4
Backup copy r4 r1 r2 r3
1 2 3 4
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/33
Placement Directory
• Performs two functions
➡ F1 (relname, placement attval) = lognode-id
➡ F2 (lognode-id) = phynode-id
• In either case, the data structure for f1 and f2 should be available when
needed at each node
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/34
Join Processing
• Three basic algorithms for intra-operator parallelism
➡ Parallel nested loop join: no special assumption
➡ Parallel associative join: one relation is declustered on join attribute and
equi-join
➡ Parallel hash join: equi-join
• They also apply to other complex operators such as duplicate elimination,
union, intersection, etc. with minor adaptation
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/35
Parallel Nested Loop Join
send
partition
node 3 node 4
node 1 node 2
R1:
S1 S2
R2:
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/36
Parallel Associative Join
node 1
node 3 node 4
node 2
R1: R2:
S1
S2
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/37
Parallel Hash Join
node node node node
node 1 node 2
R1: R2: S1: S2:
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/38
Parallel Query Optimization
• The objective is to select the “best” parallel execution plan for a query
using the following components
• Search space
➡ Models alternative execution plans as operator trees
➡ Left-deep vs. Right-deep vs. Bushy trees
• Search strategy
➡ Dynamic programming for small search space
➡ Randomized for large search space
• Cost model (abstraction of execution system)
➡ Physical schema info. (partitioning, indexes, etc.)
➡ Statistics and cost functions
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/39
Execution Plans as Operator
Trees
R2R1
R4
Result
j2
j3
Left-deep
Right-deep
j1 R3
R2R1
R4
Result
j5
j6
j4R3
R2R1
R3j7
R4
Result
j9
Zig-zag
Bushyj8
Result
j10
j12
j11
R2R1 R4R3
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/40
Equivalent Hash-Join Trees with
Different Scheduling
R3
Probe3Build3
R4
Temp2
Temp1
Build3
R4
Temp2
Probe3Build3
Probe2Build2
Probe1Build1
R2R1
R3
Temp1
Probe2Build2
Probe1Build1
R2R1
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/41
Load Balancing
• Problems arise for intra-operator parallelism with skewed data distributions
➡ attribute data skew (AVS)
➡ tuple placement skew (TPS)
➡ selectivity skew (SS)
➡ redistribution skew (RS)
➡ join product skew (JPS)
• Solutions
➡ sophisticated parallel algorithms that deal with skew
➡ dynamic processor allocation (at execution time)
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/42
Data Skew Example
Join1
Res1 Res2
Join2
AVS/TPS
AVS/TPS
AVS/TPS
AVS/TPS
JPS
JPS
RS/SS RS/SS
Scan1
S2
R2
S1
R1
Scan2
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/43
Load Balancing in a DB Cluster
• Choose the node to execute Q
➡ round robin
➡ The least loaded
✦ Need to get load information
• Fail over
➡ In case a node N fails, N’s queries are
taken over by another node
✦ Requires a copy of N’s data or SD
• In case of interference
➡ Data of an overloaded node are
replicated to another node
Q1 Q2
Load balancing
Q3 Q4
Q4
Q3
Q2
Q1
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/44
Oracle Transparent Application
Failover
Client
Node 1
connect1
Node 2Ping
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/45
Client PCs
Enterprise network
Microsoft Failover Cluster
Topology
Internal network
Fibre Channel
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/46
Main Products
Vendor Product Architecture Platforms
IBM DB2 Pure Scale
DB2 Database Partitioning
Feature (DPF)
SD
SN
AIX on SP
Linux on cluster
Microsoft SQL Server
SQL Server 2008 R2
Parallel Data Warehouse
SD
SN
Windows on SMP and
cluster
Oracle Real Application Cluster
Exadata Database Machine
SD Windows, Unix, Linux
on SMP and cluster
NCR Teradata SN
Bynet network
NCR Unix and
Windows
Oracle MySQL SN Linux Cluster
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/47
The Exadata Database Machine
• New machine from Oracle with Sun
• Objectives
➡ OLTP, OLAP, mixed workloads
• Oracle Real Application Cluster
➡ 8+ servers bi-pro Xeon, 72 GB RAM
• Exadata storage server : intelligent cache
➡ 14+ cells, each with
✦ 2 processors, 24 Go RAM
✦ 385 GB of Flash memory (read is 10* faster than disk)
✦ 12+ SATA disks of 2 To or 12 SAS disks of 600 GB
Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/48
Exadata Architecture
Real Application Cluster
Infiniband Switches
Storage cells

Más contenido relacionado

La actualidad más candente

Multiprocessor structures
Multiprocessor structuresMultiprocessor structures
Multiprocessor structuresShareb Ismaeel
 
Database , 8 Query Optimization
Database , 8 Query OptimizationDatabase , 8 Query Optimization
Database , 8 Query OptimizationAli Usman
 
DISTRIBUTED DATABASE WITH RECOVERY TECHNIQUES
DISTRIBUTED DATABASE WITH RECOVERY TECHNIQUESDISTRIBUTED DATABASE WITH RECOVERY TECHNIQUES
DISTRIBUTED DATABASE WITH RECOVERY TECHNIQUESAAKANKSHA JAIN
 
Load Balancing in Parallel and Distributed Database
Load Balancing in Parallel and Distributed DatabaseLoad Balancing in Parallel and Distributed Database
Load Balancing in Parallel and Distributed DatabaseMd. Shamsur Rahim
 
Database 2 ddbms,homogeneous & heterognus adv & disadvan
Database 2 ddbms,homogeneous & heterognus adv & disadvanDatabase 2 ddbms,homogeneous & heterognus adv & disadvan
Database 2 ddbms,homogeneous & heterognus adv & disadvanIftikhar Ahmad
 
Dichotomy of parallel computing platforms
Dichotomy of parallel computing platformsDichotomy of parallel computing platforms
Dichotomy of parallel computing platformsSyed Zaid Irshad
 
Distributed database management system
Distributed database management  systemDistributed database management  system
Distributed database management systemPooja Dixit
 
Multivector and multiprocessor
Multivector and multiprocessorMultivector and multiprocessor
Multivector and multiprocessorKishan Panara
 
Data cube computation
Data cube computationData cube computation
Data cube computationRashmi Sheikh
 
20. Parallel Databases in DBMS
20. Parallel Databases in DBMS20. Parallel Databases in DBMS
20. Parallel Databases in DBMSkoolkampus
 
Query processing and optimization (updated)
Query processing and optimization (updated)Query processing and optimization (updated)
Query processing and optimization (updated)Ravinder Kamboj
 
Memory consistency models
Memory consistency modelsMemory consistency models
Memory consistency modelspalani kumar
 

La actualidad más candente (20)

Multiprocessor structures
Multiprocessor structuresMultiprocessor structures
Multiprocessor structures
 
Database , 8 Query Optimization
Database , 8 Query OptimizationDatabase , 8 Query Optimization
Database , 8 Query Optimization
 
Semi join
Semi joinSemi join
Semi join
 
DISTRIBUTED DATABASE WITH RECOVERY TECHNIQUES
DISTRIBUTED DATABASE WITH RECOVERY TECHNIQUESDISTRIBUTED DATABASE WITH RECOVERY TECHNIQUES
DISTRIBUTED DATABASE WITH RECOVERY TECHNIQUES
 
Data cubes
Data cubesData cubes
Data cubes
 
Parallel processing
Parallel processingParallel processing
Parallel processing
 
Distributed database
Distributed databaseDistributed database
Distributed database
 
Distributed DBMS - Unit 6 - Query Processing
Distributed DBMS - Unit 6 - Query ProcessingDistributed DBMS - Unit 6 - Query Processing
Distributed DBMS - Unit 6 - Query Processing
 
Load Balancing in Parallel and Distributed Database
Load Balancing in Parallel and Distributed DatabaseLoad Balancing in Parallel and Distributed Database
Load Balancing in Parallel and Distributed Database
 
Database 2 ddbms,homogeneous & heterognus adv & disadvan
Database 2 ddbms,homogeneous & heterognus adv & disadvanDatabase 2 ddbms,homogeneous & heterognus adv & disadvan
Database 2 ddbms,homogeneous & heterognus adv & disadvan
 
Dichotomy of parallel computing platforms
Dichotomy of parallel computing platformsDichotomy of parallel computing platforms
Dichotomy of parallel computing platforms
 
Distributed database management system
Distributed database management  systemDistributed database management  system
Distributed database management system
 
Multivector and multiprocessor
Multivector and multiprocessorMultivector and multiprocessor
Multivector and multiprocessor
 
Data cube computation
Data cube computationData cube computation
Data cube computation
 
20. Parallel Databases in DBMS
20. Parallel Databases in DBMS20. Parallel Databases in DBMS
20. Parallel Databases in DBMS
 
Query processing and optimization (updated)
Query processing and optimization (updated)Query processing and optimization (updated)
Query processing and optimization (updated)
 
Memory consistency models
Memory consistency modelsMemory consistency models
Memory consistency models
 
Parallel computing persentation
Parallel computing persentationParallel computing persentation
Parallel computing persentation
 
Ordbms
OrdbmsOrdbms
Ordbms
 
NUMA overview
NUMA overviewNUMA overview
NUMA overview
 

Destacado

CS 542 Parallel DBs, NoSQL, MapReduce
CS 542 Parallel DBs, NoSQL, MapReduceCS 542 Parallel DBs, NoSQL, MapReduce
CS 542 Parallel DBs, NoSQL, MapReduceJ Singh
 
The DSP/BIOS Bridge - OMAP3
The DSP/BIOS Bridge - OMAP3The DSP/BIOS Bridge - OMAP3
The DSP/BIOS Bridge - OMAP3vjaquez
 
Database , 5 Semantic
Database , 5 SemanticDatabase , 5 Semantic
Database , 5 SemanticAli Usman
 
PL/pgSQL - An Introduction on Using Imperative Programming in PostgreSQL
PL/pgSQL - An Introduction on Using Imperative Programming in PostgreSQLPL/pgSQL - An Introduction on Using Imperative Programming in PostgreSQL
PL/pgSQL - An Introduction on Using Imperative Programming in PostgreSQLReactive.IO
 
Introduction to Parallel Processing Algorithms in Shared Nothing Databases
Introduction to Parallel Processing Algorithms in Shared Nothing DatabasesIntroduction to Parallel Processing Algorithms in Shared Nothing Databases
Introduction to Parallel Processing Algorithms in Shared Nothing DatabasesOfir Manor
 
MySQL InnoDB 源码实现分析(一)
MySQL InnoDB 源码实现分析(一)MySQL InnoDB 源码实现分析(一)
MySQL InnoDB 源码实现分析(一)frogd
 
Database ,16 P2P
Database ,16 P2P Database ,16 P2P
Database ,16 P2P Ali Usman
 
Database , 12 Reliability
Database , 12 ReliabilityDatabase , 12 Reliability
Database , 12 ReliabilityAli Usman
 
Database ,10 Transactions
Database ,10 TransactionsDatabase ,10 Transactions
Database ,10 TransactionsAli Usman
 
Postgres MVCC - A Developer Centric View of Multi Version Concurrency Control
Postgres MVCC - A Developer Centric View of Multi Version Concurrency ControlPostgres MVCC - A Developer Centric View of Multi Version Concurrency Control
Postgres MVCC - A Developer Centric View of Multi Version Concurrency ControlReactive.IO
 
InnoDB Internal
InnoDB InternalInnoDB Internal
InnoDB Internalmysqlops
 
Database , 13 Replication
Database , 13 ReplicationDatabase , 13 Replication
Database , 13 ReplicationAli Usman
 
Database ,7 query localization
Database ,7 query localizationDatabase ,7 query localization
Database ,7 query localizationAli Usman
 
Log based and Recovery with concurrent transaction
Log based and Recovery with concurrent transactionLog based and Recovery with concurrent transaction
Log based and Recovery with concurrent transactionnikunjandy
 
Database, 3 Distribution Design
Database, 3 Distribution DesignDatabase, 3 Distribution Design
Database, 3 Distribution DesignAli Usman
 
Database ,11 Concurrency Control
Database ,11 Concurrency ControlDatabase ,11 Concurrency Control
Database ,11 Concurrency ControlAli Usman
 
Oracle rac资源管理算法与cache fusion实现浅析
Oracle rac资源管理算法与cache fusion实现浅析Oracle rac资源管理算法与cache fusion实现浅析
Oracle rac资源管理算法与cache fusion实现浅析frogd
 
Discrete Structures lecture 2
 Discrete Structures lecture 2 Discrete Structures lecture 2
Discrete Structures lecture 2Ali Usman
 
Database , 15 Object DBMS
Database , 15 Object DBMSDatabase , 15 Object DBMS
Database , 15 Object DBMSAli Usman
 

Destacado (20)

Parallel Database
Parallel DatabaseParallel Database
Parallel Database
 
CS 542 Parallel DBs, NoSQL, MapReduce
CS 542 Parallel DBs, NoSQL, MapReduceCS 542 Parallel DBs, NoSQL, MapReduce
CS 542 Parallel DBs, NoSQL, MapReduce
 
The DSP/BIOS Bridge - OMAP3
The DSP/BIOS Bridge - OMAP3The DSP/BIOS Bridge - OMAP3
The DSP/BIOS Bridge - OMAP3
 
Database , 5 Semantic
Database , 5 SemanticDatabase , 5 Semantic
Database , 5 Semantic
 
PL/pgSQL - An Introduction on Using Imperative Programming in PostgreSQL
PL/pgSQL - An Introduction on Using Imperative Programming in PostgreSQLPL/pgSQL - An Introduction on Using Imperative Programming in PostgreSQL
PL/pgSQL - An Introduction on Using Imperative Programming in PostgreSQL
 
Introduction to Parallel Processing Algorithms in Shared Nothing Databases
Introduction to Parallel Processing Algorithms in Shared Nothing DatabasesIntroduction to Parallel Processing Algorithms in Shared Nothing Databases
Introduction to Parallel Processing Algorithms in Shared Nothing Databases
 
MySQL InnoDB 源码实现分析(一)
MySQL InnoDB 源码实现分析(一)MySQL InnoDB 源码实现分析(一)
MySQL InnoDB 源码实现分析(一)
 
Database ,16 P2P
Database ,16 P2P Database ,16 P2P
Database ,16 P2P
 
Database , 12 Reliability
Database , 12 ReliabilityDatabase , 12 Reliability
Database , 12 Reliability
 
Database ,10 Transactions
Database ,10 TransactionsDatabase ,10 Transactions
Database ,10 Transactions
 
Postgres MVCC - A Developer Centric View of Multi Version Concurrency Control
Postgres MVCC - A Developer Centric View of Multi Version Concurrency ControlPostgres MVCC - A Developer Centric View of Multi Version Concurrency Control
Postgres MVCC - A Developer Centric View of Multi Version Concurrency Control
 
InnoDB Internal
InnoDB InternalInnoDB Internal
InnoDB Internal
 
Database , 13 Replication
Database , 13 ReplicationDatabase , 13 Replication
Database , 13 Replication
 
Database ,7 query localization
Database ,7 query localizationDatabase ,7 query localization
Database ,7 query localization
 
Log based and Recovery with concurrent transaction
Log based and Recovery with concurrent transactionLog based and Recovery with concurrent transaction
Log based and Recovery with concurrent transaction
 
Database, 3 Distribution Design
Database, 3 Distribution DesignDatabase, 3 Distribution Design
Database, 3 Distribution Design
 
Database ,11 Concurrency Control
Database ,11 Concurrency ControlDatabase ,11 Concurrency Control
Database ,11 Concurrency Control
 
Oracle rac资源管理算法与cache fusion实现浅析
Oracle rac资源管理算法与cache fusion实现浅析Oracle rac资源管理算法与cache fusion实现浅析
Oracle rac资源管理算法与cache fusion实现浅析
 
Discrete Structures lecture 2
 Discrete Structures lecture 2 Discrete Structures lecture 2
Discrete Structures lecture 2
 
Database , 15 Object DBMS
Database , 15 Object DBMSDatabase , 15 Object DBMS
Database , 15 Object DBMS
 

Similar a Database ,14 Parallel DBMS

Database , 1 Introduction
 Database , 1 Introduction Database , 1 Introduction
Database , 1 IntroductionAli Usman
 
1 introduction DDBS
1 introduction DDBS1 introduction DDBS
1 introduction DDBSnaimanighat
 
Netezza Deep Dives
Netezza Deep DivesNetezza Deep Dives
Netezza Deep DivesRush Shah
 
1 introduction ddbms
1 introduction ddbms1 introduction ddbms
1 introduction ddbmsamna izzat
 
1 introduction
1 introduction1 introduction
1 introductionAmrit Kaur
 
Accel - EMC - Data Domain Series
Accel - EMC - Data Domain SeriesAccel - EMC - Data Domain Series
Accel - EMC - Data Domain Seriesaccelfb
 
Database Administration & Management - 01
Database Administration & Management - 01Database Administration & Management - 01
Database Administration & Management - 01FaisalMashood
 
DBAM-01.pdf
DBAM-01.pdfDBAM-01.pdf
DBAM-01.pdfhania80
 
Database , 6 Query Introduction
Database , 6 Query Introduction Database , 6 Query Introduction
Database , 6 Query Introduction Ali Usman
 
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)Lars Marowsky-Brée
 
HDFS_architecture.ppt
HDFS_architecture.pptHDFS_architecture.ppt
HDFS_architecture.pptvijayapraba1
 
Performance Tuning
Performance TuningPerformance Tuning
Performance TuningJannet Peetz
 
Persistence of memory: In-memory Is Not Often the Answer
Persistence of memory: In-memory Is Not Often the AnswerPersistence of memory: In-memory Is Not Often the Answer
Persistence of memory: In-memory Is Not Often the AnswerNeil Raden
 
Database-Migration and -Upgrade with Transportable Tablespaces
Database-Migration and -Upgrade with Transportable TablespacesDatabase-Migration and -Upgrade with Transportable Tablespaces
Database-Migration and -Upgrade with Transportable TablespacesMarkus Flechtner
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarKognitio
 
Distributed, concurrent, and independent access to encrypted cloud databases
Distributed, concurrent, and independent access to encrypted cloud databasesDistributed, concurrent, and independent access to encrypted cloud databases
Distributed, concurrent, and independent access to encrypted cloud databasesPapitha Velumani
 

Similar a Database ,14 Parallel DBMS (20)

Database , 1 Introduction
 Database , 1 Introduction Database , 1 Introduction
Database , 1 Introduction
 
1 introduction DDBS
1 introduction DDBS1 introduction DDBS
1 introduction DDBS
 
Netezza Deep Dives
Netezza Deep DivesNetezza Deep Dives
Netezza Deep Dives
 
1 introduction ddbms
1 introduction ddbms1 introduction ddbms
1 introduction ddbms
 
1 introduction
1 introduction1 introduction
1 introduction
 
Ch07.ppt
Ch07.pptCh07.ppt
Ch07.ppt
 
Accel - EMC - Data Domain Series
Accel - EMC - Data Domain SeriesAccel - EMC - Data Domain Series
Accel - EMC - Data Domain Series
 
Database Administration & Management - 01
Database Administration & Management - 01Database Administration & Management - 01
Database Administration & Management - 01
 
DBAM-01.pdf
DBAM-01.pdfDBAM-01.pdf
DBAM-01.pdf
 
Database , 6 Query Introduction
Database , 6 Query Introduction Database , 6 Query Introduction
Database , 6 Query Introduction
 
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
Modeling, estimating, and predicting Ceph (Linux Foundation - Vault 2015)
 
6-Query_Intro (5).pdf
6-Query_Intro (5).pdf6-Query_Intro (5).pdf
6-Query_Intro (5).pdf
 
HDFS_architecture.ppt
HDFS_architecture.pptHDFS_architecture.ppt
HDFS_architecture.ppt
 
Performance Tuning
Performance TuningPerformance Tuning
Performance Tuning
 
NoSQL Evolution
NoSQL EvolutionNoSQL Evolution
NoSQL Evolution
 
Chapter2.pdf
Chapter2.pdfChapter2.pdf
Chapter2.pdf
 
Persistence of memory: In-memory Is Not Often the Answer
Persistence of memory: In-memory Is Not Often the AnswerPersistence of memory: In-memory Is Not Often the Answer
Persistence of memory: In-memory Is Not Often the Answer
 
Database-Migration and -Upgrade with Transportable Tablespaces
Database-Migration and -Upgrade with Transportable TablespacesDatabase-Migration and -Upgrade with Transportable Tablespaces
Database-Migration and -Upgrade with Transportable Tablespaces
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
 
Distributed, concurrent, and independent access to encrypted cloud databases
Distributed, concurrent, and independent access to encrypted cloud databasesDistributed, concurrent, and independent access to encrypted cloud databases
Distributed, concurrent, and independent access to encrypted cloud databases
 

Más de Ali Usman

Cisco Packet Tracer Overview
Cisco Packet Tracer OverviewCisco Packet Tracer Overview
Cisco Packet Tracer OverviewAli Usman
 
Islamic Arts and Architecture
Islamic Arts and  ArchitectureIslamic Arts and  Architecture
Islamic Arts and ArchitectureAli Usman
 
Database ,18 Current Issues
Database ,18 Current IssuesDatabase ,18 Current Issues
Database ,18 Current IssuesAli Usman
 
Database , 17 Web
Database , 17 WebDatabase , 17 Web
Database , 17 WebAli Usman
 
Database , 4 Data Integration
Database , 4 Data IntegrationDatabase , 4 Data Integration
Database , 4 Data IntegrationAli Usman
 
Database ,2 Background
 Database ,2 Background Database ,2 Background
Database ,2 BackgroundAli Usman
 
Processor Specifications
Processor SpecificationsProcessor Specifications
Processor SpecificationsAli Usman
 
Fifty Year Of Microprocessor
Fifty Year Of MicroprocessorFifty Year Of Microprocessor
Fifty Year Of MicroprocessorAli Usman
 
Discrete Structures. Lecture 1
 Discrete Structures. Lecture 1  Discrete Structures. Lecture 1
Discrete Structures. Lecture 1 Ali Usman
 
Muslim Contributions in Medicine-Geography-Astronomy
Muslim Contributions in Medicine-Geography-AstronomyMuslim Contributions in Medicine-Geography-Astronomy
Muslim Contributions in Medicine-Geography-AstronomyAli Usman
 
Muslim Contributions in Geography
Muslim Contributions in GeographyMuslim Contributions in Geography
Muslim Contributions in GeographyAli Usman
 
Muslim Contributions in Astronomy
Muslim Contributions in AstronomyMuslim Contributions in Astronomy
Muslim Contributions in AstronomyAli Usman
 
Processor Specifications
Processor SpecificationsProcessor Specifications
Processor SpecificationsAli Usman
 
Ptcl modem (user manual)
Ptcl modem (user manual)Ptcl modem (user manual)
Ptcl modem (user manual)Ali Usman
 
Nimat-ul-ALLAH shah wali
Nimat-ul-ALLAH shah wali Nimat-ul-ALLAH shah wali
Nimat-ul-ALLAH shah wali Ali Usman
 
Muslim Contributions in Mathematics
Muslim Contributions in MathematicsMuslim Contributions in Mathematics
Muslim Contributions in MathematicsAli Usman
 
Osi protocols
Osi protocolsOsi protocols
Osi protocolsAli Usman
 

Más de Ali Usman (17)

Cisco Packet Tracer Overview
Cisco Packet Tracer OverviewCisco Packet Tracer Overview
Cisco Packet Tracer Overview
 
Islamic Arts and Architecture
Islamic Arts and  ArchitectureIslamic Arts and  Architecture
Islamic Arts and Architecture
 
Database ,18 Current Issues
Database ,18 Current IssuesDatabase ,18 Current Issues
Database ,18 Current Issues
 
Database , 17 Web
Database , 17 WebDatabase , 17 Web
Database , 17 Web
 
Database , 4 Data Integration
Database , 4 Data IntegrationDatabase , 4 Data Integration
Database , 4 Data Integration
 
Database ,2 Background
 Database ,2 Background Database ,2 Background
Database ,2 Background
 
Processor Specifications
Processor SpecificationsProcessor Specifications
Processor Specifications
 
Fifty Year Of Microprocessor
Fifty Year Of MicroprocessorFifty Year Of Microprocessor
Fifty Year Of Microprocessor
 
Discrete Structures. Lecture 1
 Discrete Structures. Lecture 1  Discrete Structures. Lecture 1
Discrete Structures. Lecture 1
 
Muslim Contributions in Medicine-Geography-Astronomy
Muslim Contributions in Medicine-Geography-AstronomyMuslim Contributions in Medicine-Geography-Astronomy
Muslim Contributions in Medicine-Geography-Astronomy
 
Muslim Contributions in Geography
Muslim Contributions in GeographyMuslim Contributions in Geography
Muslim Contributions in Geography
 
Muslim Contributions in Astronomy
Muslim Contributions in AstronomyMuslim Contributions in Astronomy
Muslim Contributions in Astronomy
 
Processor Specifications
Processor SpecificationsProcessor Specifications
Processor Specifications
 
Ptcl modem (user manual)
Ptcl modem (user manual)Ptcl modem (user manual)
Ptcl modem (user manual)
 
Nimat-ul-ALLAH shah wali
Nimat-ul-ALLAH shah wali Nimat-ul-ALLAH shah wali
Nimat-ul-ALLAH shah wali
 
Muslim Contributions in Mathematics
Muslim Contributions in MathematicsMuslim Contributions in Mathematics
Muslim Contributions in Mathematics
 
Osi protocols
Osi protocolsOsi protocols
Osi protocols
 

Último

Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 

Último (20)

Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 

Database ,14 Parallel DBMS

  • 1. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/1 Outline • Introduction • Background • Distributed Database Design • Database Integration • Semantic Data Control • Distributed Query Processing • Multidatabase Query Processing • Distributed Transaction Management • Data Replication • Parallel Database Systems ➡ Data placement and query processing ➡ Load balancing ➡ Database clusters • Distributed Object DBMS • Peer-to-Peer Data Management • Web Data Management • Current Issues
  • 2. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/2 The Database Problem • Large volume of data use disk and large main memory • I/O bottleneck (or memory access bottleneck) ➡ Speed(disk) << speed(RAM) << speed(microprocessor) • Predictions ➡ Moore’s law: processor speed growth (with multicore): 50 % per year ➡ DRAM capacity growth : 4 × every three years ➡ Disk throughput : 2 × in the last ten years • Conclusion : the I/O bottleneck worsens
  • 3. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/3 The Solution • Increase the I/O bandwidth ➡ Data partitioning ➡ Parallel data access • Origins (1980's): database machines ➡ Hardware-oriented bad cost-performance failure ➡ Notable exception : ICL's CAFS Intelligent Search Processor • 1990's: same solution but using standard hardware components integrated in a multiprocessor ➡ Software-oriented ➡ Standard essential to exploit continuing technology improvements
  • 4. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/4 Multiprocessor Objectives • High-performance with better cost-performance than mainframe or vector supercomputer • Use many nodes, each with good cost-performance, communicating through network ➡ Good cost via high-volume components ➡ Good performance via bandwidth • Trends ➡ Microprocessor and memory (DRAM): off-the-shelf ➡ Network (multiprocessor edge): custom • The real chalenge is to parallelize applications to run with good load balancing
  • 5. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/5 Data Server Architecture client interface query parsing data server interface communication channel Application server Data server database application server interface database functions Client
  • 6. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/6 Objectives of Data Servers • Avoid the shortcomings of the traditional DBMS approach ➡ Centralization of data and application management ➡ General-purpose OS (not DB-oriented) • By separating the functions between ➡ Application server (or host computer) ➡ Data server (or database computer or back-end computer)
  • 7. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/7 Data Server Approach: Assessment • Advantages ➡ Integrated data control by the server (black box) ➡ Increased performance by dedicated system ➡ Can better exploit parallelism ➡ Fits well in distributed environments • Potential problems ➡ Communication overhead between application and data server ✦ High-level interface ➡ High cost with mainframe servers
  • 8. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/8 Parallel Data Processing • Three ways of exploiting high-performance multiprocessor systems:  Automatically detect parallelism in sequential programs (e.g., Fortran, OPS5)  Augment an existing language with parallel constructs (e.g., C*, Fortran90)  Offer a new language in which parallelism can be expressed or automatically inferred • Critique  Hard to develop parallelizing compilers, limited resulting speed-up  Enables the programmer to express parallel computations but too low-level  Can combine the advantages of both (1) and (2)
  • 9. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/9 Data-based Parallelism • Inter-operation ➡ p operations of the same query in parallel op.3 op.1 op.2 op. R op. R1 op. R2 op. R3 op. R4 • Intra-operation ➡ The same op in parallel
  • 10. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/10 Parallel DBMS • Loose definition: a DBMS implemented on a tighly coupled multiprocessor • Alternative extremes ➡ Straighforward porting of relational DBMS (the software vendor edge) ➡ New hardware/software combination (the computer manufacturer edge) • Naturally extends to distributed databases with one server per site
  • 11. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/11 Parallel DBMS - Objectives • Much better cost / performance than mainframe solution • High-performance through parallelism ➡ High throughput with inter-query parallelism ➡ Low response time with intra-operation parallelism • High availability and reliability by exploiting data replication • Extensibility with the ideal goals ➡ Linear speed-up ➡ Linear scale-up
  • 12. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/12 Linear Speed-up Linear increase in performance for a constant DB size and proportional increase of the system components (processor, memory, disk) new perf. old perf. ideal components
  • 13. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/13 Linear Scale-up Sustained performance for a linear increase of database size and proportional increase of the system components. components + database size new perf. old perf. ideal
  • 14. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/14 Barriers to Parallelism • Startup ➡ The time needed to start a parallel operation may dominate the actual computation time • Interference ➡ When accessing shared resources, each new process slows down the others (hot spot problem) • Skew ➡ The response time of a set of parallel processes is the time of the slowest one • Parallel data management techniques intend to overcome these barriers
  • 15. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/15 Parallel DBMS – Functional Architecture RM task n DM task 12 DM task n2 DM task n1 Data Mgr DM task 11 Request MgrRM task 1 Session Mgr User task 1 User task n
  • 16. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/16 Parallel DBMS Functions • Session manager ➡ Host interface ➡ Transaction monitoring for OLTP • Request manager ➡ Compilation and optimization ➡ Data directory management ➡ Semantic data control ➡ Execution control • Data manager ➡ Execution of DB operations ➡ Transaction management support ➡ Data management
  • 17. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/17 Parallel System Architectures • Multiprocessor architecture alternatives ➡ Shared memory (SM) ➡ Shared disk (SD) ➡ Shared nothing (SN) • Hybrid architectures ➡ Non-Uniform Memory Architecture (NUMA) ➡ Cluster
  • 18. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/18 Shared-Memory DBMS on symmetric multiprocessors (SMP) Prototypes: XPRS, Volcano, DBS3 + Simplicity, load balancing, fast communication - Network cost, low extensibility
  • 19. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/19 Shared-Disk Origins : DEC's VAXcluster, IBM's IMS/VS Data Sharing Used first by Oracle with its Distributed Lock Manager Now used by most DBMS vendors + network cost, extensibility, migration from uniprocessor - complexity, potential performance problem for cache coherency
  • 20. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/20 Shared-Nothing Used by Teradata, IBM, Sybase, Microsoft for OLAP Prototypes: Gamma, Bubba, Grace, Prisma, EDS + Extensibility, availability - Complexity, difficult load balancing
  • 21. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/21 Hybrid Architectures • Various possible combinations of the three basic architectures are possible to obtain different trade-offs between cost, performance, extensibility, availability, etc. • Hybrid architectures try to obtain the advantages of different architectures: ➡ efficiency and simplicity of shared-memory ➡ extensibility and cost of either shared disk or shared nothing • 2 main kinds: NUMA and cluster
  • 22. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/22 NUMA • Shared-Memory vs. Distributed Memory ➡ Mixes two different aspects : addressing and memory ✦ Addressing: single address space vs multiple address spaces ✦ Physical memory: central vs distributed • NUMA = single address space on distributed physical memory ➡ Eases application portability ➡ Extensibility • The most successful NUMA is Cache Coherent NUMA (CC-NUMA)
  • 23. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/23 CC-NUMA • Principle ➡ Main memory distributed as with shared-nothing ➡ However, any processor has access to all other processors’ memories • Similar to shared-disk, different processors can access the same data in a conflicting update mode, so global cache consistency protocols are needed. ➡ Cache consistency done in hardware through a special consistent cache interconnect ✦ Remote memory access very efficient, only a few times (typically between 2 and 3 times) the cost of local access
  • 24. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/24 Cluster • Combines good load balancing of SM with extensibility of SN • Server nodes: off-the-shelf components ➡ From simple PC components to more powerful SMP ➡ Yields the best cost/performance ratio ➡ In its cheapest form, • Fast standard interconnect (e.g., Myrinet and Infiniband) with high bandwidth (Gigabits/sec) and low latency
  • 25. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/25 SN cluster vs SD cluster • SN cluster can yield best cost/performance and extensibility ➡ But adding or replacing cluster nodes requires disk and data reorganization • SD cluster avoids such reorganization but requires disks to be globally accessible by the cluster nodes ➡ Network-attached storage (NAS) ✦ distributed file system protocol such as NFS, relatively slow and not appropriate for database management ➡ Storage-area network (SAN) ✦ Block-based protocol thus making it easier to manage cache consistency, efficient, but costlier
  • 26. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/26 Discussion • For a small configuration (e.g., 8 processors), SM can provide the highest performance because of better load balancing • Some years ago, SN was the only choice for high-end systems. But SAN makes SN a viable alternative with the main advantage of simplicity (for transaction management) ➡ SD is now the preferred architecture for OLTP ➡ But for OLAP databases that are typically very large and mostly read-only, SN is used • Hybrid architectures, such as NUMA and cluster, can combine the efficiency and simplicity of SM and the extensibility and cost of either SD or SN
  • 27. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/27 Parallel DBMS Techniques • Data placement ➡ Physical placement of the DB onto multiple nodes ➡ Static vs. Dynamic • Parallel data processing ➡ Select is easy ➡ Join (and all other non-select operations) is more difficult • Parallel query optimization ➡ Choice of the best parallel execution plans ➡ Automatic parallelization of the queries and load balancing • Transaction management ➡ Similar to distributed transaction management
  • 28. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/28 Data Partitioning • Each relation is divided in n partitions (subrelations), where n is a function of relation size and access frequency • Implementation ➡ Round-robin ✦ Maps i-th element to node i mod n ✦ Simple but only exact-match queries ➡ B-tree index ✦ Supports range queries but large index ➡ Hash function ✦ Only exact-match queries but small index
  • 29. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/29 Partitioning Schemes Round-Robin Hashing Interval ••• ••• ••• ••• ••• ••• •••a-g h-m u-z
  • 30. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/30 Replicated Data Partitioning • High-availability requires data replication ➡ simple solution is mirrored disks ✦ hurts load balancing when one node fails ➡ more elaborate solutions achieve load balancing ✦ interleaved partitioning (Teradata) ✦ chained partitioning (Gamma)
  • 31. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/31 Interleaved Partitioning Node Primary copy R1 R2 R3 R4 Backup copy r1.1 r1.2 r1.3 r2.3 r2.1 r2.2 r3.2 r3.2 r3.1 1 2 3 4
  • 32. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/32 Chained Partitioning Node Primary copy R1 R2 R3 R4 Backup copy r4 r1 r2 r3 1 2 3 4
  • 33. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/33 Placement Directory • Performs two functions ➡ F1 (relname, placement attval) = lognode-id ➡ F2 (lognode-id) = phynode-id • In either case, the data structure for f1 and f2 should be available when needed at each node
  • 34. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/34 Join Processing • Three basic algorithms for intra-operator parallelism ➡ Parallel nested loop join: no special assumption ➡ Parallel associative join: one relation is declustered on join attribute and equi-join ➡ Parallel hash join: equi-join • They also apply to other complex operators such as duplicate elimination, union, intersection, etc. with minor adaptation
  • 35. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/35 Parallel Nested Loop Join send partition node 3 node 4 node 1 node 2 R1: S1 S2 R2:
  • 36. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/36 Parallel Associative Join node 1 node 3 node 4 node 2 R1: R2: S1 S2
  • 37. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/37 Parallel Hash Join node node node node node 1 node 2 R1: R2: S1: S2:
  • 38. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/38 Parallel Query Optimization • The objective is to select the “best” parallel execution plan for a query using the following components • Search space ➡ Models alternative execution plans as operator trees ➡ Left-deep vs. Right-deep vs. Bushy trees • Search strategy ➡ Dynamic programming for small search space ➡ Randomized for large search space • Cost model (abstraction of execution system) ➡ Physical schema info. (partitioning, indexes, etc.) ➡ Statistics and cost functions
  • 39. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/39 Execution Plans as Operator Trees R2R1 R4 Result j2 j3 Left-deep Right-deep j1 R3 R2R1 R4 Result j5 j6 j4R3 R2R1 R3j7 R4 Result j9 Zig-zag Bushyj8 Result j10 j12 j11 R2R1 R4R3
  • 40. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/40 Equivalent Hash-Join Trees with Different Scheduling R3 Probe3Build3 R4 Temp2 Temp1 Build3 R4 Temp2 Probe3Build3 Probe2Build2 Probe1Build1 R2R1 R3 Temp1 Probe2Build2 Probe1Build1 R2R1
  • 41. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/41 Load Balancing • Problems arise for intra-operator parallelism with skewed data distributions ➡ attribute data skew (AVS) ➡ tuple placement skew (TPS) ➡ selectivity skew (SS) ➡ redistribution skew (RS) ➡ join product skew (JPS) • Solutions ➡ sophisticated parallel algorithms that deal with skew ➡ dynamic processor allocation (at execution time)
  • 42. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/42 Data Skew Example Join1 Res1 Res2 Join2 AVS/TPS AVS/TPS AVS/TPS AVS/TPS JPS JPS RS/SS RS/SS Scan1 S2 R2 S1 R1 Scan2
  • 43. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/43 Load Balancing in a DB Cluster • Choose the node to execute Q ➡ round robin ➡ The least loaded ✦ Need to get load information • Fail over ➡ In case a node N fails, N’s queries are taken over by another node ✦ Requires a copy of N’s data or SD • In case of interference ➡ Data of an overloaded node are replicated to another node Q1 Q2 Load balancing Q3 Q4 Q4 Q3 Q2 Q1
  • 44. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/44 Oracle Transparent Application Failover Client Node 1 connect1 Node 2Ping
  • 45. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/45 Client PCs Enterprise network Microsoft Failover Cluster Topology Internal network Fibre Channel
  • 46. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/46 Main Products Vendor Product Architecture Platforms IBM DB2 Pure Scale DB2 Database Partitioning Feature (DPF) SD SN AIX on SP Linux on cluster Microsoft SQL Server SQL Server 2008 R2 Parallel Data Warehouse SD SN Windows on SMP and cluster Oracle Real Application Cluster Exadata Database Machine SD Windows, Unix, Linux on SMP and cluster NCR Teradata SN Bynet network NCR Unix and Windows Oracle MySQL SN Linux Cluster
  • 47. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/47 The Exadata Database Machine • New machine from Oracle with Sun • Objectives ➡ OLTP, OLAP, mixed workloads • Oracle Real Application Cluster ➡ 8+ servers bi-pro Xeon, 72 GB RAM • Exadata storage server : intelligent cache ➡ 14+ cells, each with ✦ 2 processors, 24 Go RAM ✦ 385 GB of Flash memory (read is 10* faster than disk) ✦ 12+ SATA disks of 2 To or 12 SAS disks of 600 GB
  • 48. Distributed DBMS ©M. T. Özsu & P. Valduriez Ch.14/48 Exadata Architecture Real Application Cluster Infiniband Switches Storage cells