1. A Case for Redundant Arrays of Inexpensive
Disk(RAID)
Ankita jadhao
2. Need of Level 5?
• Achieve parallelism in write operation.
• The check disk is the bottleneck.
3. RAID Level-5
• Block-interleaved Distributed parity
• Spreads data and parity among all N+1
disks, rather than storing data in N disks
and parity in 1 disk
• Optimized for multi-thread access
4. RAID Level-5
Level-4 Level-5
1 2 3 4 5
data disks
check
disk
data and check disks
1 2 3 4 5
S0
S1
S2
S3
S4
S5
S0
S1
S2
S3
S4
S5
5. RAID Level 5
• Wastage is small: same as in Raid 4
• Parity update traffic is distributed across
disks
D0,0
D1,0
D2,0
P3
D0,1
D1,1
P2
D3,1
D0,2
P1
D2,2
D3,2
P0
D1,3
D2,3
D3,3
D0,0 D0,1 D0,2 = P0
6. RAID 5 Actions
D D D P
Fault-free Read
D D D P
1
2
3
4
Fault-free Write
D D D P
Degraded Read
D D D P
Degraded Write
7. 7
1. Read old
data.
Old New
2. Write
new data
Old P.
3. XOR old and
new data to create
“Partial Product”.
4. Read old
parity data.
5. Xor old parity with
partial product, writing
out result as new parity.
P. P.}
XOR
}
XOR
New P.
Chunk 1 Chunk 2 Chunk 3
Drive
5
Chunk 0 Parity
Drive
4
Drive
3
Drive
2
Drive
1
RAID 5 - Write Operation
8. Key points of RAID Level-5
• Level-5 stripes file data and check data over
all the disks
– no longer a single check disk
– no more write bottleneck
• Drastically improves the performance of
multiple writes
– they can now be done in parallel
• Slightly improves reads
– one more disk to use for reading
9. Advantages
• Best cost/ performance for transaction oriented
n/w.
• Very high data protection
• Support multiple reads & writes
• Can also be optimized for large, sequential
request.
• Also processing in limited storage capacity
• Used in supercomputer application & transaction
processing
11. Discussion
Hardware & software solution??
Software would give the best performance as well as least cost.
It is not even clear if synchronizing the disk in a group improves
RAID performance
Each level improving
• Data rate: supercomputer application (sequential data)
(small no of request per sec)
• The I/O rate: transaction processing (random data)
(large no of read modify writes)
• Or usable storage capacity
or possible all three.
12. Comparison of all levels on the basis of
data rate & I/O rate
RAID1 RAID2 RAID3 RAID4 RAID5
Random read Better
than single
disk
fair good very good very good
Random write good poor poor poor fair
Sequential read fair very good very good good good
Sequential write Same as
single disk
good Fair to
good
good good
13. Continue…
Which level is Best ?
• Highest performance per disk comes from
either Level 1 or Level 5
• If storage is used less than 50% in case of
transaction processing then level 1 best
• If storage is used greater than 50% or in
supercomputer application or for combine
level 5 is best.
14. Why RAID 5 is used most?
• If a disk gets an error or starts to fail, data is
recreated from this distributed data and parity
block
• It allows many NAS (Network Administrator
Specialist) and server drives to be "hot-
swappable“.
• It's a great solution for fault tolerance.