Informed prefetching in distributed multi level storage systems

Informed Prefetching in Distributed Multi-
Level Storage Systems
Maen M. Al Assaf
Advisor: Xiao Qin
Department of Computer Science and Software Engineering,
Auburn University, Auburn, AL

Download the dissertation at:
http://www.eng.auburn.edu/~xqin/theses/PhD-Al-Assaf-Infomed-
Prefetching.pdf

The abstract of this dissertation can be found at:
http://etd.auburn.edu/etd/handle/10415/2935

Background
• Informed Prefetching.

• Parallel multi-level storage system.

• Distributed parallel multi-level storage system.

• Prefetching pipelining to upper level.

• Bandwidth limitation (Enough, Limited).

2

Informed Caching and Prefetching
• My Research’s Core Reference:

(TIP) R. Patterson, Hugo, G. Gibson, D. Stodolsky, and
J. Zelenka: Informed prefetching and caching, In
Proceedings of the 15th ACM Symposium on
Operating System Principles, pages 79-95, CO, USA,
1995.

3

• Applications can disclose hints about their
future accesses.
• Invoke parallel storage parallelism.
• Make the application more CPU-bound than
I/O- bound.
– Reducing I/O stalls.
– Application’s Elapsed Time.
• Tdisk: Disk read Latency (prefetching time).
4

• Cost benefit model to determine # prefetching of buffer caches (TIP)

R. Patterson, Hugo, G. Gibson, D. Stodolsky, and J. Zelenka: Informed prefetching and caching, In Proceedings of the 15th ACM
Symposium on Operating System Principles, pages 79-95, CO, USA, 1995.

5

Multi-level storage system

• Speed (Access Time) Vs. Capacity

6

Parallel (Multi-levels) Storage Systems

Cache

• High I/O performance, bandwidth, scalability, and reliability.
• Disk Arrays (enough / limited bandwidth).

7

Distributed Parallel (Multi-level) Storage
Systems

• high I/O performance, bandwidth, scalability, and reliability.
T.Madhyastha; G. Gibson; C. Faloutsos: Informed prefetching of collective input/output requests, Proceedings of the
1999 ACM/IEEE conference on Supercomputing (CDROM), Portland, Oregon, 1999.

8

Research Motivations
• The growing needs of multi-level storage
systems,
• The I/O access hints offered by applications,
and
• The possibility of multiple prefetching
mechanisms to work in parallel.

9

Research Objectives
• Minimizing the prefetching time (I/O Delays).

• Reducing application's stalls and elapsed time.

• Reducing prefetching time in distributed
storage system.

10

My Solutions
• iPipe: Informed Prefetching Pipelining for Multi-
levels Storage Systems.

• IPO: Informed Prefetching Optimization in Multi-
levels Storage Systems. (Optimizes iPipe).

• IPODS: Informed Prefetching in Distributed Multi-
levels Storage Systems.

11

Research Tasks

iPipe IPO
Informed Prefetching Pipelining For Informed Prefetching Optimization in
Multi-levels Storage Systems. Multi-levels Storage Systems

IPODS Prototyping
Informed Prefetching in Distributed Solution’s prototyping Results
Multi-levels Storage Systems.

12 12

iPipe and IPO Architecture

13

IPODS Architecture

• Network and server latency. Data is stripped.
14

Assumptions
• Pipelined data are copies: 1) small portion 2)
do not move data back and forth.

• Data initial placement in HDD. 1) worst case 2)
Larger.

• Writes and data consistency.

15

Validations and Prototyping Test Bed
• The following are our lab's test bed devices:
– Memory: Samsung 3GB RAM Main Memory.
– HDD : Western Digital 500GB SATA 16 MB Cache
WD5000AAKS.
– SSD: Intel 2Gb/s SATA SSD 80G sV 1A.
– Network Switch: Network Dell Power Connect
2824.

16

iPipe and IPO Parameters
Parameter Description Value
Block size Block size in MB 10 MB
Tcpu+Thit+Tdriver Time to consume a buffer 0.00192 s
Thdd-cache Read time from HDD to cache 0.12 s
Tss-cache Reading time from SSD to cache 0.052 s
Thdd-ss Reading time from HDD to SSD 0.122 s
MaxBW Maximum # of reading requests 15
Xcache Number of prefetching buffers 1 – 63 / 1 - 15

17

IPODS Parameters
Parameter Description Value
Block size Block size in MB 200 MB
Tcpu+Thit+Tdriver Time to consume a buffer 0.037 s
Thdd-network- cache Read time from HDD to cache 4.43 s
Tss-network-cache Reading time from SSD to cache 4.158 s
Thdd-ss Reading time from HDD to SSD 4.5 s
MaxBW Maximum # of reading requests 15
Xcache Number of prefetching buffers 1 - 15

18

iPipe
• Parallel multi-levels storage system (SSD and HDD.)
• Pipeline data to uppermost level to reduce Tdisk.
• Reduce stalls, elapsed time.
• Assumes:
– Assume enough bandwidth, and scalability.
• iPipe pipelines the informed prefetching process in
the SSD.
– Prefetching pipelining start (Pstart).
– Prefetching pipelining depth (Pdepth).

19

iPipe: Example Parameters
• Assume:
• Tcpu+Thit+Tdriver = 1 Time Unit
• Thdd-cache = 5 Time Units
• Tss-cache = 4 Time Units
• Thdd-ss = 8 Time Units
• Xcache = 3
• Tstall-hdd = 2 time units every 3 accesses.
• Tstall-ss = 1 time unit every 3 accesses.

20

iPipe: Informed Prefetching

Tcpu+Thit+Tdriver = 1 Thdd-cache = 5 Xcache = 3
Tstall-hdd = 2 every 3 accesses.
21


22


Stalls time = 16 time units & Elapsed time = 46 time units


23

iPipe: Pipelining Algorithm

24

iPipe: Pipelining Algorithm

Stalls time = 9 time units & Elapsed time = 39 time units < 46 time units
Tcpu+Thit+Tdriver = 1 Thdd-cache = 5 Tss-cache = 4 T disk-ss = 8 Xcache = 3
Tstall-hdd = 2 every 3 accesses. Tstall-ss = 1 every 3 accesses.

25

iPipe: Performance Improvement
- Elapse Time 1

56% improvement in most cases
LASR2

56%

56%
LASR1

56%
56%

Total simulation elapsed time when using 1 to 9 prefetching buffers.
26

- Elapse Time 2

56%

56% 56% improvement
56%
56%

27

- Elapse Time 3

28

Research Tasks

iPipe IPO
Informed Prefetching Pipelining For Informed Prefetching Optimization in
Multi-levels Storage Systems. Multi-levels Storage Systems

IPODS Prototyping
Informed Prefetching in Distributed Solution’s prototyping Results
Multi-levels Storage Systems.

29 29

IPO
• Parallel multi-levels storage system (SSD and HDD.)
• Pipeline data to uppermost level to reduce Tdisk.
• Reduce stalls, elapsed time.
• Assumes:
– Limited bandwidth and scalability.
– MaxBW = 15
– Pipelining depth = MaxBW - Xcache
• IPO pipelines the informed prefetching process in the
SSD.
– Prefetching pipelining start (Pstart).
– Next to Prefetch (Pnext).
30

IPO: Example Parameters
• Assume:
• Tcpu+Thit+Tdriver = 1 Time Unit
• Thdd-cache = 5 Time Units
• Tss-cache = 4 Time Units
• Thdd-ss = 8 Time Units
• Xcache = 2
• Tstall-hdd = 3 time units every 2 accesses.
• Tstall-ss = 2 time units every 2 accesses.
• MaxBW = 5 concurrent reading requests.
31

IPO: Informed Prefetching

Tcpu+Thit+Tdriver = 1, Thdd-cache = 5, Xcache = 2
32


33


34


Stalls time = 45 time units & Elapsed time = 81 time units

35

IPO: Pipelining Algorithm

Tcpu+Thit+Tdriver = 1 Thdd-cache = 5 Tss-cache = 4 T hdd-ss = 8 Xcache = 2
Tstall-hdd = 3 every 2 accesses. Tstall-ss =2 every 2 accesses.
36


37


38


Stalls time = 40 time units & Elapsed time = 76 time units < 81

39

IPO: Performance Improvement
- Elapse Time

56% improvement in critical cases
LASR2

56%

28%
LASR1

0%

Total simulation elapsed time when using 1 to 15 prefetching buffers. (MaxBW = 15)
40

IPODS
• Parallel multi-levels storage systems can be
implemented on a distributed system.
• Several storage nodes. Data are stripped.
• Tnetwork and Tserver are added to Tdisk .
• IPO pipelining was used.
• Assumes:
– Limited bandwidth and scalability (depends on # of nodes).
– MaxBW = 15

41

IPODS Architecture

• Network and server latency. Data is stripped.

42

IPODS: Performance Improvement
- Elapse Time
LASR2

6% improvement in critical cases

6%
4%
LASR1

2% 0%

Total simulation elapsed time when using 1 to 15 prefetching buffers. (MaxBW = 15)
43

Prototyping Development
• We aim to validate our simulations’ results
• 1000 I/O reading requests were sufficient to
show the pattern.
• Expect the results if simulated application
were used in prototyping.
• Simulators used LASR traces:
– LASR1 : 11686 I/O Reading Requests
– LASR2 : 51206 I/O Reading Requests

• Compare the variations.
44

iPipe Prototyping
Xcache 1 3 5 7 9
LASR1 With iPipe 559.6273 190.3077 117.2071 86.52069 67.93586
LASR1 no iPipe 1161.996 408.5916 259.0295 185.8635 151.6153
Xcache 1 3 5 7 9
LASR1 With iPipe 607.612 202.537 121.522 86.8018 67.5125
LASR1 no iPipe 1402.32 467.44 280.464 200.331 155.813

Motivational Example
Xcache 11 13 15 17 19
LASR1 With iPipe 57.32708 49.60041 44.31705 39.6187 36.04687
LASR1 no iPipe 132.4959 117.7423 114.4435 93.45855 85.93546
Xcache 11 13 15 17 19
LASR1 With iPipe 55.2375 46.7394 40.5075 35.7419 31.9796
LASR1 no iPipe 127.484 107.871 93.488 82.4894 73.8063

(3-18%) Variation
Prototyping
Simulation 45

iPipe Prototyping
Xcache 25 35 45 55 63
LASR1 With iPipe 29.69927 28.52599 27.40414 24.96726 24.22578
LASR1 no iPipe 79.44797 63.66883 59.02622 39.04141 37.02078

Xcache 25 35 45 55 63
LASR1 With iPipe 24.3045 22.4358 22.4365 22.4369 22.4371
LASR1 no iPipe 56.0928 40.0663 31.1627 25.4967 22.4371

Prototyping 35% Variation when Xcache = 63
Simulation
(3-18%) Variation in most cases

46

IPO Prototyping
Xcache 1 3 5 7
LASR1 With IPO 560.1602 236.8811 209.8946 169.9799
LASR1 no IPO 1161.996 408.5916 259.0295 185.8635
Xcache 1 3 5 7
LASR1 With IPO 607.876 202.792 201.128 172.036
LASR1 no IPO 1402.32 467.516 280.552 200.392

Xcache 9 11 13 15
LASR1 With IPO 147.101 131.3518 116.7431 114.4435
LASR1 no IPO 151.6153 132.4959 117.7423 114.4435
Xcache 9 11 13 15
LASR1 With IPO 155.768 127.448 107.878 93.5731
LASR1 no IPO 155.87 127.547 107.878 93.5731

(3-18%) Variation
Prototyping
Simulation 47

IPODS Prototyping
Xcache 1 3 5 7
LASR1 With IPODS 48590.39 16377.7 10383.65 7604.548
LASR1 no IPODS 51768.98 17772.19 10705.66 7803.21
Xcache 1 3 5 7
LASR1 With IPODS 48590.9 16200 9933.42 7246.72
LASR1 no IPODS 51769 17259.2 10357.2 7397.95

Xcache 9 11 13 15
LASR1 With IPODS 6061.563 5003.758 4264.514 3750.33
LASR1 no IPODS 6151.16 5022.374 4276.2 3750.33
Xcache 9 11 13 15
LASR1 With IPODS 5750.29 4704.81 3982.53 3454.88
LASR1 no IPODS 5754.39 4708.83 3982.53 3454.88

(1-8%) Variation
Prototyping
Simulation 48

Conclusion & Future Work
• Conclusion:
– Informed prefetching shows better performance
when Tdisk is low.
– Reading data from upper levels is faster.
– Three solutions (iPipe, IPO, IPODS).
– Elapsed time improvement in about 56% and 6%.
– Many Contributions.
• Future work:
– Data migration
– More experimental work

49

Informed prefetching in distributed multi level storage systems

Recomendados

Recomendados

Más contenido relacionado

Más de Xiao Qin

Más de Xiao Qin (20)

Último

Último (20)

Informed prefetching in distributed multi level storage systems

Notas del editor