Performance and Availability Tradeoffs in Replicated File Systems

Performance and
Availability Tradeoffs in
Replicated File Systems
Peter Honeyman
Center for Information Technology Integration
University of Michigan, Ann Arbor

Acknowledgements

• Joint work with Dr. Jiaying Zhang
• Now at Google
• This was a chapter of her dissertation
• Partially supported by
• NSF/NMI GridNFS
• DOE/SciDAC Petascale Data Storage Institute
• NetApp
• IBM ARC

Storage replication

• Advantages ☺
• Scalability
• Reliability
• Read performance

Storage replication

• Disadvantages ☹
• Complex synchronization protocols
• Concurrency
• Durability
• Write performance

Durability

• If we weaken the durability guarantee, we
may lose data ...
• And be forced to restart the computation
• But it might be worth it

Utilization tradeoffs

• Adding replication servers enhances durability
• Reduces the risk that computation must be
restarted
• Increases utilization ☺
• Replication increases run time
• Reduces utilization ☹

Placement tradeoffs

• Nearby replication servers reduce the
replication penalty
• Increases utilization ☺
• Nearby replication servers are vulnerable
to correlated failure
• Reduces utilization ☹

Run-time model

recover
fail
ok

fail
start run end

Parameters

• Failure free, single server run time
• Can be estimated or measured
• Our focus is on 1 to 10 days

Parameters

• Replication overhead
• Penalty associated with replication to
backup servers
• Proportional to RTT
• Ratio can be measured by running with a
backup server a few msec away

Parameters

• Recovery time
• Time to detect failure of the primary
server and switch to a backup server
• Not a sensitive parameter

Parameters

• Probability distribution functions
• Server failure
• Successful recovery

Server failure

• Estimated by analyzing PlanetLab ping data
• 716 nodes, 349 sites, 25 countries
• All-pairs, 15 minute interval, 1/04 to 6/05
• 692 nodes were alive throughout
• We ascribe missing pings to node failure
and network partition

PlanetLab failure

cumulative failure: log-linear scale

Correlated failures
failed
nodes
nodes per site 2 3 4 5
2 0.526 0.593 0.552 0.561
3 0.546 0.440 0.538
4 0.378 0.488
5 0.488
number of sites 259 65 21 11

P(n nodes down | 1 node down)

0.25
Correlated failures
Average Failure Correlations

0.20

0.15

0.10

0.05

0
25 75 125 175
RTT (ms)

nodes slope y-intercept
2 -2.4 x 10-4 0.195
3 -2.3 x 10-4 0.155
4 -2.3 x 10-4 0.134
5 -2.4 x 10-4 0.119

Run-time model
• Discrete event simulation for expected run
time and utilization

recover
fail ok

fail
start run end

Simulation results
one hour no replication: utilization = .995

write intensity
0.0001
0.001
0.01
RTT 0.1

1.0 1.0

0.8 0.8

0.6 0.6
RTT RTT

One backup Four backups

Simulation results
one day no replication: utilization = .934

write intensity
0.0001
0.001
0.01
RTT 0.1

1.0 1.0

0.8 0.8

0.6 0.6
RTT RTT


Simulation results
ten days no replication: utilization = .668

RTT RTT

1.00 1.00

0.75 0.75

0.50 0.50
RTT RTT


Simulation discussion

• Replication improves utilization for long-
running jobs
• Multiple backup servers do not improve
utilization (due to low PlanetLab failure
rates)

Simulation discussion

• Distant backup servers improve utilization
for light writers
• Distant backup servers do not improve
utilization for heavy writers
• Implications for checkpoint interval …

Checkpoint interval

calculated on the
back of a napkin
one day, 20% checkpoint overhead

10 day, 2% checkpoint overhead 10 day, 2% checkpoint overhead

one backup server four backup servers

Work in progress
• Realistic failure data
• Storage and processor failure
• PDSI failure data repository
• Realistic checkpoint costs — help!
• Realistic replication overhead
• Depends on amount of computation
• Less than 10% for NAS Grid Benchmarks

Conclusions

• Conventional wisdom holds that
consistent mutable replication
in large-scale distributed systems
is too expensive to consider
• Our study suggests otherwise

Conclusions
• Consistent replication in large-scale
distributed storage systems is
feasible and practical
• Superior performance
• Rigorous adherence to conventional ﬁle
system semantics
• Improved utilization

Thank you for your attention!
www.citi.umich.edu

Questions?

Performance and Availability Tradeoffs in Replicated File Systems

Recomendados

Recomendados

Más contenido relacionado

Similar a Performance and Availability Tradeoffs in Replicated File Systems

Similar a Performance and Availability Tradeoffs in Replicated File Systems (20)

Último

Último (20)

Performance and Availability Tradeoffs in Replicated File Systems