Sierra: a power-proportional, distributed storage system
Eno Thereska
Microsoft Research
SWS Colloquium
I've broad interests in systems. Currently I've been focusing on
file systems and storage technologies and high-performance data centers.
I also have great interest in applying machine learning and queuing
analysis to help simplify and automate system management. Since
September 2007 I have been a Researcher at Microsoft Research in
Cambridge, UK. I received my PhD/MS/BS from Carnegie Mellon University.
I'll present the design, implementation, and evaluation of
Sierra: a power-proportional, distributed storage system. I/O workloads
in data centers show significant diurnal variation, with peak and trough
periods. Sierra powers down storage servers during the troughs. The
challenge is to ensure that data is available for reads and writes at
all times, including power-down periods. Consistency and fault-tolerance
of the data, as well as good performance, must also be maintained.
Sierra achieves all these through a set of techniques including
power-aware layout, predictive gear scheduling, and a replicated
short-term versioned store. Replaying live server traces from a large
e-mail service (Hotmail) shows power savings of at least 23%, and
analysis of load from a small enterprise shows that power savings of up
to 60% are possible.