Motivation

1
Motivation Energy costs are rising An increasing concern for servers No longer limited to laptops Energy consumption of disk drives 24% of the power usage in web servers 77% of the power usage in proxy servers 27% of operating costs for data centers PARAID Benefits Conventional RAID: The “One-size-fits-all” approach of convention raid creates a suboptimal strategy for spinning down drives. load time drives load time drives PARAID: The number of disks can be adjusted to meet load with out degrading performance. Performance: Based on the replay of a web server trace, our 2-Gear PARAID-0 prototype consumes 15% less power on average than conventional RAID-0 as shown in the Total Power Consumption graph. The peak PARAID-0 performance is comparable to RAID-0 under the PostMark benchmark. Initial Findings Cyclic fluctuation of loads RAIDs are configured for peak performance Uniform striping over-provisions resources for light loads Over-provision of storage capacity Avoid frequent upgrades Limited power cycles for server-class drives Cannot power-cycle drives too frequently Observations PARAID: The Gear-Shifting Power-Aware RAID Charles Weddle - Mathew Oldham - An-I A. Wang - Florida State University RuGang Xu - Peter Reiher – University of California, Los Angeles Geoffrey H. Kuenning – Harvey Mudd College PARAID Design PARAID creates tiers of disks within the RAID that can be gear- shifted so that the least number of disks are used to meet system load while not degrading performance. Skewed Striping: By assigning data blocks in a skewed pattern, the number of powered-on disks can vary according to the demands placed on the system. To be able to create the gears, PARAID uses spare storage on the RAID device for soft state replication. Preserving Peak Performance: A RAID-5 layout is used for the primary copy of the data blocks which allows for maximum parallelism between the drives when the load on the system demands it Reliability: Since a RAID-5 layout is used for the primary copy of the data blocks, the distributed parity of RAID-5 will tolerate a single disk failure. PARAID creates a bimodal usage pattern so that busy drives can stay powered on most of the time and idle drives can stay powered down most of the time, leaving only middle range disks to be powered cycled more often, reducing the total number of times a drive is powered cycled. Power: The same file can be delivered with either the lower two-drive gear or the higher four-drive gear settings. Under high loads, all four drives will be powered to meet the performance demand. Under light loads, two drives are sufficient to deliver the data. Tension between energy and performance – Performance • Maximize parallelism • RAIDs with uniform striping – Energy • Minimize powered components Challenges Abstract: The energy consumption of disks is a growing concern. It now contributes 27% to the operating costs for data centers. Unfortunately, existing RAIDs are not designed to conserve energy. This is due to uniform striping in which even light loads generate activities on all drives. Power-Aware RAID (PARAID) introduces a skewed striping pattern that can be gear shifted to a minimum striping level so the performance matches the load. Unused disks are powered down. Based on our “2-gear” prototype, PARAID consumes 15% less power than a traditional RAID, while not compromising performance load time utilizat ion threshol d gear 2 gear 1 Peak HourTotal C om pletion Tim e (C D F) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 10 100 1000 10000 Tim e (m s) File Percentage RAID-0 1g PAR AID -0 2g PAR AID -0 P ostM ark B enchm ark:TotalC om pletion Tim e 5.6 52.2 109.6 4.4 53.8 115.6 0 20 40 60 80 100 120 140 1k Files,50k Transactions 20k Files,50k Transactions 20k Files, 100k Transactions Tim e (s) RAID-0 2g P AR AID-0 Total P ow erC onsum ption 0 5 10 15 20 25 30 35 40 0 1 2 3 4 5 6 7 8 9 10 Tim e (1000s) Power(W ) R AID -0 1g PAR AID -0 2g PAR AID -0 Peak HourLatency (C D F) 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 10 100 1000 Tim e (m s) File Percentage RAID-0 1g PAR AID -0 2g PAR AID -0 gear 1 Soft- State Block Replicati on gear 2 disk 1 disk 2 disk 3 disk 4 gear 3 RAID-5 Layout

description

disk 1. disk 2. disk 3. disk 4. load. load. Soft-State Block Replication. RAID-5 Layout. drives. drives. time. time. gear 1. gear 2. gear 3. Challenges. Tension between energy and performance Performance Maximize parallelism RAIDs with uniform striping Energy - PowerPoint PPT Presentation

Transcript of Motivation

Page 1: Motivation

Motivation• Energy costs are rising

– An increasing concern for servers– No longer limited to laptops

• Energy consumption of disk drives– 24% of the power usage in web servers– 77% of the power usage in proxy servers– 27% of operating costs for data centers

PARAID Benefits

Conventional RAID: The “One-size-fits-all” approach of convention raid creates a suboptimal strategy for spinning down drives.

load

time

drives

load

time

drives

PARAID: The number of disks can be adjusted to meet load with out degrading performance.

Performance: Based on the replay of a web server trace, our 2-Gear PARAID-0 prototype consumes 15% less power on average than conventional RAID-0 as shown in the Total Power Consumption graph. The peak PARAID-0 performance is comparable to RAID-0 under the PostMark benchmark.

Initial Findings

• Cyclic fluctuation of loads– RAIDs are configured for peak performance– Uniform striping over-provisions resources for light loads

• Over-provision of storage capacity– Avoid frequent upgrades

• Limited power cycles for server-class drives– Cannot power-cycle drives too frequently

Observations

PARAID: The Gear-Shifting Power-Aware RAID Charles Weddle - Mathew Oldham - An-I A. Wang - Florida State University

RuGang Xu - Peter Reiher – University of California, Los AngelesGeoffrey H. Kuenning – Harvey Mudd College

PARAID Design

PARAID creates tiers of disks within the RAID that can be gear-shifted so that the least number of disks are used to meet system load while not degrading performance.

Skewed Striping:By assigning data blocks in a skewed pattern, the number of powered-on disks can vary according to the demands placed on the system.To be able to create the gears, PARAID uses spare storage on the RAID device for soft state replication.

Preserving Peak Performance:A RAID-5 layout is used for the primary copy of the data blocks which allows for maximum parallelism between the drives when the load on the system demands it

Reliability:Since a RAID-5 layout is used for the primary copy of the data blocks, the distributed parity of RAID-5 will tolerate a single disk failure.

PARAID creates a bimodal usage pattern so that busy drives can stay powered on most of the time and idle drives can stay powered down most of the time, leaving only middle range disks to be powered cycled more often, reducing the total number of times a drive is powered cycled.

Power:The same file can be delivered with either the lower two-drive gear or the higher four-drive gear settings. Under high loads, all four drives will be powered to meet the performance demand. Under light loads, two drives are sufficient to deliver the data.

• Tension between energy and performance– Performance

• Maximize parallelism• RAIDs with uniform striping

– Energy• Minimize powered components

Challenges

Abstract: The energy consumption of disks is a growing concern. It now contributes 27% to the operating costs for data centers. Unfortunately, existing RAIDs are not designed to conserve energy. This is due to uniform striping in which even light loads generate activities on all drives.

Power-Aware RAID (PARAID) introduces a skewed striping pattern that can be gear shifted to a minimum striping level so the performance matches the load. Unused disks are powered down.

Based on our “2-gear” prototype, PARAID consumes 15% less power than a traditional RAID, while not compromising performance

load

time

utilization threshold

gear 2

gear 1

Peak Hour Total Completion Time (CDF)

00.10.20.30.40.50.60.70.80.9

1

1 10 100 1000 10000

Time (ms)

FilePercentage

RAID-0

1g PARAID-0

2g PARAID-0

PostMark Benchmark: Total Completion Time

5.6

52.2

109.6

4.4

53.8

115.6

0

20

40

60

80

100

120

140

1k Files, 50kTransactions

20k Files, 50kTransactions

20k Files,100k

Transactions

Time (s)RAID-0

2g PARAID-0

Total Power Consumption

05

10152025303540

0 1 2 3 4 5 6 7 8 9 10

Time (1000s)

Power (W)RAID-0

1g PARAID-0

2g PARAID-0

Peak Hour Latency (CDF)

00.10.20.30.40.50.60.70.80.9

1

1 10 100 1000

Time (ms)

FilePercentage

RAID-0

1g PARAID-0

2g PARAID-0

gear 1

Soft-State Block

Replication

gear 2

disk 1 disk 2 disk 3 disk 4

gear 3

RAID-5 Layout