Algorithms for load balancing in electricity markets and data centerscj... · 2021. 4. 13. · Load...

Algorithms For Load Balancing In Electricity Markets And Data

Centers

A Dissertation Presented

Bochao Shen

The College of Computer and Information Science

in partial fulfillment of the requirements

for the degree of

Doctor of Philosophy

Computer Science

Northeastern UniversityBoston, Massachusetts

April 2018

To my family.

Contents

List of Figures iv

List of Tables v

Acknowledgments vi

Abstract of the Dissertation vii

1 Introduction 11.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

1.1.1 Temporal load balancing for electricity market . . . . . . . . . . . . . . . 21.1.2 Fault-tolerant spatial load balancing for data centers . . . . . . . . . . . . 31.1.3 Load balancing for multidimensional resources . . . . . . . . . . . . . . . 4

1.2 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71.3 Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

2 Temporal Load Balancing for Electricity Market 92.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92.2 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112.3 Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122.4 Related work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132.5 Market model description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142.6 SmartShift . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162.7 Simulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 232.8 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

3 Fault-tolerant Spatial Load Balancing in Data Centers 293.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 293.2 Related work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 313.3 Approaches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 323.4 Problem statement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

3.4.1 VMPP and VMPP-AC . . . . . . . . . . . . . . . . . . . . . . . . . . . . 333.4.2 Exists?k-HA and Is?k-HA . . . . . . . . . . . . . . . . . . . . . . . . . . 34

3.5 k-HA is NP-complete . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

3.5.1 Exists?k-HA is NP-hard . . . . . . . . . . . . . . . . . . . . . . . . . . . 353.6 IID-IK and best heuristic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 363.7 Analysis of heuristics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

3.7.1 MTHM revisited . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403.7.2 Gold-dust world: when VM sizes are small . . . . . . . . . . . . . . . . . 423.7.3 Doubling world: VM sizes in the form of 2i . . . . . . . . . . . . . . . . . 443.7.4 Performance evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 453.7.5 Water-filling packs best . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

3.8 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48

4 Load Balancing for Multidimensional Resources 514.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

4.1.1 Motivation and Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . 524.1.2 Our results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 554.1.3 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57

4.2 VITA(F) for linear F . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 584.3 VITA(min) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59

4.3.1 Unconstrained, Bounded - exact . . . . . . . . . . . . . . . . . . . . . . . 594.3.2 Constrained, Bounded - strongly NP-hard . . . . . . . . . . . . . . . . . . 604.3.3 Unconstrained, Unbounded - inapproximable . . . . . . . . . . . . . . . . 604.3.4 Constrained, Unbounded - O(log n, log n) bicriteria . . . . . . . . . . . . 61

4.4 VITA(max) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 634.4.1 Unconstrained, Unbounded - exact . . . . . . . . . . . . . . . . . . . . . . 644.4.2 Constrained, Bounded - strongly NP-hard . . . . . . . . . . . . . . . . . . 644.4.3 Constrained, Unbounded - ⇥(log n) approximation . . . . . . . . . . . . . 65

4.5 VITA(2ndmax) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 664.5.1 Bounded, Unconstrained - weakly NP-hard . . . . . . . . . . . . . . . . . 664.5.2 Unweighted, Unconstrained, with number of buckets exceeding number of

dimensions - O(log n) approximation . . . . . . . . . . . . . . . . . . . . 664.6 Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68

4.6.1 Polynomial time heuristics for VITA . . . . . . . . . . . . . . . . . . . . . 684.6.2 Performance of VITA when vectors has constant number of dimensions . . 694.6.3 Performance of VITA when vectors has unbounded number of dimensions . 71

4.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72

5 Conclusion 78

Bibliography 80

List of Figures

2.1 Normalized revenue vs price volatility . . . . . . . . . . . . . . . . . . . . . . . . 242.2 Loss probability vs consumer tolerance . . . . . . . . . . . . . . . . . . . . . . . 252.3 Probability density function of profit . . . . . . . . . . . . . . . . . . . . . . . . . 262.4 Normalized social welfare vs price volatility . . . . . . . . . . . . . . . . . . . . . 272.5 Normalized social welfare and revenue of smartShift, flat-rate pricing and real-time

pricing vs Pareto distribution parameter �R . . . . . . . . . . . . . . . . . . . . . 28

3.1 Distribution of memory usage by running VMs . . . . . . . . . . . . . . . . . . . 483.2 Distribution of host memory capacity . . . . . . . . . . . . . . . . . . . . . . . . 493.3 Distribution of VM memory size . . . . . . . . . . . . . . . . . . . . . . . . . . . 503.4 Distribution of VM memory size after rounding . . . . . . . . . . . . . . . . . . . 50

4.1 VITA(min). The simplest unbounded case is inapproximable, and we give a bicri-teria guarantee for the hardest case. . . . . . . . . . . . . . . . . . . . . . . . . . . 56

4.2 VITA(max) and VITA(max�min). The unconstrained, cases are exactly solvableand we have tight logarithmic guarantees for the constrained unbounded case. . . . 57

4.3 Histogram of # of VCPUs requested . . . . . . . . . . . . . . . . . . . . . . . . . 714.4 Histogram of memory size requested . . . . . . . . . . . . . . . . . . . . . . . . . 724.5 Histogram of storage size requested . . . . . . . . . . . . . . . . . . . . . . . . . 734.6 Objective value of VITA(max) and three heuristics for minimizing bottleneck usage 734.7 Objective value of VITA(min) and three heuristics for minimizing maintenance

downtime . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 744.8 # of used buckets VS # of given buckets for minimizing maintenance downtime . . 744.9 Objective value with same increased number of buckets for minimizing maintenance

downtime . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 754.10 Objective value of VITA(max) with unbounded # of dimensions . . . . . . . . . . 754.11 Objective value of VITA(min) with unbounded # of dimensions . . . . . . . . . . 764.12 # of used buckets VS # of given buckets with unbounded # of dimensions . . . . . 764.13 Objective value with same increased number of buckets with unbounded # of di-

mensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77

List of Tables

3.1 One small host/VM size distribution . . . . . . . . . . . . . . . . . . . . . . . . . 453.2 Asymptotic metric PE for the small distribution . . . . . . . . . . . . . . . . . . . 463.3 Asymptotic metric PE for original host/VM size distribution from Nutanix . . . . . 473.4 Asymptotic metric PE for power-of-2 . . . . . . . . . . . . . . . . . . . . . . . . 47

Acknowledgments

First and foremost, I would like to thank my PhD advisor, Ravi Sundaram, for supporting andguiding me through my entire PhD study. Ravi’s insightful mind can always capture the essence ofproblems and enlighten me during our discussions. Working with Ravi and having him as my PhDadvisor is a truly amazing experience.

I would also like to thank my thesis committee members: Javed Aslam, Narayanaswamy Bal-akrishnan, Rajmohan Rajaraman. I am fortunate enough to work with each of them on differentprojects. Their comments and feedback are the valuable assets for me to have in my future careerand exploration.

I would like to thank all the lab mates during all these years at Northeastern University. Thereare so many names that I can not list them all here. But those days that we spent altogether aredeeply kept in my heart. They are vivid, warm and golden.

I would like to thank Megan Barry and Bryan Lackaye for being so patient answering myquestions about administrative processes in our college, and giving me so much help.

Finally, I would like to thank my parents for everything. I also would like to thank my wife,Die Sun, for supporting me all the time. I would never reach the moment of typing this line withouther love.

Abstract of the Dissertation

Algorithms For Load Balancing In Electricity Markets And Data Centers

Bochao Shen

Doctor of Philosophy in Computer Science

Northeastern University, April 2018Dr. Ravi Sundaram, Adviser

Electricity and computers are two corner stones of this information era. Energy and compu-tation are critical resources in perennial short supply because our consumption continues to growday by day. Increasing supply in a substantial way requires fundamental advances in energy gen-eration and computing technology, but such advances are few and far between. Load balancingis an important technique for mitigating the issue of scarce resources. We broadly interpret loadbalancing to include both the optimization of resource distribution as well as the management ofend-user demand. In this thesis, we study algorithms for load balancing in electricity markets anddata centers.

First, we study the temporal load balancing problem in electricity markets where peak demandand supply-demand imbalance are major problems. It is often suggested that exposing consumers toreal-time pricing will incentivize them to change their usage and mitigate the problem. However, weshow that risk-averse electricity consumers react to price fluctuations by scaling back on their totaldemand, leading to the unintended consequence of an overall decrease in production/consumptionand reduced economic efficiency. Compared with the relatively fixed production mode of electric-ity power (the supply), the consumption pattern of end users (the demand) is more variable andpotentially changeable. This makes temporally shifting consumers’ electricity load possible. Wepropose SmartShift, a new scheme that allows households to move their demands from peak hoursin exchange for greater electricity consumption in non-peak hours. We show that our scheme notonly enables increased consumption and consumer welfare but also allows the distribution companyto increase profits.

We next consider the fault-tolerant spatial load balancing problem in data centers where com-putational loads get balanced by being assigned on different locations (machines - computationalresources). k-HA (high-availability), a fault tolerance property of virtual machine (VM) placement

in clouds, represents the ability to tolerate up to k host failures by relocating VMs from failed hostswithout disrupting other VMs. It has long been assumed [15] that deciding the existence of a k-HAplacement is ⌃P

3 -complete. We show that k-HA reduces to multiple knapsack and hence is NP-complete. We propose a stochastic model for multiple knapsack that not only captures real-worldworkloads but also provides a uniform basis for comparing the efficiencies of different polynomial-time heuristics. We prove, using the central limit theorem and linear programming, that, in animportant special case, there exists a best polynomial-time heuristic. We turn to industry practiceand discuss the drawbacks of commonly used heuristics - First-fit, Best-fit, Worst-fit, MTHM andCSP. Based on a large real-world dataset of cluster workloads from industry we show that the naturalload-balancing heuristic - Water-filling - has several excellent properties. We compare and contrastWater-filling with MTHM using our stochastic model and find that Water-filling is a heuristic ofchoice.

Finally, we extend our study on load balancing from the single dimensional case to multidi-mensional resources. This is inspired by multiple questions from both electricity markets and datacenters. For example, what is the best way to assign VMs to data centers to minimize the disruptioncaused when data centers are powered down for maintenance? How companies distribute the loadto hybrid clouds? These related problems can be modeled as the Uncapacitated MultidimensionalLoad Assignment problem, where items (load demand) and buckets (supply capacity) are character-ized by a d-dimensional vector. Additional affinity constraints may restrict the subset of buckets aspecific item may be placed in. The cost of a bucket is obtained by aggregating the assigned itemsaccording to some metric, with different metrics (max, min, second-max, etc) representing differentapplication scenario for either electric markets or data centers. The goal is to minimize the total costacross all buckets. The temporal load balancing in electricity market and the spatial load balancingin data centers become the two applied scenarios of this problem. We provide hardness results andapproximation algorithms for this problem in a variety of settings.

Chapter 1

Introduction

1.1 Motivation

Information technologies have kept making great impact on our daily lives ever since the com-

puter was invented. Nowadays, not only computers, but phones, tablets, even watches as well can

run impressive applications and computations inside. We can even speak to the smart home assis-

tants to control the home temperature, play music, switch on/off lights. The two invisible pillars

that support today’s prosperity of the information era are electricity and computation, undoubtedly.

In fact, electricity and computation represent the two critical resources that today’s worldwide

information infrastructure relies on. Like other natural resources, our growing consumption makes

energy and computation, the two critical resources, in perennial short supply. Admittedly, technolo-

gies have kept advancing to make more efficient and cleaner energy, produce the smaller and faster

chips. Such increase in supply of these two resources in a substantial way requires fundamental

advances in energy creation and computing technology.

Load balancing, a technique which aims to optimize the use of resource and improve the dis-

tribution of work loads, becomes useful when we need to consider how to efficiently consume

the resource, given the supply is limited. In this thesis, we first study two kinds of load balancing

techniques that are applied to two different scenarios respectively: temporal load balancing for elec-

tricity market; spatial load balancing for data centers. Then we move on our study to load balancing

problems with multidimensional resources.

CHAPTER 1. INTRODUCTION

1.1.1 Temporal load balancing for electricity market

Electricity power utilities have at least two major challenges. The first is Peak Demand - a

period in which the demand for power is significantly higher than average. In order to satisfy a large

peak demand, utilities (generation/distribution companies) have to make large capital investments

including new ‘peaking’ generation stations, larger capacity lines, transformers, and operational

expenditures including expensive purchases of electricity on the “spot market” [38]. For example,

it is estimated that a 5% lowering of demand would have resulted in a 50% price reduction during

the peak hours of the California electricity crisis in 2000-2001 [44]. As a result of the quadratic

dependence between resistive losses and transmitted current, peaks also lead to substantial energy

wastage. The second challenge faced by power utilities is that of Supply-Demand imbalance. [16]

states that ”the difficulties that have appeared in California and elsewhere are intrinsic to the design

of current electricity markets, in which demand exhibits virtually no price responsiveness and supply

faces strict production constraints”.

Without allowing consumption load be temporally shifted, economists and electricity compa-

nies traditionally have focused on pricing as a mechanism to solve these two problems in a sys-

tematic way. However, homes and small businesses have an inherent and systemic requirement for

stable electricity costs [18]. It is understood that ”Consumers generally shy away from markets

when products are complicated, supply is uncertain, prices are volatile, and information is lacking”

[19]. We demonstrate that exposing risk-averse consumers to the actual real-time costs of electricity

production, will result in a reduction in aggregate demand leading to reduced revenue for the gen-

erators and distributors with potential knock-on effects for the economy at large. At the same time,

the volatility of real-time pricing can have strong effects on grid stability [70]. A second concern

with real-time pricing, is that consumer prices may increase [8] or net electricity consumption may

decrease [9], depending on the model.

We notice that the consumption pattern of end users is more variable and potentially changeable

compared with the relatively fixed production mode of power supply. We initiate the study of a new

model of incentives and utility-customer interaction. To reduce volatility, whilst yet accounting

for end-user constraints (e.g., washer must be run only during the day), we introduce an inter-

temporal characteristic to the customer ‘bidding’ language. More importantly, we ensure that the

customer and the utility will both be no worse off under our scheme than under flat-rate pricing.

Our temporal load balancing mechanism, SmartShift, does this by rewarding consumers who shift

consumption with increased allocations for the same cost. Consumers are paid in kind and not cash,

advantaging both the consumer and the producer. For real-world adoption it is critical to devise

mutually beneficial schemes, such as Smartshift, that increase the economic pie within the world of

electrical power.

1.1.2 Fault-tolerant spatial load balancing for data centers

Cloud computing has established itself as a mainstay of modern computing infrastructures.

Clouds enable the efficient utilization of resources on an as-needed basis by dynamically configur-

ing these resources to accommodate varying workload needs. The core technology at the heart of

public cloud data-centers and private clusters is virtualization. Virtualization is the use of shared re-

sources to create and operate virtual machines (VMs) [61]; a VM is an operating system or software

that emulates the behavior of a computing system with a specified set of resource characteristics,

such as CPU and memory capacity. Virtualization allows for the execution of an application onto

heterogeneous systems as well as multiple applications in parallel. Virtualization also enables live

migration or the movement of running VMs between hosts. The enormous flexibility afforded by

virtualization enables the placement (mapping of VMs to physical hosts) and re-balancing of VMs

for a variety of reasons including performance and cost-efficient resource utilization [64].

In this thesis, we study spatial load balancing where computational loads (VMs) are distributed

to a set of host machines (locations for VMs). Our work is different in context with those works

on spatial load balancing in related literature [68, 53]. They consider a larger spatial/geographical

range. Instead of studying the inter data center load balancing problem, we focus on intra data

center load balancing with fault-tolerance concerns.

An important aspect of fault-tolerance in a cluster is High Availability (HA) [54]. In general,

HA is the property of ensuring continuity of services (applications) despite the failures of hosts,

VMs or the applications themselves. In the context of this paper we will interpret HA narrowly as

the ability to tolerate host failures by restarting the VMs (from the failed hosts) in back up hosts

without affecting any other existing running VMs. We define k-HA to be the property of a placement

to tolerate the failure of up to k hosts (sequentially or in parallel). In practice the response to a failure

or multiple failures the system could involve restarting of the failed VMs in other hosts; in platforms

with HwPFA (Hardware Predicted Failure Analysis alerts it could also involve live migration of the

VMs in advance of an impending failure. In either case the key algorithmic requirement is ensuring

the sufficiency of resources in the back up hosts to support the new VMs. In general resources are

multi-dimensional (such as CPU, memory, IO etc) but in this part of work we focus on a single

resource - memory.

HA has been studied extensively both in the algorithms and the systems communities. Un-

fortunately, even the special of case of 0-HA with a single resource is the probem of bin packing

and hence already NP-complete [34]. Partly as a reaction to the hardness of HA the systems com-

munity has turned to polynomial-time heuristics on the one hand or AI (artificial intelligence) and

CSP (Constraint Satisfaction Programming) on the other. Heuristics, the choice of industry, are

typically quick to return some solution but fail to provide guarantees on the quality of the solution.

AI and CSP, areas of academic research, allow much greater expressiveness involving additional

constraints such as affinity and co-location requirements as well as multiple resource dimensions.

However, both of these techniques are very slow, particularly CSP which solves SAT (satisfiability,

an NP-complete problem [34]) at its core.

In spite of the attention that HA has received the fundamental theoretical questions still remain

unanswered: Given VMs (with sizes) and hosts (with capacities) and k how easy/hard is it to decide

whether there exists a feasible k-HA placement? Given a placement and k how easy/hard is it to de-

cide whether it (the given placement) is k-HA? And, on the practical side, the fundamental questions

are: Given that typical inputs encountered in practice are not adversarially chosen (worst-case) how

do we meaningfully compare the quality of different heuristics? Given that load-balancing is a basic

customer requirement what is the best heuristic to use? In this thesis we answer these questions with

our theoretical study and experiment results.

1.1.3 Load balancing for multidimensional resources

The study of the load balancing problem with multidimensional resources in this thesis is

inspired by the following industrial scenarios.

Scenario 1. Minimizing bottleneck usage: Consider an enterprise customer that has a choice

of several different cloud providers at which to host their VMs (virtual machines). The requirements

of each VM can be characterized along several different resource dimensions such as compute

(CPU), network (latency, bandwidth), storage (memory, disk) and energy. When different virtual

machines are placed in the same elastic resource pool (cloud), their load across each dimension

is accrued additively (though, of course the different dimensions can be scaled suitably to make

them comparable). However, the various resources that cloud providers are equipped with are not

unlimited. They are capped in various settings. The question then arises - what is the optimal way

for the enterprise customer to distribute the load amongst the different cloud providers so as to

minimize bottleneck usage?

Scenario 2. Minimizing maintenance downtime: Hosts, and even data centers need to be

powered down every so often for maintenance purposes, e.g. installing a new HVAC system in a

data center. Given this reality, how should the application (collection of virtual machines and/or

containers collectively performing a task or service), be allocated to the different data centers so

as to minimize the aggregate disruption? This scenario also applies to industrial machines where

different factories (or floors of a factory) need to be shut down for periodical maintenance work.

Scenario 3. Preserving privacy: Consider a set of end-users each with its own (hourly)

traffic profile accessing an application. We wish to partition the application components across a

set of clouds such that by observing the (hourly volume of) traffic flow of any single cloud it is

not possible to infer which components are colocated there. This leads to the following question -

how should we distribute load across clouds in order to minimize the maximum hourly variation in

aggregate traffic? As an analogy, the situation here is similar to the problem of grouping households

such that the variation of energy usage of a group is minimized making it difficult for thieves to

infer who has gone on vacation.

Scenario 4. Burstable billing: Most Tier 1 Internet Service Providers (ISPs) use burstable

billing for measuring bandwidth based on peak usage. The typical practice is to measure bandwidth

usage at regular intervals (say 5 minutes) and then use the 95th percentile as a measure of the

sustained flow for which to charge. The 95th percentile method more closely reflects the needed

capacity of the link in question than tracking by other methods such as mean or maximum rate.

The bytes that make up the packets themselves do not actually cost money, but the link and the

infrastructure on either end of the link cost money to set up and support. The top 5% of samples

are ignored as they are considered to represent transient bursts. Burstable billing is commonly used

in peering arrangements between corporate networks. What is the optimal way to distribute load

among a collection of clouds, public and private, so as to minimize the aggregate bandwidth bill?

The above scenarios constitute representative situations captured by the uncapacitated multi-

dimensional load assignment problem framework - VITA. A host of related problems from a variety

of contexts can be abstracted and modeled as VITA(F): the input consists of n, d-dimensional load

vectors V = {Vi|1 i n} and m cloud buckets B = {Bj |1 j m} with associated weights

wj and assignment constraints represented by a bipartite graph G = (V [ B,E ✓ V ⇥ B) that

restricts load Vi to be assigned only to those buckets Bj with which it shares an edge. Here, F can

be any (projection) operator mapping a vector to a scalar, such as max, min, etc. Then the goal is

to partition the vectors among the buckets, respecting the assignment constraints, so as to minimize

wj ⇤ F (X

where, in a slight abuse of notation, we let Bj also denote the subset of vectors assigned to bucket

Bj . VITA stands for Vectors-In-Total Assignment capturing the problem essence - vectors assigned

to each bucket are totaled. Unless otherwise specified we use i to index the load vectors, j to index

the cloud buckets and k to index the dimension. We let Vi(k) denote the value in the k’th position

of the vector Vi.

We now explain how VITA(F) captures the aforementioned scenarios. In general, dimensions

will either represent categorical entities such as resources (e.g., CPU, I/O, storage, etc.,) or time

periods (e.g., hours of the day or 5-minute intervals, etc.,). We gently remind the reader to note that

in each of the scenarios the elasticity of the clouds is a critical ingredient so that contention between

vectors is not the issue. The set of scenarios we present are but a small sample to showcase the

versatility and wide applicability of the VITA framework.

Scenario 1 is captured by having a vector for each VM, with each dimension representing its

resource requirement; constraints representing placement or affinity requirements [41], weights wj

representing the rates at different cloud providers. Then minimizing the sum of peak resource usage

at each cloud is just the problem VITA(max).

In Scenario 2 each dimension represents the resource (say, CPU utilization) consumed by the

application in a given time period, e.g. the vector for an application could have 24 dimensions one

for each hour in the day. Once the application is assigned to a data center (or cloud or cluster) it

is clear that disruption is minimized if the maintenance downtime is scheduled in that hour where

total resource utilization is minimum. Then minimizing the aggregate disruption is captured by the

problem VITA(min).

The dimensions in Scenario 3 are the hours of the day and the resource in question is the

traffic. To prevent leakage of privacy through traffic analysis the goal is to distribute the application

components across clouds so that the range between the peak and trough of traffic minimized. This

problem is exactly represented as VITA(max�min).

In Scenario 4, we have vectors for each application with 20 dimensions one for each 5th per-

centile [67, 66] or ventile of the billing period. Then minimizing the aggregate bandwidth bill under

the burstable billing method is VITA(2ndmax).

1.2 Contributions

This thesis first proposes an incentive mechanism for the temporal load balancing problem in

electricity market. For data centers, this thesis studies high available spatial load balancing with

a given expected faulty tolerance level. Further this thesis proposes a general framework studies

the case for load balancing problems when multidimensional resources are considered, where the

temporal load balancing in electricity market and the spatial load balancing in data centers are

naturally two applied scenarios of this framework. These three parts are described in more detail as

follows.

• Incentive mechanism based temporal load balancing for electricity market: Peak demand

and supply-demand imbalance are the two major problems in electricity markets. In order to

mitigate the peak demand problem, real-time pricing is advised to incentivize customers to

change their usage pattern. However, reacting to the real-time pricing, risk-averse electricity

consumers will scale back on their electricity demand. This will lead to the overall decrease

in production and consumption and reduced economic efficiency. We propose SmartShift, an

incentive mechanism that motivates consumers to move their demands from peak hours in ex-

change for an expanded electricity consumption in non-peak hours. We show that SmartShift

increases consumption and consumer welfare, meanwhile increases the profits of distribution

companies.

• High available spatial load balancing for data centers: k-HA (high availability) is a fault

tolerance property of virtual machine placement in clouds, characterizes the capability of

tolerating up to k host machine failures by relocating VMs from failed hosts to still-running

ones without disturbing other VMs. We show that k-HA is NP-complete. We propose a

stochastic model for multiple knapsack for comparing the efficiencies of different polynomial-

time heuristics. We also prove that there exists a best polynomial time heuristics. Based on

industrial cluster workload dataset, we show that Water-filling performs the best among a list

of common heuristics favored in industry.

• Load balancing for multidimensional resources: we study on load balancing for multidi-

mensional resources. We first present the Uncapacitated Multidimensional Load Assignment

problem, then we propose VITA(F) (Vectors-In-Total Assignment): the input consists of n,

d-dimensional load vectors V = {Vi|1 i n} and m cloud buckets B = {Bj |1 j

m} with associated weights wj and assignment constraints represented by a bipartite graph

G = (V [ B,E ✓ V ⇥ B) that restricts load Vi to be assigned only to those buckets Bj

with which it shares an edge. Here, F can be any operator mapping a vector to a scalar, such

as max, min, etc. Then the goal is to partition the vectors among the buckets, respecting the

assignment constraints, so as to minimizeP

j wj ⇤F (P

Vi2BjVi), where, in a slight abuse of

notation, we let Bj also denote the subset of vectors assigned to bucket Bj . We also let Vi(k)

denote the value in the k’th position of the vector Vi.

We characterize the complexity of VITA - providing approximation algorithms and hardness

results. Our approach involves clever rounding of carefully crafted linear programs and may

be of independent technical interest.

1.3 Outline

The rest of chapters are outlined as follows. Chapter 2 presents SmartShift, an incentive mech-

anism based temporal load balancing for electricity market. Chapter 3 studies high available spatial

load balancing for cloud computing in data centers. Chapter 4 presents the load balancing problems

with multidimensional resources. Chapter 5 concludes the thesis.

Chapter 2

Temporal Load Balancing for ElectricityMarket

Peak demand for electricity continues to surge around the world. The supply-demand imbal-

ance manifests itself in many forms, from rolling brownouts in California to power cuts in India.

It is often suggested that exposing consumers to real-time pricing, will incentivize them to change

their usage and mitigate the problem - akin to increasing tolls at peak commute times. We show

that risk-averse consumers of electricity react to price fluctuations by scaling back on their total

demand, not just their peak demand, leading to the unintended consequence of an overall decrease

in production/consumption and reduced economic efficiency. We propose a new scheme that allows

homes to move their demands from peak hours in exchange for greater electricity consumption in

non-peak hours - akin to how airlines incentivize a passenger to move from an over-booked flight in

exchange for, say, two tickets in the future. We present a formal framework for the incentive model

that is applicable to different forms of the electricity market. We show that our scheme not only

enables increased consumption and consumer social welfare but also allows the distribution com-

pany to increase profits. This is achieved by allowing load to be shifted while insulating consumers

from real-time price fluctuations. This win-win is important if these methods are to be embraced in

practice.

2.1 Introduction

Power utilities worldwide face at least two major challenges. The first is Peak Demand - a

period in which the demand for power is significantly higher than average. In order to satisfy a large

CHAPTER 2. TEMPORAL LOAD BALANCING FOR ELECTRICITY MARKET