Post on 05-Jan-2016
Load Distribution among Replicated Web Servers:
A QoS-based Approach
Marco Conti, Enrico Gregori, Fabio Panzieri
WISP99
2000.9.14
KAIST EECSD CALab
Hwang In-Chul
2/15
Contents
Introduction Load Distribution Strategies A QoS-Based Architecture Work in Progress Critique
3/15
Introduction(1/2)
A practical approach to the provision of web services– Replicate Web servers(WSs) at distinct sites
– Each client select the “most convenient” WS replica
The success of this approach– To bind dynamically a client to the most convenient
replica
– To maintain data consistency among the WS replicas
4/15
Introduction(2/2)
In this paper– Load distribution strategy
• Mirror-based strategy
• DNS-based strategy
• QoS-based strategy
– To minimize the URT(User Response Time)
5/15
Load Distribution Strategies
Mirror-based strategy– The user manually selects a replica
DNS-based strategy– “Ideal” round-robin assignment of clients to WS
replicas QoS-based strategy
– DNS : all addresses of replica WSs
– Browser selects a replica with satisfactory URT by sending probe
6/15
Load Distribution Strategies - Performance Comparison
Simulation scenario
Inter area network transfer delay
Intra area network transfer delay
Area 1
Area 4 Area 3
Area 2
Area 1Network
Delay
Area 4Network
Delay
Area 3Network
Delay
Area 2Network
Delay
Web Serverreplica 1
Web Serverreplica 2
Web Serverreplica 3
Web Serverreplica 4
Internet
Browsers
Access line to a Web Server
Simulation scenario
7/15
Load Distribution Strategies - Performance Comparison
Simulation environment– Network delay model
• Intra-area delays– The minimum area round trip time
– The queuing delays in the area router
– The packet transmission time
• Inter-area delays– Random variables
– Other factorsConsecutive query Independent and exponential distributed
Each query Access a geometrically distributed number of pages
Web page size Avg. 3000 bytes
Dummy req. 1000 bytes
Server capacity 200 request per second(FIFO queue)
8/15
Load Distribution Strategies - Performance Comparison Impact of intra-area network congestion
– Results• Utilization of each replica
– QoS-based strategy : (0.58, 0.91, 0.92, 0.92)
– Other strategies : uniformly 0.80
Area 1 Routers 0.98 Util.Other Areas Routers Max. 0.8 Util.
9/15
Load Distribution Strategies - Performance Comparison
A heavily loaded area
– Results
Area 1 User-Query Generation 0.98 of Server CapacityOther Areas 0.8 of ServerCapacity
10/15
Load Distribution Strategies - Performance Comparison
Symmetric case– All Areas
• The most congested router : 0.80 utilization
• The user-query generation rate : 0.80 of server capacity
– Results
11/15
Load Distribution Strategies - Performance Comparison
A realistic scenario– Four distinct areas
• USA, Europe, Asia, Australia
– Daily different loads in different periods of time
– Results
12/15
A QoS-Based Architecture
Do not require modification of any software Architecture
13/15
A QoS-Based Architecture
Drawback– URT estimation : Single measure
– Polling overhead
DNS
ReplicatedServer 1
ReplicatedServer 2
ReplicatedServer N
...
Browser
DNS Request All Replica’sIP Address
Probe Request
DNS
ReplicatedServer 1
ReplicatedServer 2
ReplicatedServer N
...
Browser
DNS Request One Replica’sIP Address
Probe Reply
Broadcast Poll Request
Poll Reply
Poll Request
Poll Reply
(All Replica’s IP Address)
14/15
Work in Progress
Load Distribution(LD) service– To overcome the main limitations
– Responsible for distributing the browsers’ requests
– Maintain for each WS replica the WS response time
– Continuous monitoring of the response time
15/15
Critiques
Contribution in this paper– QoS-based approach: Minimize URT
– Load distribution considering network delay Simulation with realistic workload Not Scalable More research on LD
– How to evaluate the accurate WS response time
16/15