DEXA 2013 presentation

Post on 02-Jul-2015

153 views 2 download

Transcript of DEXA 2013 presentation

Effectively Delivering XML Information in Periodic

Broadcast Environments

TU Kaiserslautern, Gottlieb-Daimler-Strasse, Kaiserslautern 67663, Germany

Muntazir Mehdi

1

Outline

• Data Broadcast Context

• Problem in This Work

• Our Approach

• Experimental Results

• Conclusions

2

Data Broadcast Context

• Rapid growth of wireless applications

– Wireless devices (smart phones, pads, etc.)

– Wireless networks

– Information Services(news, stock quotes, airline schedules, weather and traffic information)

Access Information Anywhere Anytime

3

Data Broadcast Context

• Information delivery methods

– Point-to-point access

• Logical channel/link between client and server

– Broadcast

• Data sent to all clients in broadcast area

• Clients select data that they need

4

Data Broadcast Context

5

Data Broadcast Context

• Why broadcast is attractive?

– Scalability: Single broadcast can satisfy all outstanding requests from clients

– Energy efficiency: Mobile clients can switch to doze mode when waiting for interesting data to be broadcasted

6

Data Broadcast Context

• Performance metric

– Access latency: the wait time.

– Energy consumption: the amount of data that clients need to download

7

Data Broadcast Context

• Main Research problems in Data Broadcast

– Scheduling

• To reduce access latency

– Indexing

• To reduce energy consumption

8

Problem in This Work

• How to effectively schedule semi-structured information such as XML data on wireless channels is still a challenge

• We mainly study the scheduling problem of XML data broadcast in periodic environments

9

Related Work

Traditional flat data broadcasts:

– Assume that we know clients' access patterns in advance

– face difficulty when generating data broadcast program based only on flat data itself

10

Our Approach

• Place XML documents on the broadcast channel based only on information at the server side

• Utilize Structural similarity to predict or approximate clients' access patterns

– path sets are used to calculate similarity

11

Our Approach

Path Set PS(d) – { /player/name, /player/position, /player/nationality,

/player/college, /player, /name, /position, /nationality, /college }

12

Our Approach

• Similarity measures

13

Our Approach

•Similarity Measure based on probability

14

Our Approach

•Similarity Measure based on probability suppose D = {d1 , d2 , . . . , dn } on the server, matched probability of any document d in D for a given query q is approximate to:

15

Our Approach

•Similarity Measure based on probability we define Cohesion C(di , dj) of XML documents di and dj as follows:

which can be normalised as

16

Our Approach

• Greedy Data Placement Algorithm (GDPA)

– Places XML documents with most structural sharing together first as an initial broadcast program.

– Progressively appends other XML documents to the broadcast program in a descendant order of structural sharing to the initial documents.

17

Experimental Results

• Workload parameters

18

Experimental Results

19

Experimental Results

20

Experimental Results

21

Conclusions

• We propose to take advantage of the structured characteristics of XML data to generate effective broadcast programs

• Our algorithm is based only on XML data on the server without any knowledge of the clients' access patterns

• Experiments show that our approach can place XML data on air effectively

22