101011605018

Top-k Selection Queries over Spatial Data: Mapping Strategies and

Performance Evaluation

Supervisor Presented By

Mrs. E.Baby Anitha M.E.,(Ph.D), N.Subhashini,

Assistant Professor. II M.E.(CSE).

1

Introduction

Spatial Mining

Spatial Database

Spatial Object

Spatial Preference Queries

2

Objective

To minimize the number of accesses to the input rankings , the object with the top-k aggregate scores have been identified and also reduces the I/O cost .

3

Literature ReviewYiu,M.L. Dai,X. Mamoulis,N. and Vaitis,M. (2007) ‘Top-k Spatial Preference Queries’, Proc. IEEE Int’l Conf. Data Eng. (ICDE

Object Ranking. Brance and Bound and Feature join algorithm.

Yiu,M.L. Dai,X. Mamoulis,N. and Vaitis,M. (2007) ‘Top-k Spatial Preference Queries’, Proc. IEEE Int’l Conf. Data Eng. (ICDE

Dynamic index structure. Searching and Updating.

4

Yiu,M.L. Dai,X. Mamoulis,N. and Vaitis,M. (2007) ‘Top-k Spatial Preference Queries’, Proc. IEEE Int’l Conf. Data Eng. (ICDE

Extract Geographic knowledge from the web. Query processing. Combination of text and spatial data processing .

Joao B Rocha-Junior;Akrivi Vlachou and Christos Doulkeridis,”Efficient Processing of Top-k Spatial Preference Queries: Proc VLDB Endowment.

Skyline Feature Algorithm. To improve the query processing. To reduce cost of maintenance..

5

Existing System

There is no efficient method for processing spatial

preference query.

Object ranking .

To support users in their exploratory search the

search engines are offering semantic search

suggestions.

Brute force approach.

6

Drawbacks

Ranking is not accurate.

Anyone can easily cheat this ranking system.

It does not consider the Search engine optimization

factors even to determine the ranking.

Brute force method is expensive for large input

data sets.

In spatial database ranking is often associated with

the nearest neighbor retrieval.

7

Proposed System Mapping of pairs of data and feature objects to

the distance score

space.

Minimal subset of pairs is sufficient to answer all

top-k spatial preference queries.

Top k-spatial preference queries, to find the

shortest path and shortest path distance.

Distance between two points defined by their

shortest path distance rather than Euclidean distance.

8

Modules

User interface design Preprocessing Classification Ranking and scoring Clustering results

9

Module DescriptionUser interface design:

The goal of user interface design is to make the user's

interaction as simple and efficient as possible, in terms of

accomplishing user goals.

Preprocessing:

Transform the data into the format that will be more easily

and effectively processed for the purpose of the user.

10

Ranking and score:

Spatial Ranking

Non-Spatial Ranking

Three types of scoring methods are used,

Range score

Influence score

Upper Bound Score

11

Range score and Influence score

12

Cont’d Feature points are labeled by quality values that can be obtained from the rating provider. The location of object dataset D and two feature datasets F1 and F2. The score r(p) is defined in terms of:

1)Maximum quality for each feature.

2)Aggregation of those quality. To find range score and influence score.

13

R-Tree

In order to handle spatial data effectively database

system needs the index mechanism that will help to

retrieve data item quickly.

R-Tree provides dynamic index structure

The object is represented by a Minimum Bounding

rectangle(MBR).

MBR is the smallest rectangle which completely

contains objects.14

R-Tree Example

15

Upper Bound Score

It provides maximum quality of entries. Two reasons for computing the score

1) Low I/O cost

2) Bounds are reasonably high in order to facilitate the effective pruning. Level 1 entries (non-leaf nodes) are used. Because

i)Fewer level1 entries than leaf entries

ii) High level entries cannot provide tight bounds

16

Clustering Results

A customer may want to rank the flats with respect to

the appropriateness of their location, defined after

aggregating the qualities of other features within their

spatial neighborhood.

The neighborhood concept can be specified by the user

via different functions.

It can be an explicit circular region within a given

distance from the flat.

17

Diagram

1818

System RequirementsHardware SpecificationPROCESSOR : Intel Pentium Core 2 Duo 1.8GHzHARD DISK : 160 GBRAM : 2 GB DDR IIMONITOR : 15” Color Monitor

Software Specification OPERATING SYSTEM : Windows XP BACK END : MySql LANGUAGE USED : Java WEB-SERVER : Apache Tomcat 5.0

19

ConclusionA top-k spatial preference queries, provide a novel type

of ranking for spatial objects.

The neighborhood of an object p is captured by scoring

function range score and influence score.

Branch and Bound derives upper bound scores for non-

leaf entries in the object tree.

It is scalable to large data sets.

20

Register New Place DetailsRegister New Place Details

21

Query SearchQuery Search

22

Search ResultSearch Result

23

REFERENCES

[1] Man Lung Yiu, Hua Lu, Member, IEEE, Nikos Mamoulis, and

Michail Vaitis“Ranking Spatial Data by Quality Preferences “,IEEE

transactions on Knowledge and Data Engineering Vol 23 NO 3

MARCH 2011.

[2] M.L. Yiu, X. Dai, N. Mamoulis, and M. Vaitis, “Top-k Spatial

Preference Queries,” Proc. IEEE Int’l Conf. Data Eng. (ICDE), 2007.

[3] N. Bruno, L. Gravano, and A. Marian, “Evaluating Top-k Queries

over Web-Accessible Databases,” Proc. IEEE Int’l Conf. Data Eng.

(ICDE), 2002.2007.

25

Cont’dCont’d

[4] A. Guttman, “R-Trees: A Dynamic Index Structure for Spatial Searching,” Proc. ACM SIGMOD, 1984.

[5] Yen Yu Chen, Suel T., and Markowetz A., “Efficient Query

Processing in Geographic Web Search Engines,” Proc. ACM

SIGMOD, 2006.

[6] Joao B Rocha-Junior;Akrivi Vlachou and Christos

Doulkeridis,”Efficient Processing of Top-k Spatial Preference

Queries: Proc VLDB Endowment.

26

THANK YOUTHANK YOU

27

101011605018

Documents

Transcript of 101011605018