1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different...
-
Upload
antonio-hahn -
Category
Documents
-
view
216 -
download
0
Transcript of 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different...
![Page 1: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/1.jpg)
1
Information Integration
![Page 2: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/2.jpg)
2
Information Resides on Heterogeneous Information Sources
• different interfaces• different data representations• redundant and conflicting information
WWW Excel PersonaldatabaseFlat File
![Page 3: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/3.jpg)
3
Modes of Information Integration
• Federated Databases: the sources are independent, but one source can call on others to supply information
• Data warehouse: copies of data from several sources are stored in a single database, called a data warehouse. The data stored at the warehouse is first processed in some way before storage; e.g. data may be filtered, and relations may be joined or aggregated. As the data is copied from the sources, it may need to be transformed in certain ways to make all data conform to the schema at the data warehouse
![Page 4: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/4.jpg)
4
Modes of Information Integration
• Mediation: a mediator is a software component that supports a virtual database, which the user may query as if it were materialized (physically constructed like a warehouse). The mediator store no data of its own. Rather, it translates the user’s query into one or more queries to its sources. The mediator then synthesizes the answer to the user’s query from the responses of those sources, and returns the answer to the user
![Page 5: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/5.jpg)
5
Problems of Information Integration
Example: The AAAI Automobile Co. has 1000 dealers each of
which maintains a database of their cars in stock. AAAI wants to create an integrated database containing the information of all 1000 sources. The integrated database will help dealers locate a particular model if they don’t have one in stock. It also can be used by corporate analysts to predict the market and adjust production to provide the model most likely to sell
![Page 6: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/6.jpg)
6
Problems of Information Integration
• The 1000 dealers do not all use the same database schema:
Cars (serialNo, model, color, autoTrans, cdPlayer, ...)
or Autos (serialNo, model, color)
Options (serialNo, option)
![Page 7: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/7.jpg)
7
Problems of Information Integration
• Schema difference • Different equivalent names
• Data type differences: numbers may be represented by character strings of varying length at one source and fixed length at another
• Value differences: the same concept may be represented by different constants at different sources (BLACK, BL, 100, etc)
• Semantic differences: Terms can be given different interpretations at different sources (Cars includes trucks or not)
• Missing values: a source may not record information of a type that all of the other sources provide
![Page 8: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/8.jpg)
8
Goal: System Providing Integrated View of Heterogeneous Data
Integration System
WWW Personaldatabase
• collects and combines information• provides integrated view, uniform user interface
ExcelFlat File
![Page 9: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/9.jpg)
9
The Data Warehousing Approach to Integration
Mediator
WrapperWrapper
Client
Excel Flat File
Stored Integrated
View
![Page 10: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/10.jpg)
10
The Data Warehousing Approach to Integration
• Data from several sources is extracted and combined into a global schema
• The data is stored at the warehouse which looks like an ordinary database
• There are three approaches to maintaining the data in the data warehouse:– off-line reconstruction of the whole data warehouse
– the data warehouse is updated periodically based on the changes made to the original data sources
– the data warehouse is updated immediately
![Page 11: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/11.jpg)
11
The Data Warehousing Approach to Integration
Example Suppose that there are two dealers in the system and
that they use the schemas:Cars (serialNo, model,color,autoTrans, cdPlayer, ...)
andAutos (serialNo,model,color)Options (serialNo,option)
Assume a data warehouse with the schema:
AutoWhse(serialNo,model,color,autoTrans, dealer)
![Page 12: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/12.jpg)
12
The Data Warehousing Approach to Integration
• The software to extracts data from the dealer’s databases and populates the global schema can be written as SQL-queries. The query for the first dealer:
insert into AutoWhse(serialNo,model,color,autoTrans, dealer)
select serialNo, model, color, autoTrans, ‘dealer1’from Cars
The code for the second dealer is more complex since we have to decide whether or not a given car has an automatic transmission.
![Page 13: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/13.jpg)
13
The Data Warehousing Approach to Integration
insert into AutoWhse(serialNo,model,color,autoTrans, dealer)
select serialNo, model, color, ‘yes’, ‘dealer2’from Autos, Optionswhere Autos.serialNo=Options.serialNo andoption=‘autoTrans’;insert into AutoWhse(serialNo,model,color,autoTrans,
dealer)select serialNo, model, color, ‘no’ ‘dealer2’from Autoswhere not exists ( select * from Optionswhere serialNo=Autos.serialNo andoption=‘autoTrans’);
![Page 14: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/14.jpg)
14
The Wrapper and Mediator Architecture
Mediator
WrapperWrapper
Client
business reports
portfolios for each company
stock market prices
Excel Flat File
CommonData Model
![Page 15: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/15.jpg)
15
The Wrapper and Mediator Architecture
• A mediator supports a virtual view, or collection of views, that integrates several sources in much the same way that the materialized relation(s) in a data warehouse integrate sources.
• The mediator doesn’t store any data < <<
Example:Let us consider the same scenario. The mediator
integrates the same two data sources into a view that is a single relation with the schema:
AutoMed(serialNo,model,color,autoTrans, dealer)
![Page 16: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/16.jpg)
16
The Wrapper and Mediator Architecture
Assume the user asks the mediator about the red cars:
select serialNo, model from AutosMedwhere color = ‘red’;
The mediator forward the same query to each of the two wrappers
(1) select serialNo, model from Cars where color=‘red’;(2) select serialNo, model from Autos where color=‘red’;
The mediator can take the union of answers and return the result to the user.
![Page 17: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/17.jpg)
17
The Lazy Integration Approach
Mediator
WrapperWrapper
Client
IBM portfolio
IBM price IBM related reports (in common model)
IBM related reports
Excel Flat File
Query Decomposition, Translation and Result Fusion
![Page 18: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/18.jpg)
18
Wrappers in Mediator-Based Systems
• In a data warehouse system, the source extractors consist of:– one or more queries built-in that are executed at the source to
produce data for the data warehouse
– communication mechanisms, so that wrapper can:• pass ad-hoc queries to the source
• receive responses from the source
• pass information to the warehouse
• Mediator systems require more complex wrappers - the wrapper must be able to accept a variety of queries from the mediator and translate any of them to the terms of the source.
![Page 19: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/19.jpg)
19
Wrappers in Mediator-Based Systems
• A systematic way to design a wrapper that connects a mediator to a source is to classify the possible queries that the mediator can ask into templates, which are queries with parameters that represent constants.
• The mediator can provide the constants, and the wrapper executes the query with the given constants.
• T S the template T is turned into the source query S
Example:
The source of dealer1:
Cars (serialNo, model,color,autoTrans, cdPlayer, ...)
![Page 20: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/20.jpg)
20
Wrappers in Mediator-Based Systems
Assume we use the mediator with schema:
AutoMed(serialNo,model,color,autoTrans, dealer)
How the mediator could ask the wrapper for cars of a given color>
The template:
select * from AutoMed where color= ‘$c’;
select serialNo, model color, autoTrans, ‘dealer1’
from Cars where color=‘$c’;
![Page 21: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/21.jpg)
21
Wrappers in Mediator-Based Systems
• The wrapper could have another template that specified the parameter $m representing a model
• there would be 2N templates for N attributes
• the number of templates could grow unreasonably large.
![Page 22: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/22.jpg)
22
Wrapper Generators
• The template defining a wrapper must be turned into code for the wrapper itself - the software that creates the wrapper is called a wrapper generator
• The wrapper generator creates a table that holds the various query patterns contained in the templates, and the source queries that are associated with each.
• A driver is used in each wrapper. The task of the driver is to :– accept a query from the mediator– search the table for a template that match the query– the source query is sent to the source using a communication
mechanism– the response is processed by the wrapper, if necessary, and
then returned to the mediator
![Page 23: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/23.jpg)
23
Mediator
Client
Wrapper
Wrappers & Mediators from High-Level Specifications
Mediator SpecificationInterpreter
WrapperGenerator
Wrapper
WrapperSpecification
MediatorSpecification
Source Source
![Page 24: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/24.jpg)
24
Filters
• Complex templateselect * from AutoMed where color= ‘$c’ and model = ‘$m’; select serialNo, model color, autoTrans, ‘dealer1’ from Cars where color=‘$c’ and model =‘$m’;• Wrapper filter approach - if the wrapper has a
template that returns a superset of what the query wants then it is possible to filter the result at the wrapper
• The decision whether a mediator asks for a subset of what the pattern of some wrapper template returns is a hard problem < < < <>>>>
![Page 25: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/25.jpg)
25
Filters
Example:
Given the templateselect * from AutoMed where color= ‘$c’;
The mediator needs to find blue Gobi model car:select * from AutoMed where color= ‘blue’ and
model=‘Gobi’;• use the template with $c=blue to find all blue cars• store the result in the temporary relation Temp• select from Temp the Gobi’s and return the result
![Page 26: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/26.jpg)
26
Other Wrapper Operations
• It is possible to transform the data at the wrapper in different ways
• The mediator is asked to find dealers and models such that the dealer has two red cars, of the same model, one with and one without automatic transmission. Suppose we have only one template as before.
Select A1.model A1.dealer
from AutoMed A1 AutoMed A2
where A1.model=A2.model and A1.color=‘red’ and
A2.color=‘red’ and A1.autoTrans=‘no’ and
A2.autoTrans=‘yes’;
![Page 27: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/27.jpg)
27
Other Wrapper Operations
• It is possible to answer the query by first obtaining from the Dealer’s 1 source a relation with all the red cars (use the original template) - RedAutos relation
• select distinct A1.model A1.dealer
from RedAutos A1, RedAutos A2
where A1.model=A2.model and
A1.autoTrans=‘no’ and
A2.autoTrans=‘yes’;
![Page 28: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/28.jpg)
28
Challenge: Sources Without a Well-Structured Schema
• semistructured– irregular– deeply nested– cross-referenced
• incomplete schema knowledge– autonomous– dynamic
• HTML pages• SGML documents• genome data• chemical structures• bibliographic
information• results of the
integration process
Examples
![Page 29: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/29.jpg)
29
Challenge: Different and Limited Source Capabilities
Client
Wrapper(A)
Wrapper(B)
Mediator(U = A + B)
retrieve IBM dataretrieve IBM data
retrieve IBM data
![Page 30: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/30.jpg)
30
Mediator has to Adapt to Query Capabilities of Sources
Client
Wrapper(A)
Wrapper(B)
Mediator(U = A + B)
retrieve everything
retrieve IBM data
retrieve IBM data
retrieve IBM data
(A) does notallow selection
![Page 31: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/31.jpg)
31
Part B
• Semistructured Data Representation
• Mediator Generation
• Wrapper Generation
• Capabilities-Based Rewriting
![Page 32: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/32.jpg)
32
Representation of Semistructured Information using OEM
semanticobject-id
label
Atomic Value
Set Value
structuralobject-id
<http://www/~doe, faculty, {&f1,&l1,&r1}> <&f1, first_name, “John”> <&l1, last_name, “Doe”> <&r1, rank, “professor”>
![Page 33: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/33.jpg)
33
Object Exchange Model - Goals
• Easy to read
• Easy to edit
• Easy to generate or parse by a program
• Consistency with Stanford’s other projects (developed with the TSIMMIS’)
• Possibility of extensions in the future
![Page 34: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/34.jpg)
34
Graph Representation of OEM Data
faculty first_name “John” last_name “Doe” rank “professor”
http://www/~doe
<http://www/~doe, faculty, {&f1,&l1,&r1}> <&f1, first_name, “John”> <&l1, last_name, “Doe”> <&r1, rank, “professor”>
![Page 35: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/35.jpg)
35
OEM Structures Represent Arbitrary Labeled Graphs
faculty first_name “John” last_name “Doe” rank “professor”
http://www/~doe
faculty name “Mary Smith” project “Air DB” paper
author name “John Doe”
author name “Mary Smith”
title “Thin Air DB”
http://www/~smith
![Page 36: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/36.jpg)
36
Reprezentacja danych semistrukturalnych
• Object Exchange Model
• ACeDB
• XML
• Mogą być wykorzystywane w warstwie pomiędzy mediatorem a wrapper’ami.
![Page 37: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/37.jpg)
37
Object Exchange Model
• Zdefiniowany przy okazji budowy systemu Tsimmis służącego do integracji heterogenicznych źródeł danych.
• Wykorzystywany przy projekcie Merlin (MQS) i Lorel (QL)
![Page 38: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/38.jpg)
38
Object Exchange Model (cd)
• Węzeł OEM skłąda się z czterech pól: – Object-ID – jest wykorzystywany do unikalnej
identyfikacji określonego węzła OEM– Label – jest ciągiem znaków który opisuje to co
węzeł OEM reprezentuje – Type – jest typem danych wartości węzła. (atomowy
lub kolekcja) – Value – może być albo wartością atomową albo
referencją do kolekcji węzłów OEM
• Jest zwykle reprezentowany jako: <Object-ID Label Type Value>
![Page 39: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/39.jpg)
39
Object Exchange Model – przykład
<&oid1 Notowanie Set {&oid11 &oid12}>
<&oid11 NrNotowania String „4004”>
<&oid12 Rezultaty Set {&oid121 &oid122}>
<&oid121 Miejsce1 Set {&oid1211}>
<&oid1211 Utwor String „metropolis”>
<&oid122 Miejsce2 Set {&oid1221}>
<&oid1221 Utwor String „money”>
<&oid2 Notowanie Set {&oid21 &oid22}>
<&oid21 NrNotowania String „4005”>
<&oid22 Rezultaty Set {&oid221 &oid222}>
<&oid221 Miejsce1 Set {&oid2211}>
<&oid2211 Utwor String „learning to fly”>
![Page 40: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/40.jpg)
40
Object Exchange Model – cechy
• Reprezentowany jako graf z obiektami na wierzchołkach i etykietami na krawędziach.
• Wszystkie wystąpienia są obiektami.
• Każdy obiekt ma swój unikalny identyfikator (oid).
• Rozróżniane są dwa typy obiektów: atomowe i złożone.
![Page 41: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/41.jpg)
41
Object Exchange Model – cechy (cd)
• W OEM występują tzw. nazwy (ang. names), które mogą być traktowane jako aliasy do obiektów wewnątrz bazy danych.
• Nazwa służy jako wskaźnik do bazy danych.
• Każdy obiekt w bazie danych powinien być osiągalny za pomocą nazwy.
![Page 42: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/42.jpg)
42
ACeDB
• ACeDB (A C. elegans Database) była rozwijana jako baza danych informacji genetycznej organizmów.
• Rozwijana od 1989.
• Posiada swój własny język zapytań AQL - Acedb Query Language
• http://www.acedb.org/
![Page 43: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/43.jpg)
43
ACeDB – cechy
• Schemat i dane mogą być traktowane jako drzewo z etykietowanymi krawędziami.
• Krawędzie mogą być etykietowane jakimkolwiek typem podstawowym. (int, Notowanie)np.: array Int unique Int
• Z określonego wierzchołka drzewa danych może wychodzić wiele gałęzi.
• ACeDB pozwala na to aby jakakolwiek etykieta różna od etykiety głównej była pominięta.
• Identyfikatory obiektów wprowadzane są przez użytkownika.
![Page 44: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/44.jpg)
44
ACeDB – cechy (cd)
• ACeDB wymaga schematu
• Mimo to, fakt, że dane mogą być pomijane oraz to, że etykietowane dane są traktowane jednolicie z innymi prostymi typami powoduje, że jest on bardzo bliski semistrukturalnemu modelowi danych.
![Page 45: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/45.jpg)
45
ACeDB – przykład
>Book title UNIQUE Textauthors Textchapters int UNIQUE Textlanguage UNIQUE english
frenchother
date UNIQUE month Intyear Int
&hock2 title ”Computer Simulation Using Particles”authors ”Hockney”
”Eastwood” chapters 1 ”Computer Experiments”
2 ”A One-Dimensional Model”...
language english
![Page 46: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/46.jpg)
46
XML
• Znany i lubiany
![Page 47: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/47.jpg)
47
Różnice pomiędzy XML a OEM
• XML jest uporządkowany. • Etykiety w OEM są wykorzystywane tylko jako
punkt odniesienia oraz do oznaczania zależności pomiędzy obiektami. W XML każdy element nie będący ciągiem tekstowym zawiera identyfikujący go znacznik – etykietę.
• XML nie wspiera bezpośrednio struktury grafu.
![Page 48: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/48.jpg)
48
Overview
• Semistructured Data Representation
• Mediator Generation• Example of mediator specification• Language expressiveness• Implementation and performance
• Wrapper Generation
• Capabilities-Based Rewriting
![Page 49: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/49.jpg)
49
Merge Information Relating to a Faculty
person name “John Doe” birthday “April 1”
s2faculty name “John Doe” rank “professor” papers ...
s1
faculty name “John Doe” rank “professor” birthday “April 1” papers ...
• Schema Integration• Info fusion
![Page 50: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/50.jpg)
50
Mediator Specification Example
person name “John Doe” birthday “April 1”
s2
<N faculty {<L V>}> :- <faculty {<name N> <L V>}>@s1<N faculty {<L V>}> :- <person {<name N> <L V>}>@s2
faculty name “John Doe” rank “professor” papers ...
s1
faculty name “John Doe” rank “professor” birthday “April 1” papers ...
![Page 51: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/51.jpg)
51
Mediator Specification Example: Semantics of Rule Bodies
<N faculty {<L V>}> :- <faculty {<name N> <L V>}>@s1<N faculty {<L V>}> :- <person {<name N> <L V>}>@s2
person name “John Doe” birthday “April 1”
s2
faculty name “John Doe” rank “professor” birthday “April 1” papers ...
faculty name “John Doe” rank “professor” papers ...
s1
![Page 52: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/52.jpg)
52
Mediator Specification Example: Semantics of Rule Heads
<N faculty {<L V>}> :- <faculty {<name N> <L V>}>@s1<N faculty {<L V>}> :- <person {<name N> <L V>}>@s2
person name “John Doe” birthday “April 1”
s2
“John Doe”faculty name “John Doe” rank “professor” birthday “April 1” papers ...
faculty name “John Doe” rank “professor” papers ...
s1
![Page 53: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/53.jpg)
53
Incrementally Add to Semantically Identified Object
<N faculty {<L V>}> :- <faculty {<name N> <L V>}>@s1<N faculty {<L V>}> :- <person {<name N> <L V>}>@s2
faculty name “John Doe” rank “professor” papers ...
s1person name “John Doe” birthday “April 1”
s2
“John Doe”faculty name “John Doe” rank “professor” birthday “April 1” papers ...
![Page 54: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/54.jpg)
54
Irregularities & Incomplete Schema Knowledge
<N faculty {<L V>}> :- <faculty {<name N> <L V>}>@s1faculty name “John Doe” rank “professor” papersfaculty name “Mary Smith” project “Air DB”
s1
person name “John Doe” birthday “April 1”
s2
faculty name “John Doe” rank “professor” birthday “April 1” papers faculty name “Mary Smith” project “Air DB”
“John Doe”
“Mary Smith”
![Page 55: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/55.jpg)
55
Second Rule Attaches More Subobjects to View Objects
<N faculty {<L V>}> :- <faculty {<name N> <L V>}>@s1<N faculty {<L V>}> :- <person {<name N> <L V>}>@s2
faculty name “John Doe” rank “professor” papers ...
s1
“John Doe”faculty name “John Doe” rank “professor” birthday “April 1” papers ...
person name “John Doe” birthday “April 1”
s2
![Page 56: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/56.jpg)
56
The OEM object structure of the cs wrapper
<&e1, employee, set, {&f1,&l1,&t1,&rep1}> <&f1, first name, string, 'Joe'> <&l1, last name, string, 'Chung'> <&t1, title, string, 'professor'> <&rep1, reports to, string, 'John Hennessy'>
<&e2, employee, set, {&f2,&l2,&t2}> <&f2, first name, string, 'John'> <&l2, last name, string, 'Hennessy'> <&t2, title, string, 'chairman'>
. . .
<&s3, student, set, {&f3,&l3,&y3}> <&f3, first name, string, 'Pierre'> <&l3, last name, string, 'Huyn'> <&y3, year, integer, 3>
. . .
![Page 57: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/57.jpg)
57
The OEM object structure of whois
<&p1, person, set, {&n1,&d1,&rel1,&elm1}> <&n1, name, string, 'Joe Chung'> <&d1, dept, string, 'CS'> <&rel1, relation, string, 'employee'> <&elm1, e_mail, string, 'chung@cs'>
<&p2, person, set, {&n2,&d2,&rel2}> <&n2, name, string, 'Nick Naive'> <&d2, dept, string, 'CS'> <&rel2, relation, string, 'student'> <&y2, year, integer, 3>
...
![Page 58: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/58.jpg)
58
Object exported by med
<&cp1, cs_person, {&mn1,&mrel1,&t1,&rep1,&elm1}> <&mn1, name, string, 'Joe Chung'> <&mrel1, rel, string, 'employee'> <&t1, title, string, 'professor'> <&rep1, reports_to, string, 'John Hennessy'> <&elm1, e_mail, string, 'chung@cs'>
![Page 59: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/59.jpg)
59
Problemy występujące przy tworzeniu Specyfikacji Mediatora
• Schema-domain mismatch
• Schematic discrepancy
• Schema Evolution
• Structure Irregularities
![Page 60: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/60.jpg)
60
(MSL) Rules:
<cs_person {<name N> <rel R> Rest1 Rest2}> : <person {<name N> <dept 'CS'>
<relation R> | Rest1}>@whois AND decomp(N, LN, FN) AND <R {<first name FN>
<last name LN> | Rest2}>@cs
External: decomp(string,string,string)(bound,free,free)
impl by name_to_lnfn decomp(string,string,string)(free,bound,bound)
impl by lnfn_to_name
![Page 61: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/61.jpg)
61
(MSL) Rules:
<cs_person {<name N> <rel R>}> : <person {<name N> <dept 'CS'>
<relation R>}>@whois
<cs_person {<name N> <rel R>}>:- decomp(N, LN, FN)
AND <R {<first name FN> <last name LN>}>@cs
<cs_person {<name N> <rel R> <title T>}>:- ???
![Page 62: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/62.jpg)
62
(MSL) Rules:
<cs_person {<name N> <rel R> <e_mail E>}>
:- ???
??? :- ??? <R {<first_name FN>
<last_name LN> <title E>
<reports_to S>}>@cs
Rewriting:
<cs_person {<name N> <rel R> <e_mail E> <title E>
<reports_to S>}>
:- ???
![Page 63: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/63.jpg)
63
Language Expressiveness
• Information fusion problems solved by MSL– Irregularities– Incomplete knowledge of source structure– Transformation of cross-referenced structures– Inconsistent and redundant data– Use of arbitrary matching criteria
• Theoretical analysis of expressiveness– Consider the relational representation of OEM
graphs. Then MSL is equivalent to “SQL + special form of transitive closure”
![Page 64: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/64.jpg)
64
faculty name “John Doe” rank “associate”
Inconsistent and Redundant Information
<N faculty {<L V>}> :- <faculty {<name N> <L V>}>@s1<N faculty {<L V>}> :- <person {<name N> <L V>}>@s2
AND NOT <faculty {<name N> <L V1>}>@s1
person name “John Doe” rank “assistant”
s1 s2
“John Doe”faculty
name “John Doe” rank “associate”
rank “assistant”
![Page 65: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/65.jpg)
65
Overview
• Semistructured Data Representation
• Mediator Generation• Example of mediator specification• Language expressiveness• Implementation and performance
• Wrapper Generation
• Capabilities-Based Rewriting
![Page 66: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/66.jpg)
66
Mediator Specification Interpreter Architecture
Query Rewriter
Cost-Based Optimizer
Datamerge Engine
MediatorSpecification
Query
logical datamergeprogram
plan
Result
Queries toWrappers
Results
![Page 67: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/67.jpg)
67
Query Rewriting When Known Origins of Information
• <N faculty {<salary S>}> :-:- <faculty {<name N> <salary S>}>@s1
• <N faculty {< rank R >}>
:- <person {<name N> <rank R>}>@s2• <well-paid {<name N> <salary X>}>
:- <N faculty {<salary X> <rank assistant>}> AND X>65000
![Page 68: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/68.jpg)
68
Query Rewriter Pushes Conditions to Sources
• <N faculty {<salary S>}> :- :- <faculty {<name N> <salary S>}>@s1 <N faculty {< rank R >}> :- <person {<name N> <rank R>}>@s2
• <well-paid {<name N> <salary X>}> :- <N faculty {<salary X> <rank assistant>}> AND X>65000
• logical datamerge program <well-paid {<name N> <salary X>}> :- (<faculty {<name N> <salary X>}> AND X>65000)@s1 AND <person {<name N> <rank assistant>}>@s2
![Page 69: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/69.jpg)
69
<name N> :- <person {<rank assistant>}>
Passing Bindings & Local Join Plans
Passing Bindings
Local Join
<salary X> :- <faculty {<name $N> <salary X>}> AND X>65000
<name N> :- <person {<rank assistant>}>
<a {<s X> <n N>}>:- <faculty {<name N> <salary X>}> AND X>65000
N
s1 s2
s1 s2
![Page 70: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/70.jpg)
70
Query Decomposition When Unknown Origins of Information
<X faculty {<S Y>}> :- <X faculty {<birthday “1/20”> <S Y>}>
<N faculty {<L V>}> :- <faculty {<name N> <L V>}>@s1<N faculty {<L V>}> :- <person {<name N> <L V>}>@s2
![Page 71: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/71.jpg)
71
Plan Considers All Possible Sources of birthday
<X faculty {<S Y>}> :- <X faculty {<birthday “1/20”> <S Y>}>
<N faculty {<L V>}> :- <faculty {<name N> <L V>}>@s1<N faculty {<L V>}> :- <person {<name N> <L V>}>@s2
name
s2s1
name
birthday
birthday
![Page 72: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/72.jpg)
72
Overview
• Semistructured-Data Representation
• Mediator Generation
• Wrapper Generation
• Capabilities-Based Rewriting
![Page 73: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/73.jpg)
73
Query Translation in Wrappers
Source
SELECT * FROM personSELECT * FROM personWHERE name=“Smith”
find -allfind -n Smith
Query TranslatorResult
Translator
Wrapper
![Page 74: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/74.jpg)
74
Rapid Query Translation Using Templates and Actions
Source
SELECT * FROM personSELECT * FROM personWHERE name=“Smith”
find -allfind -n Smith
TemplateInterpreter
ResultTranslator
SELECT * FROM person {emit “find -all” }SELECT * FROM personWHERE name=$N {emit “find -n $N”}
![Page 75: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/75.jpg)
75
Description of Infinite Sets of Supported Queries
• uses recursive nonterminals
• Example:– job description contains word w1 and word w2
and ...– SELECT subset(person) FROM person
WHERE \CJob\CJob : job LIKE $W AND \CJob\CJob : TRUE
![Page 76: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/76.jpg)
76
Overview
• Semistructured-Data Representation
• Mediator Generation
• Wrapper Generation
• Capabilities-Based Rewriting
![Page 77: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/77.jpg)
77
Wrapper Supported Queries
Description
Capabilities-Based Rewriter in Mediator Architecture
Capabilities-Based
Rewriter
QueryRewriter
Cost-BasedOptimizer
DatamergeEngine
logical datamerge program
supportedplans
optimal plan
MediatorSpecification
Wrapper Supported Queries
Description
Query
![Page 78: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/78.jpg)
78
Capabilities-Based Rewriter Finds Supported Plans
Supported Queries
SELECT * FROM AWHERE salary>65000
SELECT * FROM A
![Page 79: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/79.jpg)
79
Capabilities-Based Rewriter Finds Most-Selective Supported Plans
Supported Queries
SELECT * FROM BWHERE salary>65000
SELECT * FROM BSELECT * FROM BWHERE salary >65000
![Page 80: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/80.jpg)
80
Capabilities-Based Rewriter Architecture
Component SubQueryDiscovery
Plan Construction
Plan Refinement
Query CapabilitiesDescription
Component SubQueries
Plans (not fully optimized)
Query
Algebraically optimal plans
![Page 81: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/81.jpg)
81
What TSIMMIS Achieved
• system for integration of heterogeneous sources
• challenges and solutions– semistructured data & incomplete schema
knowledge• appropriate specification language and query processing
algorithms
– limited and different query capabilities• query translation algorithm
• capabilities-based query rewriting algorithm
![Page 82: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/82.jpg)
82
Overview
• TSIMMIS’ goals, technical challenges, and solutions
• Insufficiencies of the TSIMMIS’ framework
• Going forward
![Page 83: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/83.jpg)
83
Insufficiencies of the TSIMMIS framework
• OEM was really unstructured data– some loose and partial schematic info may
pay off tremendously
• too “databasy” user/mediator/source interaction
![Page 84: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/84.jpg)
84
Overview
• TSIMMIS’ goals, technical challenges, and solutions
• Insufficiencies of the TSIMMIS’ framework
• Going forward
![Page 85: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/85.jpg)
85
Web emerges as a Distributed DB and XML as its Data Model
DataSource
Native XMLDatabase
XML ViewDocument(s)
XML ViewDocument(s)
XML ViewDocument(s)
Also export:1. Schemas & Metadata (XML-Data, RDF,…)2. Description of supported queries
Wrapper
LegacySource
XMAS QueryLanguage
![Page 86: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/86.jpg)
86
Definition of Integrated Views
DataSource
DataSource
DataSource
Mediator
XML ViewDocument(s)
Integrated XML View
XML ViewDocument(s)
XML ViewDocument(s)
View Definition inXMAS
![Page 87: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/87.jpg)
87
Non-Materialized Views in the MIX mediator system
Blended Browsing &Querying (BBQ) GUI
Application
DOM for Virtual XML Doc’s
MIX Mediator
XMAS query XML document
DTDInference
IntegratedView DTD
XML Source XML Source
QueryProcessor
View Definition inXMAS
Source DTD
![Page 88: 1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.](https://reader033.fdocuments.us/reader033/viewer/2022061305/55140784550346e7488b4bcd/html5/thumbnails/88.jpg)
88
RDB2XMLWrapper
DTDInference
Resolution
Simplification
Execution
Unfolded Query
Blended Browsing &Querying (BBQ) GUI
MIX MediatorXMAS MediatorView Definition
View DTD
Translation to Algebra
Optimization
XML DocumentFragments
XMAS Query
XMLSource 1
DTD
XMASQuery
XMLDocumentFragments
DOM (VXD) Client API
Application