Ontology Driven Data Mining

9
Ontology Driven Ontology Driven Data Mining Data Mining A.K. Sinha Dept. Of Geo Sciences Virginia Tech Satish Tadepalli Dept. Of Computer Science Virginia Tech

description

Ontology Driven Data Mining. A.K. Sinha Dept. Of Geo Sciences Virginia Tech. Satish Tadepalli Dept. Of Computer Science Virginia Tech. Ontology-Driven Data Mining. Data Mining: Analysis of observational data sets to find unsuspected relationships and to summarize the data in novel ways - PowerPoint PPT Presentation

Transcript of Ontology Driven Data Mining

Page 1: Ontology Driven Data Mining

Ontology Driven Data Ontology Driven Data MiningMining

A.K. SinhaDept. Of Geo SciencesVirginia Tech

Satish TadepalliDept. Of Computer ScienceVirginia Tech

Page 2: Ontology Driven Data Mining

Ontology-Driven Data Ontology-Driven Data MiningMining

Data Mining:Data Mining:– Analysis of observational data sets to find Analysis of observational data sets to find

unsuspected relationships and to summarize unsuspected relationships and to summarize the data in novel waysthe data in novel ways

OntologyOntology– Represents domain knowledgeRepresents domain knowledge– Relationships between concepts in a domainRelationships between concepts in a domain

Ontology-driven data miningOntology-driven data mining– Use the knowledge represented by ontologies Use the knowledge represented by ontologies

to create a hierarchical structure in the datato create a hierarchical structure in the data– Apply data mining techniques on the Apply data mining techniques on the

structured data setsstructured data sets

Page 3: Ontology Driven Data Mining

GeoROC DatabaseGeoROC Database(http://georoc.mpch-(http://georoc.mpch-

mainz.gwdg.de/)mainz.gwdg.de/) GeoROC Data and Present Tectonic

Setting

Page 4: Ontology Driven Data Mining

Broad tectonic classification of GeoROC Data set for applying Data mining Techniques

Classes· Convergent

Margins· Continental

Flood Basalts· Ocean Basin

Flood Basalts· Ocean Island

Groups· Ocean Island

Plateaus· Others

Subclasses(Location-based)· Tonga· New Zealand· Papua New

Guinea· Central America· Others

Attributes (Chemical/Isotope)· SiO2· Al2O3· MnO· Sr87/Sr86· Others

Page 5: Ontology Driven Data Mining

Structuring the data sets based on ontology

Page 6: Ontology Driven Data Mining

Correlation AnalysisCorrelation Analysis

Correlations in Continental Covergent Margins

-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

0.6

0.8

1

Cascades Andean Both

Si-K

Si-Na2O

Si-Fe

Correlations in Oceanic Convergent Margins

-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

0.6

0.8

1

Tonga Mariana Both

Si-K

Si-Na2O

Si-Fe

Page 7: Ontology Driven Data Mining

Classification Using Neural Networks

Present day Plate Tectonic settings and associated data are the key to recognizing paleo-tectonic settings of rocks.

Page 8: Ontology Driven Data Mining

Ongoing ResearchOngoing Research

Data mining of spatial data sets Data mining of spatial data sets using Gaussian processesusing Gaussian processes

Sparse data miningSparse data mining

Page 9: Ontology Driven Data Mining

ConclusionConclusion

Ontology driven data mining Ontology driven data mining – Meaningful patterns at multiple levels of Meaningful patterns at multiple levels of

abstractionabstraction– Multiple views of same data setMultiple views of same data set– Ease in choosing the relevant data sets for Ease in choosing the relevant data sets for

comparisoncomparison