Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana...
-
Upload
debra-reeves -
Category
Documents
-
view
213 -
download
0
Transcript of Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana...
![Page 1: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/1.jpg)
Constructing Data Mining Applications based on Web Services Composition
Ali Shaikh Ali and Omer [email protected], [email protected]
Cardiff Universityhttp://www.cs.cf.ac.uk/
Welsh eScience Centerhttp://www.wesc.ac.uk/
![Page 2: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/2.jpg)
Welsh e-Science CentreAli Shaikh Ali
Agenda…
Software
Demo
Apps
Objectives
Q?
![Page 3: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/3.jpg)
Welsh e-Science CentreAli Shaikh Ali
Objectives Use of Web Services composition – with
distributed services Wrap third party services (Mathematica,
GNUPlot) WEKA Service template Triana Workflow
Services provided by third parties WSDL interfaces (avoid use of specialist
languages – unless really necessary) SOAP-based message exchange
Access to local and remote data sets Support for data streaming
![Page 4: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/4.jpg)
Welsh e-Science CentreAli Shaikh Ali
Origin: Gravitational Wave data analysis(GEO-LIGO efforts)
![Page 5: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/5.jpg)
Welsh e-Science CentreAli Shaikh Ali
http://www.GridLab.org/
GAT
GAP Interface
Gridlab ServicesJXTAServe P2PS WServe
JXTA Sockets Web Services OGSA + Services
![Page 6: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/6.jpg)
Welsh e-Science CentreAli Shaikh Ali
Software
trianacode.org
An open source Problem Solving Environment developed at Cardiff
Triana includes a large library of pre-written analysis tools and the ability for users to easily integrate their own tools.
Supports discovery of Web Services based on syntax (hardwired UDDI registries)
www.cs.waikato.ac.nz/ml/weka/ Collection of machine learning algorithms Contains tools for
data pre-processing, classification, regression, clustering, association rules
Accepts ARFF (Attribute-Relation File Format) file format -- an ASCII text file that describes a list of instances sharing a set of attributes.
Related work: Grid WEKA(University College Dublin)
![Page 7: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/7.jpg)
Welsh e-Science CentreAli Shaikh Ali
WEKA Algorithms
Classifiers Algorithms Bayes (8, eg. Naïve
Bayes) Functions (12, eg.
Neural Networks) Lazy (5) Meta (23, eg. Bagging,
Multiclass Classifer) Trees (10, eg. ID3) Rules (10, eg.
Conjunctive Rule) Misc (3)
Clustering Algorithms (5, e.g. K-means)
Association Rules (2, e.g Apriori)
Data Processing Filters
Attribute Selection Attribute Evaluator
(12, eg. Principle Components)
Attribute Search (8, eg. Genetic Algorithm)
![Page 8: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/8.jpg)
Welsh e-Science CentreAli Shaikh Ali
Provenance Issues
EU FP6 Provenance project (2004—2006) IBM Hursley (lead), SZTAKI, Southampton
University, DLR/German Aerospace, UPC
http://www.gridprovenance.org/
EPSRC Provenance (2004—2007) University of Southampton (lead)
http://www.pasoa.org/
![Page 9: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/9.jpg)
Welsh e-Science CentreAli Shaikh Ali
DEMO
![Page 10: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/10.jpg)
Welsh e-Science CentreAli Shaikh Ali
Inside the Data Mining Toolbox
Deals with Dataset filese.g. load datasets,
converts dataset formats
Deals with data that need manipulation
Visualize data
![Page 11: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/11.jpg)
Welsh e-Science CentreAli Shaikh Ali
Adding new Classifier Service Classifier Template
This Web service implements a complete list of classifiers, i.e. trees, rules, functions etc.
OperationsclassifyInstance()
classifyRemoteInstance()
getClassifiers( )
getOptions()
Input: DataHandler dataset String classifierName String options String attributeName
output: String result
Input: null
output: String listOfClassifiers
Input: String classifierName
output: String listOfApplicableOptions
Input: String datasetURL String classifierName String options String attributeName
output: String result
→
→→→
![Page 12: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/12.jpg)
Welsh e-Science CentreAli Shaikh Ali
Adding new Services … 2
1. Build your classifiermust implement the 4 required methods
2. Place it in the classifier’s lib.
3. Done!
![Page 13: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/13.jpg)
Welsh e-Science CentreAli Shaikh Ali
Where can you find us?
UDDI Browser
An open-source project that provides a friendly user interface allowing users to browse and manipulate content in UDDI registries. It is written in Java using the Swing libraries. Currently the browser only supports version 2.0 UDDI registries.
Cardiff UDDI:Inquiry: http://agents-comsc.grid.cf.ac.uk:8334/juddi/inquiryPublish: http://agents-comsc.grid.cf.ac.uk:8334/juddi/inquiry
![Page 14: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/14.jpg)
Welsh e-Science CentreAli Shaikh Ali
Download
Triana available at:
http://www.trianacode.org/
http://www.gridlab.org/
Data Mining Toolbox at:
http://users.cs.cf.ac.uk/Ali.Shaikhali/dipso/
![Page 15: Constructing Data Mining Applications based on Web Services Composition Ali Shaikh Ali and Omer Rana Ali.shaikhali@cs.cf.ac.ukAli.shaikhali@cs.cf.ac.uk,](https://reader036.fdocuments.us/reader036/viewer/2022071808/56649ef65503460f94c0a8c2/html5/thumbnails/15.jpg)
Welsh e-Science CentreAli Shaikh Ali
Questions
Who is part of the user community? elicit requirements
What is different with reference to e-Science? additional capability provided by the Grid additional types of requirements What additional benefit does it provide? Ability to undertake multiple runs (what-if scenarios)
Need to embed algorithms within some other program -- rather than have a stand-alone tool Can Web Services try to address this concern?
Which algorithm in what context?