NBQSA 2nd round Presentation

24

Transcript of NBQSA 2nd round Presentation

Page 1: NBQSA 2nd round Presentation
Page 2: NBQSA 2nd round Presentation

Few Statistics…Time and Savings Deposits held by the Public

2010 1,405,808

2011 1,753,896

2012 2,143,136

Crime Rate in Sri Lanka (Source: http://www.police.lk/index.php/crime-trends)

Health Expenditure in Sri Lanka(Source: http://www.who.int/gho/countries/lka.pdf)

“Lot of data waiting to be mined…”

(Source: http://www.cbsl.gov.lk/pics_n_docs/10_pub/_docs/statistics)

Page 3: NBQSA 2nd round Presentation

Introduction What is Weave-D?

Inspired by human brain Data Accumulating, Learning and Fusing

System

Video

Supports Multi-modal data

Incremental learning

Inspiration source

Page 4: NBQSA 2nd round Presentation

Why Weave-D?

Growth of information

Handle data

Prevent catastrophic

forgetting

Visualizing information Conceptualization

??

??Intuitive

Simple

Come as chunks

Heterogeneous Incremental learning

Generalization of acquired knowledge

Apply previous knowledge to acquire

new knowledge

Page 5: NBQSA 2nd round Presentation

Business Value Medical

What we can mine? New patient has a cancer or not? Effective medicine for certain diseases Diseases distribution in the country

E.g. Anuradhapura – more kidney diseases

Page 6: NBQSA 2nd round Presentation

Business Value Finance

Predict customers’ transactional behaviors, so banks can plan their strategies ahead

Forensics or Police Predict criminal behavior Identify crimes with similar evidence

And many more…

Page 7: NBQSA 2nd round Presentation

Similar Products IBM Watson

Developed by IBM to compete in Jeopardy A Question answering system Consumes “millions” of Wikipedia pages

and try to find answers from the knowledge acquired

Finance and health care domains

Page 8: NBQSA 2nd round Presentation

Uniqueness

RapidMiner IBM Watson Weave-D

Support heterogeneous data

x x Learn without forgetting past data

x x Support analyzing at different granularities

x x Visualization Fast response x x

Page 9: NBQSA 2nd round Presentation

What does Weave-D do?

Page 10: NBQSA 2nd round Presentation

Business LogicPersistenceHandlers PresentationPersistence

Weave-D architecture

Config filesXML

Raw Data Learning Component

Link Generators

XML Parsers

XML WritersUser

Interfaces

Feature Extractor Facade

Weave-D Facade

Data Models

XML Outputs

3D Visualization

Interface

Perception Model

Configuration Loaders

Logger

Feature Extractors

Page 11: NBQSA 2nd round Presentation

Knowledge Representation

Day 1 2 3

Input

Layer 1 (Day 1) Layer 2 (Day 2) Layer 3 (Day 3)

Page 12: NBQSA 2nd round Presentation

(None)

Dataset 1 Dataset 2 Dataset 3

City (Day view) City (Night view)

Forest (Autumn) Forest (Spring)Forest (Winter)

Child (4-8 years old)

Child (1-4 years old)

Child (8-12 years old)

Sunset view Sunset view

C1

C2

C3

C4

C5

Page 13: NBQSA 2nd round Presentation

Demonstration - Scenario Description

Sam is a sports enthusiast. He has a set of images belonging to following sports; Croquet, Polo, Rock-climbing, Sailing, Rowing, Badminton. Also he has a small description of the sport for each image. He needs to cluster these images and text by the sports category.

Constraints All the photos are not available to him at once. He

gets sets of images each day. (Incremental learning)

Page 14: NBQSA 2nd round Presentation

User’s Point of View Input

Query image

Expected outcomes Set of related images and documents explaining the sport

Tasks Setting up Weave-D Training Weave-D Querying from Weave-D

Sam doesn’t know what sport this is (Query image) Meaningless file names! Get documents explaining the sport denoted by image

Page 15: NBQSA 2nd round Presentation

What happens inside?

Query Image

Day 1 Day 2 Day 3

Imag

esTe

xt

Result Images

Result Text

Associative Links

Time Series Links

Page 16: NBQSA 2nd round Presentation

Bigger Picture!!! Medical domain Forensic domain

Page 17: NBQSA 2nd round Presentation

Methodology Standards Agile development – Scrum Documentation

Architecture documents Class diagrams

Git version controlling Tests

Page 18: NBQSA 2nd round Presentation

Class Diagrams

Github

Milestones

Architecture Document

Website

Page 19: NBQSA 2nd round Presentation

Implementation Standards Rich client platform Object Oriented Programming Design patterns

Factories Facades Command Objects

High decoupling XML Configuration

Page 20: NBQSA 2nd round Presentation

Monetization Plans? Promotions through Social Media

Facebook Google+

Advertising on Data Mining websites KDNuggets

Discussions ICTA Private Hospitals Private Investigation Agencies

Investments? Project group

Sri Lanka Police

National Hospital

Page 21: NBQSA 2nd round Presentation

Few years ahead in MoneyPath

January, 2015

January, 2016Today

1st Release 2nd Release

January, 2014

Part Time Full Time

Break evenAdvertising campaign

(Rs. 15,000)

Labor cost (4 members)

(Rs. 60,000)

Other (Rs. 25,000)

Initial Investment

(Rs.100,000)

Sell 5 units1 unit = 80K-100K

Sell 10 units 1 unit = 150K-200K

Profitable

Page 22: NBQSA 2nd round Presentation

Glimpse to the Future Support mining information at

different granularities Extend Weave-D Client-Server

architecture Support already existing standards

(e.g. PMML)

Page 23: NBQSA 2nd round Presentation

Further Resources Website:

http://weave-d.com/ Facebook Page: https://

www.facebook.com/treadlabz.weaved Google+ Page: https://

plus.google.com/102785205487583718859

Page 24: NBQSA 2nd round Presentation

Thank you