University of Buffalo The State University of New York Spatiotemporal Data Mining on Networks...
-
date post
18-Dec-2015 -
Category
Documents
-
view
215 -
download
0
Transcript of University of Buffalo The State University of New York Spatiotemporal Data Mining on Networks...
University of Buffalo The State University of New York
Spatiotemporal Data Mining on Networks
Taehyong Kim
Computer Science and Engineering
State University of New York at Buffalo
University of Buffalo The State University of New York
Table of Contents
Studies Spreading and Defense
model in Networks Fixed-random network Spreading Model Defense Model
Avian Influenza Outbreaks Modeling Mining parameters
Introduction Overview Networks Data Mining Spatiotemporal Data
Mining Applications
Quality of Bone (osteoporosis) as a Network Dynamics
Amazon Deforestation
University of Buffalo The State University of New York
Overview
Most of real world relationships and communications could be represented on networks (graphs).
Understanding the behavior of such systems starts with understanding the topology of the corresponding network.
Yeast PPI network AT&T Web NetworkCollaboration network
University of Buffalo The State University of New York
Overview
Recent studies on various networks Social network
Author network, School relationship Network Technical network
Cell network, Internet, Electric power network Biological network
Protein network, Metabolic network, Disease Network
Focuses on network attributes Number of nodes and edges Weight on nodes and edges
University of Buffalo The State University of New York
Overview
Hub node
Bridge node
edge
node
nodes and edges
University of Buffalo The State University of New York
Networks Data Mining
Networks Data mining has been done Prediction of unknown protein functions in protein-
protein interaction networks Resilience test of networks against attacks Prediction of people relationships in social
networks Drug targeting on cell networks Etc.
University of Buffalo The State University of New York
Spatiotemporal Data Mining
Networks are changed as time goes by World wide web is evolving by itself Interactions among proteins are changed in PPI
networks Size of cities and inter-state free ways are
changed Structure of bone is changed
Information of location and time is also important factors for further understanding on any given networks
University of Buffalo The State University of New York
Spatiotemporal Data Mining
Spatiotemporal Data Mining: knowledge extraction from large spatiotemporal repositories in order to recognize behavioural trends and spatial patterns for prediction purposes What is the relationship between the spread of
epidemics and the number and location of houses and schools by time?
What is the connection between the size of Buffalo city and thruway traffics on I-90 by an year?
University of Buffalo The State University of New York
Spatiotemporal Data Mining
Normal Osteoporosis
Drugs
University of Buffalo The State University of New York
Amazon Deforestation 2003
Fonte: INPE PRODES Digital, 2004.Fonte: INPE PRODES Digital, 2004.
Deforestation 2002/2003Deforestation 2002/2003
Deforestation until 2002Deforestation until 2002
University of Buffalo The State University of New York
Modelling Complex Problems Application of interdisciplinary knowledge to
produce a model.
If (... ? ) then ...
Desforestation?
University of Buffalo The State University of New York
Table of Contents
Studies Spreading and Defense
model in Networks Fixed-random network Spreading Model Defense Model
Avian Influenza Outbreaks Modeling Mining parameters
Introduction Overview Networks Data Mining Spatiotemporal Data
Mining Applications
Quality of Bone (osteoporosis) as a Network Dynamics
Amazon Deforestation
University of Buffalo The State University of New York
Spreading and Defense model in Networks Fixed-radius random network
Cellular transmission tower Interstate free ways Epidemics on communities Sensor networks
How we can defend if there are attacks or breaks from the center of the networks?
University of Buffalo The State University of New York
Fixed Radius Random Network 400 random points on 1*1 square unit Calculating distance between each point If two points are in a certain radius, creating
an edge between points
University of Buffalo The State University of New York
Fixed Radius Random Network Fixed-radius of random
network (r = 0.01 ~ 0.14)
Fixed-Radius
400 nodes, 2366 edges
University of Buffalo The State University of New York
Simulation on network
Network dynamics are studied based on fixed-radius random network
Simple spreading model and defense model is implemented for simulation
Mining important parameters on this model of network dynamics
Mining optimal values of parameters on this model of network dynamics
University of Buffalo The State University of New York
Spreading Model
Simulating disease spreading or message spreading
Starting from center point (0.5*0.5) Affecting edges which are in a spreading
radius (ROI) from center Spreading radius grows or reduces based on
how many edges are damaged
University of Buffalo The State University of New York
Spreading Model
Region of radial distance of spreading model (ROIt=0 = 0.1)
Spreading starts from center (0.5, 0.5)
ROI
Center
University of Buffalo The State University of New York
Spreading Model
Probability of affecting rate of edges (Pa = 0.33)
11 edges are in ROI In this case, 4 out of 11
edges are affected (Spreading will affect edges about 33% probability)
ROI
University of Buffalo The State University of New York
Defense Model
Simulating defense system of disease spreading or message spreading
Signaling to neighbor nodes in order to inform (disease) spreading
Activated when the affection of spreading (# of signals from neighbor nodes) is over threshold
Removing edges which are in a radius () from activated neighbor nodes in order to stop spreading
University of Buffalo The State University of New York
Defense Model
Circular region of programming Cell Death 0.2~3.6)
When signals from neighbor nodes are over the Td, edges in the circular region are removed by defense process
Region of defense process
University of Buffalo The State University of New York
Defense Model
Probability of Programming Cell Death (Pp = 1)
If Pp is 1, all edges in circular regions are dead
University of Buffalo The State University of New York
Result (visualization)
Time: 0 Time: 10 Time: 50
Total Damage
Intermediate
Contained
Time
University of Buffalo The State University of New York
Result
0
0.2
0.4
0.6
0.8
1
0 0.06 0.12 0.18 0.24 0.3 0.36
Ave
rag
e F
ract
ion
al o
f E
dg
es D
amag
ed
0.3
0.4
0.5
A
0.2
University of Buffalo The State University of New York
Result
0
0.2
0.4
0.6
0.8
1
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35
Av
era
ge
Fra
cti
on
al
Da
ma
ge
Offense
Defense
Total
A
University of Buffalo The State University of New York
Summary
Containment strategy on epidemics and virus spreads Mining important parameters Mining optimal values of important parameters
Understanding dynamics on human tissues and bones Development of diseases (osteoporosis) Drug effects on cell networks
University of Buffalo The State University of New York
Table of Contents
Studies Spreading and Defense
model in Networks Fixed-random network Spreading Model Defense Model
Avian Influenza Outbreaks Modeling Mining parameters
Introduction Overview Networks Data Mining Spatiotemporal Data
Mining Applications
Quality of Bone (osteoporosis) as a Network Dynamics
Amazon Deforestation
University of Buffalo The State University of New York
Avian Influenza
AI outbreaks are frequently occurring around the world recently H5N1 type has high infection and mortality rate Chickens and ducks are main victims of AI Mortality rate of H5N1 could reach 90-100% within 48
hours
Threat from AI has greatly increased for human beings There are several reports showing human infection of AI People could get infected by contacting excretion of
contaminated birds
University of Buffalo The State University of New York
AI outbreaks
Outbreaks in South Korea 2008
4 days 12 days 20 days
28 days 36 days 44 days
University of Buffalo The State University of New York
Challenges
Strategies are needed for AI containment Early identification of the first cluster of cases Warning system from contaminated area to neighbor areas
are needed Effective quarantine plan should be existed
Containment model helps plan effective strategies Prediction of damage with certain environment parameters Mining important parameters to control outbreaks Measurement of effective values of important parameters
University of Buffalo The State University of New York
A group of chickens and ducks are nodes 2231 nodes for a group of chickens and 808 nodes for a
group of ducks 76 (1x1 square) units (1 unit = 37.5 Km)
Parameters A node can interact with other nodes in range A susceptible node become a infected node by infection
probability A Infected node become a activated node by incubation
period and Nodes are culled in quarantine radius
Modeling
University of Buffalo The State University of New York
Visualization
Visualization of simulations based on AI outbreaks in South Korea 2008
4 days 14 days 24 days 34 days 44 days
University of Buffalo The State University of New York
Important Parameters
Effect of Increased Quarantine Range Quarantine radius: 0.0 ~ 0.32 unit
Effects of Increased Incubation Period Incubation Period: 0 ~ 17 days
Effects of Increasing the Infection probability Infection probability: 0.0 ~ 1.0
University of Buffalo The State University of New York
Quarantine Radius
Effect of Increased Quarantine Radius Quarantine radius: 0.0 ~ 0.32 unit Infection probability: 0.1, 0.4, 0.7 and 1.0
Research on effective quarantine radius by Infection probability Optimal quarantine radius
Infection Probability
0.1 0.4 0.7 1.0
Optimal Radius
0.04 0.10 0.16 0.22
University of Buffalo The State University of New York
Incubation Period
Effects of Increased Incubation Period Incubation Period: 0 ~ 17
days Quarantine Range: 0.0,
0.04, 0.11 and 0.18 unit For mid level control, almost
89% of poultry farms are healthy when incubation period is one day whereas only 11% of poultry farms are healthy when incubation period is 17 days.
University of Buffalo The State University of New York
Infection probability
Effects of Increasing the Infection probability Infection probability: 0.0
~ 1.0 Quarantine Range: 0.0,
0.04, 0.11 and 0.18 unit The large numbers of
poultry farms eliminated by the aggressive culling procedure with max control
University of Buffalo The State University of New York
Summary
Modeling AI dynamics based on statistic data Modeling of AI outbreaks and spreads Modeling of defense strategies
Mining important parameters and values in order to contain AI outbreaks in early stage Quarantine radius, infection rate, incubation
period Damage predictions with important parameters Mining defense strategies for future outbreaks