How Dirty is your Data : The Duality between detecting Events and Faults

How Dirty is your Data : The Duality between detecting Events and

Faults

J. Gupchup A. Terzis R. Burns A. SzalayDepartment of Computer Science

Johns Hopkins University

Outline

Background Problem Statement Experiments Results Discussion

Application

Monitoring nesting conditions of the Maryland Box turtles

Science Questions: Do nesting conditions determine sex ?

Important to correlate observations with environmental events (rain, snow etc)

Duality of Faults & Events

Data gathered from Sensor Networks contain faults

Delivering faulty data consumes resources and pollutes statistics

Need for fault detection techniques

Fault Detection methods detect readings that deviate from “normal” or “expected” values

Environmental Events :– Scientifically interesting– Deviate from the norm

Research Question(s)

Are “Events” misclassified as “Faults” ?

What metrics could be used to quantify the misclassification ?

How does the misclassification vary with:– Type of Fault– Type of Fault Detection method– Type of modality (Moisture, Temperature)

Is it possible to design a fault detection mechanism that minimizes the misclassification ?

Know Thy Faults

Short Faults– Sudden Change

in measurement

Noise Faults– Large variations in amplitude than expected– Little or no variation in amplitude (unresponsive)

Fault Detection Methods

SHORT Rule– If Xi – X(i-1) > δSHORT mark current measurement as fault (point method)

δSHORT is established from domain knowledge

NOISE Rule– Take W successive samples– IF (σW ≤ σtrain-σallow) OR (σW ≥ σtrain+σallow), mark all W readings as faulty (block method)– σtrain and σallow are established from training data

Linear Least-square Estimation (LLSE)– Estimate expected value of a sensor’s value using other sensors using LLSE

– If Xmodel – Xactual > δLLSE for k of the node’s neighbors, mark the reading as faulty (point method)

A. Sharma, L. Golubchik, and R. Govindan, “On the prevalence of sensor faults in real world deployments”, IEEE conference on Sensor, Mesh and Ad Hoc Communications and networks (SECON), 2007

Evaluation Metrics

Misclassification error (μ) for Point faults: μ = event readings tagged as faults / total event measurements

Total Misclassification (μ )= ∑i Di / ∑i Ei

Misclassification error (μ) for Block Faults:

Misclassification

Fault detection evaluation metric : False negative ratio = fraction of faults failed to be detected

Event Period (Ei)

Misclassification

Event Period (Ei)

Jug bay Deployment Map

Turtle Nests

38.784607, -76.700460

Weather Station

Courtesy: Google maps

Dataset

Sensor Data: Box temperature and soil moisture 3 motes from Jug Bay (previous slide) 5 months of data (sampled every 10 min.) Train Data Set (1 month), Test Data Set (4

months)

Event Ground Truth (Weather Data): Precipitation data collected from a weather

station ~ 700 m away (sampled every 15 min.) 21 major events (i.e. rainfall) occurred Total rainfall hours : 158 hours

Faults Ground Truth

Start with a cleandata set

Inject Faults to Establish groundTruth

Methodology

For Each Fault Detection Method & Each modality

Use 1st month’s data to Train

Obtain Model Parameters

Evaluate Method on Fault-Injected Test Data

Soil Moisture ‘SHORT RULE’

Reducing the number of misclassification errors increases false negatives

Misclassification LLSE method

Modality Misclassification error False Negatives

Box Temperature 0.3 % 77.19 %

Soil Moisture 46.3 % 50.03 %

Higher misclassification can occur due to :

Spatial & Temporal Heterogeneity of the soil

Lessons Learned

There exists a tension between detecting Events and Faults

Fault Detection Algorithms need to take this into consideration– Events can be misclassified as faults

Need for novel Fault Detection methods that are robust in the presence of Events

Need for Pattern Recognition techniques

Acknowledgements

Abhishek Sharma, Dept. of Computer Science, University of Southern California

Chris Swarth, Jug Bay Wetlands Sanctuary Life Under Your Feet team Marcus Chang, University of Copenhagen

(Courtesy : Andreas Terzis)

Questions !!!!

How Dirty is your Data : The Duality between detecting Events and Faults

Documents

Transcript of How Dirty is your Data : The Duality between detecting Events and Faults

Duality for topological abelian group stacks and T-duality · Duality for topological abelian group stacks and T-duality U. Bunke, Th. Schick, M. Spitzweck, and A. Thom January 26,

Simplex and Duality

Dirty Books, Dirty Hands

Duality methods for pricing contingent claimsbcc.impan.pl/.../PricingConjDualityV2013.pdf · duality and mathemati-cal ﬁnance Introduction Duality theory Pricing Final comments

holographic duality

Transition Faults and Transition Path Delay Faults: Test ...

LP Duality

A Short Course on Duality, Adjoint Operators, Green’s Functions, …estep/assets/adjointcourse.pdf · Duality, Adjoint Operators, and Green’s Functions Duality, Adjoints, Green’s

From Schur-Weyl duality to quantum symmetric pairs · Schur-Weyl duality. . . . . . . Quantization. . . . . . . . . q-Schur duality BLM construction. . . . . . . Quantum symmetric

Analog Duality - University Of Marylandcarnap.umd.edu/philphysics/hossenfelderslides.pdf · Analog Duality Sabine Hossenfelder Nordita Sabine Hossenfelder, Nordita Analog Duality

Abolishing Duality ESJ

Dynamics of Duality: How Duality Emerges, …mkikuchi/mla2016maruyama.pdfIntroduction Duality in Logic and Algebraic Geometry Answering the Three Questions Dynamics of Duality: How

Abolishing Duality

Duality, Residues, Fundamental classjlipman/2011-ResidueTalk.pdf5 5. Formal foundations|duality setups. Joseph Lipman (Purdue University) Duality, Residues, Fundamental class May22,

Recognizing faults Practice with thrust faults and normal faults Practice with thrust faults and normal faults.

Wave–Particle Duality

Lagrangian Duality

Macrophage Duality

Environmental Preferences and Technological Choices : Is ...Dirty dirty + grey dirty+ grey + other dirty dirty + other Observations 17,124 49,482 17,124 49,482 R-squared 0.070 0.051

Dance of Duality