Telecom Network Fault Prediction

17
1 Telecom Network Fault Prediction H. K. Yuen Department of Management Sciences City University of Hong Kong

description

Telecom Network Fault Prediction. H. K. Yuen Department of Management Sciences City University of Hong Kong. Outline. Problem Formulation Variable Selection Model Development Model Implementation. Problem Formulation. Overview - PowerPoint PPT Presentation

Transcript of Telecom Network Fault Prediction

Page 1: Telecom Network Fault Prediction

1

Telecom Network Fault Prediction

H. K. Yuen

Department of Management Sciences

City University of Hong Kong

Page 2: Telecom Network Fault Prediction

2

Outline

• Problem Formulation

• Variable Selection

• Model Development

• Model Implementation

Page 3: Telecom Network Fault Prediction

3

Problem Formulation

• Overview– Messages about network performances are

generated from transmission stations– Messages are examined manually– Messages are classified as urgent fault or non-

urgent fault– To build a model to predict whether a received

signals an urgent fault or not

Page 4: Telecom Network Fault Prediction

4

Problem Formulation

• The Data– 5,924 past messages were collected– Each message contains 1,082 variables – Each message was examine manually– The decision "Urgent" or "Non-Urgent" was set

as the target variable• Urgent case = "True" Non-Urgent case = "Null"

Page 5: Telecom Network Fault Prediction

5

Problem Formulation

• Distribution of the Target Variable

Null True

Page 6: Telecom Network Fault Prediction

6

Problem Formulation

• Selection of Cases– Use the Sampling node of Enterprise Miner

(EM) to select a sample

Page 7: Telecom Network Fault Prediction

7

Variable Selection

• Using all of the variables in the model is not practical

• Impractical to examine the associations between the target variable and the other input variables manually

• The Tree node and the Variable Selection node of Enterprise Miner were employed

Page 8: Telecom Network Fault Prediction

8

Variable Selection• Process flow

Page 9: Telecom Network Fault Prediction

9

Variable Selection

• Some results from Tree1

• A total of 23 variables are selected as input

Page 10: Telecom Network Fault Prediction

10

Model Development• Data are partitioned into three parts

– Training (50%)– Validation (25%)– Testing (25%)

• Two possible model selection criteria: – The one that most accurately predicts the

response (either "True" or "Null")– The one that generates the highest expected profit

Page 11: Telecom Network Fault Prediction

11

Model Development• Modified Profit Vector

• Neural Network models with different setting were developed

• Model Output: Prob(Target variable="True")

Page 12: Telecom Network Fault Prediction

12

Model Development

• Process flow

• Model Manager

Page 13: Telecom Network Fault Prediction

13

Model Development

• How to choose a model with the most predictive power?– Sensitivity: # of predicted "True" / # actual "True"– Specificity: # of predicted "Null" / # actual "Null" – Cutoff point: Observations with predicted

probability of the target event greater than a cutoff point are classified as "True"

Page 14: Telecom Network Fault Prediction

14

Model Development

• Receiver Operating Characteristic Chart (ROC)

Sen

siti

vity

1-Specificity

Cutoff

Higher

Lower

All "True"

All "Null"

Winner

Page 15: Telecom Network Fault Prediction

15

Model Development• Correct Classification Chart

– Displays the prediction accuracy for each actual target level across a range of cutoff values

Cutoff

Page 16: Telecom Network Fault Prediction

16

Implementing the Model

An incoming signal

with predicted Prob(target variable = "True) = p

Class 1

p 0.5

Send technician

Class 2

else

Examine the signal manually

Class 3

p 0.15

Ignore the signal

Page 17: Telecom Network Fault Prediction

17

Implementing the Model• Results of classification

• Benefits:• Saving in manpower• Faster response time to problems

Class 1 Class 2 Class 3

True 50.38% 36.84% 12.78%Null 11.31% 25.68% 63.01%

Overall 12.19% 25.93% 61.88%

Actual