ITTM: Troubleshooting Skill Manual

76
IT Troubleshooting Management Troubleshooting Skill Manual

description

Skill profiling and modelling IT troubleshooting

Transcript of ITTM: Troubleshooting Skill Manual

Page 1: ITTM: Troubleshooting Skill Manual

IT

Troubleshooting Management

Troubleshooting Skill Manual

Page 2: ITTM: Troubleshooting Skill Manual

Analyst often get ask how problem got fixed not how the problem got identified.

NT

Page 3: ITTM: Troubleshooting Skill Manual

Chapter 1 [skill principles]

Chapter 3 [skill model]

Chapter 4 [skill exercise]

Chapter 2 [skill profile]

Page 4: ITTM: Troubleshooting Skill Manual

Identify the cause of the problem

TROUBLESHOOTING

MAIN GOAL:

Page 5: ITTM: Troubleshooting Skill Manual

Test every suspect component until the fault is identified

MAIN TASK:

Page 6: ITTM: Troubleshooting Skill Manual

What makes a problem more challenging to troubleshoot than others ?

Page 7: ITTM: Troubleshooting Skill Manual

It depends on the number of suspect components that needs to be tested.

2 components 100+ components

Page 8: ITTM: Troubleshooting Skill Manual

The higher number of suspect faults, the harder it is to troubleshoot.

20 components 100+ components

Page 9: ITTM: Troubleshooting Skill Manual

Locate the fault

First troubleshooting objective:

Page 10: ITTM: Troubleshooting Skill Manual

Identity of the fault

Second troubleshooting objective:

Page 11: ITTM: Troubleshooting Skill Manual

How do you troubleshoot?

Page 12: ITTM: Troubleshooting Skill Manual

Troubleshooting components with physical boundaries,

use your eyes.

Page 13: ITTM: Troubleshooting Skill Manual

Troubleshooting components with no physical boundaries,

use logic.

10101010101010101010101010101010101010101 10101010101010101010101010101010101010101 10101010101010101010101010101010101010101 10101010101010101010101010101010101010101 10101010101010101010101010101010101010101 10101010101010101010101010101010101010101 10101010101010101010101010101010101010101

Page 14: ITTM: Troubleshooting Skill Manual

Troubleshooting digital component inside another digital component,

10101010101010101010101010101010101010101 10101010101010101010101010101010101010101

101010101010101010101010101010101010101

101010101010101010101010101010101010101

101010101010101010101010101010101010101 10101010101010101010101010101010101010101 10101010101010101010101010101010101010101

use logical isolation.

Page 15: ITTM: Troubleshooting Skill Manual

Chapter 1 [skill principles]

Chapter 3 [skill model]

Chapter 4 [skill exercise]

Chapter 2 [skill profile]

Page 16: ITTM: Troubleshooting Skill Manual

Lets define troubleshooting

Page 17: ITTM: Troubleshooting Skill Manual

SPKG

Unknown and undocumented technical issue

Problem X

Page 18: ITTM: Troubleshooting Skill Manual

Replacing or reinstalling every component until problem X is resolved is NOT troubleshooting.

SPKG

Page 19: ITTM: Troubleshooting Skill Manual

Applying known popular fixes with the hope of fixing problem X is NOT troubleshooting.

SPKG

Page 20: ITTM: Troubleshooting Skill Manual

Troubleshooting is the use of technical deduction and logical isolation to identify the cause of problem X.

SPKG

Page 21: ITTM: Troubleshooting Skill Manual

Lets define technical deduction

Page 22: ITTM: Troubleshooting Skill Manual

technical knowledge + deductive logic =

technical deduction

Page 23: ITTM: Troubleshooting Skill Manual

take out technical deduction skill from the equation and troubleshooting will always start at:

Page 24: ITTM: Troubleshooting Skill Manual

“Is the power ON?”

Page 25: ITTM: Troubleshooting Skill Manual

subtract functional components from suspect components to minimize the number of components to isolate/test

technical deduction

objective:

Page 26: ITTM: Troubleshooting Skill Manual

Example

Web Server Web

Application

Web

Application

Account

Web

Application

Account

Password

problem X’s suspect components

Page 27: ITTM: Troubleshooting Skill Manual

Based from the problem analysis, user’s neighbor can login and work on the same web application.

Example

Web Server Web

Application

Web

Application

Account

Web

Application

Account

Password

Page 28: ITTM: Troubleshooting Skill Manual

Web Server Web

Application

Therefore, troubleshooting efforts should be limited to Web application account and password.

X X

Example

Web

Application

Account

Web

Application

Account

Password

Page 29: ITTM: Troubleshooting Skill Manual

Analyst with good deductive skill, can minimize the number of suspect components on any given problem.

Page 30: ITTM: Troubleshooting Skill Manual

Analyst with excellent deductive skill, can locate the fault on any given problem.

Page 31: ITTM: Troubleshooting Skill Manual

Technical deduction skill is a required skill for any front line analyst.

Page 32: ITTM: Troubleshooting Skill Manual

lets define logical isolation

Page 33: ITTM: Troubleshooting Skill Manual

isolation definition: to set one or a group of technical components apart from others

Page 34: ITTM: Troubleshooting Skill Manual

why isolate? to test the component’s functionality

Page 35: ITTM: Troubleshooting Skill Manual

Individual components with physical boundaries can be easily isolated and tested

A

Page 36: ITTM: Troubleshooting Skill Manual

Testing A by

INCLUSION

Testing A by

EXCLUSION

A A

Types of Isolation

Page 37: ITTM: Troubleshooting Skill Manual

But component with no physical boundaries needs to be isolated logically

A

Hardware Component

Software Component

Page 38: ITTM: Troubleshooting Skill Manual

The faster one can isolate and test any given suspect component, the faster one can identify the problem cause.

Page 39: ITTM: Troubleshooting Skill Manual

fault isolation eff ic iency

# faults isolated / time =

Page 40: ITTM: Troubleshooting Skill Manual

take out isolation skill from the equation and troubleshooting will always start at:

Page 41: ITTM: Troubleshooting Skill Manual

re-installing every component

Page 42: ITTM: Troubleshooting Skill Manual

lets define the methods in

logical isolation

Page 43: ITTM: Troubleshooting Skill Manual

4 methods in logically isolating (hardware and software) faults:

simpl i fy shorten

compare error for error [ ]

Page 44: ITTM: Troubleshooting Skill Manual

[simplify]

Process of excluding complex component and substituting it with a basic component. “Is there a substitute or alternative core component ?” Example: wireless router to LAN cable; production server to test server; setting application setting to plain vanilla mode

Page 45: ITTM: Troubleshooting Skill Manual

[shorten]

Process of minimizing the process or excluding unnecessary components without altering the intended goal. “Can this component be taken out without altering the overall process?” Example: remote application to local application; network printing to local printing

Page 46: ITTM: Troubleshooting Skill Manual

[compare]

Process of taking suspected component and comparing it with a known working component or environment. “Is there a working model to compare this with?” Example: side by side comparison; a tested component inserted in place of a suspected fault

Page 47: ITTM: Troubleshooting Skill Manual

[error for error ]

Process of injecting known error into the target fault. The sequence of the expected error compared to the original error will help determine the location or identity of the fault. “Is there documented and reversible error that can be applied to this fault?” Example: intentionally entering an incorrect password, which came first “wrong password “ or original error ?

Page 48: ITTM: Troubleshooting Skill Manual

ide nt i f y proble m caus e

logical isolation objective

Page 49: ITTM: Troubleshooting Skill Manual

analyst with good isolation skills, have the ability to isolate any given software sub-components

Page 50: ITTM: Troubleshooting Skill Manual

analyst with excellent isolation skill, have the ability to narrow down the number of suspect components by 50% on the first isolation task

Page 51: ITTM: Troubleshooting Skill Manual

Chapter 1 [skill principles]

Chapter 3 [skill model]

Chapter 4 [skill exercise]

Chapter 2 [skill profile]

Page 52: ITTM: Troubleshooting Skill Manual

troubleshooting skill model

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate suspected

faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 53: ITTM: Troubleshooting Skill Manual

SPKG

Analyst encounters Problem X

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate suspected

faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 54: ITTM: Troubleshooting Skill Manual

Analyst perform DUE DILIGENCE : Recreate issue, check policies and procedures; determine if it’s a procedural issue Check knowledge base or Google; determine if it’s a known issue If issue is unknown Analyst then proceeds to the next task

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate suspected

faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 55: ITTM: Troubleshooting Skill Manual

Perform Root Cause Analysis based from:

Technical knowledge Awareness of technical components in play Hardware and software behavior

Analyst will theorize what caused Problem X

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate suspected

faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 56: ITTM: Troubleshooting Skill Manual

Analyst makes a mental list of initial suspected faults “if the 1st component tested is not the fault, what’s the next potential fault?” Ideal number of faults = 2 or greater

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate suspected

faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 57: ITTM: Troubleshooting Skill Manual

Sample Potential Faults

Fault

1

Fault

2

Fault

3

Fault

4

Fault

5

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate suspected

faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 58: ITTM: Troubleshooting Skill Manual

Analyst will then gather more information to qualify or disqualify faults by asking questions. Using the sample Task Table below, asking if Task E can be performed is the most efficient.

Task Fault Tested

Task A Fault 1

Task B Fault 2

Task C Fault 3

Task D Fault 1 & 2

Task E Fault 1,2 & 3

Technical Deduction

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate

suspected faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 59: ITTM: Troubleshooting Skill Manual

Since Task E can be performed, suspected faults is down to 2.

Fault

1

Fault

2

Fault

3

Fault

4

Fault

5 X X X

Technical Deduction

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate

suspected faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 60: ITTM: Troubleshooting Skill Manual

Isolation Method

Fault Isolated

Notes

Method X Fault 4 Positive Result = fault 4 is functional

Method Y Fault 5 Positive Result = fault 5 is functional

Method Z Fault 4 & 5 Positive Result = fault 4 is functional

Negative Result =

fault 5 is functional

Next troubleshooting procedure is to isolate / test the 2 remaining suspected faults. Using the sample Isolation Table below executing Method Z is the most efficient.

Fault Isolation

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate suspected

faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 61: ITTM: Troubleshooting Skill Manual

Applying method Z resulted in a negative result, which means problem cause is Fault 4.

Fault

4

Fault Isolation

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate suspected

faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 62: ITTM: Troubleshooting Skill Manual

Apply fix to confirm fault is identified.

Problem X

Analyze Problem

List all potential faults

Apply technical deduction to eliminate suspected

faults

Apply Fix

Test / Isolate remaining faults until problem cause is

identified

Page 63: ITTM: Troubleshooting Skill Manual

Chapter 1 [skill principles]

Chapter 3 [skill model]

Chapter 4 [skill exercise]

Chapter 2 [skill profile]

Page 64: ITTM: Troubleshooting Skill Manual

fault consistency exercise

As a team, pick an error or symptom from your existing knowledge base without looking at the problem resolution. Write down all potential faults. Now compare your list with other team members. Faults listed should be more or less the same.

Objective: test analyst’s technical knowledge and component awareness; expose team’s knowledge

inconsistency

Page 65: ITTM: Troubleshooting Skill Manual

technical deduction exercise 1

1) Pick one component ( hardware or software ) and write down the task that can be performed if this component is functional.

2) Convert this statement in a form of a question. If analyst interface with a non technical customer, use a non- technical question (NTQ).

3) Repeat process.

Objective: test analyst technical knowledge and develop individual deductive skill

Page 66: ITTM: Troubleshooting Skill Manual

technical deduction exercise 1

Component: Database Application Statement: Ability to access all customer ‘s activity and history Question: Are you able to query past customer activity? NTQ: Are you able to see yesterday’s activity?

Example

Page 67: ITTM: Troubleshooting Skill Manual

technical deduction exercise 2

1) List 2 components and write down a common task shared by both components if they are functional.

2) Convert this statement in a form of a question. If Analyst interface with a non technical customer, use a non- technical question (NTQ).

3) Increase the number of components and repeat process

Objective: test analyst technical knowledge and develop efficient deductive skill by grouping

Page 68: ITTM: Troubleshooting Skill Manual

Components: file server and folder access Statement: ability to open folder and copy / store files Question: Are you able to browse the folder contents and manipulate it ? NTQ: Are you able to see all the files from that folder and copy files ?

Example

technical deduction exercise 2

Page 69: ITTM: Troubleshooting Skill Manual

technical deduction exercise 3

1) List 4 technical faults.

2) Use a minimum number of individual and group deduction questions combination to eliminate every fault. The less questions the better.

3) Increase the number of technical faults and repeat process

NOTE: start this as a written exercise, then get a partner to make it as a

verbal exercise

Objective: develop reflexive deductive skill

Page 70: ITTM: Troubleshooting Skill Manual

logical isolation exercise 1

1. List one technical component

2. Write down which isolation methods can be applied and how

3. Increase number of components and repeat process

Objective: test analyst individual isolation skill

Page 71: ITTM: Troubleshooting Skill Manual

logical isolation exercise 1

Component: Application Setting file Simply Method: Yes, change setting to a plain vanilla or generic mode Shorten Method: No Compare Method: Yes, copying a known working file in place of this file or vice versa Error for Error Method: Yes, changing the parameter X to Y will create Error 123

Example

Page 72: ITTM: Troubleshooting Skill Manual

logical isolation exercise 2

1. List 2 components

2. Write down how to isolate all components in 1 step. Using common task for

both components to isolate is also acceptable.

3. Increase the number components and repeat process

Objective: develop efficient group isolation skill

Page 73: ITTM: Troubleshooting Skill Manual

logical isolation exercise 2

Component: PDF File and Network Printer One Step: Print PDF document

Example

Page 74: ITTM: Troubleshooting Skill Manual

troubleshooting exercise 2

Verbal Exercise

1. Ask a partner to state 5 components

2. Use a minimum number of deductive question and isolation task combination to cover all components. The lower the number the better.

3. Increase number of components

Objective: develop efficient troubleshooting skill

Page 75: ITTM: Troubleshooting Skill Manual

Online IT Troubleshooting Manager’s course can be found at

https://www.udemy.com/ittm_manager

Click here to see introduction video