Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

25
Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer) Jinwook Seo and Ben Shneiderman Human-Computer Interaction Lab Department of Computer Science University of Maryland, College Park [email protected]

description

Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer). Jinwook Seo and Ben Shneiderman Human-Computer Interaction Lab Department of Computer Science University of Maryland, College Park [email protected]. - PowerPoint PPT Presentation

Transcript of Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Page 1: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Interactive Exploration of Hierarchical Clustering Results

HCE (Hierarchical Clustering Explorer)

Jinwook Seo and Ben ShneidermanHuman-Computer Interaction Lab

Department of Computer Science

University of Maryland, College Park

[email protected]

Page 2: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Cluster Analysis of Microarray Experiment Data

• About 100 ~ 20,000 gene samples• Under 2 ~ 80 experimental conditions• Identify similar gene samples

– startup point for studying unknown genes

• Identify similar experimental conditions– develop a better treatment for a special group

• Clustering algorithms– Hierarchical, K-means, etc.

Page 3: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Dendrogram-3.64 4.87

Page 4: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Dendrogram-3.64 4.87

Page 5: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Dendrogram-3.64 4.87

Page 6: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Interactive Exploration Techniques

• Dynamic Query Controls– Number of clusters, Level of detail

• Coordinated Display– Bi-directional interaction with 2D scattergrams

• Overview of the entire dataset– Coupled with detail view

• Visual Comparison of Different Results– Different results by different methods

Page 8: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Dynamic Query ControlsFilter out less similar genes

By pulling down the minimum similarity bar

Show only the clusters that satisfy the minimum similarity threshold

Help users determine the proper number of clusters

Easy to find the most similar genes

Page 9: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Dynamic Query Controls

Adjust level of detail

By dragging up the detail cutoff bar

Show the representative pattern of each cluster

Hide detail below the bar

Easy to view global structure

Page 10: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Coordinated Displays

• Two experimental conditions for the x and y axes

• Two-dimensional scattergrams– limited to two variables at a time– readily understood by most users– users can concentrate on the data without

distraction

• Bi-directional interactions between displays

Page 11: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Overview in a limited screen space • What if there are more than 1,600 items to display?

• Compressed Overview : averaging adjacent leaves• Easy to locate interesting spots

Melanoma Microarray Experiment (3614 x 38)

Page 12: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Overview in a limited screen space • What if there are more than 1,600 items to display?

• Alternative Overview : changing bar width (2~10)• Show more detail, but need scrolling

Page 13: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Cluster Comparison

• There is no perfect clustering algorithm!• Different Distance Measures• Different Linkage Methods• Two dendrograms at the same time

– Show the mapping of each gene between the two dendrograms

– Busy screen with crossing lines – Easy to see anomalies

Page 14: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Cluster Comparison

Page 15: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Conclusion• Integrate four features to interactively

explore clustering results to gain a stronger understanding of the significance of the clusters– Overview, Dynamic Query, Coordination,

Cluster Comparison

• Powerful algorithms + Interactive tools • Bioinformatics Visualization

www.cs.umd.edu/hcil/multi-clusterJuly 2002 IEEE Computer Special Issue on BioInformatics

Page 16: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

A B C D

Dist A B C D

A 20 7 2

B 10 25

C 3

D

Distance MatrixInitial Data Items

Hierarchical Clustering

Page 17: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

A B C D

Dist A B C D

A 20 7 2

B 10 25

C 3

D

Distance MatrixInitial Data Items

Hierarchical Clustering

Page 18: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Current Clusters

Single Linkage

Hierarchical Clustering

Dist A B C D

A 20 7 2

B 10 25

C 3

D

Distance Matrix

A B CD

2

Page 19: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Dist AD B C

AD 20 3

B 10

C

Distance MatrixCurrent Clusters

Single Linkage

Hierarchical Clustering

A B CD

Page 20: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

A B CD

Dist AD B C

AD 20 3

B 10

C

Distance MatrixCurrent Clusters

Single Linkage

Hierarchical Clustering

Page 21: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Dist AD B C

AD 20 3

B 10

C

Distance MatrixCurrent Clusters

Single Linkage

Hierarchical Clustering

A BCD

3

Page 22: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Dist ADC B

ADC

10

B

Distance MatrixCurrent Clusters

Single Linkage

Hierarchical Clustering

A BCD

Page 23: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

A BCD

Dist ADC B

ADC

10

B

Distance MatrixCurrent Clusters

Single Linkage

Hierarchical Clustering

Page 24: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

Dist ADC B

ADC

10

B

Distance MatrixCurrent Clusters

Single Linkage

Hierarchical Clustering

A BCD

10

Page 25: Interactive Exploration of Hierarchical Clustering Results HCE (Hierarchical Clustering Explorer)

A BCD

Dist ADCB

ADCB

Distance MatrixFinal Result

Single Linkage

Hierarchical Clustering