Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye...

23
Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science, Kent, OH and Xinlei Zhao Kent State University, Department of Finance, Kent, OH Office of the Comptroller of the Currency, Washington, USA

Transcript of Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye...

Page 1: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Tile-based parallel coordinates and its application in financial visualization

Jamal Alsakran, Ye Zhao

Kent State University, Department of Computer

Science, Kent, OH

and Xinlei Zhao

Kent State University, Department of Finance, Kent, OH

Office of the Comptroller of the Currency, Washington, USA

Page 2: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Motivation

Visual clutter usually weakens or even diminishes

parallel coordinates ability when the data size increases

Visualization interactivity allows users to gain wider

insight into the data

Financial data analysis is a significant application domain

for visual analytics

Page 3: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Background

Johansson et al (05,06) propose high textures to

represent the data, and first introduced an opacity

transfer function to reveal structures of the data

Zhou et al (08) propose energy minimization to perform

visual clustering, where they used transfer functions to

assign opacity and colors to different clusters

In financial data, Theme River, Growth Matrix, Pixel-

based …etc

Page 4: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Tile-based Parallel Coordinates

Parallel coordinates plotting area defines an image,

I(W,H), with width W and height H

Each data item q is projected as a polyline on the image,

I(W,H)

For each fragment I(x,y), where 0 ≤ x < W and 0 ≤ y < H,

we compute the number of lines intersecting with it,

denoting as D(x,y)

A polyline-intersection density image D(W,H) is

generated.

Page 5: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Tile-based Parallel Coordinates

Tile-based PC promotes the traditional pixel-based

perspective of plotting to a new stage, by defining each

fragment as a rectangular region of the image space with

a user-specified size

A classical PC plot can simply be achieved by assigning

each fragment for one pixel in the image space

Page 6: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Tile-based Parallel Coordinates

X Y

W

H

I(x,y)

Page 7: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Tile-based Parallel Coordinates

X Y

W

H

I(x,y)

Page 8: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Tile-based Parallel Coordinates

X Y

W

H

I(x,y)

Page 9: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Tile-based Parallel Coordinates

X Y

W

H

I(x,y)

Page 10: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Color and Opacity TFs

Transfer functions are employed to assign local optical

attributes according to the density values

For each fragment I(x,y), we define four transfer

functions TF to determine the three color elements, R, G,

B, and the opacity, O, from its density value D(x,y)

The histogram of the densities is plotted to facilitate the

manipulation of the transfer functions

Page 11: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Color and Opacity TFs

density

Occurrence

Histogram

Page 12: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Fast Computing of Line-Tile Intersection

Immediate visual feedback when users continuously

change the tile size is crucial to guarantee interactivity

A fast computing algorithm is employed (Bresenham

algorithm)

To fully utilize Bresenham algorithm, we perform a

coordinates transformation, which scales each tile to one

pixel

Page 13: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Fast Computing of Line-Tile Intersection

Page 14: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Example

Original plot # tiles = 450

# tiles = 20# tiles = 150

Page 15: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

U.S. stocks during years (2000 to 2007)477,074 data items

Page 16: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Case Study: Mutual Funds

Mutual fund allows a group of investors to pool their

money together and invest.

In our study, we have 5785 funds

Each data item represents one mutual fund, whose

characteristics are investigated to find its correlation with

the annual return

The study examines the most significant characteristics

including total net asset size, cash holdings, front-end

load, rear-end load, expense ratios, and turnovers

Page 17: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Front Load vs. Return

# of tiles = 100

# of tiles = 20

Page 18: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Turnover vs. Return with Outliers

It easily accommodate emphasized outliers together with the main trend

It emphasizes crucial data items while keeping the whole data as a background view

outliers are more easily to be compared with mainstream data

Page 19: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Analyzing Statistical Regression with Visualization

Tile-based PC is used to visually analyze the performance of a traditional statistical method widely used by financial analysts

The standard linear regression model that assumes a linear relation between the explanatory variables and the dependent variable

Estimated return = coef * characteristic + interp.

Comparison shows that our method is more informative

Page 20: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Analyzing Statistical Regression with Visualization

Real Data Regression Data

Page 21: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Multiple Clusters Visualization

Page 22: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Full Attributes Visualization with Outliers

The red polyline represents the best performer, Dreyfus Premier Greater China B (DPCBX), which produced 85% return for investors.

The purple polyline is the second-best mutual fund, Old Mutual Clay Finlay China C (OMNCX)

The best performers achievement in the year 2006 has no direct relation with their fund properties and managing activities

Page 23: Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,

Conclusion

A novel tile-based density and transfer functions to for

visual cluttering reduction

The tile-based parallel coordinates technique improves

the performance, yields more controllability and

promotes the visual understanding

Visual analytical results on financial data set of 2006 U.S

mutual funds illustrate the potential of using the method

in financial economics