Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging...

110
Krist Wongsuphasawat & Jimmy Lin @kristw Using visualizations to monitor changes and harvest insights from log data at Twitter @lintool

description

Slides from my talk at the IEEE Conference on Visual Analytics Science and Technology (VAST) 2014 in Paris, France. ABSTRACT Logging user activities is essential to data analysis for internet products and services. Twitter has built a unified logging infrastructure that captures user activities across all clients it owns, making it one of the largest datasets in the organization. This paper describes challenges and opportunities in applying information visualization to log analysis at this massive scale, and shows how various visualization techniques can be adapted to help data scientists extract insights. In particular, we focus on two scenarios:\ (1) monitoring and exploring a large collection of log events, and (2) performing visual funnel analysis on log data with tens of thousands of event types. Two interactive visualizations were developed for these purposes: we discuss design choices and the implementation of these systems, along with case studies of how they are being used in day-to-day operations at Twitter.

Transcript of Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging...

Page 1: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Krist Wongsuphasawat & Jimmy Lin@kristw

Using visualizations to monitor changes and harvest insights

from log data at Twitter

@lintool

Page 2: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Logging user activities & data analysis

Page 3: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUseTwitter

Page 4: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Product Managers

Curious

Twitter

Page 5: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop Write Twitter

Instrument

Product Managers

Page 6: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

What are being logged?

tweetactivities

Page 7: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

What are being logged?

tweet from home timeline on twitter.com tweet from search page on iPhone

activities

Page 8: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

What are being logged?

tweet from home timeline on twitter.com tweet from search page on iPhone

sign up log in

retweet etc.

activities

Page 9: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Organize?

Page 10: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

log event a.k.a. “client event”

[Lee et al. 2012]

Page 11: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

log event a.k.a. “client event”

client : page : section : component : element : actionweb : home : timeline : tweet_box : button : tweet

1) User ID 2) Timestamp 3) Event name

4) Event detail

[Lee et al. 2012]

Page 12: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log data

Page 13: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop Twitter

Instrument

Write

Product Managers

bigger than Tweet data

Page 14: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop

Data Scientists

Ask

Twitter

Instrument

Write

Product Managers

Page 15: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop

Data Scientists

Find

Ask

Twitter

Instrument

Write

Product Managers

Page 16: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log data

Page 17: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop

Data Scientists

Find, Clean

Ask

Twitter

Instrument

Write

Product Managers

Page 18: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop

Data Scientists

Find, Clean

Ask

Monitor

Twitter

Instrument

Write

Product Managers

Page 19: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop

Data Scientists

Find, Clean, Analyze

Ask

Monitor

Twitter

Instrument

Write

Product Managers

Page 20: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log data

EngineersData Scientists

Usersin Hadoop

Find, Clean, Analyze

Use

Monitor

Ask

Curious

1 2

Twitter

Instrument

Write

Product Managers

Page 21: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Part I Find & Monitor Client Events

Page 22: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Motivation

Page 23: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

Engineers & Data Scientists

billions of rows

Page 24: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

Aggregate

10,000+ event types

date client page section comp. elem. action count

20141011 web home home - - impression 100

20141011 web home wtf - - click 20

Engineers & Data Scientists

Client event collection

Page 25: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

Aggregate

10,000+ event types

date client page section comp. elem. action count

20141011 web home home - - impression 100

20141011 web home wtf - - click 20

Engineers & Data Scientists

Client event collection

(Who-to-Follow)

Page 26: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

AggregateClient event collection

Engineers & Data Scientists

Page 27: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

Aggregate

Find

client page section component element action

Search

Client event collection

Engineers & Data Scientists

Page 28: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

Aggregate

Find

client page section component element action

Search

Client event collection

Engineers & Data Scientists

Page 29: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

section? component?

element?

Page 30: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Log datain Hadoop

Aggregate

web home * * impression*

Client event collection

Engineers & Data Scientists

Page 31: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

web : home : wtf : - : - : impression

Aggregate

web home * * impression*

Client event collection

Engineers & Data Scientists

Page 32: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

web : home : wtf : - : - : impression

Aggregate

search can be better

Client event collection

Engineers & Data Scientists

Page 33: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

web : home : wtf : - : - : impression

Aggregate

10,000+ event types

search can be better

Client event collection

Engineers & Data Scientists

Page 34: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

web : home : wtf : - : - : impression

Aggregate

search can be better

10,000+ event types

not everybody knowsWhat are all sections under web:home?

Client event collection

Engineers & Data Scientists

Page 35: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

Aggregate

search can be better

one graph / event

10,000+ event types

not everybody knowsWhat are all sections under web:home?

Client event collection

Engineers & Data Scientists

Page 36: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

Aggregate

search can be better

one graph / eventx 10,000

10,000+ event types

not everybody knowsWhat are all sections under web:home?

Client event collection

Engineers & Data Scientists

Page 37: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

!

• Search for client events

• Explore client event collection

• Monitor changes

Goals

Page 38: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Session analysis

!

• Monitor network logs, not user activity logs

Related work

[Lam et al. 2007, Shen et al. 2013]

[Ghoniem et al. 2013]

Page 39: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Design

Page 40: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Client event collection

Engineers & Data Scientists

Page 41: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

See

Client event collection

Engineers & Data Scientists

Page 42: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

See

Interactions search box => filter

Client event collection

narrow down

Engineers & Data Scientists

Page 43: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

See

How to visualize?

narrow down

Client event collection

Engineers & Data Scientists

Interactions search box => filter

Page 44: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

See

How to visualize?

narrow down

Client event collection

Engineers & Data Scientists

client : page : section : component : element : actionInteractions search box => filter

Page 45: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Client event hierarchy

iphone home -

- - impression

tweet tweet click

iphone:home:-:-:-:impressioniphone:home:-:tweet:tweet:click

Page 46: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Detect changes

iphone home -

- - impression

tweet tweet click

iphone home -

- - impression

tweet tweet click

TODAY

7 DAYS AGO

compared to

Page 47: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Calculate changes

+5% +5% +5%

+10% +10% +10%

-5% -5% -5%

DIFF

Page 48: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Display changes

iphone home -

- - impression

tweet tweet click

Map of the Market [Wattenberg 1999], StemView [Guerra-Gomez et al. 2013]

Page 49: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Display changes

home -

- - impression

tweet tweet click

iphone

Page 50: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Demo Scribe Radar

Page 51: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Twitter for Banana

Page 52: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter
Page 53: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Since Dec 2013

• 500 unique users, 10 users / day

!

• No training

Deployment

Page 54: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Users: PMs, Data Scientists, Engineers

• Search

• Monitor

• See effects after major product launch

Use cases

read the paper :)

Page 55: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Part II Analysis

Page 56: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Count page visits

banana : home : - : - : - : impressionhome page

Page 57: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Funnel

home page

profile page

Page 58: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Funnel analysis

banana : home : - : - : - : impression

banana : profile : - : - : - : impression

1 jobhome page

profile page

1 hour

Page 59: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Funnel analysis

banana : home : - : - : - : impression

banana : profile : - : - : - : impression banana : search : - : - : - : impression

home page

profile page search page

2 jobs2 hours

Page 60: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Funnel analysis

banana : home : - : - : - : impression

banana : profile : - : - : - : impression banana : search : - : - : - : impression

home page

profile page search page

Specify all funnels manually!

n jobsn hours

Page 61: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Goal

banana : home : - : - : - : impression

… ……

1 job => all funnels, visualized

home page

Page 62: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Visualize an overview of event sequences

!

Related work

[Wongsuphasawat et al. 2011, Monroe et al. 2013, …]

Page 63: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Visualize an overview of event sequences

!

• Big data? eBay checkout sequences

!

One funnel at a time Checkout > Payment > Confirm > Success

Related work

[Wongsuphasawat et al. 2011, Monroe et al. 2013, …]

[Shen et al. 2013]

Page 64: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

LifeFlow [CHI2011]

!

(simplified)

Page 65: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

User sessionsSession#1

A

B

start

end

Session#4

start

end

A

Session#2

B

start

end

A

Session#3

C

start

end

A

Page 66: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate4 sessions

A

BB C

start

end endend

A A

end

A

Page 67: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

A

BB C

start

end endend

end

4 sessions

Page 68: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

C

start

end endend

end

A

B

4 sessions

Page 69: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

C

start

end endend

end

A

B

4 sessions

Page 70: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

C

start

end endend

A

B end

4 sessions

Page 71: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

C

start

endend

A

B end

4 sessions

Page 72: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

C

start

endend

A

B end

4 sessions

Page 73: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

start

endend

A

CB end

4 sessions

Page 74: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

4,000,000 sessions

endend

A

CB end

start

Page 75: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

try with sample data (~millions sessions, 10,000+ event types)

!

original paper (100,000 sessions, ~10 event types)

Page 76: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

not meaningful !

small slice of data but huge file

Page 77: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

How to make it work?

Page 78: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

# of unique sequences

Page 79: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

Reduce # of unique sequences

Page 80: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

Reduce # of unique sequences

10,000 types select

tweet sign up log out

Page 81: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

Reduce # of unique sequences

10,000 types select

tweet sign up log out

Page 82: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

Reduce # of unique sequences

10,000 types select merge

tweet from home timeline tweet from search page tweet …

= tweet

Page 83: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

2. Reduce sequence length

Reduce # of unique sequences

Page 84: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

2. Reduce sequence length

Reduce # of unique sequences

session

1000 events

Page 85: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

2. Reduce sequence length

Reduce # of unique sequences

session

10 events after (window size & direction)

1000 events

visit home page (alignment)

Page 86: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

2. Reduce sequence length

Reduce # of unique sequences

Ask users for input}

Page 87: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

2. Reduce sequence length

3. More aggregation on Hadoop

Reduce # of unique sequences

Ask users for input}

Page 88: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Collapse eventsSequence ABBBCCCC ABBCC ABC ABCCCC ABCD ABCCCD ABCCE ABCDF ABCDG ABCDH

e.g. tweet, tweet, tweet, … = tweet

Page 89: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Sequence ABC ABC ABC ABC ABCD ABCD ABCE ABCDF ABCDG ABCDH

Collapse events

Page 90: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Group & CountSequence ABC ABCD ABCE ABCDF ABCDG ABCDH …

Count 2000 80 20 1 1 1 …

Page 91: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Group & CountSequence ABC ABCD ABCE ABCDF ABCDG ABCDH ABCDI ABCDJK ABCDJL

Count 2000 80 20 1 1 1 1 1 1

rare sequences (count < threshold)

Page 92: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

TruncateSequence ABC ABCD ABCE ABCDx ABCDx ABCDx ABCDx ABCDJx ABCDJx

Count 2000 80 20 1 1 1 1 1 1

Replace last event with x (…)

Page 93: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Sequence ABC ABCD ABCE ABCDx ABCDJx

Count 2000 80 20 4 2

Group & Count

Page 94: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Truncate moreSequence ABC ABCD ABCE ABCDx ABCDx

Count 2000 80 20 4 2

Page 95: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Group & CountSequence ABC ABCD ABCE ABCDx

Count 2000 80 20 6

Page 96: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Define set of events

2. Pick alignment, direction and window size

3. Run Hadoop job (with more aggregation)

4. Wait for it… (2+ hrs)

5. Visualize

Final process

~100,000 patterns (10MB)

gazillion patterns (TBs)

Page 97: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Demo Flying Sessions

Page 98: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Since Jan 2013

• Fewer users, but more in-depth ad-hoc analysis

• Initial meeting to provide support

Deployment

Page 99: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• What did users do when they visit Twitter? (in demo)

• Where did users give up in the sign up process?

• more in the paper

Case studies

Page 100: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Case studies

click on “sign up”

fill personal info

import address book

etc.

• What did users do when they visit Twitter? (in demo)

• Where did users give up in the sign up process?

• more in the paper

Page 101: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• What did users do when they visit Twitter? (in demo)

• Where did users give up in the sign up process?

• more in the paper

Case studies

read the paper :)

Page 102: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Large-scale User Activity Logs + Visual Analytics

Conclusions & Future work

Page 103: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Large-scale User Activity Logs + Visual Analytics

• Find, Monitor & Explore + Anomaly detection & automatic alert

• Funnel Analysis + More interactivity & data / reduce wait time / latency study?

• Used in day-to-day operations at Twitter

Conclusions & Future work

Page 104: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Conclusions & Future workChallenge

big data

small data

visualize & interact

• Large-scale User Activity Logs + Visual Analytics

• Find, Monitor & Explore + Anomaly detection & automatic alert

• Funnel Analysis + More interactivity & data / reduce wait time / latency study?

• Used in day-to-day operations at Twitter

aggregate & sacrifice

Page 105: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Large-scale User Activity Logs + Visual Analytics

• Find, Monitor & Explore + Anomaly detection & automatic alert

• Funnel Analysis + More interactivity & data / reduce wait time / latency study?

• Used in day-to-day operations at Twitter

• Generalize to smaller systems

Conclusions & Future workChallenge

big data

small data

visualize & interact

aggregate & sacrifice

Page 106: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Data Scientists & Engineers @Twitter — Linus Lee, Chuang Liu

• Feedback from reviewers, Ben Shneiderman & Catherine Plaisant

Acknowledgement

Page 107: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Twitter is looking for data analyst.

(contractor)

Page 108: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Large-scale User Activity Logs + Visual Analytics

• Find, Monitor & Explore + Anomaly detection & automatic alert

• Funnel Analysis + More interactivity & data / reduce wait time / latency study?

• Used in day-to-day operations at Twitter

• Generalize to smaller systems

Conclusions & Future workChallenge

big data

small data

visualize & interact

[email protected] / @kristw

aggregate & sacrifice

Page 109: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Questions?

Page 110: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Thank you