Download - Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Transcript
Page 1: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Krist Wongsuphasawat & Jimmy Lin@kristw

Using visualizations to monitor changes and harvest insights

from log data at Twitter

@lintool

Page 2: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Logging user activities & data analysis

Page 3: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUseTwitter

Page 4: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Product Managers

Curious

Twitter

Page 5: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop Write Twitter

Instrument

Product Managers

Page 6: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

What are being logged?

tweetactivities

Page 7: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

What are being logged?

tweet from home timeline on twitter.com tweet from search page on iPhone

activities

Page 8: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

What are being logged?

tweet from home timeline on twitter.com tweet from search page on iPhone

sign up log in

retweet etc.

activities

Page 9: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Organize?

Page 10: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

log event a.k.a. “client event”

[Lee et al. 2012]

Page 11: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

log event a.k.a. “client event”

client : page : section : component : element : actionweb : home : timeline : tweet_box : button : tweet

1) User ID 2) Timestamp 3) Event name

4) Event detail

[Lee et al. 2012]

Page 12: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log data

Page 13: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop Twitter

Instrument

Write

Product Managers

bigger than Tweet data

Page 14: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop

Data Scientists

Ask

Twitter

Instrument

Write

Product Managers

Page 15: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop

Data Scientists

Find

Ask

Twitter

Instrument

Write

Product Managers

Page 16: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log data

Page 17: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop

Data Scientists

Find, Clean

Ask

Twitter

Instrument

Write

Product Managers

Page 18: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop

Data Scientists

Find, Clean

Ask

Monitor

Twitter

Instrument

Write

Product Managers

Page 19: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

UsersUse

Curious

Engineers

Log datain Hadoop

Data Scientists

Find, Clean, Analyze

Ask

Monitor

Twitter

Instrument

Write

Product Managers

Page 20: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log data

EngineersData Scientists

Usersin Hadoop

Find, Clean, Analyze

Use

Monitor

Ask

Curious

1 2

Twitter

Instrument

Write

Product Managers

Page 21: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Part I Find & Monitor Client Events

Page 22: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Motivation

Page 23: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

Engineers & Data Scientists

billions of rows

Page 24: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

Aggregate

10,000+ event types

date client page section comp. elem. action count

20141011 web home home - - impression 100

20141011 web home wtf - - click 20

Engineers & Data Scientists

Client event collection

Page 25: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

Aggregate

10,000+ event types

date client page section comp. elem. action count

20141011 web home home - - impression 100

20141011 web home wtf - - click 20

Engineers & Data Scientists

Client event collection

(Who-to-Follow)

Page 26: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

AggregateClient event collection

Engineers & Data Scientists

Page 27: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

Aggregate

Find

client page section component element action

Search

Client event collection

Engineers & Data Scientists

Page 28: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Log datain Hadoop

Aggregate

Find

client page section component element action

Search

Client event collection

Engineers & Data Scientists

Page 29: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

section? component?

element?

Page 30: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Log datain Hadoop

Aggregate

web home * * impression*

Client event collection

Engineers & Data Scientists

Page 31: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

web : home : wtf : - : - : impression

Aggregate

web home * * impression*

Client event collection

Engineers & Data Scientists

Page 32: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

web : home : wtf : - : - : impression

Aggregate

search can be better

Client event collection

Engineers & Data Scientists

Page 33: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

web : home : wtf : - : - : impression

Aggregate

10,000+ event types

search can be better

Client event collection

Engineers & Data Scientists

Page 34: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

web : home : wtf : - : - : impression

Aggregate

search can be better

10,000+ event types

not everybody knowsWhat are all sections under web:home?

Client event collection

Engineers & Data Scientists

Page 35: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

Aggregate

search can be better

one graph / event

10,000+ event types

not everybody knowsWhat are all sections under web:home?

Client event collection

Engineers & Data Scientists

Page 36: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

client page section component element action

Search

Find

Query

Return

Log datain Hadoop

Resultsweb : home : home : - : - : impression

Aggregate

search can be better

one graph / eventx 10,000

10,000+ event types

not everybody knowsWhat are all sections under web:home?

Client event collection

Engineers & Data Scientists

Page 37: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

!

• Search for client events

• Explore client event collection

• Monitor changes

Goals

Page 38: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Session analysis

!

• Monitor network logs, not user activity logs

Related work

[Lam et al. 2007, Shen et al. 2013]

[Ghoniem et al. 2013]

Page 39: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Design

Page 40: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Client event collection

Engineers & Data Scientists

Page 41: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

See

Client event collection

Engineers & Data Scientists

Page 42: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

See

Interactions search box => filter

Client event collection

narrow down

Engineers & Data Scientists

Page 43: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

See

How to visualize?

narrow down

Client event collection

Engineers & Data Scientists

Interactions search box => filter

Page 44: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

See

How to visualize?

narrow down

Client event collection

Engineers & Data Scientists

client : page : section : component : element : actionInteractions search box => filter

Page 45: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Client event hierarchy

iphone home -

- - impression

tweet tweet click

iphone:home:-:-:-:impressioniphone:home:-:tweet:tweet:click

Page 46: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Detect changes

iphone home -

- - impression

tweet tweet click

iphone home -

- - impression

tweet tweet click

TODAY

7 DAYS AGO

compared to

Page 47: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Calculate changes

+5% +5% +5%

+10% +10% +10%

-5% -5% -5%

DIFF

Page 48: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Display changes

iphone home -

- - impression

tweet tweet click

Map of the Market [Wattenberg 1999], StemView [Guerra-Gomez et al. 2013]

Page 49: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Display changes

home -

- - impression

tweet tweet click

iphone

Page 50: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Demo Scribe Radar

Page 51: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Twitter for Banana

Page 52: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter
Page 53: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Since Dec 2013

• 500 unique users, 10 users / day

!

• No training

Deployment

Page 54: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Users: PMs, Data Scientists, Engineers

• Search

• Monitor

• See effects after major product launch

Use cases

read the paper :)

Page 55: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Part II Analysis

Page 56: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Count page visits

banana : home : - : - : - : impressionhome page

Page 57: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Funnel

home page

profile page

Page 58: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Funnel analysis

banana : home : - : - : - : impression

banana : profile : - : - : - : impression

1 jobhome page

profile page

1 hour

Page 59: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Funnel analysis

banana : home : - : - : - : impression

banana : profile : - : - : - : impression banana : search : - : - : - : impression

home page

profile page search page

2 jobs2 hours

Page 60: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Funnel analysis

banana : home : - : - : - : impression

banana : profile : - : - : - : impression banana : search : - : - : - : impression

home page

profile page search page

Specify all funnels manually!

n jobsn hours

Page 61: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Goal

banana : home : - : - : - : impression

… ……

1 job => all funnels, visualized

home page

Page 62: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Visualize an overview of event sequences

!

Related work

[Wongsuphasawat et al. 2011, Monroe et al. 2013, …]

Page 63: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Visualize an overview of event sequences

!

• Big data? eBay checkout sequences

!

One funnel at a time Checkout > Payment > Confirm > Success

Related work

[Wongsuphasawat et al. 2011, Monroe et al. 2013, …]

[Shen et al. 2013]

Page 64: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

LifeFlow [CHI2011]

!

(simplified)

Page 65: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

User sessionsSession#1

A

B

start

end

Session#4

start

end

A

Session#2

B

start

end

A

Session#3

C

start

end

A

Page 66: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate4 sessions

A

BB C

start

end endend

A A

end

A

Page 67: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

A

BB C

start

end endend

end

4 sessions

Page 68: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

C

start

end endend

end

A

B

4 sessions

Page 69: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

C

start

end endend

end

A

B

4 sessions

Page 70: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

C

start

end endend

A

B end

4 sessions

Page 71: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

C

start

endend

A

B end

4 sessions

Page 72: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

C

start

endend

A

B end

4 sessions

Page 73: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

start

endend

A

CB end

4 sessions

Page 74: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Aggregate

4,000,000 sessions

endend

A

CB end

start

Page 75: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

try with sample data (~millions sessions, 10,000+ event types)

!

original paper (100,000 sessions, ~10 event types)

Page 76: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

not meaningful !

small slice of data but huge file

Page 77: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

How to make it work?

Page 78: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

# of unique sequences

Page 79: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

Reduce # of unique sequences

Page 80: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

Reduce # of unique sequences

10,000 types select

tweet sign up log out

Page 81: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

Reduce # of unique sequences

10,000 types select

tweet sign up log out

Page 82: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

Reduce # of unique sequences

10,000 types select merge

tweet from home timeline tweet from search page tweet …

= tweet

Page 83: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

2. Reduce sequence length

Reduce # of unique sequences

Page 84: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

2. Reduce sequence length

Reduce # of unique sequences

session

1000 events

Page 85: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

2. Reduce sequence length

Reduce # of unique sequences

session

10 events after (window size & direction)

1000 events

visit home page (alignment)

Page 86: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

2. Reduce sequence length

Reduce # of unique sequences

Ask users for input}

Page 87: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Reduce event types

2. Reduce sequence length

3. More aggregation on Hadoop

Reduce # of unique sequences

Ask users for input}

Page 88: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Collapse eventsSequence ABBBCCCC ABBCC ABC ABCCCC ABCD ABCCCD ABCCE ABCDF ABCDG ABCDH

e.g. tweet, tweet, tweet, … = tweet

Page 89: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Sequence ABC ABC ABC ABC ABCD ABCD ABCE ABCDF ABCDG ABCDH

Collapse events

Page 90: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Group & CountSequence ABC ABCD ABCE ABCDF ABCDG ABCDH …

Count 2000 80 20 1 1 1 …

Page 91: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Group & CountSequence ABC ABCD ABCE ABCDF ABCDG ABCDH ABCDI ABCDJK ABCDJL

Count 2000 80 20 1 1 1 1 1 1

rare sequences (count < threshold)

Page 92: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

TruncateSequence ABC ABCD ABCE ABCDx ABCDx ABCDx ABCDx ABCDJx ABCDJx

Count 2000 80 20 1 1 1 1 1 1

Replace last event with x (…)

Page 93: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Sequence ABC ABCD ABCE ABCDx ABCDJx

Count 2000 80 20 4 2

Group & Count

Page 94: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Truncate moreSequence ABC ABCD ABCE ABCDx ABCDx

Count 2000 80 20 4 2

Page 95: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Group & CountSequence ABC ABCD ABCE ABCDx

Count 2000 80 20 6

Page 96: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

1. Define set of events

2. Pick alignment, direction and window size

3. Run Hadoop job (with more aggregation)

4. Wait for it… (2+ hrs)

5. Visualize

Final process

~100,000 patterns (10MB)

gazillion patterns (TBs)

Page 97: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Demo Flying Sessions

Page 98: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Since Jan 2013

• Fewer users, but more in-depth ad-hoc analysis

• Initial meeting to provide support

Deployment

Page 99: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• What did users do when they visit Twitter? (in demo)

• Where did users give up in the sign up process?

• more in the paper

Case studies

Page 100: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Case studies

click on “sign up”

fill personal info

import address book

etc.

• What did users do when they visit Twitter? (in demo)

• Where did users give up in the sign up process?

• more in the paper

Page 101: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• What did users do when they visit Twitter? (in demo)

• Where did users give up in the sign up process?

• more in the paper

Case studies

read the paper :)

Page 102: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Large-scale User Activity Logs + Visual Analytics

Conclusions & Future work

Page 103: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Large-scale User Activity Logs + Visual Analytics

• Find, Monitor & Explore + Anomaly detection & automatic alert

• Funnel Analysis + More interactivity & data / reduce wait time / latency study?

• Used in day-to-day operations at Twitter

Conclusions & Future work

Page 104: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Conclusions & Future workChallenge

big data

small data

visualize & interact

• Large-scale User Activity Logs + Visual Analytics

• Find, Monitor & Explore + Anomaly detection & automatic alert

• Funnel Analysis + More interactivity & data / reduce wait time / latency study?

• Used in day-to-day operations at Twitter

aggregate & sacrifice

Page 105: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Large-scale User Activity Logs + Visual Analytics

• Find, Monitor & Explore + Anomaly detection & automatic alert

• Funnel Analysis + More interactivity & data / reduce wait time / latency study?

• Used in day-to-day operations at Twitter

• Generalize to smaller systems

Conclusions & Future workChallenge

big data

small data

visualize & interact

aggregate & sacrifice

Page 106: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Data Scientists & Engineers @Twitter — Linus Lee, Chuang Liu

• Feedback from reviewers, Ben Shneiderman & Catherine Plaisant

Acknowledgement

Page 107: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Twitter is looking for data analyst.

(contractor)

Page 108: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

• Large-scale User Activity Logs + Visual Analytics

• Find, Monitor & Explore + Anomaly detection & automatic alert

• Funnel Analysis + More interactivity & data / reduce wait time / latency study?

• Used in day-to-day operations at Twitter

• Generalize to smaller systems

Conclusions & Future workChallenge

big data

small data

visualize & interact

[email protected] / @kristw

aggregate & sacrifice

Page 109: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Questions?

Page 110: Using Visualizations to Monitor Changes and Harvest Insights from a Global-scale Logging Infrastructure at Twitter

Thank you