Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu...

27
Twitter rank—finding topic-sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International Conference on Web Search and Data Mining (WSDM 2010)

Transcript of Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu...

Page 1: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Twitter rank—finding topic-sensitive influential twitters

Singapore Management University

Jianshu WENG

Ee Peng LIM

Jing JIANG

Qi He

ACM International Conference on Web Search and Data Mining (WSDM 2010)

Page 2: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Outline

• Problem

• Dataset

• Twitter rank

• Results

• Recommended paper

Page 3: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Problem

• Identifying influential users of micro-blogging services

How?

Page 4: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Frequently used algorithmsName Algorithm Measures the influence with Problems

InD In-degreenumber of followers does not accurately capture the

notion of influence

PR PageRank

link structure of the network ignores the interests of twitterers, which affects the way twitterers influence one another

TSPR Topic-sensitive pagerank

calculating PageRank vector for each topic.

propagates a twitterer’s influence in one topic to her friends in different topics with equal probabilities

TR Twitterrank Link structure + Topic similarity between users

Page 5: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Problem

• Identifying influential users of micro-blogging services

Topic similarity between users Link structure Twitter rank

Homophily

Page 6: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Why the problem important?

Page 7: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Why the problem important?

• Brings order to the real-time web in that it allows the search results to be sorted by the authority/influence of the contributing twitterers giving a timely update of the thoughts of influential twitterers

• A marketing platform. Targeting those influential users will increase the e ciency of the ffimarketing campaign . For example, a hand phone manufacturer can engage those twitterers influential in topics about IT gadgets to potentially influence more people.

• There are also applications that utilize Twitter to gather opinions and information on particular topics. Identifying influential twitterers for interesting topics can improve the quality of the opinions gathered.

……

Page 8: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Context _ twitter

• Friend tweet

• Follower following

friend follower Tweet(<140words)

following

Page 9: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Context _ dataset

Top-1000 Singapore

Page 10: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Context _ dataset

Resource Size

Top-1000 Core 996

Extended twitterers

All the followers and friends of each individual twitterer

6,748

Tweets All the tweets they had published so far 1,021,039

Page 11: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Context _ dataset

• Power-law distribution

5686 >1 tweetsMean=179.57

Among the 6745 twitterers, 957 have no friends, while 1782 have no followers

Page 12: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Context _ dataset• Reciprocity

• Too casual Strong indicator of the similarity among users

72.4% 80%

following

80.5% 80%

following

Homophily? How to prove?

Page 13: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Twitterrank

Homophily

Page 14: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Topic distillation

• LDA(Latent Dirichlet Allocation)

p( word | tweet ) = p( word | topic )*p( topic | tweet )

tweet topic tweet

word word topic

DT: the probability that twitterer si is interested in topic tj .

Automatically identify the topics that twitterers are interested in based on the tweets they published.

WT

Page 15: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Homophily

Page 16: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Homophily

• Hypothesis testing

Case1: >30 friendsCase2: <30 friends

Most of the twitterers (3785/4050) have less than 30 friends, which is not statistically significant. Therefore, two cases are considered. Why?

11505 pairs of twitterers with reciprocal “following” relationship.

Page 17: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Twitterrank

Graph D(V,E)

• Vertex: twitterers

• Edge: “following” relationship

ransition matrix for topic t

Page 18: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Twitterrank

• Jump why jump?

It is possible that some twitterers would “follow” one another in a looping manner without “following” other twitterers outside the loop. Such loop will accumulate high influence without distribute their influence. To tackle this, a teleportation vector E t is also introduced, which basically captures the probability that the random surfer would “jump”to some twitterers instead of following the edges of the graph D

Page 19: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Twitterrank

1. probabilities of different topics’ presence WT2. probabilities that a particular twitterer is interested in different topics DT

Page 20: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Results

• Influential twitterers identified in the Twitter dataset

Page 21: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Compared algorithmsName Algorithm Measures the influence with Problems

InD In-degreenumber of followers does not accurately capture the

notion of influence

PR PageRank

link structure of the network ignores the interests of twitterers, which affects the way twitterers influence one another

TSPR Topic-sensitive pagerank

calculating PageRank vector for each topic.

propagates a twitterer’s influence in one topic to her friends in different topics with equal probabilities

TR Twitterrank Link structure + Topic similarity between users

Page 22: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Results

Page 23: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

Application _ recommendation task

• Randomly choose | L | existing “following” relationship formed among twitterers

A

B

A

B 9

8

2

3

1

5

6

7

104

Page 24: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.

A recommended paper

• Java A, Song X, Finin T, et al. Why we twitter: understanding microblogging usage and communities[C]//Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis. ACM, 2007: 56-65.

• 引用: 2126次

Main work:• People talk about their daily activities and to seek or share information.

• Analyze the user intentions associated at a community level and show how users with similar intentions connect with each other.

Page 25: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.
Page 26: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.
Page 27: Twitter rank—finding topic- sensitive influential twitters Singapore Management University Jianshu WENG Ee Peng LIM Jing JIANG Qi He ACM International.