Magnet community identification on social networks

45
MAGNET COMMUNITY IDENTIfiCATION ON SOCIAL NETWORKS University of Illinois at Chicago, USA

Transcript of Magnet community identification on social networks

Page 1: Magnet community identification on social networks

MAGNET COMMUNITY

IDENTIfiCATION ON

SOCIAL NETWORKS

University of Illinois at Chicago, USA

Page 2: Magnet community identification on social networks

ABSTRACT

A magnet community is such a community that attracts significantly more people’s interests and attentions than other communities of similar topics.

the study of magnet community identificationproblem.

1. We observe several properties of magnet communities.

2. We formalize these properties with the combination of community feature extraction into a graph ranking formulation.

Page 3: Magnet community identification on social networks

INTRODUCTION

Magnet communities are such communities

that draw significantly more attention than

others even if they are all about the same topic.

Page 4: Magnet community identification on social networks

INTRODUCTION

Importance:

1. help people understand the trends of their

domains.

2. help people make decisions when joining

communities.

Our goal :

Given communities in a domain, we want to rank

them based on their attractiveness to people among

the communities of that domain.

In the end, the top ranked communities are the ones

people tend to adhere to.

Page 5: Magnet community identification on social networks

INTRODUCTION

Page 6: Magnet community identification on social networks

INTRODUCTION

Challenges:

1. how to extract features from these heterogeneous sources of impacting factors of a community’s attractiveness.

2. how to combine all heterogeneous information into a unified ranking model.

3. Noise handling.

common properties:

attention flow

Attention quality

persistence of people’s attention

Page 7: Magnet community identification on social networks

contributions

A new direction on social network analysis,

namely

magnet community identification.

One definition of magnet communities by

identifying their properties.

We demonstrate the effectiveness of our

framework on a particular domain of magnet

community identification, namely company’s

employee magnet community identification.

Page 8: Magnet community identification on social networks

MAGNET COMMUNITY

IDENTIFICA-

TION FRAMEWORK

Page 9: Magnet community identification on social networks

MAGNET COMMUNITY

IDENTIFICA-

TION FRAMEWORK

Page 10: Magnet community identification on social networks

MAGNET COMMUNITY

IDENTIFICA-

TION FRAMEWORK Attractiveness computation framework

M =( m1 , m2 ,..., mk )

M = f ( FV , FE , M )

M∗

Our objective function:

Page 11: Magnet community identification on social networks

MAGNET COMMUNITY

IDENTIFICA-

TION FRAMEWORK Attractiveness features

Standalone features

Attention migrating matrix as dependency

features

• an attention migrating matrix:D=(dij)k*k

• The attention vector, A =( ai )k∗1 = D · e

• Dependency features of communities:

Page 12: Magnet community identification on social networks

MAGNET COMMUNITY

IDENTIFICA-

TION FRAMEWORK Concrete formula of magnet community

ranking framework

At least one of the following conditions hold

Page 13: Magnet community identification on social networks

MAGNET COMMUNITY

IDENTIFICA-

TION FRAMEWORK

Page 14: Magnet community identification on social networks

MAGNET COMMUNITY

IDENTIFICA-

TION FRAMEWORK

Page 15: Magnet community identification on social networks

MAGNET COMMUNITY

IDENTIFICA-

TION FRAMEWORK

Page 16: Magnet community identification on social networks

MAGNET COMMUNITY

IDENTIFICA-

TION FRAMEWORK

Page 17: Magnet community identification on social networks

EVALUATION

Data collection and features extraction

Data collection

• www.linkedin.com

• Standalone features : a company’s revenue per

employee, industry, location, age

• 39527 companies’ information in 142 industries

Page 18: Magnet community identification on social networks

EVALUATION

Feature extraction

• industry – count how many people flow into it andout of it, using company level departure and arrival data.

• Locations -- popularities

• Founded year feature -- the number of companies founded for each year.

Ranking performance

Baseline Description

• PageRank

• IT and financial

• The 2011 ideal employer ranking proposed by Universumglobalthe 2011 most admired company ranking by Fortune

Page 19: Magnet community identification on social networks

EVALUATION

Case studies --- IT

Page 20: Magnet community identification on social networks

EVALUATION

Case studies --- Financial

Page 21: Magnet community identification on social networks

EVALUATION

Overall Correctness measures

Page 22: Magnet community identification on social networks

EVALUATION

Parameter sensitivity

two parameters α and μ

Page 23: Magnet community identification on social networks

CONCLUSION

提出了magnet community identification 的研究方向

对问题定义和举例、研究意义、挑战、目标、

传统思路的缺陷等有非常充分的说明

算法思路清晰,提出了三个特性,量化成目标

函数

Page 24: Magnet community identification on social networks

JOINT TOPIC MODELING

FOR EVENT

SUMMARIZATION

ACROSS

NEWS AND SOCIAL

MEDIA STREAMS

Qatar Computing Research Institute

Qatar Foundation Doha, Qatar

Page 25: Magnet community identification on social networks

ABSTRACT

a novel unsupervised approach based on topic modeling to summarize trending subjects by jointly discovering the representative and complementary information from news and tweets.

1. topic modeling formalism by combining a two-dimensional topic-aspect model and a cross-collection approach in the multi-document summarization literature.

2. co-ranking the news sentences and tweets in both sides.

Page 26: Magnet community identification on social networks

INTRODUCTION

News -- well-crafted, fact-oriented long stories

written by professionals based on the latest

past events

Tweets – personalized, more opinionated free-

style short messages posted by the average

persons in real time.

Page 27: Magnet community identification on social networks

INTRODUCTION

contributions

A novel problem of generating complementary summaries

a principled measure to assess the extent ofsentence-level complementarity

A topic modeling approach called cross-collectiontopic-aspect model (ccTAM) that combines ccLDA and topic-aspect mixture model for precisely estimating the proposed complementary measure.

a gold-standard dataset of complementary summaries

Page 28: Magnet community identification on social networks

PROBLEM DEFINITION

Page 29: Magnet community identification on social networks

PROBLEM DEFINITION

Page 30: Magnet community identification on social networks

LEARNING COMPLEMENTARY

RELATION

commonality and difference

general model and media-specific model.

Page 31: Magnet community identification on social networks

LEARNING COMPLEMENTARY

RELATION

Measuring Commonality and Difference

Page 32: Magnet community identification on social networks

LEARNING COMPLEMENTARY

RELATION

Cross-collection Topic-Aspect Model (ccTAM)

Page 33: Magnet community identification on social networks
Page 34: Magnet community identification on social networks
Page 35: Magnet community identification on social networks

Inference

Infer the general topic-word distribution φ z

Page 36: Magnet community identification on social networks

Inference

The collection-specific topic-word distribution

φcz

Page 37: Magnet community identification on social networks

GENERATE COMPLEMENTARY

SUMMARIES

G =( N ∪ T, E )

N = { n1 ,n2, ··· ,nmn}, T = { t1 ,t2 , ··· ,tnt

}

E = { ( p ( ni | tj ) ,p ( tj | ni )) | i =1 , ··· ,mn ; j

=1 , ··· ,nt } is the set of directed edges

between two sets of nodes whose values are

node-to-node jumping probabilities.

Page 38: Magnet community identification on social networks

GENERATE COMPLEMENTARY

SUMMARIES

Jumping Probability

Page 39: Magnet community identification on social networks

GENERATE COMPLEMENTARY

SUMMARIES

Sentences/Tweets Co-ranking

Summary Generation

Summary-level complementarity

Sentence-level complementarity

Page 40: Magnet community identification on social networks

EXPERIMENTS AND

RESULTS

Data Collecting

Page 41: Magnet community identification on social networks

EXPERIMENTS AND

RESULTS

gold-standard summaries

The news summaries: English Wikipedia and

Wikinews

Tweets summaries

Baseline Methods

BL-0:LexRank

BL-1: KL-divergence(KLD)

BL-2: Cosine and language modeling(LM)

BL-3:LexRank+Complementarity(LexComp)

Page 42: Magnet community identification on social networks

EXPERIMENTS AND

RESULTS

Results and Discussions

Page 43: Magnet community identification on social networks

EXPERIMENTS AND

RESULTS

Results and Discussions

Page 44: Magnet community identification on social networks

EXPERIMENTS AND

RESULTS

Example of output summaries

Page 45: Magnet community identification on social networks

CONCLUSIONS

提出了用tweets补充news summarization的想法

提出了补充度的概念,并介绍了一种度量

tweets补充度的方法。

算法结合了很多已有模型,比如topic-aspect

model, cross-collection topic model, random

walk model