Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao...

22
Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa Barbara [email protected]

Transcript of Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao...

Page 1: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Wisdom In The Social Crowd: An Analysis Of Quora

Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao

University of California at Santa [email protected]

Page 2: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

• Systems to answer user questions on the Internet• Google - general information• Wikipedia - factual knowledge

• But we often have questions that require…• Domain-specific knowledge• First-hand life experiences

2

Asking Questions on the Internet

Q: What is the most interesting souvenir you can buy in Rio?

Q: What is the population of Rio?

Page 3: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Online Q&A Services Today• Question and Answer (Q&A) sites • Web services where people ask and answer

questions• A crowd-sourced way to search information

• Large online knowledge repositories

• As the Q&A systems grow to massive scales… • More difficult for users to locate useful answers

or interesting questions• Low-value questions (spam) overwhelm the system

3

• 300+ Million questions• 1+ Billion answers

• 3.5+ Million questions• 6.8+ Million answers

Page 4: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Quora - Social Q&A

4

• “Hottest” (most successful) today• First social network based Q&A• 350% traffic growth in 2012 • Many answers are returned

as top answers to Google queries

• Quora’s advantages• High-quality questions and answers• True domain experts participation politicians, actors, startup founders, etc.

How does Quora’s internal structures contribute to its success?

Page 5: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

A Measurement Study of Quora• Limited understanding of Quora• Size of site (questions, users), growth rate• Mechanisms for content discovery, quality control

• Questions we asked in our study• How does Quora grow over time?• What’s the impact of social graph on Q&A activities? • How does Quora direct users to the valuable

content?Match experts w/ questions, and seekers w/ answers

5

Page 6: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Outline

• Introduction

• Characterizing Quora

• Analyzing Graph Structures

• Implications

6

Page 7: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

7

A Typical Question Page

Votes

Related QuestionsTopics

Question

Answer

Page 8: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Graphs, Graphs, More Graphs• User-topic graph: user following topics• Social graph: user following other users • Related question graph: connecting related

questions

8

Topics

QQQ

Q

Q

A A A

Page 9: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

• Crawling Quora• Snowball-crawled related question graph (August

2012)• Obtained the largest connected component• Slow speed, minor impact to the site

• Using the dataset of StackOverflow as a comparison

Data CollectionWebsite Data

SinceTotal

Questions

TotalTopics

TotalUsers

TotalAnswe

rs

Question

Coverage

Quora Oct. 2009

437K 56K 264K 979K 58%

StackOverflow

Jun. 2008

3.45M 22K 1.3M 6.86M 100%

9

Page 10: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Growth Over Time

10

Nu

mb

er

of

Quest

ion

s

0 5 10 15 20 25 30 35 40 45 50100

1,000

10,000

100,000

1,000,000

10,000,000

Stack Overflow

Quora (Total)

Quora (Crawled)

2008/7 2009/5 2010/3 2011/1 2011/11 2012/7

Similar growth trend with StackOverflow

761K

437K (58%)

Total # of questions

estimated by Qid

Page 11: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Outline

• Introduction

• Characterizing Quora

• Analyzing Graph Structures• Social Graph• Related Question Graph

• Implications

11

Details on User-Topic Graph in the

paper!

Page 12: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

How do social ties impact Q&A activities?

12

Page 13: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Social Graph Structure

13

1 10 100 1000 10000 1000000.001

0.01

0.1

1

10

100

FollowersFollowees

Social Degree

CC

DF (

%)

• Users can follow other users to build social connections• Asymmetric social graph• Users receive items in their newsfeed from people they

follow

Social degree has power-law distribution

Page 14: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Is the Social Graph Meaningful?

14

1 10 100

1,00

0

10,0

00

100,

000

1

10

100

1,000

10,000

100,000

User Received VotesFollow

ers

Per

User

(Avera

ge)

1 10 100

1,00

0

10,0

0010

100

1,000

10,000

100,000

User Answers

Follow

ers

Per

User

(Avera

ge)

• Correlation between user’s # of followers and• # of total answers the user wrote• # of votes the user ever received

• More answers or high-quality answers == more followers

• Social structure could indicate content quality

Page 15: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

0 2 4 6 8 10 12 14 16 18 200

20

40

60

80

100

Normal Users

Super Users

Answers Per Question

CD

F (

%)

of

Qu

esti

on

s

Using Social Ties to Attract Answers • Would social ties help to attract answers?• Defining “super-users”• Top 5% users sorted by # of followers

150.001 0.01 0.1 10

20

40

60

80

100

Normal Users

Answers Per Question(Normalized by #Followers)

CD

F (

%)

of

Qu

esti

on

s

Social ties have no effect on attracting answers

Page 16: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

How does Quora direct users to

“interesting” questions?

16

Page 17: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Related Question Graph

171 10 100 1000

1

10

100

Question Degree

CC

DF (

%)

QQ

QQ• Related question feature

• Allows users to browse a series of related questions

• Related question graph• Questions as nodes, edges indicates “related”

relationships

• Graph properties• Power-law structure• A small set of “core” questions inside each topic

Page 18: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

0 5 10 15 20 25 30 350

1000

2000

3000

4000

5000

1

2

3

4

5

6

7

ViewsAn-swers

Questions Bucketized By Degree

Avera

ge N

um

ber

of

Vie

ws

Avera

ge N

um

ber

of

An

sw

ers

Impact of Question Degree

18

• Strong correlation between question degree and user’s attention on the question

• Question graph drives users to “core” questions

Page 19: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

User Attention on Similar Questions

19

• Similar questions in Quora• Questions around very close (same) subjects• Redundant questions asked by different users

• Do users pay equal attention to similar questions?

• Locating similar questions by partitioning question graph• METIS, produce clusters, each contains similar

questionsQ Q

Q

QQ

QQ

Q

Q

Q

Page 20: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Equal Attention on Similar Questions?

20

• Is user attention evenly distributed in each cluster?• Gini coefficient (G): evaluate the uniformity of

distribution• G=0: perfect equality• G~1: extremely skewed distribution

0 50 1000

40

80

% of Questions

% o

f Tota

l V

iew

s

0 50 1000

40

80

% of Questions

% o

f Tota

l V

iew

s

0 40 800

40

80

% of Questions

% o

f Tota

l V

iew

s

• User attention is highly skewed in each cluster

• Excellent! users are not distracted by similar questions

G=0 G=0.4 G=0.9

Page 21: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Implication and Conclusion• Implication for crowdsourcing content sites • Q&A sites• Users attention is “skewed” to top questions• Avoid distraction, encourage contribution

• Other sites such as Yelp, TripAdvisor• Drive enough reviews to key venues• Ensure reliable rating

• The first large-scale measurement study on Quora

• Graph structures contribute to effective content discovery• Social graph indicates content quality• Question graph focuses user attention

21

Page 22: Wisdom In The Social Crowd: An Analysis Of Quora Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng and Ben Y. Zhao University of California at Santa.

Thank you!

Questions?

22