Why Watching Movie Tweets Won’t Tell the Whole Story?

Post on 24-Feb-2016

61 views 0 download

description

Why Watching Movie Tweets Won’t Tell the Whole Story?. Felix Ming-Fai Wong, Soumya Sen , Mung Chiang EE, Princeton University. WOSN’12. August 17, Helsinki. Twitter: A Gold Mine?. Apps: Mombo , TwitCritics , Fflick. Or Too Good to be True?. Representativeness???. - PowerPoint PPT Presentation

Transcript of Why Watching Movie Tweets Won’t Tell the Whole Story?

Why Watching Movie Tweets Won’t Tell the Whole Story?

Felix Ming-Fai Wong, Soumya Sen, Mung Chiang

EE, Princeton University

1

WOSN’12. August 17, Helsinki.

Twitter: A Gold Mine?

2

Apps: Mombo, TwitCritics, Fflick

Or Too Good to be True?

3

Representativeness???

Why representativeness?

Selection bias Gender Age Education Computer skills Ethnicity Interests

The silent majority

4

Why Movies?

Large online interest

Relatively unambiguous

Right in Timing: Oscars

5

Key Questions

Is there a bias in Twitter movie ratings? How do Twitters compare to other online site? Is there a quantifiable difference across types of

movies? Oscar nominated vs. non-nominated commercial movies

Can hype ensure Box-office gains?

6

Data Stats

12 Million Tweets 1.77M valid movie tweets

February 2 – March 12, 2012 34 movie titles

New releases (20) Oscar-nominees (14)

7

Example MoviesCommercial Movies (non-nominees) Oscar-nomineesThe Grey The DescendantsUnderworld: Awakening The ArtistContraband The HelpHaywire HugoThe Woman in Black Midnight in ParisChronicle MoneyballThe Vow The Tree of LifeJourney 2: The Mysterious Island War Horse

8

Oscar-nominees for Best Picture or Best Animated Film

Rating Comparison

Twitter movie ratings Binary Classification: Positivity/Negativity

IMDb, Rotten Tomatoes ratings Rating scale: 0 – 10, 0 – 5 Benchmark definition

Rotten Tomatoes: Positive - 3.5/5 IMDb: Positive: 7/10 Movie score: weighted average High mutual information

9

Challenges Common Words in titles

e.g., “Thanks for the help”, “the grey sky”

No API support for exact matching e.g., “a grey cat in the box”

Misrepresentations e.g., “underworld awakening”

Non-standard vocabulary e.g., “sick movie”

Non-English tweets Retweets: approves original opinion

10

Tweet Classification

11

Steps

Preprocessing Remove usernames Conversion:

Punctuation marks (e.g., “!” to a meta-word “exclmark”) Emoticons (e,g,, or :), etc. ), URLs, “@”

Feature Vector Conversion Preprocessed tweet to binary feature vector (using MALLET)

Training & Classification 11K non-repeated, randomly sampled tweets manually labeled Trained three classifiers

12

Preprocessing Training Data Classification Analysis

Classifier Comparison

SVM based classifier performance is significantly better Numbers in brackets are balanced accuracy rates

13

Temporal Context

14

Woman in Black Chronicle The Vow Safe House

Most “current” tweets are “mentions” LBS

“Before” or “After” tweets are much more positive

Fraction of tweets in joint-classes

Sentiment: Positive Bias

15

Woman in Black Chronicle The Vow Safe House Haywire Star Wars I

Twitters are overwhelmingly more positive for almost all movies

Rotten Tomatoes vs. IMDb?

Good match between RT and IMDb (Benchmark)16

Twitters vs. IMDb vs. RT ratings (1)

New (non-nominated) movies score more positively from Twitter users

17

IMDb vs. Twitter RT vs. Twitter

Twitters vs. IMDb vs. RT ratings (2)

Oscar-nominees rate more positively on IMDb and RT

18

IMDb vs. Twitter RT vs. Twitter

Quantification Metrics (1)

(x*, y*) are the medians of proportion of positive ratings in Twitter and IMDb/RT

19

B

P

Positiveness (P):

Bias (B):IM

Db/R

T sc

ore

Twitter score

x*

y*

1

1

Quantification Metrics (2)

20

Inferrability (I):

If one knows the average rating on Twitter, how well can he/she predict the ratings on IMDb/RT?

Summary Metrics

21

Oscar-nominees have higher positiveness RT-IMDb have high mutual inferrability and low bias Twitter users are more positively biased for non-

nominated new releases

Hype-approval vs. Ratings

22

Higher (lower) Hype-approval ≠ Higher (lower) IMDb/RT ratings

The VowThe Vow

This Means WarThis Means War

Hype-approval vs. Box-office

High Hype + High IMDb rating: Box-office success Other combination: Unpredictable

23

Journey 2

Summary

Tweeters aren’t representative enough Need to be careful about tweets as online polls Tweeters’ tend to appear more positive, have specific

interests and taste Need for quantitative metrics

Thanks!

24