Reddit Churn Analysis

12
Reddit Churn Analysis Boaz Gurdin Galvanize Data Science Immersive August 2015

Transcript of Reddit Churn Analysis

Reddit Churn AnalysisBoaz Gurdin

Galvanize Data Science Immersive

August 2015

$ Acquisition >

$ Retention

What factors predict whether a new user will post again?

comments

1.7Buncompressed

1TB

Pipeline

Filter'to/r/Politics

Pivot'comments'to'users

Logistic'regression

SQL

FeaturesBody of First Comment

Responses to First Comment

Time ofFirst Comment

word_countlong_postis_response(sentiment)

responses_totalhas_long_responsehas_short_responseresponses_avg_word_ctresponses_ups_avg(ups)(has_ups)(downs)(has_downs)

datetime_of_dayday_of_weekday_of_year

() = Dropped due to data quality issues

FeaturesBody of First Comment

Responses to First Comment

Time ofFirst Comment

word_countlong_postis_response(sentiment)

responses_totalhas_long_responsehas_short_responseresponses_avg_word_ctresponses_ups_avg(ups)(has_ups)(downs)(has_downs)

datetime_of_dayday_of_weekday_of_year

() = Dropped due to data quality issues

Included in final model

=

Receiving a response is the strongest predictor of commenting again

Users whose first comment in /r/politics received at least one response had 75% better odds of commenting again compared to users whose first comment received no response

75% ꜛ better odds of commenting again

Receiving a response is the strongest predictor of commenting again

Users whose first comment in /r/politics received at least one response had 75% better odds of commenting again compared to users whose first comment received no response

75% ꜛ52%'ꜛResponse <20 words

103%'ꜛResponse >20 words

better odds of commenting again

Recommendation: A/B Tests

Moderators respond to first-time commenters

Recommend active users to respond to new users’ first comments

Product Levers

Operational Levers

Thanks!Boaz [email protected]

LinkedIn: BoazGTwitter: @BoazGurdinSlideShare: BoazGurdinGitHub: BoazGurdin