Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf ·...

38
Health Misinformation in Search and Social Media Presented by: Amira Ghenai PhD Student. Cheriton School of Computer Science Supervisors: Charles L. A. Clarke, Mark D. Smucker 8/7/17

Transcript of Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf ·...

Page 1: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Health Misinformation in Search and Social Media

Presented by: Amira Ghenai

PhD Student. Cheriton School of Computer Science

Supervisors: Charles L. A. Clarke, Mark D. Smucker

8/7/17

Page 2: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Snopes: http://archive.is/bHuhe#40%Original URL: http://healtheternally.com/1562/dandelion-weed-can-boost-your-immune-system-and-cure-cancer/

Page 3: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Snopes: http://archive.is/bHuhe#40%Original URL: http://healtheternally.com/1562/dandelion-weed-can-boost-your-immune-system-and-cure-cancer/

Page 4: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in
Page 5: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Clinical trial for potential cancer-fighting using common weed

‘Snopes’ fact checking!

‘I'm living proof it works'

Page 6: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

‘I'm living proof it works'

Clinical trial for potential cancer-fighting using common weed

‘Snopes’ fact checking!

Page 7: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in
Page 8: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

PROBLEM DEFINITION§ How does online health misinformation in web search and

social media effect people’s health?

§ Misinformation: a piece of information spreading in the web confirmed to be false by reliable sources

Health Misinformation in Search and Social MediaAmira Ghenai PAGE 8

Page 9: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

OUTLINE§ Background

§ Research Methodology

§ Current progress§ Web Search

§ Research Question

§ Experiment

§ Results

§ Social Media

§ Research Question

§ Dataset & Classification Task

§ Results

§ Future Research Plan

Health Misinformation in Search and Social MediaAmira Ghenai PAGE 9

Page 10: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

BACKGROUND§ [White et al, TOIS 2015] found that web search engines have

an uncontrolled bias towards medical treatments “help’’§ People are biased towards “help’’ belief

§ [Dredze et al, NCBI 2016] analyzed misleading theories about Zika vaccination in Twitter § Observed the effect of vaccine-skeptic communities over other users’

vaccination opinion

§ People hold the wrong beliefs even before the vaccine is released

§ The Zika vaccine misconceptions are more influential because there were existing claims about vaccine

Health Misinformation in Search and Social MediaAmira Ghenai PAGE 10

Page 11: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Web Search§ Measure the influence of search

engine results on people health care decisions

§ Method: Controlled lab studies

§ Goal:

§ Understand how people use web online content in health search

§ Develop better search engines to support people’s health decision making process

Social Media§ Analyze the effect of health

misinformation in social media on people’s behavior

§ Method: Observational studies

§ Goal:

§ Automatically Detect/Track health rumors

§ Online behavior: sharing, spreading more information

§ Offline behavior: anxiety level, event/outcome/personal experience

RESEARCH METHODOLOGY

Health Misinformation in Search and Social MediaAmira Ghenai PAGE 11

Page 12: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

CURRENT PROGRESS1. The Positive and Negative Influence of Search

Results on People’s Decisions about the Efficacy of Medical Treatments. Frances Pogacar, Amira Ghenai, Mark D. Smucker, Charles L. A. Clarke, 2017, October. In Proceedings of the 3rd ACM International Conference on the Theory of Information Retrieval (ICTIR17). Amsterdam

2. Catching Zika Fever: Tracking Health Misinformation in Twitter. Amira Ghenai, Yelena Mejova, 2017, January. In the Fifth IEEE International Conference on Healthcare Informatics (ICHI17), Park City, Utah

Health Misinformation in Search and Social MediaAmira Ghenai PAGE 12

Page 13: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

§ Measure the influence of health misinformation in search results for 10 medical treatments on people’s decisions

§ Influence of correct/incorrect bias on the decision about the efficacy of the treatment for medical condition

§ Influence of rank on the decision about the efficacy of the treatment for medical condition

Web search Social Media

The Positive and Negative Influence of Search Results on People’s Decisions about the Efficacy of Medical Treatments, F. Pogacar, A. Ghenai, M. Smucker, and C. Clarke, ICTIR 2017

PAGE 13

RESEARCH QUESTION

Page 14: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Web search Social Media

The Positive and Negative Influence of Search Results on People’s Decisions about the Efficacy of Medical Treatments, F. Pogacar, A. Ghenai, M. Smucker, and C. Clarke, ICTIR 2017

PAGE 14

EXPERIMENT§ 60 participants in the experimental user study

§ Participants were told to pretend to be searching for the answer to a question about the effectiveness of a treatment for a health issue

§ Participants had to classify the medical treatments as helpful, inconclusive, or unhelpful

§ They either received a search engine result page, or the control condition, with no SERP

Page 15: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Web search Social Media

The Positive and Negative Influence of Search Results on People’s Decisions about the Efficacy of Medical Treatments, F. Pogacar, A. Ghenai, M. Smucker, and C. Clarke, ICTIR 2017

PAGE 15

EXPERIMENTAL CONDITIONSTopmost Correct Rank Search Result Bias

§ 8:2 ratio of results

§ 8 correct, 2 incorrect

§ 2 correct, 8 incorrect

§ Always had a correct result at rank 1 or rank 3

§ Remaining correct results were placed randomly in the lower ranks.

Page 16: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Submit Answer

Page 17: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Submit Answer

“Does X help Y?”

Page 18: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Definitions of the treatment and health issue

Submit Answer

“Does X help Y?”

Page 19: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Definitions of the treatment and health issue

Clickable link, to take to document page

Submit Answer

“Does X help Y?”

Page 20: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Definitions of the treatment and health issue

Document title, snippet, url

Clickable link, to take to document page

Submit Answer

“Does X help Y?”

Page 21: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Definitions of the treatment and health issue

Instructions & classifications

Document title, snippet, url

Clickable link, to take to document page

Submit Answer

“Does X help Y?”

Page 22: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

§ Results biased towards incorrect information reduced people’s accuracy from 43% to 23%

§ Results biased towards correct information increased accuracy from 43% to 65%.

Web search Social Media

The Positive and Negative Influence of Search Results on People’s Decisions about the Efficacy of Medical Treatments, F. Pogacar, A. Ghenai, M. Smucker, and C. Clarke, ICTIR 2017

PAGE 22

RESULTS - ACCURACY

Page 23: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

§ Top most rank of a correct result appears to have some effect on people’s accuracy

§ When biased towards correct, the accuracy was 59% if the correct result was at rank 3 (incorrect at rank 1&2) compared to 70% accuracy when the rank 1 item was correct

Web search Social Media

The Positive and Negative Influence of Search Results on People’s Decisions about the Efficacy of Medical Treatments, F. Pogacar, A. Ghenai, M. Smucker, and C. Clarke, ICTIR 2017

PAGE 23

RESULTS - RANK

Page 24: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

§ Self-reported knowledge reduces the effect of incorrect information on accuracy (p= 0.04)

§ Like [White and Hassan, TWEB 2014] we found that participants are biased towards saying treatment are helpful

Web search Social Media

The Positive and Negative Influence of Search Results on People’s Decisions about the Efficacy of Medical Treatments, F. Pogacar, A. Ghenai, M. Smucker, and C. Clarke, ICTIR 2017

PAGE 24

RESULTS - KNOWLEDGE

Page 25: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

• Can we automatically detect tweets containing rumors about a health condition?

• Understand the behavior of rumor-related topics in social media

Web search Social media

Tracking Zika Health Misinformation on Twitter, Amira Ghenai, Yelena Mejova, ICHI 2017 PAGE 25

RESEARCH QUESTION

Page 26: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

§ 13 million tweets regarding the Zika outbreak from January 13 to August 22, 2016

§ 6 Zika related rumors posted by WHO

Web search Social media

Tracking Zika Health Misinformation on Twitter, Amira Ghenai, Yelena Mejova, ICHI 2017 PAGE 26

DATASET2016−01−13

2016−01−20

2016−01−27

2016−02−03

2016−02−11

2016−02−18

2016−02−25

2016−03−03

2016−03−10

2016−03−17

2016−03−24

2016−03−31

2016−04−07

2016−04−14

2016−04−21

2016−04−28

2016−05−05

2016−05−12

2016−05−19

2016−05−26

2016−06−02

2016−06−09

2016−06−16

2016−06−23

2016−06−30

2016−07−07

2016−07−14

2016−07−21

2016−07−28

2016−08−04

2016−08−11

2016−08−18

0100000

200000

300000

400000 Other

SpanishPortugueseEnglish

Page 27: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Web search Social media

Tracking Zika Health Misinformation on Twitter, Amira Ghenai, Yelena Mejova, ICHI 2017 PAGE 27

RESULTS – RUMOR OR CLARIFICATION?

Page 28: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Web search Social media

Tracking Zika Health Misinformation on Twitter, Amira Ghenai, Yelena Mejova, ICHI 2017 PAGE 28

RESULTS – RUMOR OR CLARIFICATION? R1: Zika virus is

linked to genetically modified mosquitoes

Page 29: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Web search Social media

Tracking Zika Health Misinformation on Twitter, Amira Ghenai, Yelena Mejova, ICHI 2017 PAGE 29

RESULTS – RUMOR OR CLARIFICATION? R1: Zika virus is

linked to genetically modified mosquitoes

R5: Americans are immune to Zikavirus

Page 30: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Web search Social media

Tracking Zika Health Misinformation on Twitter, Amira Ghenai, Yelena Mejova, ICHI 2017 PAGE 30

RESULTS – RUMOR OR CLARIFICATION?

R2: Zika virus symptoms are similar to seasonal flu

Page 31: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Web search Social media

Tracking Zika Health Misinformation on Twitter, Amira Ghenai, Yelena Mejova, ICHI 2017 PAGE 31

RESULTS – RUMOR OR CLARIFICATION?

R2: Zika virus symptoms are similar to seasonal flu

R6: Coffee as mosquito- repellent to protect against Zika

Page 32: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

Web search Social media

Tracking Zika Health Misinformation on Twitter, Amira Ghenai, Yelena Mejova, ICHI 2017 PAGE 32

CLASSIFICATION TASK§ A total of 48 features grouped into five categories

§ Twitter features

§ Sentiment features

§ Linguistic features: characterize different linguistic styles in Twitter text

§ Readability features: less readable information are more credible

§ Medical features: medical lexicon of tweets and the reliability of sources shared using URLs

Page 33: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

§ Best features to predict if a tweet is a rumor or not§ Medical features (advocacy domains count, Wikipedia domains

count)

§ Syntax of the tweet text (question marks, exclamation marks...)

§ Sentiment features (sentiment score, count positive/negative words)

§ Twitter features

Web search Social media

Tracking Zika Health Misinformation on Twitter, Amira Ghenai, Yelena Mejova, ICHI 2017 PAGE 33

RESULTS - FEATURES

Page 34: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

§ Random training/testing set selection 80/20§ Consider all rumor topics

§ High accuracy (0.92)

§ Training on 5 topics and testing on the 6th§ Low accuracy for new topics (we know truth but still new)

Web search Social media

Tracking Zika Health Misinformation on Twitter, Amira Ghenai, Yelena Mejova, ICHI 2017 PAGE 34

RESULTS - ACCURACY

Page 35: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

FUTURE RESEARCH PLAN§ Understand possible factors that influence people in search

§ Stimulated Recall user experiment (play screen recording with questions to participants)

§ Trustworthiness of resources

§ Rank? Exposure bias?

§ Do people use search engines in the wrong way?

Health Misinformation in Search and Social MediaAmira Ghenai PAGE 35

Page 36: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

FUTURE RESEARCH PLAN§ Whether people talking about a rumor are more likely to

have some specific event or not

§ Predict possible rumor topics§ Study the difference in behavior of cohorts susceptible to rumors

§ Behavior: online (retweeting, social network behavior, etc. ) and offline (anxiety, immunization, hospital visits, etc.)

Health Misinformation in Search and Social MediaAmira Ghenai PAGE 36

Page 37: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in

University of Waterloo§ Prof Charles L. A. Clarke, supervisor

§ Prof. Mark D. Smucker, supervisor

§ Frances A. Pogacar, colleague

Qatar Computing Research Institute§ Yelena Mejova, scientist, research collaborator

§ Luis Fernandez-Luque, scientist, research collaborator

ACKNOWLEDGEMENT

Health Misinformation in Search and Social MediaAmira Ghenai PAGE 37

SIGIR Student Travel Grant

Page 38: Health Misinformation in Search and Social Mediaaghenai/assets/presentation/DC_SIGIR.pdf · Conference on Healthcare Informatics (ICHI17), Park City, Utah Health Misinformation in