Download - Dealing with Frauds at Appsflyer

Transcript
Page 1: Dealing with Frauds at Appsflyer

Click FraudsDealing with frauds at Appsflyer

Page 2: Dealing with Frauds at Appsflyer

Business

Advertiser

Page 3: Dealing with Frauds at Appsflyer

Business

Advertiser

Publisher

Page 4: Dealing with Frauds at Appsflyer

Business

Advertiser

Publisher

Page 5: Dealing with Frauds at Appsflyer

Business

Advertiser

Publisher

Click

Page 6: Dealing with Frauds at Appsflyer

Business

Advertiser

Publisher

Click

Install

Page 7: Dealing with Frauds at Appsflyer

Business

Advertiser

Publisher

Click

Install

Page 8: Dealing with Frauds at Appsflyer

Who’s the bad guy?

Advertiser

Publisher

Click

Install

Page 9: Dealing with Frauds at Appsflyer

Who’s the bad guy?

Advertiser

Publisher

Click

Install

Page 10: Dealing with Frauds at Appsflyer

What advertiser pays for?

Cost per impression

Cost per click

Cost per install

Cost per action

Page 11: Dealing with Frauds at Appsflyer

What advertiser pays for?

Cost per impression

Cost per click

Cost per install

Cost per action

Page 12: Dealing with Frauds at Appsflyer

What advertiser pays for?

Cost per impression

Cost per click

Cost per install

Fraud techniques of a different league

Less fraudulent installs than clicks/views as CPI is usually much higher

Cost per action

Page 13: Dealing with Frauds at Appsflyer

Fraud methods

Page 14: Dealing with Frauds at Appsflyer

Fraud methods

Programmatic (bots)

Page 15: Dealing with Frauds at Appsflyer

Fraud methods

Programmatic (bots)Humans

Page 16: Dealing with Frauds at Appsflyer

Fraud detection methods

Rule-based

Need expert knowledge of past fraud behaviour

Highly effective at detecting known fraud types

Ineffective at new types

Anomaly detection

Good for new kinds of deviations

Not good for known types of fraud

Supervised learning

Need examples of past fraud

Can be effective at detecting similar occurrences

Ineffective at new types of fraud

Page 17: Dealing with Frauds at Appsflyer

Rule-based

Unrecognized user agent string

Mozilla/4.0 (compatible; MSIE 4.5; Windows 98; )

Wrong IMEI

Too many applications installed from the same device

Frequent re-installs on a specific device

Save device installs from many different geographical locations

Inadequately short time between click and install

iOS app install receipt can’t be validated by iTunes

Page 18: Dealing with Frauds at Appsflyer

Anomaly detection

k-means clustering

Page 19: Dealing with Frauds at Appsflyer

Anomaly detection

k-means clustering

Page 20: Dealing with Frauds at Appsflyer

Anomaly detection

k-means clustering

Page 21: Dealing with Frauds at Appsflyer

Choosing features

Normally distributed values (or half-normally)

Normalizing data

Custom normalizer

StandardScaler (Spark >= 1.4)

Choose number of clusters

Iterate on different clusters number

Evaluate “clustering score”

Build k-means model

Find vectors with P(x) < 𝝴

k-means clustering

Page 22: Dealing with Frauds at Appsflyer

k-means clustering - parsing

Page 23: Dealing with Frauds at Appsflyer

k-means clustering - feature selection

Page 24: Dealing with Frauds at Appsflyer

k-means clustering - finding K

Page 25: Dealing with Frauds at Appsflyer

k-means clustering - find anomalies

Page 26: Dealing with Frauds at Appsflyer

Supervised learning

Logistic regression

Decision tree

Random forests

Training set {x1, x2, …., xN} -> E

Train the model

Validate, then train again..

Test

Apply!

Page 27: Dealing with Frauds at Appsflyer

Action items

Drop fraudulent requests

Pros:

Less traffic goes through the system

Cons:

False positives

Must capture all the frauds as they come in

Mark transactions, which are fraud (in our opinion)

Pros:

Let customer decide what to do

Allows offline fraud detection

Mixed approach

Page 28: Dealing with Frauds at Appsflyer

Thank you!