Crime Forecasting Using Boosted Ensemble Classifiers

Crime Forecasting Using Boosted Ensemble Classifiers Chung-Hsien Yu

Department of Computer Science University of Massachusetts Boston

2012 GRADUATE STUDENTS SYMPOSIUM

Present by: Chung-Hsien Yu

Advisor: Prof. Wei Ding

• Retaining spatiotemporal knowledge by applying multi-clustering to monthly aggregated crime data.

• Training baseline learners on these clusters obtained from clustering.

• Adapting a greedy algorithm to find a rule-based ensemble classifier during each boosting round.

• Pruning the ensemble classifier to prevent it from overfitting. • Constructing a strong hypothesis based on these ensemble

classifiers obtained from each round.

Abstract

Original Data

Residential Burglary

911 Calls

Arrest

Foreclosure

Street Robbery

Aggregated Data

Monthly Data3

Monthly Clusters (k=3)

Monthly Clusters (k=4)

Flow Chart

Algorithm (Part I)

Algorithm (Part II)

Confidence Value

From AdaBoosting (Schapire & Singer 1998) we have

Let and ignore the boosting round .

𝑍=∑𝑖𝑤 (𝑖 ) exp (−𝐶𝑅¿ 𝑦 𝑖)¿

is defined as the confidence value for the rule and if .

Objective Function

Therefore,

𝑊 0= ∑{ 𝑖|𝑥 𝑖∉𝑅 }

𝑤 (𝑖 )𝑊+¿= ∑{𝑖|𝑥𝑖∈𝑅 𝑎𝑛𝑑 𝑦=1 }

𝑤 ( 𝑖 ) ¿𝑊−= ∑{𝑖|𝑥 𝑖∈𝑅𝑎𝑛𝑑 𝑦=− 1}

𝑤 (𝑖 )

𝑊 0+𝑊+¿+𝑊 −=1¿

Minimum Z Value

𝑑𝑍𝑑𝐶𝑅

=−𝑊+¿exp (−𝐶 𝑅 )+𝑊 −exp (𝐶𝑅 )=0¿

→𝑊−exp (𝐶𝑅 )=𝑊+¿ exp (−𝐶𝑅 ) ¿

→ ln (𝑊 −exp (𝐶𝑅 ))=ln ¿¿→ ln (𝑊 −)+𝐶𝑅= ln ¿¿→2𝐶𝑅= ln¿ ¿

→𝐶𝑅=12 ln ¿¿

has the minimum value when

𝑑𝑍𝑑𝐶𝑅

2=𝑊+¿ exp (−𝐶𝑅 )+𝑊 −exp (𝐶𝑅 )>0¿

BuildChain Function

𝑊 0+𝑊+¿+𝑊 −=1¿

Repeatedly adding a classifier to R until it maximizes . This will minimize as well.

PruneChain Function

�́�=¿Loss Function:

Minimize by removing the last classifier from R.

is obtained from GrowSet. are obtained from applying R to PruneSet

Update Weights

Calculate with ensemble classifier R on the entire data set.

Strong Hypothesis

At the end of boosting, there are chains,

�̂�𝑅𝑡=0 𝑖𝑓 𝑥 ∉𝑅𝑡

1. The grid cells with the similar crime counts clustered together also are close to each other on the map geographically. Besides, the high-crime-rate area and low-crime-rate area are separated with cluster.

2. The original data set is randomly divided into two subsets each round. The greedy weak-learn algorithm adapts confidence-rate evaluation to “chain” the base-line classifiers using one data set. And then, “trim” the chain using the other data set.

3. The strong hypothesis is easy to calculate.

SUMMARY

THANK YOU!!

Crime Forecasting Using Boosted Ensemble Classifiers

Documents

Transcript of Crime Forecasting Using Boosted Ensemble Classifiers

Tracking recurring contexts using ensemble classifiers: An ... · Classiﬂers: An Application to Email Filtering Ioannis Katakis, Grigorios Tsoumakas, and Ioannis Vlahavas Department

Local Topic Discovery via Boosted Ensemble of Nonnegative ... · as ‘lol,’ ‘wow,’ ‘great,’ and ‘hahah.’ On the contrary, our method, which attempts to discover local

Ensemble Learning - University of Washington · 2020. 2. 19. · Ensemble Learning Consider a set of classifiers h 1, ..., h L Idea: construct a classifier H(x)that combines the individual

Comprehensive benchmarking and ensemble approaches for ...stelo/papers/GB17B.pdfComprehensive benchmarking and ensemble approaches for metagenomic classifiers Alexa B. R. McIntyre1,2,3,

Mining Several Databases with an Ensemble of Classifiers Seppo Puuronen Vagan Terziyan Alexander Logvinovsky 10th International Conference and Workshop.

The Shapley Value of Classifiers in Ensemble Games

Object detection using cascades of boosted classifiers Javier Ruiz-del-Solar and Rodrigo Verschae EVIC 2006 December 15th, 2006 Chile Universidad de Chile.

Fully Automatic Facial Feature Point Detection Using Gabor Feature Based Boosted Classifiers

© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 1 Ensemble Methods l An ensemble method constructs a set of base classifiers from the training.

Classifiers in Kam-Tai Languages · 2013-01-23 · 4.2.7. Classifiers and temporal words ..... 169 4.2.8. Classifiers and pronouns ..... 171 4.2.9. Classifiers and numerals ... few

Personalized classifiers

Introduction to Boosted Treestqchen/pdf/BoostedTree.pdf · Introduction to Boosted Trees ... I want to predict whether I like romantic music at time t ... •Regression tree ensemble

Boosted Tree

Classifier Ensemble for Improving Land Cover Classification · such parametric classifiers to handle complex datasets consisting of different kind of data such as multisource data.

Regularized Weighted Ensemble of Deep Classifiers › papers › ijcsa › V5N3 › 5315ijcsa05.pdf · 2015-10-23 · regularized weighted ensemble of deep support vector machine

Dynamic Selection of Classifiers - UFPRSelection of classifiers A single or an ensemble of classifiers can be selected. Static: performed during training, the same selected classifiers

Large Iterative Multitier Ensemble Classifiers for ...feihu.eng.ua.edu/bigdata/week15_2.pdfReceived 15 October 2013; revised 13 March 2014; accepted 16 March 2014. Date of publication

Crime Forecasting Using Boosted Ensemble Classifiers Chung-Hsien Yu Crime Forecasting Using Boosted Ensemble Classifiers Department of Computer Science.

Ensemble learning - CSUcs545/fall13/dokuwiki/lib/...Breiman, Leo (1996). "Bagging predictors”. Machine Learning 24 (2): 123–140. Bagging Comments: How to combine the classifiers

EVALUATING EFFICIENCY OF ENSEMBLE CLASSIFIERS IN ...