Near-Minimax Optimal Learning with Decision Trees University of Wisconsin-Madison and Rice...

Near-Minimax Optimal Learning with Decision Trees

University of Wisconsin-Madison and Rice University

Rob Nowak and Clay Scott

Supported by the NSF and the ONR

nowak@engr.wisc.edu

Basic Problem

Classification: build a decision rule based on labeled training data

Given n training points, how well can we do ?

Smooth Decision Boundaries

Suppose that the Bayes decision boundary behaves locally like a Lipschitz function

Mammen & Tsybakov ‘99

Dyadic Thinking about Classification Trees

recursive dyadic partition

Pruned dyadic partition

Pruned dyadic tree

Dyadic Thinking about Classification Trees

Hierarchical structure facilitates optimization

The Classification Problem

Problem:

Classifiers

The Bayes Classifier:

Minimum Empirical Risk Classifier:

Generalization Error Bounds

Selecting a good h

Convergence to Bayes Error

Ex. Dyadic Classification Trees

labeled training data Bayes decision boundary complete RDP pruned RDP

Dyadic classification tree

Codes for DCTs

0 01 1 1 1

code-lengths:

code: 0001001111+ 6 bits for leaf labels

Error Bounds for DCTs

Compare with CART:

Rate of Convergence

Suppose that the Bayes decision boundary behaves locally like a Lipschitz function

Mammen & Tsybakov ‘99 C. Scott & RN ‘02

Why too slow ?

because Bayes boundary is a (d-1)-dimensional manifold “good” trees are unbalanced

all |T| leaf trees are equally favored

Local Error Bounds in Classification

Spatial Error Decomposition: Mansour & McAllester ‘00

Relative Chernoff Bound

Local Error Bounds in Classification

Bounded Densities

Global vs. Local

Key: local complexity is offset by small volumes!

Local Bounds for DCTs

Unbalanced Tree

J leafsdepth J-1

Global bound:

Local bound:

Convergence to Bayes Error

Mammen & Tsybakov ‘99 C. Scott & RN ‘03

Concluding Remarks

data dependent bound

Neural Information Processing Systems 2002, 2003 nowak@engr.wisc.edu

Near-Minimax Optimal Learning with Decision Trees University of Wisconsin-Madison and Rice...

Documents

Transcript of Near-Minimax Optimal Learning with Decision Trees University of Wisconsin-Madison and Rice...

MiniMax NT LowNOx

dentistry · Dr. Arthur Nowak (Faculty 1973-2000) Dr. Nowak didn’t take retirement lightly, ...

Maximax, maximin y Minimax

Honey Well Minimax Om

Prospekt Minimax Classic.pdf

Jan nowak eportfolio

Minimax v 212 Man

Thesis Minimax Algorithm

Aleksandra Nowak

By Adam Nowak and Jan Nowak. My mum is ironing now.

Minimax 2010 nr2

Robert Nowak ECE Dept., UW-Madison nowak@engr.wisc ece.wisc/~nowak

Minimax Pathology

MiniMax 100 Manual

Español English Français · abstracción de conocimiento knowledge abstraction abstraction de connaissance acción action action acción minimax minimax action action minimax acción

1.cdn.edl.io · PDF fileJamie Hooper Kristyn Humbles Margaret VanDuyne Guy Nowak Gail Nowak Brad Nowak ... Anna Duffy JT & Chris Urban Neve Van

Minimax-Optimal Classiﬁcation with Dyadic Decision Trees · Minimax-Optimal Classiﬁcation with Dyadic Decision Trees Clayton Scott∗, Member, IEEE, and Robert Nowak, Senior Member,

Minimax - Nr 4, 2009

Nowak Panel9

Minimax Optimization