Beyond Worst Case Analysis in Machine Learning

Beyond Worst Case Analysis in Machine Learning

Maria-Florina Balcan

A PAC-style framework for Semi-Supervised Learning

3

Image Classification

Document Categorization

Speech Recognition

Branch Prediction

Protein Classification

Spam Detection Fraud Detection

Machine Learning

http://images.google.com/imgres?imgurl=http://www.classicsavers.com/casablanca.jpg&imgrefurl=http://www.classicsavers.com/Casablanca.html&h=600&w=800&sz=72&tbnid=wSXOd5UUibIJ:&tbnh=106&tbnw=141&start=3&prev=/images%3Fq%3Dcasablanca%26hl%3Den%26lr%3D%26ie%3DUTF-8


4

Supervised Classification

Goal: use past emails to produce good rule for future data.

Not spam spam

Example: Decide which emails are spam and which are important.

Supervised classification

5

example label

Reasonable RULES:

Predict SPAM if unknown AND (money OR pills)

Predict SPAM if 2money + 3pills –5 known > 0

Represent each message by features. (e.g., keywords, spelling, etc.)

Example: Supervised Classification

+

-

+ + +

- -

-

-

-

Linearly separable

6

Two Main Aspects in Machine Learning

Algorithm Design. How to optimize?

Automatically generate rules that do well on observed data.

Confidence Bounds, Generalization Guarantees

Confidence for rule effectiveness on future data.

Well understood for supervised learning.

7

Supervised Learning Formalization

• X – feature space

err(h)=Prx 2 D(h(x) c*(x))

• Do optimization over S, find hypothesis h 2 C.

• S={(x, l)} - set of labeled examples

• Goal: h has small error over D.

– drawn i.i.d. from distr. D over X and labeled by target concept c*

h c*

• c* in C, realizable case; else agnostic

Classic models: PAC (Valiant), SLT (Vapnik)



8

Finite C, realizable

• In PAC, also talk about efficient algorithms.

Sample Complexity Results

Infinite C, realizable

• Agnostic – replace with 2.

9

Classic Paradigm Insufficient Nowadays

Modern applications: massive amounts of raw data.

Only a tiny fraction can be annotated by human experts.

Billions of webpages Images Protein sequences

http://images.google.com/imgres?imgurl=http://www.teachenglishinasia.net/files/u2/purple_lotus_flower.jpg&imgrefurl=http://www.teachenglishinasia.net/asiablog/asian-water-lilies-and-lotus-flowers&usg=__jQMsElfDOQGWm-hebjVtJDqL-40=&h=335&w=500&sz=28&hl=en&start=3&sig2=hmb0FR5kLCXdU2BdwjCNcA&zoom=1&itbs=1&tbnid=r_JpUcIxETAUoM:&tbnh=87&tbnw=130&prev=/images?q=flower&hl=en&gbv=2&tbs=isch:1&ei=Is1yTcKaOcGAlAe0wqGmAQ

http://images.google.com/imgres?imgurl=http://www.soccerballworld.com/images/Ball-UEFA-FINALE.jpg&imgrefurl=http://www.soccerballworld.com/UEFA-Champions-Finale.htm&usg=__IkP2FDq2F5yEyjqW0wDIthrifzc=&h=300&w=300&sz=15&hl=en&start=5&sig2=Sw_H6_M6TzTbVg5PqjP6lw&zoom=1&itbs=1&tbnid=E1Xl3YAzGDlvgM:&tbnh=116&tbnw=116&prev=/images?q=ball&hl=en&gbv=2&tbs=isch:1&ei=-8xyTbz0OISclge849RL

http://images.google.com/imgres?imgurl=http://ceoworld.biz/ceo/wp-content/uploads/2009/07/microsoft_logo.jpg&imgrefurl=http://ceoworld.biz/ceo/2009/07/13/microsoft-to-sell-razorfish-wpp-omnicom-publicis-groupe-interpublic-group-and-dentsu-possible-bidder&usg=__D-yYVRjhhUHbi2sYkh2hyP_5EK4=&h=299&w=300&sz=5&hl=en&start=2&sig2=4orOiNNbbV2i6Ii7SFMdmg&zoom=1&itbs=1&tbnid=Sk8jPaSR61onuM:&tbnh=116&tbnw=116&prev=/images?q=microsoft&hl=en&gbv=2&tbs=isch:1&ei=1MtyTbH9IYOglAeJrq2bAQ

http://images.google.com/imgres?imgurl=http://www.treehugger.com/ec-rnd-005.jpg&imgrefurl=http://www.treehugger.com/files/2008/07/17-electric-cars-overview-2005-to-2008.php&usg=__4G2QjNYXIHEYVXE6DoqIE5zzzrY=&h=322&w=468&sz=40&hl=en&start=9&sig2=P7jmw1vmKb715x8CELHezQ&zoom=1&itbs=1&tbnid=iGci7mkzT2KN2M:&tbnh=88&tbnw=128&prev=/images?q=car&hl=en&gbv=2&tbs=isch:1&ei=qMtyTcilGIOClAey9IGqCw

http://images.google.com/imgres?imgurl=http://images.nymag.com/news/features/newface080811_lede_560.jpg&imgrefurl=http://nymag.com/news/features/48948/&usg=__cimnczuBny94AVQhDb04KyNUIm4=&h=482&w=560&sz=50&hl=en&start=11&sig2=4wGA_vH60nBqlHqvekl3lA&zoom=1&itbs=1&tbnid=gg1XN95D7aTn3M:&tbnh=114&tbnw=133&prev=/images?q=face&hl=en&gbv=2&tbs=isch:1&ei=dMtyTbuKL8OqlAecwcFN

http://images.google.com/imgres?imgurl=http://www.classicsavers.com/casablanca.jpg&imgrefurl=http://www.classicsavers.com/Casablanca.html&h=600&w=800&sz=72&tbnid=wSXOd5UUibIJ:&tbnh=106&tbnw=141&start=3&prev=/images?q=casablanca&hl=en&lr=&ie=UTF-8


http://images.google.com/imgres?imgurl=http://www.flash-slideshow-maker.com/images/help_clip_image020.jpg&imgrefurl=http://www.flash-slideshow-maker.com/flash-images/&usg=__jMIGXrfx-_pXlchniUcjT2UYT9k=&h=422&w=554&sz=35&hl=en&start=1&sig2=YCfTn6kxhcufspbxJdXhrQ&zoom=1&itbs=1&tbnid=_ruOuQ2EjL3wOM:&tbnh=101&tbnw=133&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF

http://images.google.com/imgres?imgurl=http://www.dynamicdrive.com/dynamicindex4/lightbox2/flower.jpg&imgrefurl=http://www.dynamicdrive.com/dynamicindex4/lightbox2/index.htm&usg=__6G24ltZqq0UGAerNdiENN4jOqqQ=&h=509&w=500&sz=46&hl=en&start=4&sig2=P_pNfXebhTzd8n0DrpgNhw&zoom=1&itbs=1&tbnid=--IQZoA0t4jdOM:&tbnh=131&tbnw=129&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF

http://images.google.com/imgres?imgurl=http://www.nrl.navy.mil/content_images/clem.jpg&imgrefurl=http://www.nrl.navy.mil/media/popular-images/&usg=__Cj15wZbDIqpRCeVmO7SsP5XIOzM=&h=1213&w=1720&sz=1347&hl=en&start=3&sig2=TRnaOP-O9wUtTBwS0_vDZA&zoom=1&itbs=1&tbnid=EprKgSAflWdsYM:&tbnh=106&tbnw=150&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF

http://images.google.com/imgres?imgurl=http://hirise.lpl.arizona.edu/HiBlog/wp-content/uploads/2008/03/mex_phobos.jpg&imgrefurl=http://hirise.lpl.arizona.edu/HiBlog/category/hirise/news-events/special-images/&usg=__DLojkIcAQfQzGQ4oBrU4aT3OcHI=&h=400&w=400&sz=29&hl=en&start=15&sig2=cqA6Gj5KQHXSZIuOwlp0iQ&zoom=1&itbs=1&tbnid=QIVdyHeoWfEyFM:&tbnh=124&tbnw=124&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF

http://images.google.com/imgres?imgurl=http://www.mabima.net/company/images/Tucker_House_2003_640.JPG&imgrefurl=http://www.mabima.net/company/&usg=__yK2ELUzEORpde1XjSFJTrRDtFxc=&h=426&w=640&sz=69&hl=en&start=4&sig2=Q0e47SnpfYjO5jf5DQQk-Q&zoom=1&itbs=1&tbnid=GD54wp6WHlrhMM:&tbnh=91&tbnw=137&prev=/images?q=house&hl=en&gbv=2&tbs=isch:1&ei=KMtyTZnAE8aqlAegju1U

http://images.google.com/imgres?imgurl=http://www.popsci.com/files/imagecache/article_image_large/articles/ie2.jpg&imgrefurl=http://www.popsci.com/gadgets/article/2010-08/microsoft-fight-between-revenue-and-privacy-money-1-privacy-zero&usg=__wfaTvyWsRtJGRnAP5yJyb-kp3u0=&h=333&w=500&sz=22&hl=en&start=1&sig2=PTP24GqOtv3iVaxKrsl8dg&zoom=1&itbs=1&tbnid=_nE8aJVhvxfpbM:&tbnh=87&tbnw=130&prev=/images?q=internet&hl=en&gbv=2&tbs=isch:1&ei=FsxyTaDEPMWAlAePlcFS

10

Expert Labeler

Semi-Supervised Learning

raw data

face not face

Labeled data

Classifier

http://images.google.com/imgres?imgurl=http://static.howstuffworks.com/gif/house-selling-1.jpg&imgrefurl=http://home.howstuffworks.com/real-estate/house-selling.htm&usg=__qnE74MTnGIEKpohCSP6iTM9E8EA=&h=327&w=400&sz=24&hl=en&start=3&sig2=G6Bq69GdXS9x6hMpDk3HfA&zoom=1&itbs=1&tbnid=3fiW8K30_ygY1M:&tbnh=101&tbnw=124&prev=/images?q=house&hl=en&gbv=2&tbs=isch:1&ei=RF9sTcWkOsL88AbtremUBQ


http://images.google.com/imgres?imgurl=http://www.amptoons.com/blog/images/schiavo_ct_scan.jpg&imgrefurl=http://amptoons.com/blog/2005/03/20/regarding-the-cat-scan-of-terri-schiavos-brain/&usg=__3YqGoT0iFyPA8fbeIsWpFL9IXc8=&h=265&w=400&sz=15&hl=en&start=3&sig2=bly3Yc3vCh_1FCPEsUaJ-w&zoom=1&itbs=1&tbnid=JQHeSAZrAr7-nM:&tbnh=82&tbnw=124&prev=/images?q=ct+scan&hl=en&gbv=2&tbs=isch:1&ei=-c9yTYHOKoWdlgfB86Q3

http://images.google.com/imgres?imgurl=http://www.stanford.edu/~grg/images/orange_flower.jpg&imgrefurl=http://www.stanford.edu/~grg/photography/flowers.html&usg=__DguR48LSR6uf5bhuxI1v6nXpdow=&h=1413&w=1693&sz=280&hl=en&start=5&sig2=VOO3RyoRWfswRzJpi6CtjA&zoom=1&itbs=1&tbnid=kY7_4LKgNqDrbM:&tbnh=125&tbnw=150&prev=/images?q=flower&hl=en&gbv=2&tbs=isch:1&ei=Is1yTcKaOcGAlAe0wqGmAQ

http://images.google.com/imgres?imgurl=http://static.howstuffworks.com/gif/house-selling-1.jpg&imgrefurl=http://home.howstuffworks.com/real-estate/house-selling.htm&usg=__qnE74MTnGIEKpohCSP6iTM9E8EA=&h=327&w=400&sz=24&hl=en&start=3&sig2=G6Bq69GdXS9x6hMpDk3HfA&zoom=1&itbs=1&tbnid=3fiW8K30_ygY1M:&tbnh=101&tbnw=124&prev=/images?q=house&hl=en&gbv=2&tbs=isch:1&ei=RF9sTcWkOsL88AbtremUBQ


http://images.google.com/imgres?imgurl=http://www.socksoff.co.uk/00001/page05/Lone_Tree_1600.jpg&imgrefurl=http://www.hongkiat.com/blog/100-absolutely-beautiful-nature-wallpapers-for-your-desktop/&usg=__dsyO8OzB7rmE16btYKAa2WrNJtQ=&h=1200&w=1600&sz=693&hl=en&start=9&sig2=N22ozJ4mtT9DuMR9P-wNOg&zoom=1&itbs=1&tbnid=UU66VMJn9kPePM:&tbnh=113&tbnw=150&prev=/images?q=tree&hl=en&gbv=2&tbs=isch:1&ei=YsxyTearFsSclge1p_hf

http://images.google.com/imgres?imgurl=http://www.soccerballworld.com/images/Ball-UEFA-FINALE.jpg&imgrefurl=http://www.soccerballworld.com/UEFA-Champions-Finale.htm&usg=__IkP2FDq2F5yEyjqW0wDIthrifzc=&h=300&w=300&sz=15&hl=en&start=5&sig2=Sw_H6_M6TzTbVg5PqjP6lw&zoom=1&itbs=1&tbnid=E1Xl3YAzGDlvgM:&tbnh=116&tbnw=116&prev=/images?q=ball&hl=en&gbv=2&tbs=isch:1&ei=-8xyTbz0OISclge849RL

http://images.google.com/imgres?imgurl=http://www.popsci.com/files/imagecache/article_image_large/articles/ie2.jpg&imgrefurl=http://www.popsci.com/gadgets/article/2010-08/microsoft-fight-between-revenue-and-privacy-money-1-privacy-zero&usg=__wfaTvyWsRtJGRnAP5yJyb-kp3u0=&h=333&w=500&sz=22&hl=en&start=1&sig2=PTP24GqOtv3iVaxKrsl8dg&zoom=1&itbs=1&tbnid=_nE8aJVhvxfpbM:&tbnh=87&tbnw=130&prev=/images?q=internet&hl=en&gbv=2&tbs=isch:1&ei=FsxyTaDEPMWAlAePlcFS

http://images.google.com/imgres?imgurl=http://ceoworld.biz/ceo/wp-content/uploads/2009/07/microsoft_logo.jpg&imgrefurl=http://ceoworld.biz/ceo/2009/07/13/microsoft-to-sell-razorfish-wpp-omnicom-publicis-groupe-interpublic-group-and-dentsu-possible-bidder&usg=__D-yYVRjhhUHbi2sYkh2hyP_5EK4=&h=299&w=300&sz=5&hl=en&start=2&sig2=4orOiNNbbV2i6Ii7SFMdmg&zoom=1&itbs=1&tbnid=Sk8jPaSR61onuM:&tbnh=116&tbnw=116&prev=/images?q=microsoft&hl=en&gbv=2&tbs=isch:1&ei=1MtyTbH9IYOglAeJrq2bAQ

http://images.google.com/imgres?imgurl=http://www.flash-slideshow-maker.com/images/help_clip_image020.jpg&imgrefurl=http://www.flash-slideshow-maker.com/flash-images/&usg=__jMIGXrfx-_pXlchniUcjT2UYT9k=&h=422&w=554&sz=35&hl=en&start=1&sig2=YCfTn6kxhcufspbxJdXhrQ&zoom=1&itbs=1&tbnid=_ruOuQ2EjL3wOM:&tbnh=101&tbnw=133&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF

http://images.google.com/imgres?imgurl=http://www.wallpaperbase.com/wallpapers/animals/fish/fish_4.jpg&imgrefurl=http://www.wallpaperbase.com/animals-fish.shtml&usg=__2GMQzBfy-zcwD5e3ceBpUaGJGsE=&h=768&w=1024&sz=115&hl=en&start=6&sig2=86Ql8saH12Fk6g3FoyAagw&zoom=1&itbs=1&tbnid=vPHzH3MqjP6xOM:&tbnh=113&tbnw=150&prev=/images?q=fish&hl=en&gbv=2&tbs=isch:1&ei=TstyTb3AB8Oclget89ChAw

http://images.google.com/imgres?imgurl=http://www.dynamicdrive.com/dynamicindex4/lightbox2/flower.jpg&imgrefurl=http://www.dynamicdrive.com/dynamicindex4/lightbox2/index.htm&usg=__6G24ltZqq0UGAerNdiENN4jOqqQ=&h=509&w=500&sz=46&hl=en&start=4&sig2=P_pNfXebhTzd8n0DrpgNhw&zoom=1&itbs=1&tbnid=--IQZoA0t4jdOM:&tbnh=131&tbnw=129&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF


http://images.google.com/imgres?imgurl=http://www.teachenglishinasia.net/files/u2/purple_lotus_flower.jpg&imgrefurl=http://www.teachenglishinasia.net/asiablog/asian-water-lilies-and-lotus-flowers&usg=__jQMsElfDOQGWm-hebjVtJDqL-40=&h=335&w=500&sz=28&hl=en&start=3&sig2=hmb0FR5kLCXdU2BdwjCNcA&zoom=1&itbs=1&tbnid=r_JpUcIxETAUoM:&tbnh=87&tbnw=130&prev=/images?q=flower&hl=en&gbv=2&tbs=isch:1&ei=Is1yTcKaOcGAlAe0wqGmAQ

http://images.google.com/imgres?imgurl=http://www.nrl.navy.mil/content_images/clem.jpg&imgrefurl=http://www.nrl.navy.mil/media/popular-images/&usg=__Cj15wZbDIqpRCeVmO7SsP5XIOzM=&h=1213&w=1720&sz=1347&hl=en&start=3&sig2=TRnaOP-O9wUtTBwS0_vDZA&zoom=1&itbs=1&tbnid=EprKgSAflWdsYM:&tbnh=106&tbnw=150&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF

http://images.google.com/imgres?imgurl=http://www.stat.columbia.edu/~cook/movabletype/archives/svm.png/svm.png&imgrefurl=http://www.stat.columbia.edu/~cook/movabletype/archives/2006/02/&usg=__FMRwI8uVmWrF1XuZtHRPkuQd644=&h=300&w=300&sz=5&hl=en&start=13&sig2=WTrT_xU4PP2cmJerZqYUFg&zoom=1&itbs=1&tbnid=M-5DEI1W7sISaM:&tbnh=116&tbnw=116&prev=/images?q=svm&hl=en&gbv=2&tbs=isch:1&ei=q2dwTd6XHsKTtwflnuiTDw

11

Semi-Supervised Learning

• Variety of methods and experimental results. E.g.,:

• WC models not useful for SSL. Scattered and very specific theoretical results…

We provide: a general discriminative (PAC, SLT style) framework for SSL.

• Transductive SVM [Joachims ’98]

• Co-training [Blum & Mitchell ’98]

• Graph-based methods [Blum & Chawla01], [Zhu-Lafferty-Ghahramani’03]

Different SSL algs based on very different assumptions.

Challenge: capture many of the assumptions typically used.

[Balcan-Blum, COLT 2005; JACM 2010; book chapter, 2006]

12

Example of “typical” assumption: Margins

+

+

_

_

Labeled data only

+

+

_

_

+

+

_

_

Transductive SVM SVM

Belief: target goes through low density regions (large margin).

13

Another Example: Self-consistency

My Advisor Prof. Avrim Blum My Advisor Prof. Avrim Blum

x1- Text info x2- Link info x - Link info & Text info

x = h x1, x2 i

Agreement between two parts : co-training [Blum-Mitchell98].

- examples contain two sufficient sets of features, x = h x1, x2 i

For example, if we want to classify web pages:

- belief: the parts are consistent, i.e. 9 c1, c2 s.t. c1(x1)=c2(x2)=c*(x)

14

New discriminative model for SSL

Unlabeled data useful if we have beliefs not only about

the form of the target, but also about its relationship

with the underlying distribution.

Key Insight

Problems with thinking about SSL in standard WC models

• PAC or SLT: learn a class C under (known or unknown) distribution D.

• Unlabeled data doesn’t give any info about which c 2 C is the target.

• a complete disconnect between the target and D

Su={xi} - xi i.i.d. from D and Sl={(xi, yi)} –xi i.i.d. from D, yi =c*(xi).

15

New model for SSL, Main Ideas

Augment the notion of a concept class C with a notion of

compatibility between a concept and the data distribution.

“learn C” becomes “learn (C,)” (learn class C under )

Express relationships that target and underlying distr. possess.

Idea I: use unlabeled data & belief that target is compatible to

reduce C down to just {the highly compatible functions in C}. +

+

_

_

Class of fns C

e.g., linear separators unlabeled data

Idea II: degree of compatibility estimated from a finite sample.

abstract prior

Compatible fns in C e.g., large margin

linear separators finite sample

16

Formally

Idea II: degree of compatibility estimated from a finite sample.

(h,D)=Ex2 D[(h, x)] compatibility of h with D, (h,x)2 [0,1]

Require compatibility (h,D) to be expectation over individual examples. (don’t need to be so strict but this is cleanest)

errunl(h)=1-(h, D) incompatibility of h with D

View incompatibility as unlabeled error rate

17

Margins, Compatibility

Margins: belief is that should exist a large margin separator.

+

+

_

_ Highly compatible

Incompatibility of h and D (unlabeled error rate of h) – the probability mass within distance of h.

Can be written as an expectation over individual examples

(h,D)=Ex 2 D[(h,x)] where: (h,x)=0 if dist(x,h) ·

(h,x)=1 if dist(x,h) ¸

18

Margins, Compatibility

Margins: belief is that should exist a large margin separator.

+

+

_

_ Highly compatible

If do not want to commit to in advance, define (h,x) to be a smooth function of dist(x,h), e.g.:

Illegal notion of compatibility: the largest s.t. D has

probability mass exactly zero within distance of h.

19

Co-training: examples come as pairs hx1, x2i and the goal is to learn a pair of functions hh1,h2i.

Co-Training, Compatibility

Hope is that the two parts of the example are consistent.

Legal (and natural) notion of compatibility:

- the compatibility of hh1,h2i and D:

- can be written as an expectation over examples:

20

Types of Results in Our Model

As in PAC, can discuss algorithmic and sample complexity issues.

Can generically analyze sample complexity issues and instantiate them for specific cases.

– How much unlabeled data we need for learning.

- Ability of unlabeled data to reduce # of labeled examples needed for learning.

In PAC, either realizable or agnostic, now additional issue of unlabeled error rate.

21

Sample Complexity, Uniform Convergence Bounds

Probability that h with errunl(h)> ² is compatible with Su is (1-²)mu · ±/(2|C|)

Compatible fns in C

CD,() = {h 2 C :errunl(h) ·}

Proof

By union bound, prob. 1-±/2 only hyp in CD,() are compatible with Su

ml large enough to ensure that none of fns in CD,() with err(h) ¸ ² have an

empirical error rate of 0.

22


Bound # of labeled examples as a measure of the helpfulness of D wrt

– helpful D is one in which CD, () is small

Compatible fns in C

CD,() = {h 2 C :errunl(h) ·}

23


Compatible fns in C

Highly compatible

+

+

_

_

Helpful distribution Non-helpful distribution

1/°2 clusters, all partitions separable by large margin

24

Sample Complexity Subtleties

Distr. dependent measure of complexity

Depends both on the complexity of C and on the complexity of

-Cover bounds much better than Uniform Convergence bounds.

For algorithms that behave in a specific way: • first use the unlabeled data to choose a representative set of compatible

hypotheses

• then use the labeled sample to choose among these

Uniform Convergence Bounds

Highly compatible +

+ _

_

25

Summary

• How much unlabeled data is needed.

• depends both on complexity of C and of compatibility notion.

• Ability of unlabeled data to reduce #of labeled examples.

• compatibility of the target, helpfulness of the distribution

• Both uniform convergence bounds and epsilon-cover bounds.

WC models (PAC, SLT) don’t capture usefulness of unlabeled data in SSL.

We develop a new model for SSL.

Can analyze fundamental sample complexity aspects:

26

Summary

WC models (PAC, SLT) don’t capture usefulness of unlabeled data.

We develop a new model for SSL.

Can analyze fundamental sample complexity aspects.

Algorithmically, our analysis suggests better ways to do regularization based on unlabeled data.

Subsequent work using our framework

P. Bartlett, D. Rosenberg, AISTATS 2007;

J. Shawe-Taylor et al., Neurocomputing 2007;

K. Sridharan, S. Kakade, COLT 2008

J. Zhu, survey in Encyclopedia of machine learning (2011)

32

Sample Complexity Subtleties

33

The Agnostic Case

Inherent tradeoff here between the quality of the comparison function ft

* , which improves as t increases, and the estimation error ²t, which gets worse as t increases. Ideally want:

Achieve it by using an estimate of ²t

Output

Beyond Worst Case Analysis in Machine Learning

Documents

Transcript of Beyond Worst Case Analysis in Machine Learning