Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... ·...

15
Non-Linear Regression GOAL: Keep the math of linear regression, but extend to more general functions KEY IDEA: You can make a non-linear function from a linear weighted sum of non-linear basis functions 1 Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Transcript of Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... ·...

Page 1: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Non-Linear Regression

GOAL:

Keep the math of linear regression, but extend to more general functions

KEY IDEA:

You can make a non-linear function from a linear weighted sum of non-linear basis functions

1Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Page 2: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Non-linear regression

Linear regression:

Non-Linear regression:

where

In other words, create z by evaluating x against basis functions, then linearly regress against z.

2Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Page 3: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Example: polynomial regression

3Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

A special case of

Where

Page 4: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Radial basis functions

4Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Page 5: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Arc Tan Functions

5Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

note:sigmoid-like functions

Page 6: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Non-linear regression

Linear regression:

Non-Linear regression:

where

In other words, create z by evaluating x against basis functions, then linearly regress against z.

6Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Page 7: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Maximum Likelihood

7Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Same as linear regression, but substitute in Z for X:

N

Page 8: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Bayesian Approach

Learn s2 from marginal likelihood as before

Final predictive distribution:

8Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Page 9: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

9Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Page 10: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

The Kernel TrickNotice that the final equation doesn’t need the

data itself, but just dot products between data items of the form zi

Tzj

So, we take data xi and xj pass through non-linear function to create zi and zj and then take dot products of different zi

Tzj

10Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Page 11: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

The Kernel TrickSo, we take data xi and xj pass through non-linear function to create zi and zj and then take dot products of different zi

Tzj

Key idea:

Define a “kernel” function that does all of this together. • Takes data xi and xj• Returns a value for dot product zi

Tzj

If we choose this function carefully, then it will correspond to some underlying z=f[x].

Never compute z explicitly - can be very high or infinite dimension

11Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Page 12: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Kernelized RegressionBefore

12Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

After

Page 13: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Example Kernels

(Equivalent to having an infinite number of radial basis functions at every position in space. Wow!)

13Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Page 14: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

RBF Kernel Fits

14Computer vision: models, learning and inference. ©2011 Simon J.D. Prince

Page 15: Non-Linear Regression - Penn State College of Engineeringrtc12/CSE586/lectures/regrNonlinear... · 2013-04-01 · Non-linear regression Linear regression: Non-Linear regression: where

Fitting Variance• We’ll fit the variance with maximum likelihood• Optimize the marginal likelihood (likelihood after

gradients have been integrated out)

• Have to use non-linear optimization

15Computer vision: models, learning and inference. ©2011 Simon J.D. Prince