CORRECTIONS L2 regularization ||w|| 2 2, not ||w|| 2 Show second derivative is positive or negative...

CORRECTIONS

• L2 regularization ||w||22

, not ||w||2

• Show second derivative is positive or negative on exams, or show convex– Latter is easier (e.g. x2)

• Loss = error associated with one data point• Risk = sum of all losses• Pseudoinverse gives least-squares solution, NOT

exact solutions• Magnitude of w matters for SVMs.

• Will be released today.• Probably harder than HW1 or HW2• Due Oct 6 (two Tuesdays from now)• HW party: Oct 1.• I wrote (some of) it.

Downsides of using kernels

• Speed & memory– Need to store all training data, each test point

must be computed against each training point• SVMs only need subset of data (support vectors)

• Overfit

3 Perspectives on Linear Regression

1. Minimize Loss (see lecture)

• Take derivative of ||Xw – y||2, set to 0• Result: X’Xw = X’y

2. Projections

3. Gaussian noise

• HW 3 – first problem has a question on this

Bias & Variance

• Bias:– Incorrect assumptions in your model – Your algorithm is only able to capture models of

complexity <= C, but the true model complexity is C’ > C

• Variance– Sensitivity of your algorithm to noise in the data.– How much your model changes per “unit” change

in the data.

Bias & Variance

• Bias vs. variance is a tradeoff• Bias– you assume data is linear, when it’s nonlinear.

• Variance– you assume data could be polynomial, when it’s

always linear.– By assuming data could be polynomial, lots of free

parameters that move around if the training data changes.

– High variance = “overfitting”

Bias & Variance

• If variance if too high, will often add bias in order to reduce variance.

• This is the reason regularization exists.– Increase bias, reduce variance.

• Usually depends on amount of data– More data fix down all those free parameters.

• Will revisit this with random forests.

Problem 1

• a) Do at home• b) Follow the Gaussian noise interpretation of

linear regression

Problem 2Credit: Yun Park

Problem 3 & 4

• 3) Write loss function, find derivative.• 4) Practice problems– “Extra for experts” is inaccurate – there is a very

simple answer.

CORRECTIONS L2 regularization ||w|| 2 2, not ||w|| 2 Show second derivative is positive or negative...

Documents

Transcript of CORRECTIONS L2 regularization ||w|| 2 2, not ||w|| 2 Show second derivative is positive or negative...

prml regularization

Non-local Regularization of Inverse Problems · 1.1.3 Non-local regularization of inverse problems. For some class of inverse problems, the weights w x;y can be estimated from the

Robust Attribution Regularization

Spectral Regularization and its Applications in Quantum ... · of view and satisﬁes both requirements is the Spectral Regularization. Spectral regularization was ﬁrst introduced

MEESEVA USER MANUAL FOR REGULARIZATION OF ENCROACHMENTS …ap.meeseva.gov.in/DeptPortal/Manuals/Revenue/Regularization of... · MEESEVA USER MANUAL FOR REGULARIZATION OF ENCROACHMENTS

Regularization Paths for Generalized Linear Models via … · 2017. 5. 5. · 2 Regularization Paths for GLMs via Coordinate Descent 4. ‘ 1 regularization paths for generalized

Sparsity regularization for inverse problems using curvelets€¦ · 1.1 Inverse problems, regularization and curvelets Inverse problems and regularization are widely used in di erent

Hybrid Regularization and Sparse Reconstruction of · PDF fileHybrid Regularization and Sparse Reconstruction of ... is a technique ... Hybrid Regularization and Sparse Reconstruction

W Hole Persian sLide SHow-Tester

LNCS 5304 - Non-local Regularization of Inverse Problemscohen/mypapers/... · framework improves over wavelet and total variation regularization. 2 Non-local Regularization 2.1 Inverse

N*W*C The Race Show

L1 Regularization

Regularization of linear inverse problems with total ... · regularization and convergence behavior for multiple parameters and functionals. However, all e orts aim towards regularization

Sliid show w short sale

Regularization, Ridge Regression - University of …courses.cs.washington.edu/.../regularization-xvalidation-lasso.pdf · Ridge Regression: Effect of Regularization 14 ! Solution

Fully Automatic Video Colorization With Self-Regularization ......Self-Regularization 4.1. Self regularization for colorization network Consider colorizing a textureless balloon. Although

Smooth regularization of bang-bang optimal control · PDF fileSmooth regularization of bang-bang optimal control ... Smooth regularization of bang-bang optimal control problems ...

Sparsity Based Regularization

Regularization in Neural Networks - Welcome to CEDARcedar.buffalo.edu/~srihari/CSE574/Chap5/Chap5.5-Regularization.pdf · Regularization in Neural Networks ... Need for Regularization

Geometry of Optimization and Implicit Regularization in ... · GEOMETRY OF OPTIMIZATION AND IMPLICIT REGULARIZATION DEEP LEARNING Geometry of Optimization and Implicit Regularization

NWC The Race Show