Multiple Linear Regression - Matrix Formulation Let x = (x 1, x 2, …, x n )′ be a n 1 column...

Multiple Linear Regression - Matrix Formulation

Let x = (x1, x2, … , xn)′ be a n 1 column

vector and let g(x) be a scalar function of x. Then, by definition,

For example, let 2

g x x x x

Let a = (a1, a2, … , a n)′ be a n 1 column vector

of constants. It is easy to verify that

x a ax

and that, for symmetrical A (n n)

2x A x A xx

Theory of Multiple Regression

Suppose we have response variables Yi ,

i = 1, 2, … , n and k explanatory variables/predictors X1, X2, … , Xk .

0 1 1 2 2 ...i i i k ki iY b b x b x b x

i = 1,2, … , nThere are k+2 parameters b0 , b1 , b2 , …, bk and σ2

11 21 1

12 22 2

n n kn

X is called the design matrix

:Model Y Xb

OLS (ordinary least-squares) estimation

S Y Xb Y Xb

Y b X Y Xb

2Y Y b X Y b X Xb

2 2 0S

X Y X Xbb

b X X X Xb

b A A X X X

ˆ ˆE b b AE b b so is unbiased

ˆX Xb X Y

Fitted values are given by

1ˆY X b X X X X Y HY

1H X X X X

H is called the “hat matrix” (… it puts the hats on the Y’s)

The error sum of squares, SSRES , is

ˆ ˆ ˆ2S Y Y b X Y b X Xb Min

1ˆ ˆ2Y Y b X Y b X X X X X Y

ˆY Y b X Y

The estimate of 2 is based on this.

Example: Find a model of the form

y x1 x2

3.5 3.1 30

3.2 3.4 25

3.0 3.0 20

2.9 3.2 30

4.0 3.9 40

2.5 2.8 25

2.3 2.2 30

0 1 1 2 2 ...i i i k ki iY b b x b x b x for the data below.

1 31 30

1 34 25

1 30 20

1 32 30

1 39 40

1 28 25

1 2 2 30

X is called the design matrix

Y Xb The model in matrix form is given by:

ˆ ( )

X Xb X Y

b X X X Y

We have already seen that

Now calculate this for our example

7 0 216 200 0

216 683 626 0

200 0 626 0 5950 0

R can be used to calculate X’X and the answer is:

To input the matrix in R use

X=matrix(c(1,1,1,1,1,1,1,3.1,3.4,3.0,3.4,3.9,2.8,2.2,30,25,20,30,40,25,30),7,3)

Number of rows

Number of columns

Notice command for matrix multiplication

The inverse of X’X can also be obtained by using R

We also need to calculate X’Y

1ˆ ( )b X X X Y Now

Notice that this is the same result as obtained previously using the lm result on R

So y = -0.2138 + 0.8984x1 + 0.01745x2 + e

1H X X X X

The “hat matrix” is given by

The fitted Y values are obtained by

Recall once more we are looking at the model

Compare with

Error Terms and Inference

2 1 ˆˆ1Y Y b X Y

A useful result is :

n : number of points

k: number of explanatory variables

ˆˆ ˆ~ . .

ˆ. .i i

n k i ii

b bt s e b c

In addition we can show that:

And c(i+1)(i+1) is the (i+1)th diagonal element of

where s.e.(bi)=c(i+1)(i+1)

For our example:

ˆ67.44 67.1031Y Y b X Y

. . . 2 1

467 44 671031 0 08422

ˆ 0.2902

was calculated as:

This means that

c11= 6.683, c22=0.7600,c33=0.0053

Note that c11 is associated with b0, c22 with b1 and c33 with b2

We will calculate the standard error for b1

This is 0.7600 x 0.2902 = 0.2530

The value of b1 is 0.8984

Now carry out a hypothesis test.

H0: b1 = 0

H1: b1 ≠ 0

The standard error of b1 is 0.2530

The test statistic is

This calculates as (0.8984 – 0)/0.2530 = 3.55

Ds…..

……….

t tables using 4 degrees of freedom give cut of point of 2.776 for 2.5%.

………………................

We therefore accept H1. There is no evidence at the 5% level that b1 is zero.

The process can be repeated for the other b values and confidence intervals calculated in the usual way.

CI for 2 - based on the 42 distribution of

4 2 2 / ((4 0.08422)/11.14 , (4 0.08422)/0.4844)

i.e. (0.030 , 0.695)

ˆ ˆˆ ˆRESSS Y Xb Y Xb

The sum of squares of the residuals can also be calculated.

Multiple Linear Regression - Matrix Formulation Let x = (x 1, x 2, …, x n )′ be a n 1 column...

Documents

Transcript of Multiple Linear Regression - Matrix Formulation Let x = (x 1, x 2, …, x n )′ be a n 1 column...

ü > I > ) J = > I > - : þ > E ) ? ; 2 } ÍQ N Q N P P X N N Q N O W X N N Q N P N X N N Q N P O X N N Q N O T X N N Q N O U X N N Q N O V X N N Q N O Q X N N Q N O R X N N Q N O

The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ; ) where is either 1 or 2. Let g(x 1, …,

ECE595 / STAT598: Machine Learning I Lecture 23 ... · Theorem (SLLN) Let X 1;:::;X N be a sequence of i.i.d. random variables with common mean . Let M N = 1 N P N n=1 X n. Then,

k arXiv:2003.10761v1 [math.AG] 24 Mar 2020 · Theorem 1.2. Let k be an arbitrary ﬁeld of characteristic zero. Let X be a Mukai variety of genus g= g(X) and dimension n= dim(X) such

Web viewf x = lim n→∞ f n x = lim n→∞ nx 1+ n 2 x 2 = lim n→∞ x 1 n 2 + x 2 = x x 2 = x x =sgn x

On the Hamiltonian Bigraphs - Longdom...Hamiltonian, and * x,y 1 and are solutions of the problem. Else, let 0 0 1 1 n 2 n 2 x y x y ...x y be a Hamiltonian cycle of * 2 such that

The Banach Space c - unex.es · the banach space c0 3 Proof. ([36]) Let y⁄ n = T⁄(e⁄ n).Clearly kTk = supn kyn⁄k.By Hahn– Banach theorem, there exist x⁄ n 2 X⁄ such

Principal ideals in matrix rings - NIST · Let B be the nt X t matrix Let V be a unit matrix of R nt such that VB = T is upper triangular. Thus VB=T=[~] , where H is an n X n upper

Exam M Actuarial Models - Sample Solutions · Let N =# of computers in department Let X = cost of a maintenance call Let S = aggregate cost () ()() () ( ) () 2 2 2 2 2 2 Var Standard

region bounded by y = 4x - x2 and the x-axis on [0, 4] Let ... · region bounded by y = 4x - x2 and the x-axis on [0, 4] Let n = 4 Let be the midpoint of each partition of an interval

Chapter 15 Constrained Optimization. The Linear Programming Model Let : x 1, x 2, x 3, ………, x n = decision variables Z = Objective function or linear.

STAT 460/560 - STATISTICAL INFERENCE I 2017/2018, TERM Ijhchen/stat560/2017Note.pdf · 2017-11-28 · 1.3 Statistical inference Let X= (X 1;X 2;:::;X n) be a random sample from a

Canadian Organ Replacement Register · Vancouver General. X X n/a n/a X n/a X X X X. Alberta Alberta Children’s Hospital. X n/a n/a n/a n/a n/a n/a n/a X X. Foothills Medical Centre/AKC-South.

Lecture 18 Parametric Polymorphism · polymorphism.) I Only let (rec) variables (e.g., x in let x = e1 in e2) can have polymorphic types. So n= 0 for function arguments, pattern variables,

Investor SentimentAligned:A Powerful Predictor of Stock ... · Investor Sentiment Aligned: A Powerful Predictor of Stock Returns Let x t =(x 1,t,...,x N,t) denote an N×1 vector of

Searching and Sorting - im.ntu.edu.twim.ntu.edu.tw/~tsay/dokuwiki/lib/exe/fetch.php?... · Searching a Sorted Sequence Problem Let x 1;x 2; ;x n be a sequence of real numbers such

cpb-ca-c1.wpmucdn.com › blogs.winnipegsd.ca … · Web viewSet Theory Review: Lesson 1-3Let U be the universal set, with X ⊂U. Determine n X if n(U ) = 96 and n(X’) = 42.Let

ignouprojects.files.wordpress.com · (4) n be a random sample from f (x) Re-Ir., x > 0. Find the p.d.f. of Let Xi, i 2, . (4) 21nX where X = EX i and identify the distribution. Let

RULES iii UNCLASSIFIED Ehmhmmhhmhum · X. N-X. k [Ni) P 1 l-P ) , a 0 1 . . N Let f=~) I f.i(x. jp. where x 1' ) and (pl .. p For each L3, let PEI : .. : PIk 3 be the ordered parameters

Live n Let Rhyme