Real Analysis Lecture Notes

download Real Analysis Lecture Notes

of 124

Transcript of Real Analysis Lecture Notes

  • 8/3/2019 Real Analysis Lecture Notes

    1/124

    REAL ANALYSIS LECTURE NOTES

    Contents1. Introduction and Preliminaries 12. The Structure ofR 73. Lebesgue Measure 113.1. Lebesgue Measure and Measurable Sets 113.2. Measurable Functions 174. Lebesgue Integration 235. Differentiation and Integration 325.1. Motivation 325.2. Derivatives and Derivates 34

    5.3. Vitalis Covering Lemma 375.4. Variation and Absolute Continuity 395.5. Convexity 466. Lp Spaces 497. Metric Spaces 597.1. The Basics 597.2. Compactness 607.3. Continuous Functions 627.4. The StoneWeierstrass Theorem 638. General Measure and Integration 67

    8.1. Introduction 678.2. Measurable Functions 708.3. Integration 738.4. Lp Spaces 798.5. Signed Measures 828.6. Radon-Nikodym Theorem 909. Outer Measure and Extensions of Measures 929.1. Outer Measure 929.2. The Extension Theorem 959.3. Baire Measures on R 999.4. Product Measures 1019.5. Integral Operators 11010. The AscoliArzela Theorem 113Appendix A. Convergence in Measure 116References 118Index 119

  • 8/3/2019 Real Analysis Lecture Notes

    2/124

    c 2009, Jonathan R. Kish

  • 8/3/2019 Real Analysis Lecture Notes

    3/124

    REAL ANALYSIS LECTURE NOTES 1

    1. Introduction and Preliminaries

    The following notes are an offshoot of a year-long course in graduate analysis that Itook from Judith Packer at the University of Colorado, Boulder in 2005-2006. We usedRoydens text Real Analysis, 3rd edition (see [5]). There are no proofs in what follows,only statements of results as well as definitions, examples and remarks. I wanted alist of results that I could easily and quickly navigate and that was extensively cross-referenced to itself and the original source. I started with my lecture notes from class,added an index and did a bit of reorganizing. It proved enormously helpful while doinghomework and studying for exams. I may one day fill in all the proofs and providedetailed explanations, but until then the book Measure, Integral and Probability byCapinski and Kopp (see [3]) more than suffices. Another book I like is [6]. For anaccount of the history of the subject, see [4]. For undergraduate analysis, I used ARadical Approach to Real Analysis by David Bressoud (see [1]), a book that I have justrecently begun to appreciate and revisit with great interest. I have also just becomeaware that Bressoud has written a book on Lebesgue integration (see [2]). While I

    havent looked at it yet, if its anywhere near as good as [1] it will become part ofmy library. Any comments or suggestions are welcome, as are corrections. Errors are ofcourse mine. We begin with a brief motivational review of Riemann integration, followedby some elementary set theory and structure of the real line.

    Jonathan KishBoulder, ColoradoNovember 24, [email protected]

  • 8/3/2019 Real Analysis Lecture Notes

    4/124

    2 REAL ANALYSIS LECTURE NOTES

    Definition 1.1. Let a, b R such that a < b. Suppose that f : [a, b] R is boundedin the sense that

    < infx[a,b]

    f(x) supx[a,b]

    f(x) < .

    Let P= {a = x0 < x1 < < xn = b} be a partition of [a, b]. For each i, 1 i n, letmi = inf

    xi1xxif(x) and Mi = sup

    xi1xxif(x).

    Define the lower Riemann sum of f with respect to P by

    Lp(f) =n

    i=1

    mi(xi xi1).

    Similarly, define the upper Riemann sum of f with respect to P by

    Up(f) =n

    i=1

    Mi(xi xi1).

    Note that, by definition,

    (1) Lp(f) Up(f).The lower Riemann integral of f over [a, b], written

    a

    b

    f(x)dx,

    is defined by

    (2)

    a

    b

    f(x)dx = supP

    Lp(f)

    where sup

    Pis the supremum taken over all partitions

    Pof [a, b]. Moreover, note that the

    set Lp(f) is bounded above and so has a supremum. Similarly, the upper Riemannintegral of f over [a, b], written b

    a

    f(x)dx,

    is defined by

    (3)

    ba

    f(x)dx = infP

    Up(f)

    .

    We always have that

    a

    b

    f(x)dx ba

    f(x)dx.

    We say that f is Riemann integrable over [a, b] ifa

    b

    f(x)dx =

    ba

    f(x)dx =

    ba

    f(x)dx.

  • 8/3/2019 Real Analysis Lecture Notes

    5/124

    REAL ANALYSIS LECTURE NOTES 3

    Remark 1.2. Recall that the Riemann integral has some nice properties:

    (i) Every continuous function is Riemann integrable.(ii) The Fundamental Theorem of Calculus holds: If F is differentiable on [a, b] and

    if f(x) = F(x) is continuous on [a, b], then

    b

    a

    f(x)dx = F(b)

    F(a).

    Remark 1.3. There are drawbacks of the Riemann integral, however:

    (i) Recall that f has to be bounded. Consider, for example, f : [0, 1] R definedby

    f(x) =

    0 if x = 0

    1x

    if 0 < x 1.Note that of course we can get around this with improper integrals.

    (ii) Many functions that are easy to define are not Riemann integrable. Consider

    f : [0, 1] R defined byf(x) =

    0 if x is rational

    1 if x is irrational.

    For any partition Pof [0, 1], we have Lp(f) = 0 (there is a rational number inany interval), whereas Up(f) = 1. It follows that

    a

    b

    f(x)dx = 0 < 1 =

    ba

    f(x)dx.

    (iii) The Riemann integral does not work well with limiting processes. That is, it is

    possible to have a sequence of functions {fn}n=1 defined on [a, b], each Riemannintegrable, and limn fn(x) = f(x) exists, but f(x) might not be Riemannintegrable over [a, b].

    Our main aim is Lebesgue integration theory, in which it is easier to impose conditionsunder which

    limn

    fn(x)dx =

    limn

    fn(x)

    if limn

    fn(x) = f(x) exists.

    Remark 1.4.We denote the

    power setof a set

    Xby 2X or

    P(X

    ). The latter shouldcause no confusion with the notation Pthat stands for a partition, as the meaning ofthe symbol Pwill always be clear from the context.

    Definition 1.5. Let A be a subset of the underlying set X. The complement of A inX, A, is defined by A = {x X : x / A}.

  • 8/3/2019 Real Analysis Lecture Notes

    6/124

    4 REAL ANALYSIS LECTURE NOTES

    Similarly, if A X and B X, we define the relative complement of A in B,written B A, by

    B A = {x B : x / A}= B

    A.

    Definition 1.6. C is called a collection of subsets of X if C P(X).

    Lemma 1.7 (Generalized DeMorgans Laws). Let C be a collection of subsets ofX. Then

    AC

    A =AC

    A and AC

    A =AC

    A.See Exercise 13, p. 16 of [5].

    Definition 1.8. Let X be a set and suppose A and B are subsets ofX. The symmetricdifference of A and B, written AB, is defined by

    AB = (A B) (B A)= (A B) (B A).

    Note that AB = BA by properties of unions.

    Definition 1.9. Let X be a nonempty set. Assume that A P(X) is a nonemptycollection of subsets of X. We say that

    Ais an algebra of subsets of X if

    (i) Whenever A A and B A, then A B A (A is closed under the unionoperation).

    (ii) Whenever A A, A A (A is closed under taking complements).Example 1.10. Consider the following examples:

    (i) Let X be a set. Then A = P(X) is always an algebra.(ii) Let X = {a, b}. Then A = , {a, b} is an algebra.

    Remark 1.11 (Facts about algebras). Let A be an algebra of subsets of X. Then(i) If A, B A, then A B A.

    (ii) , X A.(iii) If A1, . . . , An A, then

    ni=1

    Ai A.

    (iv) If A1, . . . , An A, thenn

    i=1

    Ai A.

  • 8/3/2019 Real Analysis Lecture Notes

    7/124

    REAL ANALYSIS LECTURE NOTES 5

    (v) If A, B A, then B A A.(vi) If A, B A, then AB A.

    Remark 1.12. One way to construct a nontrivial algebra of sets is to let C be anynonempty collection of subsets of X and let

    Abe the smallest algebra of sets of X

    containing C. See Proposition 1.13.

    Proposition 1.13 (Proposition 1, 1.4, p. 17 of [5]). LetX be a nonempty set and letC be a nonempty collection of subsets of X. Then there is a unique algebra A containingC that is the smallest such algebra in the sense that if B is any other algebra of subsetsof X containing C, thenB also contains the algebra A.

    Definition 1.14. The smallest algebra containing

    Cas in Proposition 1.13 is called the

    algebra generated by C.

    Proposition 1.15 (Proposition 2, 1.4, p. 17 of [5]). LetX be a nonempty set and letA be an algebra of subsets of X. Suppose {Ai}i=1 A, that is, Ai A for all i 1.Then there is a collection of subsets {Bi}i=1 A such that

    (i) Bm Bn = for m = n.(ii) For every n N,

    ni=1

    Bi =n

    i=1

    Ai. (these are in the algebraA)

    (iii)

    i=1 Bi =

    i=1 Ai. (these infinite unions might not be inA, but as subsets of Xthey are equal)Remark 1.16. Proposition 1.15(iii) motivates the following definition.

    Definition 1.17. Let X be a nonempty set and let A be an algebra of subsets of X.A is a -algebra of subsets of X if, whenever {Ai}i=1 A,

    i=1

    Ai A. Thus we areallowing countably infinite unions now, not just finite unions.

    Remark 1.18 (Facts about -algebras).

    (i) Every -algebra of subsets is an algebra of subsets, but the converse is not truein general.

    (ii) Let {Ai}i=1 A for A a -algebra. Theni=1

    Ai A.

  • 8/3/2019 Real Analysis Lecture Notes

    8/124

    6 REAL ANALYSIS LECTURE NOTES

    Example 1.19. Let X be a nonempty set. Then P(X) is a -algebra of subsets of X.

    Proposition 1.20 (Proposition 3, 1.4, p. 19 of [5]). LetX be a nonempty set and let Cbe a nonempty collection of subsets of X. Then there is a unique -algebraA containing

    Cthat is the smallest possible in the sense that if

    Bis any other -algebra and

    C B,

    thenA B. Compare to Proposition 1.13, and see Exercise 19, p. 19 of [5].

    Definition 1.21. The smallest -algebra containing C as in Proposition 1.20 is calledthe -algebra generated by C.

  • 8/3/2019 Real Analysis Lecture Notes

    9/124

    REAL ANALYSIS LECTURE NOTES 7

    2. The Structure ofR

    Definition 2.1. Let O R. We say O is open in R if for each x O > 0 suchthat whenever y R and |y x| < , y O.

    Remark 2.2. Note that is open as there is nothing to verify. Moreoever, all ofR isopen. Now, ifx (a, b), let = min(x a, x b). Then (x , x + ) (a, b), whichimplies that (a, b) is open.

    Proposition 2.3 (Properties of open sets).

    (i) LetC be any collection of open subsets ofR. Then OC

    O is open (Proposition 7,2.5, p. 41 of [5]).

    (ii) If

    {Oi

    }ni=1 is any finite collection of open subsets ofR, then

    n

    i=1 Oi is open (Corol-lary 6, 2.5, p. 41 of [5]).Remark 2.4. If we have a countably infinite family of open subsets ofR, {Oi}i=1, theni=1

    Oi need not be open (it could be, if the intersection is empty). For example, letOn =

    1n ,

    1n

    for every n N, each of which is open. Then

    n=1On =

    n=1

    1n

    ,1

    n

    = {0}.

    This is not open since > 0 such that (0 , 0 + ) {0}.

    Proposition 2.5 (Proposition 8, 2.5, p. 42 of [5]). Any nonempty, open subset O ofR can be written as a countable union of disjoint open intervals.

    Definition 2.6. Let E R. We say that x R is a point of closure for E if forevery > 0, (x , x + ) E= .

    Definition 2.7. Let

    E = {x R : x is a point of closure of E}.E is called the closure of E. Note that E E, for if x E, then > 0, (x , x +

    ) E= . However, E E in general.

  • 8/3/2019 Real Analysis Lecture Notes

    10/124

    8 REAL ANALYSIS LECTURE NOTES

    Example 2.8. Consider a, b R with a < b. Let E = (a, b). It can be shown that(a, b) = [a, b].

    Proposition 2.9 (Properties of closures).

    (i) If A, B

    R, then A

    B = A

    B .

    (ii) If A B, then A B (Proposition 10, 2.5, p. 43 of [5]).(iii) Note, however, that in general, A B = A B.(iv) If E R, then E = E (Proposition 11, 2.5, p. 43 of [5]).

    Definition 2.10. We say a set F R is closed if F = F.

    Proposition 2.11 (Properties of closed sets).

    (i) Let

    Cbe any collection of closed subsets ofR. Then FC F is closed (Proposition

    13, 2.5, p. 44 of [5]).(ii) If {Fi}ni=1 is any finite collection of closed subsets of R, then

    ni=1

    Fi is closed

    (Proposition 12, 2.5, p. 44 of [5]).Compare to Proposition 2.3.

    Remark 2.12. If we have a countably infinite family of closed subsets ofR, {Fi}i=1,then

    i=1Fi need not be closed. For example, let Fn =

    2 + 1n , 3 1nfor every n N,

    each of which is closed. Thenn=1

    Fn =n=1

    2 + 1

    n, 3 1

    n

    = (2, 3),

    which is open. Compare to Remark 2.4.

    Proposition 2.13 (Proposition 14, 2.5, p. 44 of [5]). The complement of an open setis closed and the complement of a closed set is open.

    Example 2.14. F1 = {a} is closed, and F1 = (, a) (a, ) is open. Similarly,F2 = [a, b] is closed and F2 = (, a) (b, ) is open. See Proposition 2.13.Definition 2.15. Let B(R) (or just B) denote the smallest -algebra containing all theopen intervals, that is, containing C = {(a, b) : a < b }. B(R) is calledthe -algebra of Borel sets of R. B(R) also contains every open subset ofR since

  • 8/3/2019 Real Analysis Lecture Notes

    11/124

    REAL ANALYSIS LECTURE NOTES 9

    every open set can be written as a countable union of (disjoint) open intervals (seeProposition 2.5). Since B(R) is closed under taking complements, B(R) also contains allclosed subsets ofR (see Proposition 2.11(iii)).

    Definition 2.16. A set A

    R is said to be a G-set if we can write

    A =i=1

    Oi,

    where each Oi is open ( is from the d in Durchschnitt, German for intersection).Note that a G-set need not be open itself (cf. Remark 2.4)

    Definition 2.17. A set B R is said to be an F-set if we can write

    B =

    i=1 Fi,where each Fi is closed. Note that an F-set need not be closed (cf. Remark 2.12).

    Remark 2.18. Note that G-sets and F-sets are all Borel sets. Moreover, we could con-sider (G)-sets (the countable union ofG-sets) or (F)-sets (the countable intersectionof F-sets), and so on, all of which are also Borel sets.

    Definition 2.19. Let {xk}k=1 be a bounded sequence of real numbers. For each n N,let

    n = inf{xn, xn+1, xn+2, . . . } and n = sup{xn, xn+1, xn+2, . . . }.Thus we have two new sequences {n}n=1 and {n}n=1 where the ns are increasing,the ns are decreasing, and n n for all n.

    Define the lower limit of{xk}k=1 bylim xk = liminfxk = sup{n}n=1 = lim

    nn

    (the limit of an increasing sequence that is bounded above is its supremum).

    Similarly, define the upper limit of{xk}k=1 bylim xk = limsup xk = inf{n}n=1 = lim

    nn

    (the limit of a decreasing sequence which is bounded below is its infimum). Note that

    lim xk = limn

    n limn

    n = lim xk.

  • 8/3/2019 Real Analysis Lecture Notes

    12/124

    10 REAL ANALYSIS LECTURE NOTES

    Example 2.20. Consider the sequence

    {x1, x2, . . . , x2n1, x2n, . . . } =

    1, 3, 12

    ,5

    2, . . . , 1

    n, 2 +

    1

    n, . . .

    .

    Thenlim {xk} = 0 and lim {xk} = 2.

  • 8/3/2019 Real Analysis Lecture Notes

    13/124

    REAL ANALYSIS LECTURE NOTES 11

    3. Lebesgue Measure

    3.1. Lebesgue Measure and Measurable Sets.

    Remark 3.1. Our ideal situation is to generalize the notion of length of intervals to

    every subset ofR so that, letting denote this generalized length,(i) For all A R, 0 (A) . (could have infinite length)

    (ii) If I is an interval, then (I) = (I).(iii) Countable Additivity: If {Ai}i=1 is a countable family of pairwise disjoint

    subsets ofR, we want

    i1

    Ai

    =i1

    (Ai).(iv) Translation invariance: If A R and r R, then

    (A + r) =

    (A), where

    A + r =

    {x + r : x

    A

    }.

    We will see that it is possible to define this generalized length , which we will denotem, on P(R), and m will satisfy conditions (i), (ii) and (iv), but we wont be able toget countable additivity in general (condition (iii)). We can satisfy all four conditionsfor Borel sets, but not for every subset ofR.

    Definition 3.2. If A R is any subset ofR, the Lebesgue outer measure of A,denoted m(A), is given by

    m(A) = infn1 (In) :all countable collections

    of open intervals

    {In

    }n1 suchthat A

    n1 In .Here we allow

    n1 (In) to equal as some of the intervals might have infinite length.

    We also allow, for r [0, ), r + = + r = . Note that it could be the casethat for every countable collection of open intervals {In}n1 such that A

    n1 In,

    n1 (In) could be . In this case, m(A) = (e.g. A = R).

    Remark 3.3 (Basic facts about m).

    (i) 0 m(A) for all A R. (note the sums are nonnegative)(ii) m(

    ) = 0.

    (iii) Monotone Property: If A B R, then m(A) m(B).(iv) m

    {x} = 0 for every x R.Proposition 3.4 (Proposition 1, 3.2, p. 56 of [5]). Let I be an interval inR. Thenm(I) = (I).

  • 8/3/2019 Real Analysis Lecture Notes

    14/124

    12 REAL ANALYSIS LECTURE NOTES

    Proposition 3.5 (Countable Subadditivity - Proposition 2, 3.2, p. 57 of [5]). If{An}n1 is a countable collection of subsets ofR, then

    m

    n1An

    n1

    m(An).

    Corollary 3.6 (Corollary 3, 3.2, p. 58 of [5]). If E is a countable subset ofR, thenm(E) = 0.

    Corollary 3.7 (Corollary 4, 3.2, p. 58 of [5]). The set [0, 1] is uncountable.

    Corollary 3.8. R is an uncountable set.

    Corollary 3.9. m(Q) = 0.

    Corollary 3.10. m

    [0, 1] Q = 0.Note that [0, 1] Q is dense in [0, 1], yet m[0, 1] Q = 0 < 1 = m[0, 1].Corollary 3.11. R

    \Q is uncountable.

    Remark 3.12. Note that we made no assumption of pairwise disjointness of the {An}n1in Proposition 3.5. What if they are pairwise disjoint? Do we get equality? ConsiderA1 = [a, b) and A2 = [b, c]. Then A1 A2 = [a, c] where A1 A2 = . Moreover,m(A1 A2) = c a and m(A1) + m(A2) = b a + c b = c a, so we do get equalityin this particular case. This is not true in general, however. Using the Axiom ofChoice, it is possible to construct A1, A2 R with A1 A2 = and m(A1 A2) 0, there exists an open set U A such that

    m(U) m(A) + .(ii) Given A R, there exists a G-set G with G A and m(G) = m(A).

    See Exercise 6, p. 58 of [5].

    Proposition 3.33 (Proposition 15, 3.3, p. 63 of [5]). LetE R. Then the followingfive conditions are equivalent:

    (i) E M.(ii) For every > 0, there exists an open set U E such that m(U E) < .

    (iii) For every > 0, there exists a closed set F E such that m(E F) < .(iv) There exists a G-set G E such that m(G E) = 0.(v) There exists an F-set F E such that m(E F) = 0.

    If, in addition, the outer measure of E is finite, all of the conditions (i) (v) areequivalent to

    (vi) For every > 0, there exists a finite collection{Ui}Ni=1 of open intervals such that

    (11) m

    Ni=1

    Ui

    E

    < .

    See Exercise 13, p. 64 of [5].

    Remark 3.34. Part (vi) of Proposition 3.33 is the precise statement of LittlewoodsFirst Principle: Every Lebesgue measurable set is almost a finite union of open

    intervals. In other words, you can approximate any finite, measurable E R with afinite union of open intervals.

    Remark 3.35 (Existence of nonmeasurable sets). Start with E = [0, 1] R,which is measurable. Define an equivalence relation on E by x y x y Q.Let (E) denote the family of equivalence classes of E with respect to . For eachequivalence class (E), choose x E = [0, 1] (using the Axiom of Choice). Let

    P =

    (E){x}.

    We claim that P is nonmeasurable. If it were measurable, then there are two possibilities:(i) m(P) = 0.

    (ii) 0 < m(P) 1 since P [0, 1].Assuming (i), one can show that m(R) = 0, which is a contradiction. Similarly, assuming(ii), one can show that m

    [1, 2] = , which is again a contradiction. Thus P must

    be nonmeasurable. See 3.5 of [5] for the full details.

  • 8/3/2019 Real Analysis Lecture Notes

    19/124

    REAL ANALYSIS LECTURE NOTES 17

    3.2. Measurable Functions.

    Remark 3.36. At this point, we wish to generalize the notion of continuous functions,which should of course include the continuous functions. See Proposition 3.37, Definition3.38 and Example 3.39.

    Proposition 3.37 (Proposition 18, 3.5, p. 66 of [5]). Let f be an extended real-valued function whose domain E is measurable. Then the following four conditions areequivalent:

    (i) R {x E : f(x) > } M.(ii) R {x E : f(x) } M.

    (iii) R {x E : f(x) < } M.(iv) R {x E : f(x) } M.

    All of the conditions (i) (iv) imply(v)

    R

    {x

    E : f(x) =

    } M(actually this is true

    R

    {,}

    ).

    Note that conditions (i) (iv) can be rewritten as(i) R f1(, ] M.

    (ii) R f1[, ] M.(iii) R f1[, ) M.(iv) R f1[, ] M.

    Definition 3.38. If any of the conditions (i)(iv) in Proposition 3.37 hold, the functionf is said to be Lebesgue measurable.

    Example 3.39. Suppose that f : R R is continuous. Fix R. Considerx R : f(x) > = f1(, ).

    Note that (, ) is open. Since f is continuous, f1(, ) is open and hencef1

    (, ) M (see Proposition 18, 2.6, p. 47 of [5]). Thus f satisfies condi-

    tion (i) of Proposition 3.37 and is therefore Lebesgue measurable. It follows that theLebesgue measurable functions include the continuous functions, as desired (see Remark3.36).

    Definition 3.40. Let A R. We define the characteristic function (a.k.a indicatorfunction) A : R R by

    A(x) =

    1 if x A0 if x A .

    Note that if A / {,R}, then range A = {0, 1}. If A = , then range A = {0} and ifA = R, range A = {1}.

  • 8/3/2019 Real Analysis Lecture Notes

    20/124

    18 REAL ANALYSIS LECTURE NOTES

    Lemma 3.41. LetA R. Then A is measurable iff A M.

    Proposition 3.42. Let E M and suppose that D E is a measurable subset ofE. Let f be a measurable extended real-valued function defined on E. Then f|D ismeasurable.

    Proposition 3.43 (Proposition 19, 3.5, p. 67 of [5]). Let c R and let f and g bemeasurable, real-valued functions defined on E M. Then

    (i) f + g is measurable.(ii) c f is measurable.

    (iii) g f is measurable.(iv) gf is measurable (product, not composition).

    Definition 3.44. We say that a property P holds almost everywhere (abbreviateda.e.) on a measurable subset E ofR if

    m{x E : P is not true for x} = 0.

    Proposition 3.45 (Proposition 21, 3.5, p. 69 of [5]). If f and g are defined onE Msuch that f = g a.e. on E and f is measurable, then g is measurable (Note f and g canbe extended real-valued functions).

    Theorem 3.46 (Theorem 20, 3.5, p. 68 of [5]). Let{fi}i=1 be a sequence of measurableextended real-valued functions defined on E M. Then

    (i) sup{fi}ni=1 is measurable n N.(ii) sup{fi}i=1 is measurable.

    (iii) inf{fi}ni=1 is measurable n N.(iv) inf{fi}i=1 is measurable.(v) Hence lim fn and lim fn are measurable extended real-valued functions.

    Corollary 3.47. If{

    fn}n=1 is a sequence of measurable functions defined on E

    Mand if f(x) = limn

    fn(x) exists in the extended real numbers for every x E, then f ismeasurable.

    Definition 3.48. Let E M. We say that : E R is simple if is Lebesguemeasurable and the range of is finite. The simple functions are the building-blockfunctions.

  • 8/3/2019 Real Analysis Lecture Notes

    21/124

    REAL ANALYSIS LECTURE NOTES 19

    Proposition 3.49. Let be a simple function defined on E M. Then we can write

    (12) (x) =n

    i=1

    ciAi(x)

    where the {Ai}ni=1 are pairwise disjoint measurable sets such thatn

    i=1Ai = E, we have

    c1 < c2 < < cn and range = {ci}ni=1.

    Remark 3.50 (Converse to Proposition 3.49). If

    g(x) =n

    i=1

    ciAi(x)

    where the Ai M are pairwise disjoint, then g is a simple function defined on R.

    Definition 3.51. Equation (12) is called the canonical representation of the simple

    function . Note that corresponds one set to each element in the range of .

    Remark 3.52. If1 and 2 are simple functions and c, d R, then c1 +d2, c1 d2and 12 (the product) are all simple.

    Definition 3.53. Let {fn}n=1 be a sequence of real-valued functions defined on E R,and let f be a real-valued function defined on E R. We say that fn convergespointwise to f on E if for every x E, lim

    nfn(x) = f(x). That is, > 0, N N

    (depending on and on x) such that

    |fn(x) f(x)| < whenever n N.

    Definition 3.54. Let {fn}n=1 be a sequence of real-valued functions defined on E R,and let f be a real-valued function defined on E R. We say that fn convergesuniformly to f on E if > 0, N N (depending only on and not on x) such that

    |fn(x) f(x)| < for every n

    N and for every x

    E (the same N

    N works for all x

    E). This means

    limn

    fn(x) = f(x) and fn(x) f(x) at a rate that can be made independent of x E.

    Theorem 3.55. Letf : E [0, ] be a measurable, nonnegative extended real-valuedfunction defined on E M. Then there is an increasing sequence {n}n=1 of simple functions with lim

    nn(x) = f(x) for all x E. If f is bounded, this convergence is

    uniform. See Exercise 4, p. 89 of [5].

  • 8/3/2019 Real Analysis Lecture Notes

    22/124

    20 REAL ANALYSIS LECTURE NOTES

    Definition 3.56. For a measurable extended real-valued function f defined on E M,let

    P = {x E : f(x) 0}and

    N = {x E : f(x) < 0}.Note that P, N

    Mby Proposition 3.37, P

    N =

    and P

    N = E. Let

    (13) f+ = fP =

    f(x) if x P0 if x N = E P

    (where we define 0 = 0) and

    (14) f = fN =

    f(x) if x N0 if x P = E N

    (where we define 0 = 0). f+ and f are called the positive and negative parts off, respectively.

    Remark 3.57. Note that,

    (15) f(x) = f+(x) f(x)and

    (16) |f(x)| = f+(x) + f(x).Furthermore, f+ and f are measurable whenever f is (note that P and N aremeasurable, now use Proposition 3.43 with (13) and (14)). Therefore |f| is measurablesince it is the sum of two measurable functions (see (16) and Proposition 3.43). This

    motivates the following corollary.

    Corollary 3.58. Let f : E [, ] be a measurable function defined on E M. Then there exists a sequence {n}n=1 of simple functions defined on E such thatlimn

    n(x) = f(x) for every x E. If f is bounded, this convergence is uniform.

    Remark 3.59. An intuitive way of stating Corollary 3.58 is that Every measurablefunction is nearly a simple function. Technically this means that if f is a measurablefunction defined on E

    Mwith m(E) 0 there exists a simple

    function such that

    m{x E : |f(x) (x)| } < .

    That is, if we let

    B = {x E : |f(x) (x)| },then for all x E B, |f(x) (x)| < . In other words, approximates f very wellon E B. Related to this is Littlewoods Second Principle: Every measurable

  • 8/3/2019 Real Analysis Lecture Notes

    23/124

    REAL ANALYSIS LECTURE NOTES 21

    function is nearly a continuous function. The technical form of this principle is givenin Proposition 3.60.

    Proposition 3.60 (Proposition 22, 3.5, p. 69 of [5]). Letf be a measurable functiondefined on a closed interval [a, b]. Assume f takes on the values

    on a set of measure

    0. Then given any > 0 there exists a step function g and a continuous function h suchthat

    m{x [a, b] : |f(x) g(x)| } <

    and

    m{x [a, b] : |f(x) h(x)| } < .

    That is,

    |f(x) g(x)| < except on a set of measure less than and

    |f(x) h(x)| < except on a set of measure less than .Moreover, if f is bounded, m f(x) M for every x [a, b], then we can choose g andh such that m g(x) M and m h(x) M for all x [a, b]. See Exercise 23, p. 71of [5].

    Remark 3.61. The intuitive form of Littlewoods Third Principle states that ev-ery sequence of measurable functions that is pointwise convergent is nearly uniformlyconvergent. See Proposition 3.65 for the technical form of this principle.

    Proposition 3.62 (Proposition 23, 3.6, p. 72 of [5]). LetE be a measurable subset ofR such that m(E) < . Suppose that {fn}n=1 is a sequence of measurable real-valued

    functions defined onE that converges pointwise to some real-valued function f definedon E. Then given > 0 and > 0 there is a measurable set A E with m(A) < and N N such that |fn(x) f(x)| < for every x E A and for every n N.Remark 3.63. Note that we are not assuming fn converges uniformly on E

    A. The

    set A depends on both and so that as you change and the set A changes.

    Corollary 3.64 (Proposition 24, 3.6, p. 73 of [5]). In the statement of Proposition3.62, it is enough to assume that fn(x) f(x) a.e. on E. The conclusion of theProposition will still hold.

  • 8/3/2019 Real Analysis Lecture Notes

    24/124

    22 REAL ANALYSIS LECTURE NOTES

    Proposition 3.65 (Egoroffs Theorem - See Exercise 30, p. 73 of [5]). Let{fn}n=1be a sequence of measurable, real-valued functions defined on E M with m(E) < .Suppose that {fn}n=1 converges pointwise a.e. to the measurable, real-valued function fdefined on E. Then given any > 0 there exists a measurable set A E with m(A) < and {fn}n=1 converges uniformly to f on E

    A.

  • 8/3/2019 Real Analysis Lecture Notes

    25/124

    REAL ANALYSIS LECTURE NOTES 23

    4. Lebesgue Integration

    Remark 4.1. Recall that if is a simple function, we can write

    (17) (x) =n

    i=1ciAi(x)

    where

    Ai = {x R : (x) = ci}and c1 < c2 < < cn (see Proposition 3.49). Note that if ci0 = 0 for some i0,1 i0 n, then

    ci0Ai0 (x) = 0 Ai0 = 0,so that this term does not contribute to the sum (17). Thus we will henceforth assumethat all the cis are not equal to zero.

    Definition 4.2. We say a simple function is zero outside a set of finite measureor vanishes outside a set of finite measure if

    m{x R : (x) = 0} < .

    This will happen if and only ifn

    i=1

    m(Ai) < .

    Definition 4.3. Let be a simple function that is zero outside a set of finite measure.Then the Lebesgue integral of over R, written

    or

    R

    is defined by

    (18)

    =

    ni=1

    cim(Ai)

    where is given by (17). Similarly, for E M, the Lebesgue integral of overE R is defined by

    (19) E

    =n

    i=1

    cim(Ai E)

    Lemma 4.4. If E M, then E

    =

    E.

  • 8/3/2019 Real Analysis Lecture Notes

    26/124

    24 REAL ANALYSIS LECTURE NOTES

    Lemma 4.5 (Lemma 1, 4.2, p. 78 of [5]). Let be a simple function that vanishesoutside a set of finite measure. Suppose that we can write

    (20) (x) =m

    j=1

    bjEj(x)

    where the{

    Ej}mj=1 are measurable, pairwise disjoint subsets ofR. Note that (20) might

    not be the canonical representation of (the bjs might not be increasing, or they mightrepeat if we split an Ai into two parts). But then

    (21)

    =

    mj=1

    bjm(Ej).

    So as long as the Ejs are pairwise disjoint, we may use the same formula for the integralas in (18).

    Proposition 4.6 (Proposition 2, 4.2, p. 78 of [5]). Let and be simple functionsthat both vanish outside a set of finite measure. Then(i) a, b R,

    (a + b) = a

    + b

    .

    (ii) If (x) (x) onR, then

    .

    Definition 4.7. Let f be a bounded, measurable real-valued function defined on E Mwhere m(E) < . We define the Lebesgue integral of f over E, denoted

    (22)

    E

    f

    by

    (23)

    E

    f = inf

    E

    : is simple and (x) f(x) x E

    .

    Remark 4.8. Note that we could have taken

    (24) sup

    E

    : is simple and (x) f(x) x E

    in Definition 4.7. It turns out these two definitions of the integral are the same, but thiswill take some work (see Proposition 4.12).

  • 8/3/2019 Real Analysis Lecture Notes

    27/124

    REAL ANALYSIS LECTURE NOTES 25

    Lemma 4.9. If f is a measurable, bounded function defined on E M such thatm(E) < , and if A f(x) B for all x E, then(25) A m(E)

    E

    f B m(E).

    Definition 4.10. Recall Definition 4.7. Iff is a bounded measurable function we define

    (26) A = inf

    E

    : is simple and (x) f(x) x E

    ,

    called the upper envelope of f over E. Similarly, we define

    (27) B = sup

    E

    : is simple and (x) f(x) x E

    ,

    called the lower envelope of f over E.

    Remark 4.11. We can always show that B A regardless of whether or not f isLebesgue measurable.

    Proposition 4.12 (Proposition 3, 4.2, p. 79 of [5]). Let f be a bounded real-valued function defined onE M with m(E) < . Then

    (i) If f is Lebesgue measurable, then

    (28)

    E

    f = A = B

    (ii) If A = B, then f is Lebesgue measurable,

    where A and B are as defined in Definition 4.10. In other words,A = B if and only if f is Lebesgue measurable.

    Corollary 4.13 (Proposition 4, 4.2, p. 81 of [5]). Letf be a bounded function definedon [a, b], < a < b < . If f is Riemann integrable over [a, b], then f is Lebesguemeasurable on [a, b] and

    (29) Riem

    ba

    f(x)dx

    = Leb

    [a,b]

    f

    .

    Proposition 4.14 (Properties of the Lebesgue integral coinciding with theRiemann integral - Proposition 5, 4.2, p. 82 of [5]). Let f, g be bounded Lebesguemeasurable functions defined on E M such that m(E) < . Then

    (i) For every a R, E

    af = a

    E

    f.

  • 8/3/2019 Real Analysis Lecture Notes

    28/124

    26 REAL ANALYSIS LECTURE NOTES

    (i) E

    (f + g) =

    E

    f +

    E

    g.

    Hence for every a, b R,

    E(af + bg) = a

    Ef + b

    Eg.

    (ii) If f g a.e. on E, then E

    f E

    g.

    (iii) If f = g a.e. on E, then E

    f =

    E

    g.

    (iv) If M1 f(x) M2 a.e. on E, thenM1 m(E)

    E

    f M2 m(E).(v) IfA andB are disjoint measurable subsets ofE with m(A) 0. Then

    (i) E

    af = a

    E

    f

    where a = in case f is not integrable.(ii)

    E

    (f + g) =

    E

    f +

    E

    g.

  • 8/3/2019 Real Analysis Lecture Notes

    30/124

    28 REAL ANALYSIS LECTURE NOTES

    (iii) If f(x) g(x) a.e. on E, thenE

    f E

    g.

    (iv) If E1 and E2 are disjoint measurable subsets ofR, then

    E1E2 f = E1 f + E2 f.We use this Proposition to prove Fatous Lemma (Lemma 4.20).

    Theorem 4.20 (Fatous Lemma - Theorem 9, 4.3, p. 86 of [5]). Let{fn}n=1 be asequence of nonnegative, Lebesgue measurable functions defined on E M. Supposethat there is a function f defined on E such that fn(x) f(x) a.e. on E (note that fis thus measurable by Corollary 3.47). Then

    (34) Ef lim Efn.Note that were not saying the RHS of (34) isnt , or that both sides of (34) arent.

    Theorem 4.21 (Monotone Convergence Theorem, or MCT - Theorem 10, 4.3,p. 87 of [5]). Let{fn}n=1 be a sequence of nonnegative, Lebesgue measurable functionsdefined on E M. Suppose also that the fns are increasing, that is,

    0 f1(x) f2(x) fn(x) fn+1(x) for every n N and for every x E (hence the monotone). Thenf(x) = lim

    nfn(x)

    exists as a measurable, extended real-valued function on E and

    (35)

    E

    f = limn

    E

    fn.

    Since f(x) = limn

    fn(x), (35) is the same as

    (36)

    E

    limn

    fn = limn

    E

    fn.

    Corollary 4.22 (Corollary 11, 4.3, p. 87 of [5]). Let{un}n=1 be a sequence of nonneg-ative, Lebesgue measurable functions defined on E M. Then(37)

    E

    n=1

    un =n=1

    E

    un.

    Note that both sides of the equals sign in (37) might be . Note also that if 0 a band un = anx

    n for an 0, then anxn 0 on [a, b].

  • 8/3/2019 Real Analysis Lecture Notes

    31/124

    REAL ANALYSIS LECTURE NOTES 29

    Corollary 4.23 (Proposition 12, 4.3, p. 87 of [5]). Let f be a nonnegative integrable function defined onE M. Let{Ei}i=1 be pairwise disjoint measurable subsets of E.Then

    (38)

    i=1

    Ei

    f =i=1

    Ei

    f.

    Remark 4.24. Note that, up to this point, we have only been able to integrate functionsthat are nonnegative. The natural question arises: how do we integrate functions thatare not necessarily nonnegative?

    Definition 4.25. Suppose that f is a measurable function defined on E M. We saythat f is Lebesgue integrable over E if f+ and f are both integrable over E. Inthis case, we define

    (39) E

    f = E

    f+ E

    f.

    See Definition 3.56.

    Proposition 4.26. Let f be a measurable function defined on E M. Then f isintegrable over E if and only if |f| is integrable over E. See Exercise 10(a), p. 93 of [5].

    Proposition 4.27 (Proposition 13, 4.3, p. 88 of [5]). Let f and g be nonnegative,measurable functions defined on E

    M. Suppose that f is integrable and g(x)

    f(x)

    for every x E. Then g and f g are integrable and(40)

    E

    (f g) =E

    fE

    g.

    Proposition 4.28 (Proposition 14, 4.3, p. 88 of [5]). Letf be a nonnegative, integrableextended real-valued function defined on E M. Then for any > 0, > 0 such thatwhenever A E is measurable with m(A) < , we have

    Af < .

    This is sometimes referred to as the continuity of the Lebesgue integral. This

    shows that

    A

    f can be as small as you want as long as you take the set A small enough.

  • 8/3/2019 Real Analysis Lecture Notes

    32/124

    30 REAL ANALYSIS LECTURE NOTES

    Proposition 4.29 (Proposition 15, 4.4, p. 90 of [5]). Letf andg be integrable functionsover E M. Then

    (i) For every c R, cf is integrable over E andE

    cf = c

    E

    f.

    (ii) f + g is integrable over E andE

    (f + g) =

    E

    f +

    E

    g.

    (iii) If f(x) g(x) a.e. on E, thenE

    f E

    g.

    In particular,

    (41)

    E

    f

    E

    |f|.

    (iv) If E1 and E2 are disjoint measurable subsets of E, thenE1E2

    f =

    E1

    f +

    E2

    f.

    Compare Proposition 4.19.

    Corollary 4.30 (Corollary to Proposition 4.28 and Proposition 4.29 (iii)).If f is an integrable function over E M, then > 0, > 0 such that wheneverA E and m(A) < ,

    (42) A

    f < .Theorem 4.31 (Lebesgue Dominating Convergence Theorem, or DCT -Theorem 16, 4.4, p. 91 of [5]). Let g be a nonnegative, measurable function definedon E M that is integrable over E. Let {fn}n=1 be a sequence of measurable func-tions all defined on E such that |fn(x)| g(x) x E and n N. Suppose thatlimn

    fn(x) = f(x) a.e. for some function f defined on E. Then f is integrable over E

    and

    (43) E

    f = limn

    E

    fn.

    Since f(x) = limn

    fn(x), (43) is equivalent to

    (44)

    E

    limn

    fn = limn

    E

    fn.

  • 8/3/2019 Real Analysis Lecture Notes

    33/124

    REAL ANALYSIS LECTURE NOTES 31

    Remark 4.32. As an application of DCT, recall that MCT (see Theorem 4.21) neednot hold for a decreasing sequence of functions unless we impose some further conditions(see Exercise 7(b), p. 89 of [5]). This motivates the following corollary.

    Corollary 4.33. Let

    {fn

    }n=1 be a decreasing sequence of nonnegative, Lebesgue mea-

    surable functions defined onE M. Suppose further thatf1 is integrable overE. Then,letting f = lim

    nfn, we have

    (45)

    E

    f = limn

    E

    fn.

    Note that, since the fns are decreasing and nonnegative (i.e., bounded below by 0),limn

    fn exists and is equal to its infimum. See also Assignment 4, Exercise 2(b).

    Theorem 4.34 (Generalized Lebesgue Dominating Convergence Theorem,

    or GDCT - Theorem 17, 4.4, p. 92 of [5]). Let{gn}n=1 be a sequence of nonnegative,integrable functions defined on E M. Suppose that(i) lim

    ngn(x) = g(x) a.e. on E, where g is integrable over E.

    Now, let{fn}n=1 be a sequence of measurable functions all defined on E such that(ii) lim

    nfn(x) = f(x) a.e. on E.

    Suppose also that

    (iii) |fn(x)| gn(x) x E and n N.Then, if

    (46) limnEgn = Eg,then

    (47) limn

    E

    fn =

    E

    f.

    Since f(x) = limn

    fn(x), (47) is equivalent to

    (48) limn

    E

    fn =

    E

    limn

    fn.

    Remark 4.35.Note that in Theorem 4.34 each

    fnis dominated by

    gn, which might

    change. If all the gns are equal, we get the regular DCT (see Theorem 4.31).

  • 8/3/2019 Real Analysis Lecture Notes

    34/124

    32 REAL ANALYSIS LECTURE NOTES

    5. Differentiation and Integration

    5.1. Motivation.

    Remark 5.1. Recall the Fundamental Theorem of Calculus: Let f be continuouson [a, b] and suppose that F : [a, b]

    R is differentiable and satisfies F(x) = f(x) for

    every x [a, b]. Then

    (49)

    ba

    f(x)dx = F(b) F(a) orba

    F(x)dx = F(b) F(a).

    If F is measurable on [a, b], can we obtain equalities resembling (49)? There are threepossible roadblocks to answering this question in the affirmative:

    (i) F(x) might not be defined everywhere, that is,

    {x [a, b] : F is not differentiable }

    might have positive measure.(ii) Suppose that F(x) is defined a.e. on [a, b]. F(x) might not be integrable over

    [a, b], which implies that [a,b]

    F

    might not be defined.(iii) Even if F exists a.e. on [a, b] and is integrable over [a, b], how would we know

    that

    (50) [a,b] F = F(b) F(a)holds? In other words, why would (50) be true for more general functions than

    just continuous functions?

    Indeed, all three roadblocks above can occur, which we illustrate in the following threeexamples.

    Example 5.2. (See Remark 5.1(i)) In the late 1800s Karl Weierstrass provided anexample of a function F which is continuous at every x

    [a, b] but differentiable at no

    point in [a, b].

    Example 5.3. (See Remark 5.1(ii)) Note that

    F(x) =

    x2 sin

    1x2

    if 0 < x 1

    0 if x = 0

  • 8/3/2019 Real Analysis Lecture Notes

    35/124

    REAL ANALYSIS LECTURE NOTES 33

    is continuous at every x [0, 1]. Furthermore, one can check that F is differentiable atall points in [0, 1] where

    F(x) =

    2x sin

    1x2

    2x

    cos

    1x2

    if 0 < x 1

    0 = limx0+

    F(x)F(0)x0 if x = 0.

    This function F is not integrable over [0, 1], however, so the integral[0,1]

    F(x)

    is not defined and thus we cant define an analog of the Fundamental Theorem of Cal-culus.

    Example 5.4. (See Remark 5.1(iii)) Let

    F(x) = 0 if 0 x 0

    f(x + h) f(x)h

    = inft>0

    sup

    0

  • 8/3/2019 Real Analysis Lecture Notes

    37/124

    REAL ANALYSIS LECTURE NOTES 35

    and

    (59) Df(x) Df(x) Left-hand derivates.What if theyre all equal? It turns out that

    (60) limh0+f(x + h)

    f(x)

    h exists and is finite D+f(x) = D+

    f(x)

    and

    (61) limh0

    f(x + h) f(x)h

    exists and is finite Df(x) = Df(x).Thus

    f(x) = limh0

    f(x + h) f(x)h

    exists and is finite if and

    only if Df(x) = Df(x) = D+f(x) = D+f(x)

    by (60) and (61). This can be used to determine when a function is not differentiable.

    Remark 5.8. If a measurable function f is differentiable a.e. on [a, b], put h = 1n in thedefinition of derivative so that

    (62) limn

    f(x + 1n

    ) f(x)1n

    = limn

    n

    f

    x +

    1

    n

    f(x)

    exists a.e. on [a, b]. Note that fn(x) = n

    f

    x + 1n

    f(x)

    is measurable for every

    nN, so if the limit in the right-hand side of (62) exists and is finite a.e., the limit

    function limn

    fn(x) will be Lebesgue measurable. Thus the derivative is measurable in

    this case.

    Definition 5.9. A function f : [a, b] R is called monotone increasing if wheneverx1, x2 [a, b] and x1 < x2, then f(x1) f(x2).

    Definition 5.10. A function f : [a, b] R is called monotone decreasing if when-ever x1, x2 [a, b] and x1 < x2, then f(x1) f(x2).

    Remark 5.11. Note that monotone functions are always bounded since f(a) f(x) f(b) for all x [a, b] if f is monotone increasing, whereas f(a) f(x) f(b) for allx [a, b] if f is monotone decreasing.

  • 8/3/2019 Real Analysis Lecture Notes

    38/124

    36 REAL ANALYSIS LECTURE NOTES

    Remark 5.12. If f(x) is monotone, then limtx+

    f(t) and limtx

    f(t) exist. For example, if

    f is monotone increasing, then

    (63) limtx+

    f(t) = inft>x

    {f(t)} and limtx

    f(t) = supt

  • 8/3/2019 Real Analysis Lecture Notes

    39/124

    REAL ANALYSIS LECTURE NOTES 37

    5.3. Vitalis Covering Lemma.

    Definition 5.15. Let E R. Let Ibe a collection of nondegenerate intervals ofR.We say that Iis a Vitali cover for E, or that Icovers E in the sense of Vitali, if x E and > 0, I Isuch that x I and m(I) = l(I) < .

    Example 5.16. Let E = R and let

    I= {(a, b) : a < b, a, b R}.This is a Vitali cover for R since if x R and > 0 is given, let

    I =

    x 3

    , x +

    3

    .

    Then x I and l(I) = 23 < .

    Example 5.17. Let E = [0, 1] and letI= {(s, t) : 0 s < t 1, s ,t Q}.

    You should convince yourself that this Iis a Vitali cover for [0, 1].

    Lemma 5.18 (Vitalis Covering Lemma - Lemma 1, 5.1, p. 98 of [5]). SupposeE R with m(E) < (E need not be measurable here). LetIbe a Vitali cover forE.Then > 0 there exist pairwise disjoint intervals {In}Nn=1 with{In}Nn=1 I, IiIj =

    for 1 i j N, i = j and such that

    (66) mE Ni=1

    Ii < .Theorem 5.19 (Theorem 3, 5.1, p. 100 of [5]). Letf be monotone increasing on [a, b].Then f is differentiable a.e. on [a, b]. Moreover, f is measurable (see Exercise ??) andintegrable over [a, b], with

    (67) 0

    [a,b]

    f f(b) f(a).

    Note that the left-hand inequality in (67) follows from the fact that all the derivatives

    are nonnegative.

    Corollary 5.20. Let f : [a, b] R be monotone decreasing. Then f exists a.e. on[a, b], f is integrable over [a, b] and

    (68) < f(b) f(a)

    [a,b]

    f.

  • 8/3/2019 Real Analysis Lecture Notes

    40/124

    38 REAL ANALYSIS LECTURE NOTES

    Note that f is monotone increasing. The result follows by applying Theorem 5.19 tothe function f.

    Remark 5.21. Theorem 5.19 and Corollary 5.20 show that all monotone functions fare differentiable a.e. on [a, b] and that f is integrable. Furthermore, these two results

    show that all finite linear combinations of monotone functions are differentiable a.e. on[a, b] and that their derivatives are integrable.

  • 8/3/2019 Real Analysis Lecture Notes

    41/124

    REAL ANALYSIS LECTURE NOTES 39

    5.4. Variation and Absolute Continuity.

    Definition 5.22. Let f : [a, b] R. Let(69) P= {a = x0 < x1 < x2 < < xn = b}be a partition of [a, b]. Define the total variation of f with respect to P, T(f,

    P),

    by

    (70) T(f, P) =n

    i=1

    f(xi) f(xi1) < (note that (70) is finite since its a finite sum). We say that f is of bounded variationover [a, b], written f BV[a, b], if(71) sup

    P partitionsof [a,b]

    T(f, P) < .

    If f

    BV[a, b], let

    (72) Tba(f) = supPT(f, P),

    Tba(f) is called the total variation of f over [a, b].

    Example 5.23. Let

    (73) f(x) =

    sin 1x if 0 < x 10 if x = 0

    What happens when you take the supremum of all of these sums (70) over all possiblepartitions P of [0, 1]? Surely f(x) is not of bounded variation as defined above (seeDefinition 5.22).

    Remark 5.24 (Properties and examples of functions of bounded variationover [a, b]).

    (i) If f is monotone on [a, b], then f BV[a, b].

    (ii) There exist examples of functions that are continuous on [a, b] yet are not of

    bounded variation over [a, b]. Thus continuity alone is not enough to ensure afunctions is of bounded variation over [a, b]. See Exercise ??.

    (iii) If f and g are of bounded variation over [a, b] and c R, thenf + g BV[a, b]

    and

    cf BV[a, b].

  • 8/3/2019 Real Analysis Lecture Notes

    42/124

    40 REAL ANALYSIS LECTURE NOTES

    This implies that the class of functions of bounded variation over [a, b] is a vectorspace over R.

    (iv) If f exists at all x [a, b], and if f is bounded on [a, b], then f BV[a, b]. SeeExercise ?? and use the Mean Value Theorem.

    (v) If f BV[a, b], then f is bounded on [a, b], i.e., M > 0 such that |f(x)| M.

    Remark 5.25. We fix a bit of notation. For r R, let

    (74) r+ =

    r if r 00 if r < 0

    and

    (75) r =

    0 if r > 0

    r if r

    0

    .

    Note that

    (76) r = r+ r,(77) |r| = r+ + r,(78) 0 r+ |r|,and

    (79) 0 r |r|.Note the similarity to Definition 3.56.

    Definition 5.26. Let f : [a, b] R. If P= {a = x0 < x1 < < xn = b} is somepartition of [a, b], let the positive variation of f with respect to P be

    (80) p(f, P) =n

    i=1

    f(xi) f(xi1)

    +.

    Similarly, define the negative variation of f with respect to P by

    (81) n(f, P) =n

    i=1 f(xi) f(xi1)

    .

    Remark 5.27. Note that by (78) and (79) we get that

    (82) 0 p(f, P) T(f, P)and

    (83) 0 n(f, P) T(f, P).

  • 8/3/2019 Real Analysis Lecture Notes

    43/124

    REAL ANALYSIS LECTURE NOTES 41

    This is true for all partitions Pof [a, b]. Note that, using (76), one can easily derive that(84) p(f, P) n(f, P) = f(b) f(a),that is,

    (85) p(f, P) = f(b) f(a) + n(f, P)

    for any partition Pof [a, b]. Similarly, using (77), we see that(86) p(f, P) + n(f, P) = T(f, P)for every partition Pof [a, b].

    Definition 5.28. Let f BV[a, b]. We define the positive variation of f over [a, b]as

    (87) Pba(f) = supP

    p(f, P).

    Similarly, we define the negative variation of f over [a, b] as(88) Nba(f) = supP

    n(f, P).

    Remark 5.29. Letting f BV[a, b], since(89) p(f, P) T(f, P) Tba(f) < ,(where the last inequality on the right comes from the assumption that f BV[a, b]),we see that

    (90) 0 Pba(f) < since Tba(f) is an upper bound on p(f, P) for any partition Pof [a, b] by (89), hencethere exists a least upper bound on

    p(f, P) for all partitions P of [a, b], which we

    have defined as Pba(f) (see Definition 5.28). Similar to (89) and (90), we see that

    (91) 0 Nba(f) Tba(f) < where again the last inequality on the right follows from the hypothesis that f BV[a, b].

    Lemma 5.30 (Lemma 4, 5.2, p. 103 of [5]). Letf : [a, b] R be of bounded variationover [a, b]. Then(92) Pba(f) Nba(f) = f(b) f(a)and

    (93) Pba(f) + Nba(f) = T

    ba(f).

  • 8/3/2019 Real Analysis Lecture Notes

    44/124

    42 REAL ANALYSIS LECTURE NOTES

    Definition 5.31. Let f BV[a, b]. For a x b, considerTxa (f) : [a, b] [0, )

    defined by

    (94) Txa (f) = 0 if x = a

    Tx

    a

    (f) if a < x

    b

    Similarly, define

    Pxa (f) : [a, b] [0, )and

    Nxa (f) : [a, b] [0, )in the corresponding way. Note that if f BV[a, b], then f is of bounded variation

    over [a, x] for all x (a, b]. Thus, by Lemma 5.30 we have, for every x [a, b],(95) f(x) f(a) = Pxa (f) Nxa (f)and

    (96) Txa (f) = Pxa (f) + Nxa (F).

    If x = a, everything is zero.

    Proposition 5.32. If f BV[a, b], then Txa (f), Pxa (f) and Nxa (f) are monotone in-creasing functions on [a, b].

    Theorem 5.33 (Theorem 5, 5.2, p. 103 of [5]). Let f : [a, b] R. Then f is

    of bounded variation on [a, b] if and only if f can be written as the difference of twomonotone increasing (real-valued) functions on [a, b].

    Corollary 5.34 (Corollary 6, 5.2, p. 104 of [5]). Suppose that f : [a, b] R and thatf is of bounded variation over [a, b]. Then f exists a.e. on [a, b] and f is integrableover [a, b]. Note that f integrable over [a, b] implies that f is measurable over [a, b].

    Remark 5.35. Note that we still dont have an analogue of the Fundamental Theoremof Calculus. Recall Exercise 5,

    4.3, p. 89 of [5]: If f : [a, b]

    [0,

    ], and f is

    integrable over [a, b], then

    (97) F(x) =

    [a,x]

    f(t)dt or F(x) =

    [a,x]

    f

    is continuous on [a, b]. It turns out this is true in more generality. We have the followingLemma 5.36.

  • 8/3/2019 Real Analysis Lecture Notes

    45/124

    REAL ANALYSIS LECTURE NOTES 43

    Lemma 5.36 (Lemma 7, 5.3, p. 105 of [5]). Suppose that f : [a, b] [, ] isintegrable over [a, b]. Then

    (98) F(x) =

    xa

    f(t)dt

    is continuous on [a, b] and F BV[a, b].

    Remark 5.37. Note that by Theorem 5.33 and its Corollary 5.34, F is differentiablea.e. on [a, b] and F is integrable over [a, b] where

    F(x) =

    xa

    f(t)dt.

    The natural question arises: What is F(x)? It turns out that F(x) = f(x), but thisis not as easy to show as in the Riemann case. We need to develop some further toolsfirst.

    Lemma 5.38 (Lemma 8, 5.3, 105 of [5]). Suppose that f : [a, b] [, ] isintegrable over [a, b], and suppose that

    (99) F(x) =

    xa

    f(t)dt = 0

    for every x [a, b]. Then f(x) = 0 a.e. on [a, b].

    Lemma 5.39 (Lemma 9, 5.3, p. 106 of [5]). Letf : [a, b] R be Lebesgue measurableand bounded on [a, b] (note that f is then integrable over [a, b]). Suppose that

    (100) F(x) = F(a) + xa

    f(t)dt.

    Then F(x) = f(x) a.e. on [a, b].

    Theorem 5.40 (Theorem 10, 5.3, p. 107 of [5]). Letf be a Lebesgue integrable functionover [a, b] and suppose that

    (101) F(x) = F(a) +

    xa

    f(t)dt

    for F(a)

    R. Then F(x) = f(x) a.e. on [a, b].

    Definition 5.41. Let g : [a, b] R. We say that g is absolutely continuous on[a, b] if > 0, > 0 such that whenever (ci, di)ni=1 is a pairwise disjoint collectionof intervals in [a, b] with

    (102)n

    i=1

    (di ci) < ,

  • 8/3/2019 Real Analysis Lecture Notes

    46/124

    44 REAL ANALYSIS LECTURE NOTES

    then

    (103)n

    i=1

    g(di) g(ci) < .Remark 5.42. It turns out that g is absolutely continuous on [a, b] if and only if g(x)exists a.e. on [a, b], g is integrable over [a, b], and

    (104) g(x) g(a) =xa

    g(t)dt

    for every x [a, b]. We will see shortly how to prove this. This result extends theFundamental Theorem of Calculus to the class of absolutely continuous functions. Notethe integrand in (104) need not be continuous as in the Riemann case, it only needs tobe Lebesgue integrable.

    Remark 5.43. Note that any absolutely continuous function is uniformly continuous on[a, b] (take n = 1 in Definition 5.41 and you get the definition of uniformly continuous).The converse is not true in general, however. We will shortly characterize all absolutelycontinuous functions. See Theorem 5.50.

    Remark 5.44. If

    F(x) =

    xa

    f(t)dt + F(a)

    where f is Lebesgue integrable over [a, b], then F is absolutely continuous over [a, b].To prove this, use the continuity of the Lebesgue integral (see Proposition 4.28). The

    converse of this will turn out to be true, but this is harder to prove. We will show thatany absolutely continuous function is differentiable first, and the easiest way to do thisis to show it is of bounded variation over [a, b].

    Lemma 5.45 (Lemma 11, 5.4, p. 108 of [5]). Let f : [a, b] R be absolutelycontinuous on [a, b]. Then f is of bounded variation over [a, b].

    Corollary 5.46. If f is absolutely continuous on [a, b], then f exists a.e. on [a, b] andf is Lebesgue integrable over [a, b].

    Proof. See Corollary 5.34.

    Remark 5.47. Linear combinations of absolutely continuous functions are absolutelycontinuous, but compositions are not necessarily. See Exercise 17(a), p. 111 of [5] (andbe careful - the Exercise as stated in the book is incorrect. Can you come up with acounterexample?).

  • 8/3/2019 Real Analysis Lecture Notes

    47/124

    REAL ANALYSIS LECTURE NOTES 45

    Lemma 5.48 (Lemma 13, 5.4, p. 109 of [5]). If f is absolutely continuous on [a, b]and f(x) = 0 a.e., then f(x) = f(a) is constant for every x [a, b].

    Remark 5.49. Note that Lemma 5.48 is not true in general for functions of boundedvariation. Consider the Cantor ternary function (see Exercise 48, p. 50 and Exercise 15,

    p. 111 of [5]). Call this function f. Then f : [0, 1] [0, 1], f is monotone increasing, fexists a.e. and f(x) = 0 a.e., but f is not constant. The problem is that f is continuousbut not absolutely continuous.

    Theorem 5.50 (Characterization of absolutely continuous functions -Theorem 14, 5.4, p. 110 of [5]). A function F : [a, b] R can be expressed as anindefinite integral of a Lebesgue integrable function over[a, b] if and only ifF is absolutelycontinuous on [a, b].

    Corollary 5.51 (Corollary 15, 5.4, p. 110 of [5]). Every absolutely continuous functionF on [a, b] can be written as the indefinite integral of its derivative plus a constant, F(a).

  • 8/3/2019 Real Analysis Lecture Notes

    48/124

    46 REAL ANALYSIS LECTURE NOTES

    5.5. Convexity.

    Definition 5.52. Suppose that : (a, b) R is a function defined on an open interval(a, b) in R. We say that is convex if for each x, y (a, b), x < y and for every [0, 1],(105) x + (1 )y (x) + (1 )(y).This relates to concave up in the Calc 1 sense of the term (whereas concave meansconcave down in Calc 1 language). Consider the chord joining

    x, (x)

    and

    y, (y)

    .

    The point on this chord whose first coordinate is x + (1 )y has second coordinateequal to (x) + (1 )(y). Hence is convex iff the graph of lies on or below anychord joining two points on the graph.

    Lemma 5.53 ((Lemma 16, 5.5, p. 113 of [5]). Let be convex on (a, b). Supposec1, d1, c2, d2 are points in (a, b) such that c1 c2 < d2 and c1 < d1 d2, then

    (106) (d1) (c1)d1 c1

    (d2) (c2)d2 c2 .

    The inequality in (106) is equivalent to

    (107) (d2 c2)(d1) + (d1 c1)(c2) (d1 c1)(d2) + (d2 c2)(c1).

    Definition 5.54. Suppose f : (a, b) R and let x0 (a, b). IfDf(x0) = Df(x0)exist, are finite and equal to each other, we say that f is left differentiable at x0 andwrite the common value Df(x0) = Df(x0) as fL(x0). Thus

    (108) limxx

    0

    f(x) f(x0)x x0 = f

    L(x0).

    Similarly, if we have D+f(x0) = D+f(x0) and both are finite, we say that f is right

    differentiable at x0 and write the common value D+f(x0) = D+f(x0) as f

    R(x0). Thus

    (109) limxx+

    0

    f(x) f(x0)x x0 = f

    R(x0).

    Remark 5.55. Note that, if perchance m

    R is such that L(x0)

    m

    R(x0), then

    we always have

    (110)(x) (x0)

    x x0 m (y) (x0)

    y x0whenever a < x < x0 < y < b. We will need this to prove Jensens inequality (seeProposition 5.61).

  • 8/3/2019 Real Analysis Lecture Notes

    49/124

    REAL ANALYSIS LECTURE NOTES 47

    Proposition 5.56 (Proposition 17, 5.5, p. 113 of [5]). Let : (a, b) R be convex.Then on every closed subinterval of(a, b) is absolutely continuous. Moreover, the right-and left-hand derivatives L and

    R exist at each point in (a, b) and

    L(x) =

    R(x)

    except on a countable subset of [a, b] (Note that in general if a function is absolutelycontinuous then the derivative exists a.e. on its domain. See Corollary 5.46). Further-more, L and

    R are monotone increasing functions and

    L(x)

    R(x) at each point

    x (a, b).

    Proposition 5.57 (Proposition 18, 5.5, p. 114 of [5]). Suppose is continuous on(a, b) and suppose that one derivate, say D+ is finite and monotone increasing on(a, b). Then is convex on [a, b].

    Definition 5.58. If is convex on (a, b), m R and x0 (a, b), we say the line(111) y = m(x

    x0) + (x0)

    through the point (x0, (x0)) is a supporting line for if the graph of is always onor above the line in (125). This is equivalent to saying that (x) m(x x0) + (x0)for every x (a, b). In other words, (125) is a supporting line if

    (x) (x0)x x0 m

    for x > x0 and(x) (x0)

    x x0 mfor x < x0. Thus if is differentiable and m

    L(x0),

    R(x0)

    , then (125) is a

    supporting line for . Thus convex functions always have supporting lines (althoughthis is not true for more general functions). See Proposition 5.56 and Remark 5.55.

    Corollary 5.59 (Corollary 19, 5.5, p. 115 of [5]). Let : (a, b) R be twicedifferentiable on (a, b). Then is convex on (a, b) if and only if (x) 0 for everyx (a, b).

    Example 5.60. To illustrate Corollary 5.59, consider

    (i) Let (x) = e

    x

    = exp(x) for every x R. Since (x) = ex

    > 0 for every x R, is convex.(ii) Let (x) = x2. Since (x) = 2x and (x) = 2 > 0 for every x R, is convex.

    Proposition 5.61 (Jensens Inequality - Proposition 20, 5.5, p. 115 of [5]). Letbe convex on (, ) = R and suppose that f : [0, 1] R is Lebesgue integrable over[0, 1]. Suppose that either

  • 8/3/2019 Real Analysis Lecture Notes

    50/124

    48 REAL ANALYSIS LECTURE NOTES

    (i) f is integrable over [0, 1], or(ii) (x) 0 for all x R.

    Then

    (112)

    [0,1]

    f(t)

    dt

    [0,1]

    f(t)dt

    .

    Corollary 5.62 (Corollary 21, 5.5, p. 116 of [5]). Let f be Lebesgue integrable over[0, 1]. Then

    (113)

    [0,1]

    exp

    f(t)

    dt exp

    [0,1]

    f(t)dt

    .

    Remark 5.63. The concave version of Jensens inequality and its corollary (See Propo-sition 5.61 and Corollary 5.62) are as follows: If is concave and h is Lebesgue integrableover [0, 1], then

    (114)

    [0,1]

    h(t)

    dt

    [0,1]

    h(t)dt

    .

    Otherwise, let h(x) be positive and (x) = ln x. Then

    (115)

    [0,1]

    ln

    h(t)

    dt ln

    [0,1]

    h(t)dt

    .

    Remark 5.64. Here is a useful fact about convex functions. Let : (a, b) R beconvex. Suppose that t1, t2, . . . , tn [0, 1] satisfy

    ni=1 ti = 1. Ifx1, x2, . . . , xn (a, b),

    then

    ni=1

    tixi

    ni=1

    ti(xi).

  • 8/3/2019 Real Analysis Lecture Notes

    51/124

    REAL ANALYSIS LECTURE NOTES 49

    6. Lp Spaces

    Definition 6.1. Let X be either [0, 1] or R. Let f : X [, ] be Lebesguemeasurable. Recall f is said to be integrable over X if f+ and f are integrable overX (see Definition 4.25). This is equivalent to |f| being integrable over X (see Exercise10, p. 93 of [5]) and we say in this case that f is in L1(X,m), and write

    (116) f1 =X

    |f(x)|dm.

    Remark 6.2. Recall if h : X [, ] is Lebesgue measurable and finite a.e.,and g : R R is continuous, then g h is Lebesgue measurable. Now, suppose thatf : X [, ] is Lebesgue measurable on X and finite a.e. Then |f| is Lebesguemeasurable on X and finite a.e. as well. Fix p [1, ). Let gp : [0, ) [0, )be defined by g

    p(t) = tp. Note that g

    pis continuous, thus g

    p|f| = |f|p is Lebesguemeasurable and finite a.e.Definition 6.3. Let f be Lebesgue measurable on X, f : X [, ]. Fix p [1, ). We say f is in Lp(X,m) if

    (117)

    X

    |f(x)|pdm(x) < ,

    and we write

    (118) fp = X |f(x)|pdm(x)1/p

    (note that we will need the 1/p in (118) to prove the triangle inequality later).

    Example 6.4. Let X = [0, 1] and let

    f(x) =

    if x = 0

    1x

    if 0 < x 1.It can be shown that f is in L1

    [0, 1], m

    . f is not in L2

    [0, 1], m

    , however, since

    |f(x)|2 = if x = 01x if 0 < x 1

    and [0,1]

    1

    xdm(x) = .

  • 8/3/2019 Real Analysis Lecture Notes

    52/124

    50 REAL ANALYSIS LECTURE NOTES

    Remark 6.5. Example 6.4 shows that, even on [0, 1], just because |f(x)| < a.e., itdoes not necessarily follow that f is in Lp[0, 1] for all p.

    Example 6.6. Let X = R and let

    g(x) = 0 if < x 11x

    if 1 x < .Recall that g is not integrable over R since

    R

    |g(x)|dm(x) =

    [1,)

    1

    xdx = lim

    tln x

    t1

    = ,

    so that g / L1(R, m). However,

    |g(x)|2 =

    0 if < x 11x2

    if 1 x <

    and R

    |g(x)|2dx < so that g L2(R, m).

    Remark 6.7. Powers of 1x are useful for constructing examples, as in Examples 6.4 and6.6.

    Definition 6.8. We turn to the p =

    case. Let f : X

    [

    ,

    ] and suppose that

    f is Lebesgue measurable. We say that f is essentially bounded on X if M 0such that

    |f(x)| M a.e. on (X, m),that is,

    m{x : |f(x)| > M} = 0.

    M is called an essential bound for f in this case.

    Example 6.9. Let X = R and let

    f(x) =0 if x = 01 if x R\Q1

    x if x Q\{0}.This function is not bounded since lim

    nf( 1n) = limn

    11n

    = . Note, however, that f isessentially bounded by 1 since |f| = 1 a.e.

  • 8/3/2019 Real Analysis Lecture Notes

    53/124

    REAL ANALYSIS LECTURE NOTES 51

    Definition 6.10. We say that f is in L(X,m) iff is essentially bounded on (X, m).In this case we define

    (119) f = M = inf{M 0 : M is an essential bound for f}.Note that the set in (119) is nonempty and bounded below by 0, so it has an infimum.

    Proposition 6.11. If f L(X, m), thenf = M is an essential bound for f. Inother words, M is an element of the set in (119).

    Remark 6.12. For 1 p , we want to show that Lp(X, m) is a normed vectorspace. Thus the key elements of the norm we want to show are

    (i) For p : Lp(X, m) [0, ), fp = 0 f = 0 (this shows uniqueness ofthe zero vector).

    (ii) For every R and for all f Lp

    (X, m), f Lp

    (X, m) and fp = ||fp.(iii) If f1, f2 Lp(X, m), then f1 + f2 Lp(X, m) and (Minkowskis inequality)f1 + f2p f1p + f2p.

    The problem that arises is that iff = 0 a.e., then fp = 0 but f is not identically zero.To remedy this situation, if fp < and gp < , we say f and g are equivalent,written f g, if(120) m

    {x : f(x) = g(x)} = 0,that is, if f(x) = g(x) a.e. on X. You should check that is an equivalence relation.We are now ready to give the proper definition of Lp(X, m).

    Definition 6.13. The elements ofLp(X,m) are equivalence classes of functions

    (121)

    [f] : f : X [, ] is Lebesgue measurable and fp <

    ,

    where the equivalence relation is as defined in Remark 6.12. It can be shown easily thatLp(X, m) thus defined is still a vector space. We get a unique zero vector in this case,fixing the problem from Remark 6.12. We will immediately become less formal and usemostly function notation (i.e. f and g) to denote elements ofLp(X, m) instead of [f], [g].

    This should cause no confusion.

    Proposition 6.14 (See 1, 6.2, p. 120 of [5]). Fix p [1, ]. Let f, g Lp(X, m).Then f + g Lp(X, m). Note that this actually works for all p (0, ], but we dontget a norm for p (0, 1).

  • 8/3/2019 Real Analysis Lecture Notes

    54/124

    52 REAL ANALYSIS LECTURE NOTES

    Proposition 6.15. We now show that for 1 p , Lp(X, m) is a normed vectorspace with norm

    (122) fp =

    X

    |f(x)|pdx1/p

    if 1 p <

    ess sup |f(x)| if p = .That is, p : Lp(X, m) [0, ) defined in (122) satisfies

    (i) fp = 0 f = 0.(ii) For every R and for every f Lp(X, m), fp = ||fp.

    (iii) Minkowskis inequality: If f1, f2 Lp(X, m), thenf1 + f2p f1p + f2p.

    (See 1, 6.2, p. 120 of [5] and Theorem 6.16).

    Theorem 6.16 (Minkowskis Inequality - See 1, 6.2, p. 120 of [5]). This is theproof of Proposition 6.15(iii). If f, g Lp(X, m) with 1 p , then(123) f + gp fp + gp.Furthermore, if 1 < p < , then equality will hold in (123) if and only if , Rsuch that f = g.

    Remark 6.17. Thus for 1 p , Lp(X, m) is a normed vector space. In fact, we willshow that Lp(X, m) is a Banach space for 1 p , that is, Lp(X, m) is complete in

    the metric induced by its norm (see Remark 6.22 and Theorem 6.34). First we establishthe next important inequality for Lp-spaces, the Holder inequality.

    Definition 6.18. Ifp [1, ], we defined the conjugate exponent q to p as follows:

    q =

    if p = 1

    pp1 if 1 < p <

    1 if p =

    .

    Note that 1 < pp1 < in the middle case above and limp1+

    pp1 = . Furthermore,

    limp

    pp1 = 1 so our definition is a continuous choice of conjugate exponents. Note that

    1

    p+

    1

    q=

    1

    p+

    1p

    p1=

    1

    p+

    p 1p

    = 1.

    The special case where p = q occurs when p = q = 2.

  • 8/3/2019 Real Analysis Lecture Notes

    55/124

    REAL ANALYSIS LECTURE NOTES 53

    Proposition 6.19 (Youngs Inequality - See Exercise 8, p. 123 of [5]). Leta 0,b 0 and p (1, ). Let q be the conjugate exponent to p. Then(124) ab a

    p

    p+

    bq

    q

    with equality holding in (124) if and only if ap = bq.

    Theorem 6.20 (Holders Inequality - See 4, 6.2, p. 121 of [5]). Let (X, ,m)be

    [0, 1], M, m or (R, M, m). Letp [1, ] and let q be the conjugate exponent top. Letf Lp(X, m) and g Lq(X, m). Thenf g is Lebesgue integrable over X (i.e.,f g L1(X, m)) and(125) f g1 fpgq,with equality holding in (125) if and only if , 0 with |f(x)|p = |g(x)|q a.e.Corollary 6.21 (Cauchy-Schwarz Inequality). If f, g L2(X, m), then f g L1(X, m) (i.e., f g is Lebesgue integrable over X) and

    f g1 f2g2.This is why L2 is an important space - we are able to define an inner product.

    Remark 6.22. Note that the norm p in (122) makes Lp(X, m) into a metric spacewith metric

    dp

    [f], [g]

    = f gp.

    Minkowskis inequality can be used to prove that, given [f], [g], [h]

    Lp(X, m),

    dp

    [f], [g] dp[f], [h]+dp[h], [g],

    the triangle inequality in the metric space.

    Definition 6.23. We say that a sequence {fn}n=1 in a normed linear space is a Cauchysequence if for any > 0 there is an N N such that for all n N and all m N,fn fm < . It is easy to show that every convergent sequence is a Cauchy sequence.

    Definition 6.24. Recall a metric space (Y, d) is said to be complete if every Cauchysequence {yn}n=1 Y converges to some y Y.

    Example 6.25. Note that (R, d = | |) (i.e. R with the normal absolute value) iscomplete, whereas (Q, | |) is not complete.

  • 8/3/2019 Real Analysis Lecture Notes

    56/124

    54 REAL ANALYSIS LECTURE NOTES

    Definition 6.26. Let (Y, ) be a normed vector space over R. We say that (Y, ) isa Banach space over R if (Y, d) is a complete metric space (where d is the metricinduced by the norm. See Remark 6.22 for an example). We will show that Lp(X, m) isa Banach space for 1 p .

    Remark 6.27. For the record, everything we do from now on will be over R. Ourresults generalize to over C, but this is not the focus of our course.

    Definition 6.28. Let (Y, ) be a normed vector space (over R). Let {yk}k=1 Y bea sequence. We say that

    k=1 yk is summable in (Y, ) if s Y such that taking

    sn =n

    k=1 yk, then the sequence of partial sums {sn}n=1 converges to s in (Y, ), thatis,

    limn

    s sn = 0.

    Example 6.29. Consider the following:

    (i) Let (Y, ) = (R, | |). Let

    yk =(1)k

    k

    k=1

    R. Thenk=1

    (1)kk is summable

    (the alternating harmonic series).

    (ii)

    yk =(1)kk2

    k=1

    R is also summable.

    Definition 6.30. Let {yk}k=1 be a sequence in (Y, ), a normed vector space. Wesay that the sum k=1 yk is absolutely summable if(126)

    k=1

    yk < ,

    that is, if the sum in (126) converges.

    Remark 6.31. Absolutely summable does not imply summable in general, althoughit does in R. Take (Q, | |), for example, and let y1 = 3, y2 = 110 , y3 = 4100 , y4 = 11000 ,y5 =

    510000 , etc. That is, let

    yn = nth digit of

    10n.

    Thenk=1

    yk =k=1

    |yk| <

    converges to in R but

    k=1 yk is not summable in (Q, | |).

  • 8/3/2019 Real Analysis Lecture Notes

    57/124

    REAL ANALYSIS LECTURE NOTES 55

    For the converse, i.e. that summability does not imply absolute summability, consider

    the sequence from Example 6.29(i),

    yk =(1)k

    k

    k=1

    (R, | |). Recall that k=1 yk issummable, but

    k=1

    |yk| =k=1

    1

    k=

    so the sum is not absolutely summable. Note that the series from Example 6.29(ii) isabsolutely summable.

    Theorem 6.32 (Proposition 5, 6.3, p. 124 of [5]). A normed vector space (Y, ) overR is a Banach space (in the metric induced by its norm) if and only if every absolutelysummable series in (Y, ) is summable in (Y, ).

    Remark 6.33. As an application of Theorem 6.32, we will show that Lp(X, m) isa Banach space for 1 p . Suppose that {yk}k=1 Lp(X, m) is such that

    k=1 fkp < . It must be shown that there exists an f Lp(X, m) such that

    limn

    fn

    k=1

    fk

    p

    = 0.

    This is not easy, but we prove it in the following Theorem 6.34.

    Theorem 6.34 (Riesz-Fischer Theorem - Theorem 6, 6.3, p. 125 of [5]). Letp [1, ]. Then L

    p

    (X, m) is a Banach space.

    Remark 6.35. Recall that we showed in Section 3 that for f a Lebesgue integrablefunction over X, for every > 0 there exists a step function and a continuous function such that

    X

    f(x) (x)dm < and X

    f(x) (x)dm < with |(x)| < |f(x)| and |(x)| < |f(x)| (see Proposition 3.60). We want to do the same

    thing in L

    p

    (X, m), but we will need p < . We first do the bounded case.

    Lemma 6.36 (Lemma 7, 6.4, p. 127 of [5]). Let 1 p < and let f Lp(X, m).Then there exists anM > 0 and a bounded Lebesgue measurable function fM Lp(X, m)such that |fM(x)| |f(x)| for every x X and f fMp < for any > 0.

  • 8/3/2019 Real Analysis Lecture Notes

    58/124

    56 REAL ANALYSIS LECTURE NOTES

    Proposition 6.37 (Proposition 8, 6.4, p. 128 of [5]). Letf Lp[0, 1], p [1, ). Given > 0 there exists a step function Lp[0, 1] and a continuous function Lp[0, 1]such that

    f p < and f p < .Note that we already proved this for the p = 1 case (see Proposition 3.60). We provethis for X = [0, 1] and leave the case X = R as an exercise (Hint: Use DCT).

    Definition 6.38. Let (Y, ) be a normed linear space over R. A map F : Y R iscalled a bounded linear functional if:

    (i) F(f1 + f2) = F(f1) + F(f2) for all , R and for all f1, f2 Y (linearity)(ii) There exists an M > 0 such that |F(f)| Mf for every f Y (boundedness).

    Proposition 6.39. The bounded linear functionals on (Y, ) are precisely the contin-uous linear functionals on Y.

    Proof. (=) If F is a bounded linear functional on Y, then M > 0 such that for anyf1, f2 Y, |F(f1 f2)| Mf1 f2 |F(f1) F(f2)| Mf1 f2, which canbe made arbitrarily small if f1 and f2 are close together.(=) The proof of this direction requires functional analysis and so we will assume itfor now.

    Definition 6.40. Given a bounded linear functional F on (Y, ), we define the normof F, written F, by

    (127) F = sup |F(f)|f : f Y {0} .Note that, by definition,

    |F(f)|f F f Y {0},

    which implies that

    (128) |F(f)| F f f Ysince (128) is also true for f = 0.

    Example 6.41 (See Proposition 11, 6.5, p. 131 of [5]). Fix g L2

    [0, 1]. Note thatby Holders inequality, gf L1[0, 1] (i.e., gf is Lebesgue integrable over [0, 1]) for allf L2[0, 1] (recall 2 is the conjugate exponent of itself). Define Tg : L2[0, 1] R byTg(f) =

    [0,1]

    gf dx and note that Tg is linear (which follows from the linearity of the

    integral). Tg is also bounded:

    |Tg(f)|

    [0,1]

    gf dx

    [0,1]

    |g||f| dx

    [0,1]

    |g|2dx 12

    [0,1]

    |f|2dx 12

  • 8/3/2019 Real Analysis Lecture Notes

    59/124

    REAL ANALYSIS LECTURE NOTES 57

    by Holders inequality again, which implies that |Tg(f)| g2f2. Thus for [f] = 0 inL2[0, 1],

    |Tg(f)|f2 g2 = Tg g2

    since g2 is an upper bound on the set in (127), but Tg is the least upper bound onthat set by Definition 6.40, thus

    Tg

    g

    2.

    Remark 6.42. Example 6.41 is true more generally for conjugate exponents p and q.The same argument shows that for any fixed g Lq(X, m), Tg : Lp(X, m) R definedby

    Tg(f) =

    X

    gf dx

    is a bounded linear functional on Lp(X,m, p) with(129) Tg gq.It turns out that

    Tg

    =

    g

    q, which we now show. First we need a definition.

    Definition 6.43. Define

    sgn g(x) =

    1 if g(x) > 0

    0 if g(x) = 0

    1 if g(x) < 0Note that sgn g(x) is bounded and measurable (its a simple function). Note also thatg(x) [sgn g(x)] = |g(x)| and [sgn g(x)] |g(x)| = g(x).

    Proposition 6.44 (Proposition 11, 6.5, p. 131 of [5]). If p and q are conjugate expo-nents in [1, ], thenTg = gq.

    Remark 6.45. Our main aim of this section is to show that if p [1, ), any boundedlinear functional on Lp[0, 1] must be of the form Tg, where g Lq[0, 1] and q is theconjugate exponent to p (Tg is as defined in Remark 6.42). This g is in fact unique,which establishes a one-to-one correspondence between the bounded linear functionalson Lp[0, 1] and the elements of Lq[0, 1], a result known as the Riesz RepresentationTheorem (see Theorem 6.47). This is also true for Lp(R, m), but this is beyond thescope of this course.

    Lemma 6.46 (Lemma 12, 6.5, p. 131 of [5]). Let p [1, ). Letg be a real-valued,Lebesgue measurable function on [0, 1] with the following property: M > 0 such that

    for every bounded, Lebesgue measurable function f on [0, 1], fg is integrable with[0,1]

    fg

    Mfp.

  • 8/3/2019 Real Analysis Lecture Notes

    60/124

    58 REAL ANALYSIS LECTURE NOTES

    Then g Lq[0, 1] and gq M, where q is the conjugate exponent to p.

    Theorem 6.47 (Riesz Representation Theorem - Theorem 13, 6.5, p. 132 of[5]). LetF be a bounded linear functional on Lp[0, 1], 1 p < . Letq be the conjugateexponent to p. Then there exists a unique g

    Lq[0, 1] (up to equivalence class) such that

    F(f) = [0,1]

    gf dx

    for every f Lp[0, 1]. Moreover, F = gq.

    Corollary 6.48. LetF be a bounded linear functional on L2[0, 1]. Then there exists aunique g L2[0, 1] such that

    F(f) =

    [0,1]

    f(x)g(x) dx.

    Proof. Take p = q = 2 in Theorem 6.47.

  • 8/3/2019 Real Analysis Lecture Notes

    61/124

    REAL ANALYSIS LECTURE NOTES 59

    7. Metric Spaces

    7.1. The Basics.

    Definition 7.1. Let X be a nonempty set. We say that a map d : X

    X

    [0,

    ) isa metric on X if

    (i) d(x, y) = 0 x = y, x, y X.(ii) d(x, y) = d(y, x) for every x, y X (symmetry)

    (iii) If x,y,z X, then d(x, z) d(x, y) + d(y, z) (triangle inequality).In this case, (X, d) is said to be a metric space.

    Example 7.2 (Examples of metric spaces).

    (i) X = R, d(x, y) = |x y|.(ii) X = L

    p

    [0, 1], d(f, g) = f gp, p [1, ].(iii) X = Rn, d(x, y) =

    ni=1

    (xi yi)21/2

    , where x = (x1, . . . , xn) and y = (y1, . . . , yn)

    are in Rn.

    Definition 7.3. Let {xn}n=1 X.(i) We say {xn}n=1 converges to x0 X if for every > 0 there exists an N N

    such that d(x0, xn) < whenever n N.(ii) The sequence {xn}n=1 X is Cauchy if and only if for every > 0 there exists

    an N N such that if m > n N, d(xm, xn) < .(iii) (X, d) is called a complete metric space if every Cauchy sequence converges tosome x0 X.

    Definition 7.4. Let U X. We say U is open in (X, d) if, given any x U, thereexists a > 0 such that whenever y X and d(x, y) < , then y U.

    Remark 7.5 (See Proposition 1, 7.2, p. 142 of [5]). Note that and U = X are bothopen subsets of X.

    Example 7.6. Let x X, fix > 0 and define B(x, ) = {y X : d(x, y) < }. ThenB(x, ) is open in X (prove this - it doesnt simply follow from the definition), calledthe open ball at x with radius .

  • 8/3/2019 Real Analysis Lecture Notes

    62/124

    60 REAL ANALYSIS LECTURE NOTES

    7.2. Compactness.

    Definition 7.7. Let (X, d) be a metric space and let K X. We say that {U}I isan open cover of K if

    I

    U K and each U is open in X.

    Example 7.8. Let X = R, d(x, y) = |x y|. Let K = (0, 1) R and Un =

    1n+1 ,

    nn+1

    .

    Each Un is open in R andn=1

    Un = (0, 1) = K so {Un}n=1 is an open cover of K.

    Definition 7.9. Let (X, d) be a metric space and K X. We say that K is compactin (X, d) if every open cover {U}I of K admits a finite subcover U1, . . . , U N withN

    j=1Uj K. X is said to be a compact metric space ifX is compact in (X, d) itself.

    Example 7.10. (See Example 7.8) Consider (0, 1) (R, d||). Let Un =

    1n+1

    , nn+1

    .

    Thenn=1

    = (0, 1) but {Un}n=1 contains no finite subcover since U1 U2 U3 (0, 1) and thus

    Ni=1

    Ui = UN =

    1

    N + 1,

    N

    N + 1

    (0, 1)

    for every N N.

    Remark 7.11. It turns out that [0, 1] R is compact however (see Example 7.10).More generally, [a, b] R is compact. In fact, any closed and bounded subset of Ris compact, which follows from the HeineBorel Theorem: a subset K ofRn withEuclidean metric (see Example 7.2(iii)) is compact if and only ifK is closed and boundedin Rn (see Theorem 15, 2.5, p. 44 of [5] for the proof in R). This is not true for moregeneral metric spaces.

    Definition 7.12. Let (X, d) be a metric space. We say that (X, d) is sequentially

    compact if for any sequence {xn}n=1 X there is a convergent subsequence {xnk}k=1 {xn}n=1 (i.e., there exists an x0 X with lim

    kxnk = x0 in the d-metric). This will turn

    out to be equivalent to a metric space being compact (see Theorem 7.18).

    Remark 7.13. [a, b] R are all sequentially compact, which follows from the Bolzano-Weierstrass Theorem (see Lemmas 16 and 17, 7.7, pp. 153-154 of [5]).

  • 8/3/2019 Real Analysis Lecture Notes

    63/124

    REAL ANALYSIS LECTURE NOTES 61

    Definition 7.14. We say that K (X, d) is bounded if there exists an M [0, )such that for every x, y K, d(x, y) M. Note that [a, b] R is bounded. We saythat (X, d) is bounded if X is bounded as a subset of itself.

    Example 7.15. Let > 0. Fix x0

    X and let K = B(x0, ) (see Example 7.6). Then

    if x, y K,d(x, y) d(x, x0) + d(x0, y) by the triangle inequality

    = d(x, x0) + d(y, x0) by symmetry

    < +

    = 2.

    Thus B(x0, ) is bounded.

    Definition 7.16. We say that K

    (X, d) is totally bounded if for every > 0 there

    exist x1, . . . , xn X such that ni=1

    B(xi, ) K, that is, we can cover K with a finitenumber of -balls.

    Remark 7.17. Note that totally bounded implies bounded but not conversely.

    Theorem 7.18 (Theorem 21, 7.7, p. 155 and Proposition 25, 7.7, p. 156 of [5]).Let(X, d) be a metric space. Then the following are equivalent:

    (i) (X, d) is compact (open covers have finite subcovers).(ii) (X, d) is sequentially compact (every sequence has a convergent subsequence).(iii) (X, d) is complete and totally bounded.

  • 8/3/2019 Real Analysis Lecture Notes

    64/124

    62 REAL ANALYSIS LECTURE NOTES

    7.3. Continuous Functions.

    Definition 7.19. Let (X, d) be a metric space and let f : X R be a function. Wesay f is continuous at x0 X if for every > 0 there exists a > 0 such that wheneverx X and d(x, x0) < , then |f(x) f(x0)| < . We say that f is continuous on X iff is continuous at each point x

    X.

    Remark 7.20. Let (X, d) be a compact metric space. Let f : X R be continuouson X. Then the range of f, f(X) = {f(x) : x X} is a compact subset ofR. Inparticular, the range off is bounded. Hence there exists an M > 0 such that |f(x)| Mfor every x X.

    Definition 7.21. If (X, d) is a compact metric space, we let C(X) denote the familyof all real-valued continuous functions on X.

    Remark 7.22. Note that C(X) is a vector space over R:

    (i) C(X) has a zero function f0 where f0(x) = 0 for every x X.(ii) If R and f C(X), then f C(X) where (f)(x) = f(x).

    (iii) If f, g C(X), then (f + g)(x) = f(x) + g(x) is continuous on X.Note also that C(X) has a norm, defined by

    f = supxX

    |f(x)| = maxxX

    |f(x)|.

    Note that this definition makes sense since the supremum exists (the fs are bounded

    see Remark 7.20) and continuous functions achieve their maximum. We have the usualproperties of the norm:

    (i) f = 0 f = f0 = 0.(ii) If R and f C(X), then f = ||f.

    (iii) If f, g C(X), then f + g f + g (triangle inequality).Thus C(X) is a normed vector space.

    Proposition 7.23. Let (X, d) be a compact metric space. Let {fn}n=1 C(X) andf

    C(X). Then lim

    n fn

    f

    = 0 iff fn

    f uniformly on x.

    Proof. See Exercise 10, p. 136 of [5], Assignment 1. We can use this result to prove thatC(X) is complete just like L (see Exercise 11, p. 126 of [5], Assignment 1).

    Proposition 7.24. Let (X, d) be a compact metric space. Then C(X) is complete inthe metric determined by its norm (see Definition 7.21 and Remark 7.22).

  • 8/3/2019 Real Analysis Lecture Notes

    65/124

    REAL ANALYSIS LECTURE NOTES 63

    7.4. The StoneWeierstrass Theorem.

    Remark 7.25. A natural question to ask is what subspaces are dense in C(X)? Thisis the aim of the StoneWeierstrass Theorem (see Theorem 7.35).

    Definition 7.26. Let S C(X). We say S separates the points of X if for everyx1, x2 X with x1 = x2, there exists an f S with f(x1) = f(x2). In other words, fdistinguishes between the points of X.

    Example 7.27. Let S = C(X). Then C(X) separates the points of X. See Exercise 2,Assignment 2.

    Definition 7.28. Let

    A C(X) and suppose

    Ais a subspace (over R) of C(X). We

    say that A is an algebra of functions in C(X) if whenever f, g A, then f g Awhere (f g)(x) = f(x)g(x).

    Remark 7.29. Note that C(X) is not only a normed vector space, but it is also analgebra: if f and g are continuous, then so is f g.

    Example 7.30. Let X = [0, 1]. Let

    A=

    {p : [0, 1]

    R : p is a polynomial of arbitrary degree with R coefficients

    }.

    Then A is an algebra of C[0, 1] since A is a subspace and is closed under multiplication.

    Definition 7.31. Let L be a nonempty subset of C(X). We say that L is a lattice inC(X) if whenever f, g L, then fg and fg are in L where fg(x) = max{f(x), g(x)}and f g(x) = min{f(x), g(x)} for every x X.

    Remark 7.32. Note that if L C(X) is a lattice and if {f1, . . . , f n} L, it followseasily by induction that f

    1 f

    2 fn

    (x) = minf1(x), . . . , f n(x) and f1 f2 fn(x) = maxf1(x), . . . , f n(x) define functions f1 f2 fn and f1 f2 fnin L.

    Remark 7.33. C(X) is a lattice in itself since if f, g C(X), then

    (130) f g(x) = maxf(x), g(x) = f(x) + g(x) + |f(x) g(x)|2

    C(X)

  • 8/3/2019 Real Analysis Lecture Notes

    66/124

    64 REAL ANALYSIS LECTURE NOTES

    since the absolute value of a continuous function is continuous (prove (130). There aretwo cases: f(x) g(x) and g(x) f(x)). Similarly,

    f g(x) = minf(x), g(x) = f(x) + g(x) |f(x) g(x)|2

    C(X).

    Remark 7.34. Not every algebra is a lattice. Consider C[0, 1] and let A denote thealgebra of all polynomials on [0, 1] (see Example 7.30). Take f(x) = x and g(x) = 1 xon [0, 1]. Then neither f g(x) = maxf(x), g(x) nor f g(x) = minf(x), g(x) isin A since neither is a polynomial (graph it