2_pdfsam_iq.pdf

162
Master de mathématiques UFR Mathématiques Quantum mechanics Mathematical foundations and applications Lecture notes Dimitri Petritis Rennes (draft of 2 October 2013 at 13:45)

description

Matematicas de la mecanica cuantica

Transcript of 2_pdfsam_iq.pdf

Page 1: 2_pdfsam_iq.pdf

Mas

ter

dem

athé

mat

ique

s

UFR Mathématiques

Quantum mechanicsMathematical foundations

and applications

Lecture notes

Dimitri Petritis

Rennes (draft of 2 October 2013 at 13:45)

Page 2: 2_pdfsam_iq.pdf

© 2003 – 2013 D Petritis

Page 3: 2_pdfsam_iq.pdf

Contents

1 Physics, mathematics, and mathematical physics 1

2 Phase space, observables, measurements, and yes-no experiments 7

2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

2.2 Classical physics as a probability theory with a dynamical law . . 9

2.2.1 Some reminders from probability theory . . . . . . . . . . . 9

2.2.2 Postulates for classical systems . . . . . . . . . . . . . . . . . 11

2.3 Classical probability does not suffice to describe Nature! . . . . . . 15

2.3.1 Bell’s inequalities . . . . . . . . . . . . . . . . . . . . . . . . . 17

2.3.2 The Orsay experiment . . . . . . . . . . . . . . . . . . . . . . 17

2.4 Quantum systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

2.4.1 Postulates of quantum mechanics: the Hilbert space ap-proach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

2.4.2 Interpretation of the basic postulates . . . . . . . . . . . . . 24

2.5 First consequences of quantum formalism . . . . . . . . . . . . . . 29

2.5.1 Irreducibility of quantum randomness and Heisenberg’s un-certainty principle . . . . . . . . . . . . . . . . . . . . . . . . 29

2.5.2 Quantum explanation of the Orsay experiment . . . . . . . 30

i

Page 4: 2_pdfsam_iq.pdf

3 Short resumé of Hilbert spaces 31

3.1 Scalar products and Hilbert spaces . . . . . . . . . . . . . . . . . . . 31

3.2 Orthogonality and projection; direct and orthogonal sums . . . . 33

3.3 Duality and Fréchet-Riesz theorem; adjoint . . . . . . . . . . . . . 36

3.4 Tensor product of Hilbert spaces . . . . . . . . . . . . . . . . . . . . 36

3.5 Orhonormal systems, ortonormal bases; criteria of completenessof a basis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

3.6 Dirac’s bra and ket notation . . . . . . . . . . . . . . . . . . . . . . . 38

3.7 Some more involved quantum mechanical phenomena . . . . . . 39

3.7.1 Composite systems and entanglement . . . . . . . . . . . . 39

3.7.2 Decoherence and quantum to classical transition . . . . . . 39

3.8 Rigged Hilbert spaces and generalised kets . . . . . . . . . . . . . . 39

4 Algebras of operators 41

4.1 Introduction and motivation . . . . . . . . . . . . . . . . . . . . . . 41

4.2 Algebra of operators . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

4.3 Convergence of sequences of operators . . . . . . . . . . . . . . . . 45

4.4 Classes of operators in B(H) . . . . . . . . . . . . . . . . . . . . . . 45

4.4.1 Self-adjoint and positive operators . . . . . . . . . . . . . . . 46

4.4.2 Projections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

4.4.3 Unitary operators . . . . . . . . . . . . . . . . . . . . . . . . . 47

4.4.4 Isometries and partial isometries . . . . . . . . . . . . . . . . 47

4.4.5 Normal operators . . . . . . . . . . . . . . . . . . . . . . . . . 49

4.5 States on algebras, GNS construction, representations . . . . . . . 49

Page 5: 2_pdfsam_iq.pdf

5 Spectral theory in Banach algebras 51

5.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

5.2 The spectrum of an operator acting on a Banach space . . . . . . . 53

5.3 The spectrum of an element of a Banach algebra . . . . . . . . . . 56

5.4 Relation between diagonalisability and the spectrum . . . . . . . . 58

5.5 Spectral measures and functional calculus . . . . . . . . . . . . . . 60

5.6 Some basic notions on unbounded operators . . . . . . . . . . . . 66

6 Propositional calculus and quantum formalism based on quantum logic 69

6.1 Lattice of propositions . . . . . . . . . . . . . . . . . . . . . . . . . . 70

6.2 Classical, fuzzy, and quantum logics; observables and states onlogics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

6.2.1 Logics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74

6.2.2 Observables associated with a logic . . . . . . . . . . . . . . 75

6.2.3 States on a logic . . . . . . . . . . . . . . . . . . . . . . . . . . 77

6.3 Pure states, superposition principle, convex decomposition . . . . 80

6.4 Simultaneous observability . . . . . . . . . . . . . . . . . . . . . . . 82

6.5 Automorphisms and symmetries . . . . . . . . . . . . . . . . . . . . 84

7 Standard quantum logics 87

7.1 Observables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

7.2 States . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88

7.3 Symmetries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92

8 States, effects, and the corresponding quantum formalism 95

Page 6: 2_pdfsam_iq.pdf

8.1 States and effects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

8.2 Operations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

8.3 General quantum transformations, complete positivity, Kraus the-orem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

9 Quantum formalism based on the informational approach 97

10 Two illustrating examples 99

10.1 The harmonic oscillator . . . . . . . . . . . . . . . . . . . . . . . . . 99

10.1.1 The classical harmonic oscillator . . . . . . . . . . . . . . . . 99

10.1.2 Quantum harmonic oscillator . . . . . . . . . . . . . . . . . 103

10.1.3 Comparison of classical and quantum harmonic oscillators 107

10.2 Schrödinger’s equation in the general case, rigged Hilbert spaces . 108

10.3 Potential barriers, tunnel effect . . . . . . . . . . . . . . . . . . . . . 108

10.4 The hydrogen atom . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108

11 Quantifying information: classical and quantum 109

11.1 Classical information, entropy, and irreversibility . . . . . . . . . . 109

12 Turing machines, algorithms, computing, and complexity classes 113

12.1 Deterministic Turing machines . . . . . . . . . . . . . . . . . . . . . 113

12.2 Computable functions and decidable predicates . . . . . . . . . . 116

12.3 Complexity classes . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

12.4 Non-deterministic Turing machines and the NP class . . . . . . . . 117

12.5 Probabilistic Turing machine and the BPP class . . . . . . . . . . . 118

12.6 Boolean circuits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119

Page 7: 2_pdfsam_iq.pdf

12.7 Composite quantum systems, tensor products, and entanglement 121

12.8 Quantum Turing machines . . . . . . . . . . . . . . . . . . . . . . . 121

13 Cryptology 125

13.1 An old idea: the Vernam’s code . . . . . . . . . . . . . . . . . . . . . 126

13.2 The classical cryptologic scheme RSA . . . . . . . . . . . . . . . . . 127

13.3 Quantum key distribution . . . . . . . . . . . . . . . . . . . . . . . . 129

13.3.1 The BB84 protocol . . . . . . . . . . . . . . . . . . . . . . . . 129

13.3.2 Simple eavesdropping strategies, disturbance and informa-tional gain . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133

13.3.3 Other cryptologic protocols . . . . . . . . . . . . . . . . . . . 133

14 Elements of quantum computing 135

14.1 Classical and quantum gates and circuits . . . . . . . . . . . . . . . 135

14.2 Approximate realisation . . . . . . . . . . . . . . . . . . . . . . . . . 136

14.3 Examples of quantum gates . . . . . . . . . . . . . . . . . . . . . . . 140

14.3.1 The Hadamard gate . . . . . . . . . . . . . . . . . . . . . . . 140

14.3.2 The phase gate . . . . . . . . . . . . . . . . . . . . . . . . . . 140

14.3.3 Controlled-NOT gate . . . . . . . . . . . . . . . . . . . . . . . 140

14.3.4 Controlled-phase gate . . . . . . . . . . . . . . . . . . . . . . 141

14.3.5 The quantum Toffoli gate . . . . . . . . . . . . . . . . . . . . 141

15 The Shor’s factoring algorithm 143

16 Error correcting codes, classical and quantum 145

Page 8: 2_pdfsam_iq.pdf

References 146

Index 150

Page 9: 2_pdfsam_iq.pdf
Page 10: 2_pdfsam_iq.pdf

1Physics, mathematics, and

mathematical physics

La mathématique est une science expérimentale. Contrairementen effet à un contresens qui se répand de nos jours (. . . ), les objetsmathématiques préexistent à leurs définitions ; celles-ci ont étéélaborées et précisées par des siècles d’activité scientifique et, sielles se sont imposées, c’est en raison de leur adéquation aux objetsmathématiques qu’elles modélisent.

Michel Demazure: Calcul différentiel, Presses de l’École Polytech-

nique, Palaiseau (1979).

Physics relies ultimately on experiment. Observation of many different ex-periments of similar type establishes a phenomenology revealing relations be-tween the experimentally measured physical observables. A phenomenology,even relying on false hypotheses, can still be useful if it predicts correctly quan-titative relationships occurring in yet unrealised experiments1 The next step isinductive: physical models are proposed satisfying the phenomenological rela-tions. Then, new phenomenology is predicted, new experiments designed to

1For instance the phenomenology prevailing in the Anticythera mechanism had useful pre-dictive power although the underlying hypothesis of a geocentric solar system was false.

1

Page 11: 2_pdfsam_iq.pdf

Chapter 1

verify it, and new models are proposed. When sufficient data are available, aphysical theory is proposed verifying all the models that have been developedso far and all the phenomenological relations that have been established. Thetheory can deductively predict the outcome for yet unrealised experiments. Ifit is technically possible, the experiment is performed. Either the subsequentphenomenology contradicts the theoretical predictions — and the theory mustbe rejected — or it is in accordance with them — and this precise experimentserves as an additional validity check of the theory.2 Therefore, physical the-ories have not a definite status: they are accepted as long as no experimentcontradicts them!

It is a philosophical debate how mathematical theories emerge. Some scien-tists — among them the author of these lines — share the opinion expressed byMichel Demazure (see quotation), claiming that Mathematics is as a matter offact an experimental science. Accepting, for the time being, this view, hypothe-ses for particular mathematical branches are the pendants of models. Whatdifferentiates strongly mathematics from physics is that once the axioms arestated, the proved theorems (phenomenology) need not be experimentally cor-roborated, they exist per se. The experimental nature of mathematics is hiddenin the mathematician’s intuition that served to propose a given set of axiomsinstead of another.

Mathematical physics is physics, i.e. its truth relies ultimately on experimentbut it is also mathematics, in the sense that physical theories are stated as aset of axioms and the resulting physical phenomenology must derive both astheorems and as experimental truth.

A general physical theory must describe all physical phenomena in the uni-verse, extending from elementary particles to cosmological phenomena. Nu-merical values of the fundamental physical quantities, i.e. mass, length, andtime span vast ranges: 10−31kg≤ M ≤ 1051kg, 10−15m≤ L ≤ 1027m, 10−23s≤ T ≤1017s. Units used in measuring fundamental quantities, i.e. kilogramme (kg),metre (m), and second (s) respectively, were introduced after the French Rev-olution so that everyday life quantities are expressed with reasonable numer-ical values (roughly in the range 10−3 − 103.) The general theory believed todescribe the universe3 is called Quantum Field Theory; it contains two funda-mental quantities, the speed of light in the vacuum, c = 2.99792458×108m/s,

2See an example of this procedure in le Monde of 20 September 2002 (reproduced on theweb page of the author http://perso.univ-rennes1.fr/dimitri.petritis/).

3Strictly speaking, there remain unsolved theoretical difficulties in order to succesfully in-clude gravitational phenomena.

/Users/dp/a/ens/mq/iq-intro.tex 2 Stabilised version of 16 September 2013

Page 12: 2_pdfsam_iq.pdf

and the Plank’s constant ħ = 1.05457×10−34J·s. These constants have extraor-dinarily atypical numerical values. Everyday velocities are negligible comparedto c, everyday actions are overwhelmingly greater than ħ. Therefore, everydayphenomena can be thought as the c → ∞ and ħ → 0 limits of quantum fieldtheory; the corresponding theory is called classical mechanics.

It turns out that considering solely the c →∞ limit of quantum field theorygives rise to another physical theory called quantum mechanics; it describesphenomena for which the action is comparable with ħ. These phenomena areimportant when dealing with atoms and molecules.

The other partial limit, ħ → 0, is physically important as well; it describesphenomena involving velocities comparable with c. These phenomena lead toanother physical theory called special relativity.

Although quantum field theory is still mathematically incomplete, the theo-ries obtained by the limiting processes described above, namely quantum me-chanics, special relativity, and classical mechanics are mathematically closed,i.e. they can be formulated in a purely axiomatic fashion and all the experi-mental observations made so far (within the range of validity of these theories)are compatible with the derived theorems.

Among those three theories, quantum mechanics has a very particular sta-tus. It can be formulated in a totally axiomatic way; all its predictions have beenverified with unprecedented accuracy; not a single experiment has ever put thetheory in difficulty; quantum phenomena play a prominent role in the globaleconomy (a very conservative estimate is that 35% of the global wealth relieson exploiting quantum phenomena). In spite of this tremendous success in thepredictive/explanatory power of quantum mechanics and its mathematicallyclosed form, its axiomatic setting is not very satisfactory; the theory looks as ifa conceptual building block were missing in the description of the theory.

The purpose of this course is twofold. Firstly, the mathematical founda-tions of quantum mechanics are presented. Algebra, analysis, probability, andstatistics are necessary to describe and interpret this theory. Its predictions areoften totally counter-intuitive. Hence it is interesting to study this theory thatprovides a useful application of the mathematical tools, a source of inspira-tion4 for new developments for the underlying branches of mathematics, and

4Recall that entire branches of mathematics have been developed on purpose, to give pre-cise mathematical meaning to — initially — ill-defined mathematical objects introduced byphysicists to formulate and handle quantum theory. To mention but the few most prominentexamples of such mathematical theories: von Neumann algebras, spectral theory of operators,

/Users/dp/a/ens/mq/iq-intro.tex 3 Stabilised version of 16 September 2013

Page 13: 2_pdfsam_iq.pdf

Chapter 1

a description of unusual physical phenomena. All these phenomena are ver-ified experimentally nowadays. Quantum mechanics intervenes in a decisivemanner in the explanation of vast classes of phenomena in other fundamentalsciences and in technology. Without being exhaustive, here are some examplesof such quantum phenomena:

• atomic and molecular physics (e.g. stability of matter, physical propertiesof matter), quantum optics (e.g. lasers),

• on which rely chemistry (e.g. valence theory) and biology (e.g. photosyn-thesis, structure of DNA),

• solid state physics (e.g. physics of semiconductors, transistors),• tunnel effect (e.g. atomic force microscope) and nanotechnology,• supraconductivity (e.g. used in magnetic levitation for ultra-fast trains)

and superfluidity• . . .

There is however another major technological breakthrough that is foreseenwith a tremendous socio-economical impact: if the integration of electroniccomponents continues at the present pace (see figure 1.1), within 10–15 years,only some tenths of silicium atoms will be required to store a single bit of infor-mation. Classical (Boolean) logics does not apply any longer to describe atomiclogical gates, quantum (orthocomplemented lattice) logics is needed instead.

Theoretical exploration of this new type of informatics has started and it isproven [30] that some algorithmically complex problems, like the integer primefactoring problem — for which the best known algorithm [18] requires a timeis superpolynomial in the number of digits — can be achieved in polynomialtime using quantum logic. The present time technology does not yet allow theprime factoring of large integers but it demonstrates that there is no funda-mental physical obstruction to its achievement for the rapidly improving com-puter technology. Should such a breakthrough occur, all our electronic trans-missions, protected by classical cryptologic methods could become vulnerable.The following table gives a very rough estimate of the time needed to factor ann = 1000 digits numbers, assuming an operation per nanosecond for varioushypotheses of algorithmic complexity.

On the other hand, present day technology allows to securely and unbreak-ably cipher messages using quantum cryptologic protocols. Thus the secondpurpose of this course is to present the applications of quantum mechanics

theory of distributions, non-commutative probabilities.

/Users/dp/a/ens/mq/iq-intro.tex 4 Stabilised version of 16 September 2013

Page 14: 2_pdfsam_iq.pdf

Figure 1.1: The evolution of the number of transistors on integrated circuits inthe period 1971–2011. (Source: Wikipedia, transistor count.)

into the rapidly developing field of quantum information, computing, commu-nication, and cryptology.

/Users/dp/a/ens/mq/iq-intro.tex 5 Stabilised version of 16 September 2013

Page 15: 2_pdfsam_iq.pdf

Chapter 1

O (exp(n)) O (exp(n1/3(logn)2/3)) O (n3)10417 yr 0.2 yr 1 s

Table 1.1: A very rough estimate of the order of magnitude of the time neededto factor an n-digit number, under the assumption of execution of the algo-rithm on a hypothetical computer performing an operation per nanosecond, asa function of the time complexity of the used algorithm. When the cryptologicprotocol RSA has been proposed [24] in 1978, the best factoring algorithm hada time complexity in O (exp(n)) (for comparison: age of the universe 1.5×1010

yr). The best known algorithm (the general number field sieve algorithm) re-ported in [19] requires time O (exp(n1/3 log2 n)) to factor a n-digit number. TheShor’s quantum factoring algorithm [30] requires a time of O (n3).

/Users/dp/a/ens/mq/iq-intro.tex 6 Stabilised version of 16 September 2013

Page 16: 2_pdfsam_iq.pdf

2Phase space, observables,

measurements, and yes-noexperiments

2.1 Introduction

As is the case in all experimental sciences, information on a physical systemis obtained through observation (also called measurement) of the values —within a prescribed set — that can take the physical observables. The acqui-sition procedure of the information must be described operationally in termsof

• macroscopic instruments, and• prescriptions on their application on the observables of an objectified

physical system.

The biggest the set of observables whose values are known, the finest isthe knowledge about the physical system. Since crude physical observables(e.g. number of particles, energy, velocity, etc.) can take values in various sets

7

Page 17: 2_pdfsam_iq.pdf

2.1. Introduction

(N,R+,R3, etc.), to have a unified treatment for general systems, we reduce anyphysical experiment into a series of measurements of a special class of observ-ables, called yes-no experiments. This is very reminiscent of the approxima-tion of any integrable random variable by a sequence of step functions. There-fore, ultimately, we can focus on observables taking values in the set 0,1.

To become quantifiable and theoretically exploitable, experimental obser-vations must be performed under very precise conditions, known as the ex-perimental protocol. Firstly, the objectified system must be carefully preparedin an initial condition known as the state of the system. Mathematically, thestate incorporates all the a priori information we have on the system, it belongsto some abstract space of states. Secondly, the system enters in contact witha measuring apparatus, specifically designed to measure the values of a givenobservable, returning the experimental data with values in some space (X,X );this is precisely the measurement process.

The whole physics relies on the postulate of statistical reproducibility ofexperiments: if the same measurement is performed a very large number oftimes on a system prepared in the same given state, the experimentally ob-served data for a given observable are scattered around some mean value inX with some fluctuations around the mean value. However, when the num-ber of repetitions tends to infinity, the empirical distribution of the observeddata tends to some probability distribution on (X,X ). Thus, abstractly, a mea-surement is a black box transforming states into probability measures on somespace of observations. Mathematically, it will be shown that the measurementcorresponds to a transformation kernel.

Dealing with random variables, the natural question that arises is: what isthe appropriate (abstract) probability space, if any, on which random variablesentering a given problem can be defined? For sequences of classical randomvariables, the answer is well known: such an abstract probability space exists,provided that the sequence verifies the Kolmogorov’s compatibility conditions;in that case, there exists a canonical (minimal) realisation of the abstract prob-ability space on which the whole sequence is defined. Elements of this prob-ability space are called trajectories of the sequence. The physical analogue ofthe minimal realisation of the abstract probability space is called phase space.Elements of the phase space are called (pure) phases. The physical analogueof a random variable is called observable.

It turns out that physical observables for classical systems are just randomvariables so that the phase space for such systems is a genuine probability space,

/Users/dp/a/ens/mq/iq-phase.tex 8 Stabilised version of 16 September 2013

Page 18: 2_pdfsam_iq.pdf

while for quantum systems, observables are (generally non-commuting) Her-mitean operators acting on an abstract Hilbert space that plays the rôle ofquantum phase space.

2.2 Classical physics as a probability theory with adynamical law

2.2.1 Some reminders from probability theory

Let us start with the mathematical notion of a random variable.

Definition 2.2.1. (Random variable) Let (Ω,F ,P) be an abstract probabilityspace and (X,X ) a measurable space1. A function X : Ω→ X that is (F ,X )-measurable is called (a X-valued) random variable. The induced probabilitymeasure PX on (X,X ) (i.e. PX (A) = P(ω ∈ Ω : X (ω) ∈ A, A ∈ X ) is called thelaw (or distribution) of X .

Remark 2.2.2. We cannot refrain from stressing that in the above definitionof a random variable the pertinent property is that of a measurable functionX : Ω→ X. Now, it is elementary to show that the datum of X is equivalent tothe datum of a deterministic Markovian kernel KX : Ω×X → [0,1] such thatKX (ω, A) = εX (ω)(A) = 1 X −1(A)(ω) = 1 A(X (ω). The kernel KX acts on its right tobX = f :X→R, f bounded measurable by

bX 3 f 7→ KX f (ω) :=∫X

KX (ω,d x) f (x) ∈ bF , ∀ω ∈Ω,

and on its left to M1(F ) = probability measures on F , by

M1(F ) 3µ 7→µKX (A) :=∫Ωµ(dω)KX (ω, A) ∈M1(X ), ∀A ∈X .

Note also that X = KX 1 X and PX = PKX . If X is fixed, on denoting Q A the ran-dom variable Q A(ω) = 1 A(X (ω)), for A ∈X , we remark that Q2

A =Q A (hence Q A

is a projection) and Q AQB = 0 if A ∩B = ;. Such random variables are calledquestions.

1The spaceX can be any Polish space (i.e. a metric, complete, and separable space.) We shallonly consider the case X=Rd , for some d , in this course.

/Users/dp/a/ens/mq/iq-phase.tex 9 Stabilised version of 16 September 2013

Page 19: 2_pdfsam_iq.pdf

2.2. Classical physics as a probability theory with a dynamical law

Example 2.2.3. Let X = 0,1, X be the algebra of subsets of X, and PX (0) =PX (1) = 1/2 the law of a random variable X (the honest coin tossing). A possi-ble realisation of (Ω,F ,P) is ([0,1],B([0,1]),λ), where λ denotes the Lebesguemeasure, and a possible realisation of the random variable X is

X (ω) =

0 if ω ∈ [0,1/2[1 if ω ∈ [1/2,1].

Notice however that the above realisation of the probability space involvesthe Borel σ-algebra over an uncountable set, quite complicated an object in-deed. A much more economical realisation should be given by Ω = 0,1, F =X , and P(0) = P(1) = 1/2. In the latter case the random variable X shouldread X (ω) = ω: on this smaller probability space, the random variable is theidentity function. Such a realisation is minimal.

Exercise 2.2.4. (An elementary but important exercise; please solve it beforereading the sequel!) Generalise the above minimal construction to the case weconsider two random variables Xi :Ω→X, for i = 1,2. Are there some plausiblerequirements on the joint distributions for such a construction to be possible?

The canonical construction of the minimal probability space carrying aninfinite family of random variables is also possible.

Definition 2.2.5. (Consistency) Let T be an infinite set (countable or uncount-able) and for each t ∈ T denote by Rt a copy of the real line, indexed by t . De-note by RT = ×t∈TRt and for n ≥ 1 by τ = (t1, . . . , tn) a finite ordered set of dis-tinct indices ti ∈ T, i = 1, . . . ,n. Denote Pτ a probability measure on (Rτ,B(Rτ))where Rτ = Rt1 ×·· ·×Rtn . We say that the family (Pτ), where τ runs through allfinite ordered subsets of T , is consistent, if

1. P(t1,...tn )(A1 ×·· ·× An) =P(tσ(1),...tσ(n) (Aσ(1) ×·· ·× Aσ(n)), where σ is an arbi-trary permutation of (1, . . . ,n) and Ai ∈B(Rti ), and

2. P(t1,...tn )(A1 ×·· ·× An−1 ×R) =P(t1,...tn−1)(A1 ×·· ·× An−1).

Definition 2.2.6. Let T be a subset of R. A family of random variables X ≡(X t )t∈T is called a stochastic process with time domain T .

If T = N or Z, the process is called a discrete time process or random se-quence, if T = [0,1] or R or R+, the process is a continuous time process.

/Users/dp/a/ens/mq/iq-phase.tex 10 Stabilised version of 16 September 2013

Page 20: 2_pdfsam_iq.pdf

The natural question that arises is whether there exists a probability space(Ω,F ,P) carrying the whole process. In other words, if PX denotes the dis-tribution of the process X , what are the conditions it must fulfil so that thereexists a probability space (Ω,F ,P) such that P(B) = P(ω ∈Ω : X (ω) ∈ B for allB ∈B(RT )? The answer is given by the following

Theorem 2.2.7 (Kolmogorov’s existence). Suppose that for n ≥ 1, the familyP(X t1 ,...,X tn ), with t1 < . . . tn and ti ∈ T ⊆R, for i = 1, . . . ,n, is a consistent family ofprobability measures. Then, there are

1. a probability space (Ω,F ,P) and

2. a stochastic process X = (X t )t∈T such that P(X t1 ,...,X tn )(]−∞, x1]× ·· ·×]−∞, x −n] =P(ω ∈Ω : X t1 (ω) ≤ x1, . . . , X tn (ω) ≤ xn.

Proof. See, for instance, in [29, Theorem II.2.1, p. 247]. äRemark 2.2.8. The canonical construction of the minimal probability spaceis Ω = RT , F = B(RT ) and for every t ∈ T , X t (ω) = ωt . This minimal space isalso called space of trajectories of the random process and the realisation of Xcoordinate method.

2.2.2 Postulates for classical systems

A rough definition of the notion of classical phase space is: the (minimal) spaceon which all legitimate physical observables can be defined, or, equivalently,all legitimate questions can receive a definite answer. From this conceptualview, the classical phase space shares the same indeterminacy as the probabil-ity space. The only objects having physical relevance are the physical observ-ables (as is the case for random variables in probability theory.) Therefore, thesame system can be minimally described by two different phase spaces, de-pending on the set of questions to be answered.

Example 2.2.9. (Die rolling) Let the physical system be a die and the completeset of questions to be answered the set Q1, . . . ,Q6, where Qi , i = 1, . . . ,6 standsfor the question: “When the die lies at equilibrium on the table, does the topface read i ?” An obvious choice for the phase space isΩ= 1, . . . ,6. The randomvariable X corresponding to the physical observable “value of the top face” isrealised by X (ω) =ω,ω ∈Ω and the questions by Qi = 1 X=i , for i = 1, . . . ,6.

/Users/dp/a/ens/mq/iq-phase.tex 11 Stabilised version of 16 September 2013

Page 21: 2_pdfsam_iq.pdf

2.2. Classical physics as a probability theory with a dynamical law

Example 2.2.10. Consider the same physical system as in example 2.2.9 andthe same set of questions but think of the die as a solid body that can evolvein the space. To completely describe its state, we need 3 coordinates for itsbarycentre, 3 coordinates for the velocity of the barycentre, 3 coordinates forthe angular velocity, and 3 Euler angles for the unit exterior normal at the cen-tre of face “6”. Thus, Ω= R9 ×S2. Now the realisation X :Ω→ 1, . . . ,6 is muchmore involved (but still possible in principle) and the questions are again rep-resented by Qi = 1 X=i , for i = 1, . . . ,6. Yet, obviously, the representation givenin example 2.2.9 is much simpler than the present one.

Exercise 2.2.11. Determine the phase space for a point mass in dimension 1subject to the force exerted by a spring of elastic constant k.

Solution: Recall that a point mass m in dimension 1 obeys Newton’s equation:

md 2x

d t 2(t ) = F (x(t )),

subject to the initial conditions x(0) = x0 and x(0) = v0, where x(t ) denotesthe position of the mass at instant t and F (y) denotes the force exerted by thespring on the particle when it is at position y . It reads F (y) = k(y − y0) where y0

is the equilibrium elongation of the spring. The kinetic energy, K , of the particleis a quadratic form in the velocity

K (x) = m

2x2

and the potential energy, U , is given by

U (x) =−∫ x

x0

F (y)d y.

In order to conclude, we need the following

Theorem 2.2.12. The total energy H(x, x) = K (x)+U (x) is a constant of motion,i.e. does not depend on t.

Proof.

d

d t(K (x)+U (x)) = mxx + ∂U

∂x(x)x

= x(mx −F (x))

= 0.

ä

/Users/dp/a/ens/mq/iq-phase.tex 12 Stabilised version of 16 September 2013

Page 22: 2_pdfsam_iq.pdf

p

q

H ′

H

Figure 2.1: The phase space for a point mass in dimension one.

Hence the Newton’s equation is equivalent to the system of first order dif-ferential equations, known as Hamilton’s equations:

d p

d t= −∂H

∂qd q

d t= ∂H

∂p,

subject to the initial condition (q(0)p(0)

)=

(q0

p0

),

where p = mx, q = x, and H = p2

2m +U (q). Therefore, the phase space for thepoint mass in dimension one is R2 (one dimension for the position, q , and onefor the momentum p.) Moreover, this space is stratified according to constantenergy surfaces that are ellipses for the case of elastic spring, because potentialenergy is quadratic in q (see figure 2.1.)

If ω(t ) =(

q(t )p(t )

)∈ R2 represents the coordinate and momentum of the sys-

tem at time t , the time evolution induced by the system of Hamilton’s equationscan be thought as the flow on R2, described by ω(t ) = Ttω(0), with initial con-

dition ω(0) =(

q0

p0

).

Postulate 2.2.13. The phase space of a classical system is an abstract measur-able space (Ω,F ). The states of a classical system are the probability measureson (Ω,F ). Pure states correspond to Dirac masses. When two systems, respec-tively described by (Ω1,F1) and (Ω2,F2) are merged and considered as a single

/Users/dp/a/ens/mq/iq-phase.tex 13 Stabilised version of 16 September 2013

Page 23: 2_pdfsam_iq.pdf

2.2. Classical physics as a probability theory with a dynamical law

system, their phase space is (Ω1×Ω2,F1⊗F2), where F1⊗F2 is the sigma algebragenerated by F1 ×F2.

Postulate 2.2.14. Any time evolution of an isolated classical system is imple-mented by an invertible measurable transformation T :Ω→Ω leaving the statesinvariant.

Postulate 2.2.15. To any physical observable of a classical system correspondsa random variable X :Ω→X, where (X,X ) is a measurable space. Yes-no ques-tions are special observables of the form Q : Ω→ 0,1. Measurement of a clas-sical observable X when the system is in state µ, corresponds in determining itslaw under µ.

Remark 2.2.16. Questions are special kinds of random variables. They alwayscan be written as Q = 1 A X , where X : Ω→ X ⊆ R is a random variable, andA ∈ X . Any random variable X is termed physical observable. Questions arespecial types of physical observables. Since any question is an indicator, it ver-ifies Q2 = Q i.e. it is a projector. When dealing with a single random variable(physical observable), the complete set of possible questions is in bijection withtheσ-algebra X of measurable subsets ofX. If Q A = 1 A X and QB = 1 B X aretwo different questions and moreover A ∩B = ; then Q AQB = 0, i.e. questionstesting disjoint sets in the range of a random variable are orthogonal projectors.

Exercise 2.2.17. Let µ be a state on (Ω,F ), X aX-valued random variable (X⊆R), and Q A the question 1 X∈A for some fixed A ∈ X . Compute µ(Q A). Whathappens if µ is a pure state? What happens if (Ω,F ) is minimal for the randomvariable X ?

Solution:

µ(Q A) =∫Ω

1 X∈A(ω)µ(dω)

=∫Ω

1 A(X (ω))µ(dω)

If µ= δω0 for some ω0.

δω0 (Q A) =∫Ω

1 X∈A(ω)δω0 (dω)

= 1 A(X (ω0)).

If the space is minimal for X , then X (ω) = ω and we get respectively: µ(Q A) =∫Ω1 A(X (ω))µ(dω) =µ(A) and δω0 (Q A) = δω0 (A). ä

Exercise 2.2.18. What is the minimal phase space for a mechanical systemcomposed by N point particles in dimension 3?

/Users/dp/a/ens/mq/iq-phase.tex 14 Stabilised version of 16 September 2013

Page 24: 2_pdfsam_iq.pdf

2.3 Classical probability does not suffice to describeNature!

Quantum mechanics has been proposed as a theory explaining physical phe-nomena occurring mainly in microscopic systems. Nowadays quantum me-chanics provides us with a formalism that has been successfully tested in allknown experimental situations; not a single prediction made by quantum the-ory has ever been falsified by an experiment! Nevertheless, the quantum for-malism remains highly counter-intuitive and several physicists have advancedthe hypothesis that the theory is incomplete. The most prominent among thosephysicists was Einstein2 who refused to admit3 the intrinsically stochastic na-ture of quantum mechanics in an influential paper [13] written with Podolsky4

and Rosen5 in 1935, based on a Gedankenexperiment6 and known as the EPRparadox7.

When dealing with physical theories, we are confronted with two basic no-tions, locality and realism. Locality means that we can always take actions thathave consequences only within a small region of space. In physics, localitystems from the finiteness of the speed of light. Since no interaction can propa-gate faster than light, no influence can be sensed in space points lying beyondthe wave front of light. Realism means that although experiments have alwaysrandom outputs, the observed randomness is nothing else than the reflectionof the imperfection of the measuring instruments. In a theory where realismapplies, there is no conceptual obstruction to think that there exists a state inwhich the system can be perfectly described in principle. The observed ran-domness is only a reflection of our incomplete knowledge of the precise state

2Albert Einstein, (Ulm) 1879 – (Princeton) 1955. Developed the theory of special (1905) andgeneral (1913) relativity, pillars of modern theoretical physics but has been awarded the NobelPrize of Physics in 1921 not for this achievement but for his explanation of the photoelectriceffect. Published more than 300 scientific papers of extreme originality. Beyond his scien-tific contributions Einstein was a humanist worried about war; he has signed with the Britishphilosopher and mathematician Bertrand Russell the Russell-Einstein manifesto against nu-clear weapons.

3Cf. for instance his famous aphorism “God does not play dice with the world” (quoted, forinstance, from the conversations with Hermanns, in William Hermanns, Einstein and the Poet:in search of the cosmic man, Branden Press, Brookline, MA (1983).

4Boris Yakovlevich Podolsky, (naturalised) American physicist 1896 – 1966.5Nathan Rosen, American physicsist1909 – 1995.6Literally: thought experiment. A very powerful epistemological method — developed by

Einstein — of questioning the validity of the theoretical predictions.7See the “Analyse” item at http://bibnum.education.fr/physique/physique-quantique/le-

paradoxe-epr for the unconventional beginnings and fate of the EPR paper.

/Users/dp/a/ens/mq/iq-phase.tex 15 Stabilised version of 16 September 2013

Page 25: 2_pdfsam_iq.pdf

2.3. Classical probability does not suffice to describe Nature!

of the system. The role of the EPR paradox was to show the incompleteness ofthe quantum theory.

Based on Einstein’s refutations, Bohm8 proposed, in [9, 10], a formalism ofquantum mechanics postulating the existence of “hidden variables” (i.e. nonobserved ones), allowing to describe the same phenomenology as quantummechanics without postulating an intrinsically stochastic nature of the theory.Therefore, the hidden variables formalism intended to restore the realism ofquantum mechanics. The introduction of hidden variables did not predict anynew phenomenon beyond those predicted by standard quantum theory andfor many years, it was the root of a mainly philosophical controversy betweenthe tenants of standard quantum theory (the only one we shall present in thesequel of this course) and the tenants of the hidden variables description.

A major conceptual step was performed by Bell9 who established in [7] thatif hidden variables existed then they should have predictable consequencesthat could be experimentally tested. In spite of the fundamental interest suchan experiment would have in the conceptual foundations of the theory, it wasdismissed for several years as an uninteresting philosophical quest not worththe efforts of respectable scientists10. It was only thanks to the ingeniousness ofseveral groups of experimental physicists around the world (Clauser, Shimony,Horne, Holt, Aspect, Dalibard, Roger who persevered in willing to experimen-tally test the hypothesis of hidden variables) that the existence of both local andrealistic physical theories has been refuted in the three seminal papers [4, 5, 3]of the group at the Université d’Orsay. This experiment, of the utmost funda-mental importance, refutes all physical theories that are both realistic and lo-cal. It established that quantum theory is local but does not satisfies realism;it describes phenomena that — when interpreted within classical probability— appear as non-local. This fact is sometimes wrongly termed quantum non-locality in the literature. This term will never used in the sequel of this course.

Therefore, before presenting the nowadays accepted form of quantum me-chanics, we spend some lines to describe Bell inequalities and explain in somedetails the Orsay experiment. We follow the exposition of [21].

8David Joseph Bohm, 1997 – 1992. American physicist who introduced the hidden variablesformalism in the quest of restoring realism of quantum mechanics.

9John Stewart Bell, 1928 – 1990. A Northern-Irish physicist with a major contribution knownas Bell’s theorem, establishing the experimental consequences that should have hidden vari-ables in quantum mechanics.

10Readers so inclined to philosophical meditation are invited to consider the ravages“fashion-led” or “project-oriented” research can cause to the advancement of science.

/Users/dp/a/ens/mq/iq-phase.tex 16 Stabilised version of 16 September 2013

Page 26: 2_pdfsam_iq.pdf

2.3.1 Bell’s inequalities

Proposition 2.3.1 (The three-variable Bell’s inequality). Let X1, X2, X3 be anarbitrary triple of 0,1-valued random variables defined on some probabilityspace (Ω,F ,P). Then

P(X1 = 1, X3 = 0) ≤P(X1 = 1, X2 = 0)+P(X2 = 1, X3 = 0).

Proof:

P(X1 = 1, X3 = 0) = P(X1 = 1, X2 = 0, X3 = 0)+P(X1 = 1, X2 = 1, X3 = 0)

≤ P(X1 = 1, X2 = 0)+P(X2 = 1, X3 = 0).

ä

Proposition 2.3.2 (The four-variable Bell’s inequality). Let X1, X2,Y1,Y2 be anarbitrary quadruple of 0,1-valued random variables defined on some proba-bility space (Ω,F ,P). Then

P(X1 = Y1) ≤P(X1 = Y2)+P(X2 = Y2)+P(X2 = Y1).

Proof: The random variables being 0,1-valued, it is enough to check on all 16possible realisations of the quadruple (X1(ω), X2(ω),Y1(ω),Y2(ω)) that

X1 = Y1 ⊆ [X1 = Y2]∨ [X2 = Y2]∨ [X2 = Y1].

ä

2.3.2 The Orsay experiment

The idea behind the Orsay experiment is to associate precise physical quanti-ties with 0,1-valued quantities. Classical theory of light assigns electromag-netic waves; the electric field oscillates in a plane perpendicular to the prop-agation direction known as polarisation. When monochromatic light emittedfrom a random source (i.e. unpolarised) of some intensity I passes through apolariser, the emerging beam is polarised in the direction of the polariser andhas intensity I /2. Now, it has been established that light beam is composed of a

/Users/dp/a/ens/mq/iq-phase.tex 17 Stabilised version of 16 September 2013

Page 27: 2_pdfsam_iq.pdf

2.3. Classical probability does not suffice to describe Nature!

α β

Figure 2.2: When a photon passes through the first polariser — oriented indirection α — emerges polarised in that direction. When it encounters a sec-ond polariser — oriented in direction β — passes through with probabilitycos2(α−β). If the photon is initially already polarised in direction α, nothingchanges if the first polariser is removed.

great number of elementary light quanta called photons; the corpuscular na-ture of light has been conjectured already by Gassendi11and Newton12 and ir-refutably confirmed by the photoelectric effect whose theoretical explanationwas given by Einstein. Therefore, the statement on intensities made above haveonly a statistical meaning; if a photon passing through a polariser oriented in agiven direction α, encounters a second polariser oriented in a direction β, hasprobability 1

2 cos2(α−β) to pass through (see figure 2.2). This is an experimen-tal fact, in accordance with both quantum mechanical prescriptions and withclassical electromagnetic theory of light.

If the experiment is to be explained in terms of classical probability, with ev-ery polariser in direction α ∈ [0,π/2] is associated a random variable Xα ∈ 0,1;the random variables X are defined on a probability spaces (Ω,F ,P) whereω ∈Ω represent the microscopic state of the photon. Now for the experimentalsetting depicted in figure 2.2, the random variables X are correlated as

E(XαXβ) =P(Xα = 1, Xβ = 1) = 1

2cos2(α−β).

But now there is a problem because this correlation cannot be that of classicalrandom variables. Choosing in fact three polarisations α1, α2, and α3, we haveP(Xαi = 1, Xα j = 0) = P(Xαi = 1)−P(Xαi = 1, Xα j = 1) = 1

2 (1− cos2(αi −α j )) =11Pierre Gasendi, French philosopher, priest, scientist, and mathematician 1592 – 1655.12Sir Isaac Newton, English physicist and mathematician, 1642 – 1727. In his very influential

work, its Philosophiæ naturalis principia mathematica established the classical theory of uni-versal gravitation and discovered what later became differential calculus to solve the equationsof motion.

/Users/dp/a/ens/mq/iq-phase.tex 18 Stabilised version of 16 September 2013

Page 28: 2_pdfsam_iq.pdf

αi

PM1

Coincidence monitoring

PM2

β j

Ca

Figure 2.3: Schematic view of the Orsay experiment [5]. A beam of calcium (Ca)atoms is triggered by a laser. When thus excited, calcium atom emit simultane-ously two photons (at different frequencies) in opposite directions and havingcorrelated polarisations. An ingenious system of optical switches is used whosenet effect can be described by the following equivalent description. The left po-lariser is oriented in one of the anglesα1 orα2, the right polariser into one ofβ1

or β2. After passing through the polarisers, the photons are counted by photo-multipliers (PM1) and (PM2) and only photons detected in synchronisation arerecorded. A careful design is made so that all photons travel on same opticallengths and the choice of left and right polarisation is made after the photonsare emitted (so that any causal influence of the choice of orientations on themanner the photons are emitted can be excluded).

12 sin2(αi −α j ) for i , j ∈ 1,2,3. The three-variable Bell inequality 2.3.1 readsthen

1

2sin2(α1 −α3) ≤ 1

2sin2(α1 −α2)+ 1

2sin2(α2 −α3).

The choice α1 = 0, α2 = π/6, and α3 = π/3 leads to the impossible inequality3/8 ≤ 1/8+1/8. Therefore, classical probability cannot describe this simple ex-periment.

On a second reading, this experiment is not very convincing because onarranging polarisers on the optical table as described above, there is nothingpreventing conceptually the second random variable Xβ to depend in fact onboth α and β. But then the correlation reads E(XαXα,β) = P(Xα = 1, Xα,β = 1) =12 cos2(α−β) and this can be satisfied by choosing, for instance, Xα and Xα,β

independent withP(Xα = 1) = 1/2 andP(Xα,β = 1) = cos2(α−β) which of coursecan be easily conceived.

The irrefutable evidence of the impossibility of describing Nature with merelyclassical probability is provided through the second experiment Aspect, Dal-

/Users/dp/a/ens/mq/iq-phase.tex 19 Stabilised version of 16 September 2013

Page 29: 2_pdfsam_iq.pdf

2.3. Classical probability does not suffice to describe Nature!

ibard and Roger performed in 1982, [3], schematically described in figure 2.3.The same analysis can be made as in the previous experimental setting. De-noting by Xα the 0,1-valued random variable quantifying the passage of thephoton through the left polariser and Yβ through the right one, it is experimen-tally established in [3] that,

P(Xα = Yβ) = 1

2sin2(α−β),

for every choice of α and β. (Note incidentally that the same conclusion is ob-tained using — the not yet presented — quantum formalism). Now the choiceα1 = 0, α2 =π/3, β1 =π/2, and β2 =π/6, should read 1 ≤ 1/4+1/4+1/4, mani-festly violating the four variable Bell’s inequality 2.3.2.

To better grasp the significance of this experiment, it has been proposed,see [21] for instance, to think of it as a card game between two players X and Ywho can pre-agree on any conceivable strategy in order to win the game. Thegame is described in the following

Exercise 2.3.3. (The Orsay experiment as a card game [21]) The game is playedbetween players X , Y (see figure 2.3), and A who acts as an arbiter and as gameleader (A like . . . Aspect).

Description of the game

• A disposes of a well shuffled deck of red and black cards (consider it as aninfinite sequence of i.i.d. red,black-valued random variables uniformlydistributed on red,black := r,b).

• X and Y are free to use random resources (e.g. dice) if they wish.• Before the game starts, X and Y agree on given strategy (deterministic,

non-deterministic, or random) how to determine a yes,no-valued vari-able out of the colour of the card they will be presented. Once the gamestarts, the players are not allowed any longer to communicate.

• A picks two cards from the deck and presents the one to X and the otherto Y (mind that X and Y don’t know each other’s card).

• X and Y apply their own pre-agreed strategy to the colour the are pre-sented with and simultaneously say yes or no.

• After the announcement of the players, the cards are laid on the table.Four different card pairs are possible (r r ), (r b), (br ), (bb), where the firstcolour refers to the colour of the X ’s card and the second to the Y ’s one(consider these colour pairs as boxes in a 2×2 board). If both players havegiven the same answer then 1 is written in the corresponding box, else 0is marked.

/Users/dp/a/ens/mq/iq-phase.tex 20 Stabilised version of 16 September 2013

Page 30: 2_pdfsam_iq.pdf

• In the course of the game, the boxes get filled by sequences of 0’s and 1’s.• Let πcc ′ , with c,c ′ ∈ r,b, be the limit of the empirical probability of 1’s

in the box corresponding to colours (cc ′) when the game runs indefi-nitely. The players win the game if πr r > πr b +πbr +πbb . The purposeis to show that there exists no strategy (deterministic, non-deterministic,or random) allowing the players to win the game.

Questions

1. Suppose that X and Y have agreed on the following strategy: X alwayssays “yes”, independently of the colour of the card presented to her/himand Y answers the question “is my card red?”. Compute explicitly thevalues of πcc ′ for c,c ′ ∈ r,b and show that with this deterministic strat-egy, the numbers πcc ′ satisfy the four variable Bell’s inequality πr r ≤πr b+πbr +πbb .

2. In the above strategy, the decision making process is described throughthe matrices DX and DY with DX : r,b× 0,1 → [0,1] (and similarly forDY ), defined respectively by

DX =(0 10 1

)and DY =

(0 11 0

),

interpreted as meaning P(Answer of X is a | card colour is c) = DX (c, a)for c ∈ r,b and a ∈ 0,1 (and similarly for Y ). Hence the previouslydescribed strategies are termed deterministic strategies. Determine alldeterministic strategies and show that for all of them, Bell’s inequality isverified.

3. Propose a plausible parametrisation of the space of all strategies (deter-ministic and random) and show that this space is convex. Is it a simplex?Show that the previous deterministic strategies are extremal points of thisspace.

4. Conclude that in general any strategy can be written as a convex com-bination of deterministic strategies. Is this decomposition unique? Howsuch a convex combination is related to hidden variables? Conclude thatno classical strategy exists allowing to win this game.

(Some hints concerning convexity and convex decomposition of stochastic ma-trices needed for the the two last questions can be found in the chapter “Markovchains on finite state spaces” of the lecture notes [22]).

/Users/dp/a/ens/mq/iq-phase.tex 21 Stabilised version of 16 September 2013

Page 31: 2_pdfsam_iq.pdf

2.4. Quantum systems

This exercise shows that any system described in terms of classical proba-bilities, even augmented to incorporate hidden variables, cannot win this game.But the Orsay experiment proved that Nature wins! Therefore, the quantumstrategy used by Nature is strictly more powerful than any classical strategy.Probably the first person to really understand this power was Feynman13 (see[15, chap. 7 and 15] for instance), who first conjectured the computational powerof quantum mechanics and was the first who proposed to use atoms as quan-tum computers.

2.4 Quantum systems

Quantum mechanics emerged thanks to various experimental and theoreticaladvances, due mainly to Bohr14, Planck15, Schrödinger16, Heisenberg17, Dirac18,and many others. Various formulations of quantum mechanics are possible.We start from the most straightforward one, historically introduced by von Neu-mann19 [35], based on the Hilbert space formalism. Later on, a more general

13Richard Phillips Feynman, 1918 – 1988. American physicist with major contributions inquantum mechanics, especially known for his work in the path integral formulation of quan-tum mechanics. Has been awarded the Nobel prize in Physics in 1965. Feynman was an un-paralleled populariser of physics; he left — co-authored with Leighton and Sands — as legacyto generations of young physicists the seminal 3-volume textbook known as “The Feynmanlecture on Physics”.

14Niels Henrik David Bohr, 1865 – 1962,. Danish physicist with foundational contributions inquantum mechanics.

15Max Karl Ernst Ludwig Planck, 1858 –1947. German physicist; has been awarded the NobelPrize in Physics in 1918. One of the originators of quantum theory.

16Erwin Rudolf Josef Alexander Schrödinger, Austrian physicist. Developed quantum theory.17Werner Karl Heisenberg, 1901 – 1976. German physicist with major contributions in quan-

tum mechanics. The uncertainty relations are named after him.18Paul Adrien Maurice Dirac, 1902 – 1984. English physicist with major contributions in

quantum electrodynamics. Predicted the existence of antimatter. Has been awarded the No-bel Prize in Physics in 1933. For the needs of quantum mechanics he introduced the notionof δ-“function” and its derivatives, known nowadays as Dirac measure and Dirac distributionsrespectively after they have been given a rigorous mathematical status by Laurent Schwartz.Another happy achievement of Dirac was the so-called “Dirac’s notation in terms of bras andkets” (it will presented in §3.6).

19 János / Johann / John von Neumann, 1903 – 1954, mathematician, physicist, and computerscientist (before this last term has been coined by lack of . . . any computer), born as MargittaiNeumann János Lajos in Hungary, achieved his PhD under the direction of Fejér Lipot at theInstitute of theoretical physics of the university of Budapest. Moved to Italy then Switzerlandwhere he completed studies as chemical engineer at the Eidegenössische technische HohcschuleZürich. Exerced as Privatdozent in Göttingen, Berlin and Hamburg but finaly emigrated in 1930

/Users/dp/a/ens/mq/iq-phase.tex 22 Stabilised version of 16 September 2013

Page 32: 2_pdfsam_iq.pdf

formulation [32], based on quantum logics and C∗-algebras, will be given; thisformulation has the advantage of allowing a unified treatment for both classicaland quantum systems. Another possible formulation is provided by the infor-mational formulation of quantum mechanics [11, 12]. The reader may won-der why so many formulations have been proposed so far. The answer is thatalthough all the formulations are totally satisfactory from the computationalpoint of view, their predictive power, and their adaptedness to explain diverseexperiments, none of the existing ones is philosophically and epistemologicallysatisfactory. Quantum mechanics is a partial theory, describing fragile systems,i.e. systems that eventually leave the quantum realm to enter the classical one;such a fragility demands for a unified treatment of classical and quantum for-malism, not guaranteed by any of the existing formalisms.

2.4.1 Postulates of quantum mechanics: the Hilbert space ap-proach

For the time being, we proclaim that a quantum system verifies the followingpostulates.

Postulate 2.4.1. The phase space of a quantum mechanical system is a complexHilbert space H. Unit vectors20 of H correspond to pure quantum states. Whentwo systems, respectively described by Hilbert spaces H1 and H2, are consideredas a single system, the composite system is described by the Hilbert spaceH1 ⊗H2

where ⊗ denotes the tensor product.

Postulate 2.4.2. Any time evolution of an isolated quantum system is describedby a unitary operator acting on H. Conversely, any unitary operator acting on Hcorresponds to a possible time evolution of the system.

Postulate 2.4.3. With every physical observable, OX , of a quantum system isassociated a self-adjoint operator X acting on the phase space H of the system.Yes-no questions are special self-adjoint operators that are projections. Measure-ment of an observable represented by the self-adjoint operator X for a quantum

to the United States (Institute of Adanced Studies - Princeton) to escape from the prosecutionsagainst Jews that had started in Germany. (Source the biography of von Neumann: [6]).

Among his major achievements are the mathematical foundations of quantum mechanics, aclass of operator algebras (known nowadays as von Neumann algebras), the construction of thefirst computer in the world, the introduction of the theory of games, . . . . (See the impressivesum of his collected works [36, 37, 37, 38, 39, 40, 41]).

20Strictly speaking, equivalence classes of unit vectors differing by a global phase, called rays.For the sake of simplicity we stick to unit vectors in this introductory section.

/Users/dp/a/ens/mq/iq-phase.tex 23 Stabilised version of 16 September 2013

Page 33: 2_pdfsam_iq.pdf

2.4. Quantum systems

system being in the pure state described by the unit vector ψ corresponds to thespectral measure on the real line induced by ⟨ψ |Xψ⟩.

These axioms will be revisited later. For the time being, it is instructing toillustrate the implications of these axioms on a very simple non-trivial quantumsystem and try to interpret their significance.

2.4.2 Interpretation of the basic postulates

In this subsection we study a quantum system whose phase spaceH=C2. Thisis the simplest non-trivial situation that might occur and could describe, forinstance, the internal degrees of freedom of an atom having two states. Noticehowever that in general, even for very simple finite systems, the phase space isnot necessarily finite-dimensional.

Interpretation of postulate 2.4.1

Every f ∈ H can be decomposed into f = f1ε1 + f2ε2 with f1, f2 ∈ C and ε1 =(10

)and ε2 =

(01

). If ‖ f ‖ 6= 0, denote by φ = f

‖ f ‖ the corresponding normalised

vector21.

Now φ=φ1ε1+φ2ε2 with |φ1|2+|φ2|2 = 1 is a pure state. The numbers |φ1|2and |φ2|2 are non-negative reals summing up to 1; therefore, they are inter-preted as a probability on the finite set of coordinates 1,2. Consequently, thecomplex numbers φ1 = ⟨ε1 |φ⟩ and φ2 = ⟨ε2 |φ⟩ are complex probability ampli-tudes, their squared modulus represents the probability that a system in a purestate φ is in the pure state ε1 or ε2.

Some precisions are needed to explain the tensor product construction ap-pearing in composite systems.

Definition 2.4.4. Let V ,W be two vector spaces. Their (algebraic) tensor prod-uct is a vector space, denoted V ⊗W with a bilinear tensor mapping τ definedby

V ×W 3 (x, y) 7→ τ(x, y) := x ⊗ y ∈V ⊗W

21General (unnormalised) vectors of H are denoted by small Latin letters f , g ,h, etc.; nor-malised vectors, representing rays, by small Greek letters φ,χ,ψ, etc.

/Users/dp/a/ens/mq/iq-phase.tex 24 Stabilised version of 16 September 2013

Page 34: 2_pdfsam_iq.pdf

satisfying the following universality property: let Z be an arbitrary vector spaceand β : V ×W → Z an arbitrary bilinear map. Then there exists a unique linearmap T : V ⊗W → Z such that β= T τ.

Remark 2.4.5. The meaning of this definition for finite dimensional spacesis the following: if (e1, . . . ,edimV ) and ( f1, . . . , fdimW ) are bases of V and W , then(ei ⊗ f j )1≤i≤dimV ;1≤ j≤dimW is a basis of V ⊗W , i.e. if v =∑

i vi ei and w =∑j w j f j ,

then τ(v, w) = v⊗w =∑i , j vi w j ei ⊗ f j . On the other hand, bilinearity of β reads

β(v, w) = ∑i , j vi w jβ(ei , f j ) ∈ Z . The unique linear map T is then defined by

its action on basis vectors by T (ei ⊗ f j ) = β(ei , f j ). Then obviously, β(v, w) =T (τ(x, y)) by construction.

In case of infinite-dimensional vector spaces some care must be paid ondefining the decomposition of vectors on the respective bases. When thesespaces are (infinite-dimensional separable) Hilbert spaces, their tensor prod-uct will be defined in §3.4.

Corollary 2.4.6. dim(V ⊗W ) = dimV dimW .

Definition 2.4.7. Let ψ ∈ H1 ⊗ ·· · ⊗Hn , for n ≥ 2. The pure state ψ is calledentagled if it cannot be written as a tensor product ψ=ψ1 ⊗·· ·⊗ψn , with ψi ∈Hi , for all i = 1, . . . ,n.

Example 2.4.8. Let n = 2 and H1 =H2 =H=C2. If we denote by (e0,e1) a basisofC2, a basis ofH⊗2 is given by (e0⊗e0,e0⊗e1,e1⊗e0,e1⊗e1). An arbitrary vectorψ ∈H⊗2 is decomposed as

ψ=ψ0e0 ⊗e0 +ψ1e0 ⊗e1 +ψ2e1 ⊗e0 +ψ3e1 ⊗e1.

If ψ2 = ψ3 = 0 while ψ1ψ1 6= 0, then ψ = ψ0e0 ⊗ e0 +ψ1e0 ⊗ e1 = e0 ⊗ (ψ0e0 +ψ1e1) and the state can still be written as a tensor product. If ψ1 =ψ2 = 0 whileψ0ψ3 6= 0 then the state cannot be written as a tensor product. Such states arecalled entangled22.

The entanglement is a distinctive property of quantum mechanics withoutclassical analogue. It is exploited from all the informational applications ofquantum mechanics (quantum computing, quantum cryptography, quantumcommunication, etc.)

22The notion was introduced by Schrödinger himself who named this property Ver-schränkung in German and translated into English (by Schrödinger himself) as entanglement.The term is commonly translated intrication in French, although the author of these linesprefers the term enchevêtrement.

/Users/dp/a/ens/mq/iq-phase.tex 25 Stabilised version of 16 September 2013

Page 35: 2_pdfsam_iq.pdf

2.4. Quantum systems

Interpretation of postulate 2.4.2

A unitary operator on H is a 2×2 matrix U , verifying UU∗ =U∗U = I . If φ is apure state, then ψ=Uφ verifies ‖ψ‖2 = ⟨Uφ |Uφ⟩ = ⟨φ |U∗Uφ⟩ = ‖φ‖2. There-fore quantum evolution preserves pure states. Moreover, due to the unitarity ofU , we have φ=U∗ψ, and since U∗ is again unitary, it corresponds to a possibletime evolution (as a matter of fact to the time reversed evolution of the one cor-responding to U .) This shows that time evolution of isolated quantum systemsis reversible.

Interpretation of postulate 2.4.3

This axiom has the most counter-intuitive consequences. Recall that any linearoperator X admits a spectral decomposition X = ∫

spec(X )λP (dλ): If X is self-adjoint, then spec(X ) ⊆ R. Let us illustrate with a very simple example: chose

for X the matrix X =(

1 2i−2i −2

). We compute easily

Eigenvalues Eigenvectors Projectorsλ u(λ) P (λ)

−3 1p5

( −i2

)15

(1 −2i2i 4

)2 1p

5

(2i1

)15

(4 2i

−2i 1

)

Hence

X = ∑λ∈−3,2

λP (λ)

= (−3)1

5

(1 −2i2i 4

)+2

1

5

(4 2i

−2i 1

).

The operators P (−3) and P (2) are self-adjoint (hence they correspond toobservables) and are projectors to mutually orthogonal subspaces. They playthe role of yes-no questions for a quantum system (recall remark 2.2.16.)

Now, let ψ ∈ H be a pure phase; since u(−3) and u(2) are two orthonor-mal vectors of H (hence also pure phases), they serve as basis to decompose

/Users/dp/a/ens/mq/iq-phase.tex 26 Stabilised version of 16 September 2013

Page 36: 2_pdfsam_iq.pdf

ψ = α−3u(−3)+α2u(2), with ‖ψ‖2 = |α−3|2 + |α2|2 = 1. Thus any pure state ψ,with probability |⟨ψ |u(−3)⟩|2 is in the pure state u(−3) and with probability|⟨ψ |u(2)⟩|2 is in the pure state u(2).

Compute further

⟨ψ |Xψ⟩ = ∑λ,λ′,λ′′

α∗λαλ′′λ′⟨u(λ) |P (λ′)u(λ′′)⟩

= ∑λ∈spec(X )

λ|αλ|2.

Yet (|αλ|2)λ∈spec(X ) can be interpreted as a probability on the set of the spectralvalues. Hence, the scalar product ⟨ψ |Xψ⟩ is the expectation of the spectralvalues with respect to the decomposition ofψ on the basis of eigenvectors. It isworth noticing that expectation of a classical random variable X taking valuesin a finite set x1, . . . , xn with probabilities p1, . . . , pn respectively, is

EX =n∑

i=1xi pi

=n∑

i=1

ppi xi

ppi

=n∑

i=1

ppi exp(−iθi )xi

ppi exp(iθi ),

with arbitrary θi ∈ R, i = 1, . . . ,n. Hence, classically, EX = ⟨ψ |Xψ⟩ with ψ =p

p1 exp(iθ1)...p

pn exp(iθn)

, verifying ‖ψ‖ = 1 and with X =

x1 0. . .

0 xn

. We have more-

over seen that classical probability is equivalent to classical physics; thanks tothe previous lines, it turns out that that it is also equivalent to quantum physicsinvolving solely diagonal self-adjoint operators as observables. The full flavourof quantum physics is obtained only when the observables are represented bynon-diagonal self-adjoint operators.

Consider now,

fλ = P (λ)ψ

= ⟨u(λ) |ψ⟩u(λ) if λ ∈ spec(X )

0 otherwise.

The vector fλ is in general unnormalised; the corresponding normalised state

φλ = P (λ)ψ‖P (λ)ψ‖ , well defined when λ ∈ spec(X ), has a very particular interpreta-

tion. Suppose we ask the question: “does the physical observable OX takes the

/Users/dp/a/ens/mq/iq-phase.tex 27 Stabilised version of 16 September 2013

Page 37: 2_pdfsam_iq.pdf

2.4. Quantum systems

value −3?” The answer, as in the classical case, is a probabilistic one: P(OX =−3) = |α−3|2 = ⟨ f−3 | f−3 ⟩ = ‖P (−3)ψ‖2. What is new, is that once we haveasked this question, the state ψ is projected on the eigenspace P (−3)H and isrepresented by the state φ−3. This means that asking a question on the systemchanges its state! This is a totally new phenomenon without classical coun-terpart. Asking questions about a quantum system corresponds to a quan-tum measurement. Hence, the measurement irreversibly changes (projects)the state of the system.

Summarising the interpretation of the three axioms, we have learnt that

• quantum mechanics has a probabilistic interpretation, generalising theclassical probability theory to a quantum (non-Abelian) one,

• quantum evolution is reversible,• quantum measurement is irreversible.

Were only to consider this generalisation of probability theory to a non-commutative setting and to explore its implications for explaining quantumphysical phenomena, should the enterprise be already a fascinating one. Butthere is even much more fascination about it: there has been demonstratedlately that quantum phenomena can serve to cipher messages in an unbreak-able way and these theoretical predictions have already been exemplified bycurrently working pre-industrial prototypes23.

In a more speculative perspective, it is even thought that in the near fu-ture there will be manufactured computers capable of performing large scalecomputations using quantum algorithms24. Should such a construction be re-alised, a vast family of problems in the (classical) complexity class of “exponen-tial time” could be solved in polynomial time on a quantum computer.

23See the article [14], articles in Le Monde (they can be found on the website of this course),the website www.idquantique.com of the company commercialising quantum cryptologic andteleporting devices, etc.

24Contrary to the quantum transmitters and cryptologic devices that are already available(within pre-industrial technologies), the prototypes of quantum computers that have beenmanufactured so far have still extremely limited scale capabilities.

/Users/dp/a/ens/mq/iq-phase.tex 28 Stabilised version of 16 September 2013

Page 38: 2_pdfsam_iq.pdf

2.5 First consequences of quantum formalism

2.5.1 Irreducibility of quantum randomness and Heisenberg’suncertainty principle

The first and more spectacular direct consequence of the quantum mechanicalformalism is the so called Heisenberg’s uncertainty principle establishing theconceptual and practical impossibility of considering systems with arbitrarilysmall randomness. Spectral decomposition allows computation of the expec-tation of an operator X , in a pure state, ψ, by

EψX = ⟨ψ |Xψ⟩ = ∑λ∈spec(X )

λ|ψλ|2

and when the operator X is self-adjoint, the spectrum is real and the expec-tation is then a real number. What makes quantum probability different fromclassical one, is (among other things) the impossibility of simultaneous diago-nalisation of two non-commuting operators. Following the probabilistic inter-pretation, denote by Varψ(X ) = Eψ(X 2)− (Eψ(X ))2.

Theorem 2.5.1 (Heisenberg’s uncertainty). Let X ,Y be two bounded self-adjointoperators on a Hilbert spaceH and suppose a fixed pure state ψ is given. Then√

Varψ(X )Varψ(Y ) ≥ |⟨ψ | [X ,Y ]ψ⟩|2

.

Proof. First notice that (i [X ,Y ])∗ = i [X ,Y ] thus the commutator is skew-adjoint.Without loss of generality, we can assume that EψX = EψY = 0 (otherwise con-sider X − EψX and similarly for Y .) Now, ⟨ψ |X Y ψ⟩ = α+ iβ, with α,β ∈ R.Hence, ⟨ψ | [X ,Y ]ψ⟩ = 2iβ and obviously

0 ≤ 4β2 = |⟨ψ | [X ,Y ]ψ⟩|2≤ 4|⟨ψ |X Y ψ⟩|2≤ 4⟨ψ |X 2ψ⟩⟨ψ |Y 2ψ⟩,

the last inequality being Cauchy-Schwarz. ä

This is a typically quantum phenomenon without classical counterpart. Infact, given two arbitrary classical random variables X ,Y on a measurable space(Ω,F ), there exists always states (i.e. probability measures) on (Ω,F ) such thatVar(X )Var(Y ) = 0 (for instance chose P(dω) = δω0 (dω).

/Users/dp/a/ens/mq/iq-phase.tex 29 Stabilised version of 16 September 2013

Page 39: 2_pdfsam_iq.pdf

2.5. First consequences of quantum formalism

2.5.2 Quantum explanation of the Orsay experiment

/Users/dp/a/ens/mq/iq-phase.tex 30 Stabilised version of 16 September 2013

Page 40: 2_pdfsam_iq.pdf

3Short resumé of Hilbert spaces

For the sake of completeness, some standard results on Hilbert spaces are re-minded in this chapter. Most of the proofs in this chapter are omitted becausethey are considered as exercises; they can be found in the classical textbooks[1, 16, 23, 25] which are strongly recommended for further reading.

3.1 Scalar products and Hilbert spaces

Hilbert spaces have many distinct features. They are C-vector spaces (henceare algebraic objects) equipped with a Hermitean scalar product (hence anglesand geometry follow) from which a Hilbert norm can be defined (they thus be-come analytic objects) for which they are complete s (hence a unit vectorψ hascomponents on an orthonormal basis (en) reading

∑n |ψn |2 = 1; therefore the

components can be interpreted as probability amplitudes).

Definition 3.1.1. Let X be a C-vector space, x, y, z ∈ X, and α,β ∈ C. A forms :X×X→C is a a scalar product if it verifies

• it is linear with respect to the second argument: s(x,αy +βz) =αs(x, y)+βs(x, z),

31

Page 41: 2_pdfsam_iq.pdf

3.1. Scalar products and Hilbert spaces

• it is Hermitean: s(x, y) = s(y, x),• it is positive: s(x, x) ≥ 0, and• it is definite: s(x, x) = 0 ⇔ x = 0.

The scalar product s is denoted usually ⟨ · | · ⟩ and the pair (X,⟨ · | · ⟩) is called apre-Hilbert space.

Lemma 3.1.2. Let (X,⟨ · | · ⟩) be a pre-Hilbert space and x, y ∈ X. The followinghold:

• Cauchy-Schwarz-Buniakovski inequality: |⟨x | y ⟩| ≤ ⟨x |x ⟩⟨ y | y ⟩,• The scalar product defines uniquely a norm ‖x‖ =p⟨x |x ⟩, called the Hilbert

norm, and is compatible with the Hilbert norm, i.e.

⟨x | y ⟩ = 1

4

3∑k=0

i k‖x + i k y‖2.

• The function ⟨ · | · ⟩ :X×X→C is continuous.

The scalar product of a pre-Hilbert space (X,⟨ · | · ⟩) induces a Hilbert norm‖x‖ = p⟨x |x ⟩ turning (X,‖ · ‖) into a normed space; furthermore, the Hilbertnorm defines a distance d(x, y) = ‖x − y‖, turning thus (X,d) into a metricspace. We can therefore define the notion of a fundamental (Cauchy) sequenceon it as being a sequence x = (xn)n∈N of vectors xn ∈X such that for every ε> 0

there exists N = N (ε) ∈ N such that for m,n ≥ N , we have d(xn , xm) < ε. Nev-ertheless, nothing imposes that any fundamental sequence converges withinX. If such is the case, the metric space (X,d) (or the normed space (X,‖ · ‖) istermed complete or Banach space. A pre-Hilbert space that is complete for themetric induced by its Hilbert norm is called a Hilbert space.

We recall that a complete metric space (X, d) is called a completion of ametric space (X,d) if there exists an isometric embedding ι : X→ X such thatthe image ι(X) is dense in X. An arbitrary normed space (not necessarily com-plete) can be completed via a standard procedure we recall here briefly. LetCS(X) be the set of Cauchy sequences onX, ∼ an equivalence relation on CS(X)defined by

x ∼ y ⇐⇒ limn→∞‖xn − yn‖ = 0,

and X = CS(X)/ ∼ the set of equivalence classes of ∼. The space X acquiresnaturally a vector space structure and through the definition

X 3 [x] 7→ ‖[x]‖ = limn→∞‖xn‖

/Users/dp/a/ens/mq/iq-hilbe.tex 32 lud on 2 October 2013 at 13:45

Page 42: 2_pdfsam_iq.pdf

— that can be shown to be independent of the representative x of [x] — be-comes a complete normed space. On identifying elements of X with constantsequences X we establish a canonical embedding ι :X→ X. One can show thatthe completion is unique up to isomorphisms (see [16, §1.6, pp. 17–21] for thedetails).

Exercise 3.1.3. The following are classical examples.

1. The finite-dimensional vector spaceX=Cd with the ordinary scalar prod-uct ⟨x | y ⟩ =∑d

n=1 xn yn is obviously a Hilbert space.2. The space X= `p (N) = x :N→ C;

∑n∈N |xn |p <∞ is a complete normed

space for all p ≥ 1, with norm ‖x‖p = (∑

n∈N |xn |p )1/p . In the particularcase p = 2 it becomes a Hilbert space with a scalar product, compatiblewith ‖ · ‖2, defined by ⟨x | y ⟩ = ∑

n∈N xn yn . It is the infinite-dimensionalgeneralisation of the previous example.

3. The space X= Lp ([a,b],λ) with p ≥ 1 can be equipped with a norm ‖ · ‖p

defined by ‖x‖p = (∫

[a,b] |x(t )|pλ(d t ))1/p . Then (X,‖·‖p ) is a Banach space.For p = 2, it becomes also a Hilbert space for the scalar product ⟨x | y ⟩ =∫

[a,b] x(t )y(t )λ(d t ).

4. The space X = C k ([a,b]) can be equipped with a scalar product ⟨x | y ⟩ =∑kj=0

∫[a,b] x( j )(t )y ( j )(t )d t . The corresponding norm is denoted ‖ · ‖W k,2

but the normed space (X,‖·‖W k,2 ) is not complete. Its completion is calledSobolev space W k,2([a,b]) and is a Hilbert space. In particular, W 0,2 = L2.

Definition 3.1.4. Let X :H1 →H2 be a linear operator between Hilbert spacesH1 andH2. Its norm is defined by

‖X ‖ := ‖X ‖H1,H2 = suph∈H1:‖h‖H1

‖X h‖H2

‖h‖H1

.

The definition holds also in the special case when H2 = C (in which case theoperator is a form). When ‖X ‖ < ∞ the operator is termed bounded. If anoperator is unbounded it can only be defined on its domain Dom(X ).

3.2 Orthogonality and projection; direct and orthog-onal sums

In this sectionHwill always denote a Hilbert space equipped with a scalar prod-uct ⟨ · | · ⟩ and corresponding Hilbert norm ‖ ·‖.

/Users/dp/a/ens/mq/iq-hilbe.tex 33 lud on 2 October 2013 at 13:45

Page 43: 2_pdfsam_iq.pdf

3.2. Orthogonality and projection; direct and orthogonal sums

Definition 3.2.1. 1. Two vectors g ,h ∈H are orthogonal if ⟨g |h ⟩ = 0.2. Two subsets A,B ⊂H are called orthogonal if ∀a ∈ A and ∀b ∈ B , we have

⟨a |b ⟩ = 0.3. If A ⊂H, its orthogonal complement is defined as

A⊥ := h ∈H : ∀a ∈ A,⟨a |h ⟩ = 0 .

Theorem 3.2.2. If A ⊂H, then A⊥ is a Hilbert subspace (closed vector subspace)ofH.

Theorem 3.2.3. 1. Let K ⊂H be a closed convex subset ofH. Then

∀h ∈H,∃!k0 ∈ K : ‖h −k0‖ = infk∈K

‖h −k‖.

2. LetK be a Hilbert subspace ofH. Then

(a) ∀h ∈H,∃!k0 ∈ K : ‖h −k0‖ = infk∈K ‖h −k‖, and(b) the vector k0 is the unique element ofK such that (h −k0) ⊥K.

Exercise 3.2.4. Let L 1(Ω,F ,P;R) denote the vector space of integrable ran-dom variables over some probability space (Ω,F ,P) and G a sub-σ-algebra ofF . Denote by X = L 2(Ω,F ,P;R) and Y = L 2(Ω,G ,P;R). On X a sesquilinearform s can be defined by s(X ,Y ) = ∫

Ω X (ω)Y (ω)P(dω) that can be turned into ascalar product by considering the space L2 instead of L 2.

1. Use theorem 3.2.3 to establish that for every X ∈ X there exists a Y ∈ Y(unique up to modifications differing from X on P-negligible sets), suchthat X −Y ⊥ Z for all Z ∈Y.

2. Using the previous result, with Z = 1 G for an arbitrary G ∈G , to establishthat Y is a version of the (classical) conditional expectation E(X |G ).

3. Use the density of L 2 into L 1 and the monotone convergence theoremto establish that for every random variable X ∈ L 1(Ω,F ,P;R), there ex-ists a random variable Y ∈L 1(Ω,G ,P;R) verifying

∀G ∈G ,∫

GX (ω)P(dω) =

∫G

Y (ω)P(dω).

Definition 3.2.5. Let X be a vector space and V,W vector subspaces of X.

1. If for every x ∈X, there exists a unique v ∈V (and consequently a uniquew ∈W) such that x = v +w , we say that X is the direct sum of V and Wand write X=V⊕W.

/Users/dp/a/ens/mq/iq-hilbe.tex 34 lud on 2 October 2013 at 13:45

Page 44: 2_pdfsam_iq.pdf

2. If X=V⊕W, define a linear operator PX→V by

X 3 x = v +w 7→ P x = P (v +w) := v ∈V.

Remark 3.2.6. The decompositionX=V+W is unique if and only ifV∩W= 0;we write then X =V⊕W. If P is the operator defined in 3.2.5, obviously P 2x =P 2(v +w) = P v = v = P x. Hence P 2 = P . Additionally imP =V and kerP =W.

Definition 3.2.7. A projection on a vector spaceX is a linear operator P :X→X

satisfying the condition P 2 = P .

There exists a bijection between projections and decompositions in directsums as stated in the following

Theorem 3.2.8. Let X be a vector space.

1. If a linear operator P is a projection on X, then X= imP ⊕kerP.2. If V and W are vector subspaces of X such that X = V⊕W, there exists a

projection P on X such that imP =V and kerP =W.

When the vector space X is a Hilbert space (hence equipped with a scalarproduct) and V a Hilbert subspace of X, we can consider the case W=V⊥. Ob-viously then we can decompose the space into the direct sum X=V⊕V⊥. Theprojection operator is then defined analogously, thanks to the theorem 3.2.8,but now we have for x = v +w and x ′ = v ′+w ′ that

⟨P x |x ′ ⟩ = ⟨v |v ′+w ′ ⟩ = ⟨v |v ′ ⟩ = ⟨v |P x ′ ⟩ = ⟨x |P x ′ ⟩.

We have thus:

Definition 3.2.9. A linear operator P : H → H is an orhtoprojection on H ifP is a projection (i.e. P 2 = P ) and for every pair h,h′ ∈ H, we have ⟨Ph |h′ ⟩ =⟨h |Ph′ ⟩ (i.e.1 P∗ = P ).

Again we can establish a bijection between orthoprojections and decompo-sitions into orthogonal Hilbert subspaces as shown in the next

Theorem 3.2.10. 1. If P is an orthoprojection on H, then imP is closed andH= imP ⊕kerP.

2. If V is a Hilbert subspace of H then there exists an orthoprojection P on Hsuch that imP =V and kerP =V⊥.

1See ?? below.

/Users/dp/a/ens/mq/iq-hilbe.tex 35 lud on 2 October 2013 at 13:45

Page 45: 2_pdfsam_iq.pdf

3.3. Duality and Fréchet-Riesz theorem; adjoint

Exercise 3.2.11. LetH= L2(R).

1. IfV is the Hilbert subspace of even square integrable functions, thenV⊥ isthe Hilbert subspace of odd square integrable functions, then P,Q definedby

Ph(x) = h(x)+h(−x)

2, and Qh(x) = h(x)−h(−x)

2are orthoproejections onH.

2. If A ∈B(R) andV= h ∈H : h = 1 Ah, thenV is a vector subspace ofH notnecessarily closed. Nevertheless, an orthoprojection P can be associatedwith the decomposition H = V⊕V⊥, where V is the Hilbert subspace offunctions with support contained in A.

Exercise 3.2.12. An orthoprojection P 6= 0 onH has norm ‖P‖ = 1.

Compléter: exemples de mesures étrangères et de somme infinie dénom-brable d’espaces.

3.3 Duality and Fréchet-Riesz theorem; adjoint

Compléter.

3.4 Tensor product of Hilbert spaces

The notion of algebraic tensor product is given in 2.4.4 Here we extend this no-tion to hold in the case where the factor spaces of the tensor product are notmere vector spaces but (infinite dimensional) separable Hilbert spaces. Intu-itively, we interpret the tensor map τ :G×H→L as a kind of product τ(g ,h) =g ⊗h ∈L . We wish to equip L with a scalar product rendering it a pre-Hilbertspace that will be ultimately be completed for the Hilbert norm to a Hilbertspace L. This construction has been carried on in [42, §3.4, pp. 47–49] andmore extensively in [17, §II.4, pp. 49–54] that, although based on the ideas of[42] is more direct. When the factors of the tensor product are Banach spaces,this construction is carried on in [27]. The construction proposed here followsthe [23, pp. 49–54] account of [42].

Let us now explain the main steps of the construction.

/Users/dp/a/ens/mq/iq-hilbe.tex 36 lud on 2 October 2013 at 13:45

Page 46: 2_pdfsam_iq.pdf

Let G,H be separable Hilbert spaces. For every ζ ∈ G and η ∈ H, constructthe coniugate bilinear form ζ⊗η defined by

G×H 3 (g ,h) 7→ ζ⊗η(g ,h) := ⟨g |ζ⟩G⟨h |η⟩H,

and consider the linear manifold of the finite linear combination of such forms

L :=

n∑i=1

ciζi ⊗ηi ,ζi ∈G,ηi ∈H, i = 1, . . . ,n,n ∈N

.

Obviously L is a vector space. A sesquilinear form ⟨ · | · ⟩L is defined by its ac-tion on simple tensor products in L , by

⟨ζ⊗η |ζ′⊗η′ ⟩L := ⟨ζ |ζ′ ⟩G⟨η |η′ ⟩H

and extended by linearity on all elements of L .

Lemma 3.4.1. The sesquilinear form ⟨ · | · ⟩L is

1. well defined, and2. positive defined.

Proof. 1. No particular hypothesis has been made on the vectors ζi and ηi

entering in the decomposition of forms in L ; therefore, the decompo-sition of any bilinear form β ∈ L into elementary tensor product formsis not necessarily unique. To establish well definiteness of ⟨β |β′ ⟩L , wemust show that the result is independent of the representation used inβ and β′. It is enough to show that if µ denotes the zero form, then⟨β |µ⟩L = 0 for any β ∈ L . Let β = ∑M

i=1 ciζi ⊗ηi be an arbitrary form.

Then

⟨β |µ⟩L =M∑

i=1c i ⟨ζi ⊗ηi |µ⟩ =

M∑i=1

c iµ(ζi ,ηi ) = 0,

establishing thus well definiteness.2. Let again β=∑M

i=1 ciζi ⊗ηi and Compléter.

/Users/dp/a/ens/mq/iq-hilbe.tex 37 lud on 2 October 2013 at 13:45

Page 47: 2_pdfsam_iq.pdf

3.5. Orhonormal systems, ortonormal bases; criteria of completeness of a basis

3.5 Orhonormal systems, ortonormal bases; crite-ria of completeness of a basis

3.6 Dirac’s bra and ket notation

Dirac’s notation is a very convenient shorthand notation for dealing with Hilbertspaces, scalar products, tensor products and forms in quantum mechanics.

Usual notation Dirac’s notation

n symbols, eg. e1, . . . ,enOrthonormal basis (e1, . . . ,en) |e1 ⟩, . . . |en ⟩

ψ=∑i ψi ei |ψ⟩ =∑

i ψi |ei ⟩⟨φ|ψ⟩ =∑

φiψi ⟨φ|ψ⟩ =∑φiψi

H∗ = f :H→C, linear H∗ = f :H→C, linear† :H→H∗ † :H→H∗

† :φ 7→ f (φ(·) = ⟨φ|· ⟩ † : |φ⟩ 7→ ⟨φ|⟨φ|ψ⟩ = fφ(ψ) ⟨φ|ψ⟩ = ⟨φ||ψ⟩

X = X ∗ X = X ∗

⟨φ|Xψ⟩ = ⟨X ∗φ|ψ⟩ = ⟨Xφ|ψ⟩ ⟨φ|X |ψ⟩X u(λi ) =λi u(λi ) X |λi ⟩ =λi |u(λi )⟩P (λi ) projector |u(λi )⟩⟨u(λi )|X =∑

i λi P (λi ) X =∑i λi |u(λi )⟩⟨u(λi )|

Tensor product φ⊗ψ |φ⟩⊗ |ψ⟩ = |φψ⟩

Exercise 3.6.1. Let (en)n∈N be an orthonormal basis of a Hilbert spaceH.

1. What is the interpretation of |en ⟩⟨en | for some n?2. If φ and ψ are unit vectors ofH, what is the interpretation of |φ⟩⟨ψ |?3. What is the interpretation of the identity

∑n∈N |en ⟩⟨en | s= I (where

s= de-notes the strong limit of the partial sums)?

4. Let H = L2(T) and (en) the basis of trigonometric polynomials en(t ) =exp(i nt ). Derive the Parseval formula using the Dirac formalism.

Compléter avec décomposition matricielle, produit tensoriels.

/Users/dp/a/ens/mq/iq-hilbe.tex 38 lud on 2 October 2013 at 13:45

Page 48: 2_pdfsam_iq.pdf

3.7 Some more involved quantum mechanical phe-nomena

3.7.1 Composite systems and entanglement

3.7.2 Decoherence and quantum to classical transition

3.8 Rigged Hilbert spaces and generalised kets

/Users/dp/a/ens/mq/iq-hilbe.tex 39 lud on 2 October 2013 at 13:45

Page 49: 2_pdfsam_iq.pdf

3.8. Rigged Hilbert spaces and generalised kets

/Users/dp/a/ens/mq/iq-hilbe.tex 40 lud on 2 October 2013 at 13:45

Page 50: 2_pdfsam_iq.pdf

4Algebras of operators

4.1 Introduction and motivation

Let V = Cn , with n ∈ N. Elementary linear algebra establishes that the set oflinear mappings L(V) = T :V→V : T linear is a C-vector space of (complex)dimension n2, isomorphic to Mn(C), the space of n ×n matrices with complexcoefficients. Moreover, if S,T ∈ L(V), the maps S and T can be composed,their composition T S being represented by the corresponding matrix prod-uct. Thus, on the vector space L(V), is defined an internal multiplication

L(V)×L(V) 3 (T,S) 7→ T S ∈L(V)

turning this vector space into an algebra.

When the underlying vector space V is of infinite dimension, caution mustbe paid on defining linear maps. In general, linear mappings T :V→V, called(linear) operators, are defined only on some proper subset ofVdenoted Dom(T )and called the domain1 of T . WhenV is a normed space, there is a natural wayto define a norm on L(V). We denote by B(V) the vector space of bounded lin-

1The set Dom(T ) is generally a linear manifold, i.e. algebraically a vector subspace of Vwhich is not necessarily topologically closed.

41

Page 51: 2_pdfsam_iq.pdf

4.2. Algebra of operators

ear operators on V, i.e. linear maps T :V→V such that ‖T ‖ <∞ (equivalently,verifying Dom(T ) = V.) When H is a Hilbert space, bounded linear operatorson H, whose set is denoted by B(H), with operator norm ‖T ‖ = sup‖T x‖, x ∈H,‖x‖ ≤ 1, share the properties of linear operators defined on more algebraicsetting. Sometimes it is more efficient to work with explicit representations ofoperators in B(H) (that play the rôle of matrices in the infinite dimensionalsetting) and some others with abstract algebraic setting.

Since all operators encountered in quantum mechanics are linear, we drophenceforth the adjective linear.

4.2 Algebra of operators

Definition 4.2.1. An algebra is a set A endowed with three operations:

1. a scalar multiplication C×A 3 (λ, a) 7→λa ∈A,

2. a vector addition A×A 3 (a,b) 7→ a +b ∈A, and

3. a vector multiplication A×A 3 (a,b) 7→ ab ∈A,

such that A is a vector space with respect to scalar multiplication and vectoraddition and a ring (not necessarily commutative) with respect to vector addi-tion and vector multiplication. Moreover, λ(ab) = (λa)b = a(λb) for all λ ∈ Cand all a,b ∈ A. The algebra is called commutative if ab = ba, for all a,b ∈ A;it is called unital if there exists (a necessarily unique) element e ∈A (often alsowritten 1 or 1 A) such that ae = ea = a for all a ∈A;

A linear map from an algebra A1 to an algebra A2 is a homomorphism if itis a ring homomorphism for the underlying rings, it is an isomorphism if it is abijective homomorphism.

Definition 4.2.2. An involution on an algebra A is a map A 3 a 7→ a∗ ∈A thatverifies

1. (λa +µb)∗ =λa∗+µb∗,

2. (ab)∗ = b∗a∗, and

/Users/dp/a/ens/mq/iq-algop.tex 42 lud on 1 October 2013

Page 52: 2_pdfsam_iq.pdf

3. (a∗)∗ = a.

Involution is also called adjoint operation and a∗ the adjoint of a. An involu-tive algebra is termed a ∗-algebra.

An element a ∈A is said normal if aa∗ = a∗a, an isometry if a∗a = 1 , uni-tary if both a and a∗ are isometries, self-adjoint or Hermitean if a = a∗. Ondenoting h : A1 → A2 a homomorphism between two ∗-algebras, we call it a∗-homomorphism if it preserves adjoints, i.e. h(a∗) = h(a)∗.

A normed (respectively Banach) algebra A is an algebra equipped with anorm map ‖·‖ : A→R+ that is a normed (respectively Banach) vector space forthe norm and verifies ‖ab‖ ≤ ‖a‖‖b‖ for all a,b ∈A. A is normed (respectivelyBanach) ∗-algebra if it has an involution verifying ‖a∗‖ = ‖a‖ for all a ∈A.

Theorem 4.2.3. Let T :H1 →H2 be a linear map between two Hilbert spaces H1

andH2. Then the following are equivalent:

1. ‖T ‖ = sup‖T f ‖H2 , f ∈H1,‖ f ‖H1 ≤ 1 <∞,

2. T is continuous,

3. T is continuous at one point ofH1.

Proof: Analogous to the proof of the theorem ?? for linear functional. (Pleasecomplete the proof!) äNotation 4.2.4. We denote by B(H1,H2) the algebra of bounded operators withrespect to the aforementioned norm:

B(H1,H2) = T ∈L(H1,H2) : ‖T ‖ <∞.

WhenH1 =H2 =H, we write simply B(H).

Proposition 4.2.5. Let H1 and H2 be two Hilbert spaces and T ∈ B(H1,H2).Then, there exists a unique bounded operator T ∗ :H2 →H1 such that

⟨T ∗g | f ⟩ = ⟨g |T f ⟩ for all f ∈H1, g ∈H2.

Proof: For each g ∈H2, the map H1 3 f 7→ ⟨g |T f ⟩H2∈ C is a continuous (why?)

linear form. By Riesz-Fréchet theorem ??, there exists a unique h ∈ H1 suchthat ⟨h | f ⟩H1

= ⟨g |T f ⟩H2, for all f ∈ H1. Let T ∗ : H2 → H1 be defined by the

assignment T ∗g = h; it is obviously linear and easily checked to be bounded(exercise!) ä

/Users/dp/a/ens/mq/iq-algop.tex 43 lud on 1 October 2013

Page 53: 2_pdfsam_iq.pdf

4.2. Algebra of operators

Proposition 4.2.6. For all T ∈B(H1,H2),

1. ‖T ∗‖ = ‖T ‖,

2. ‖T ∗T ‖ = ‖T ‖2.

Proof:

1. By Cauchy-Schwarz inequality, for all f ∈H2, g ∈H1,

|⟨ f |T g ⟩H2| ≤ ‖ f ‖H2‖T g‖H2

≤ ‖T ‖B(H1,H2)‖g‖H1‖ f ‖H2

so that‖T ‖ ≥ sup|⟨ f |T g ⟩| : ‖g‖ ≤ 1,‖ f ‖ ≤ 1.

Conversely, we may assume that ‖T ‖ 6= 0, and therefore choose some ε ∈]0,‖T ‖/2[. Choose now g ∈H1 with ‖g‖ ≤ 1, such that ‖T g‖ ≥ ‖T ‖−ε andf = T g

‖T g‖ ∈H2, ‖ f ‖ = 1. For this particular choice of f and g :

|⟨ f |T g ⟩H2| ≥ ‖T g‖ ≥ ‖T ‖−ε.

Hence,sup|⟨ f |T g ⟩| : ‖g‖ ≤ 1,‖ f ‖ ≤ 1 ≥ ‖T ‖−ε.

Since ε is arbitrary, we get ‖T ‖ = sup|⟨ f |T g ⟩H2| : g ∈ H1, f ∈ H2,‖g‖ ≤

1,‖ f ‖ ≤ 1. As ⟨ f |T g ⟩ = ⟨T ∗ f |g ⟩ for all f and g , we get ‖T ∗‖ = ‖T ‖

2. B(H1,H2) being a normed algebra, ‖T ∗T ‖ ≤ ‖T ∗‖‖T ‖ = ‖T ‖2. Conversely,

‖T ‖2 ≤ sup|T f ‖ : f ∈H1,‖ f ‖ ≤ 1

= sup|⟨T f |T f ⟩| : f ∈H1,‖ f ‖ ≤ 1

= sup|⟨ f |T ∗T f ⟩| : f ∈H1,‖ f ‖ ≤ 1

≤ ‖T ∗T ‖.

äDefinition 4.2.7. A C∗-algebra A is an involutive Banach algebra verifying ad-ditionally

‖a∗a‖ = ‖a‖2, for all a ∈A.

/Users/dp/a/ens/mq/iq-algop.tex 44 lud on 1 October 2013

Page 54: 2_pdfsam_iq.pdf

Example 4.2.8. LetXbe a compact Hausdorff2 space and A= f :X→C | f continuous ≡C (X) Define

1. C×A 3 (λ, f ) 7→λ f ∈A by (λ f )(x) =λ f (x),∀x ∈X,

2. A×A 3 ( f , g ) 7→ f + g ∈A by ( f + g )(x) = f (x)+ g (x),∀x ∈X,

3. A×A 3 ( f , g ) 7→ f g ∈A by ( f g )(x) = f (x)g (x),∀x ∈X,

4. A 3 f 7→ f ∗ ∈A by f ∗(x) = f (x),∀x ∈X,

Then A is a unital (specify the unit!) C∗-algebra for the norm ‖ f ‖ = supx∈X | f (x)|.(Prove it!) The algebra A is moreover commutative.

Example 4.2.9. LetH1 andH2 be two Hilbert spaces. Then B(H1,H2) is a unitalC∗-algebra. In general, this algebra is not commutative.

This example has also a converse, given in theorem 4.5.2, below.

Example 4.2.10. Let X be a compact Hausdorff space. Then A = C (X) := f :X→ C, continuous equipped with the uniform norm and pointwise multipli-cation is a unital Banach commutative algebra; further equipped with an invo-lution defined by complex conjugation, becomes a B∗-algebra.

Example 4.2.11. A= `1(Z), with à compléter.

Example 4.2.12. A= L1(R), with à compléter.

The previous example must not induce the reader to erroneously concludethat non-unital algebras have natural approximate identities since à compléter.

4.3 Convergence of sequences of operators

4.4 Classes of operators in B(H)

We shall see that any C∗-algebra can be faithfully represented on some Hilbertspace H; the different classes of abstract elements of the algebra, introduced

2Recall that a topological space is called Hausdorff when every two distinct of its pointsposses disjoint neighbourhoods.

/Users/dp/a/ens/mq/iq-algop.tex 45 lud on 1 October 2013

Page 55: 2_pdfsam_iq.pdf

4.4. Classes of operators in B(H)

in the previous section, have a counterpart in the context of this representa-tion. But additionally, B(H) is a very special C∗-algebra because is closed forthe weak operator topology (defined in §4.3). This fact endows B(H) with avery rich family of projections allowing to generate3 back the unital C∗-algebraB(H).

4.4.1 Self-adjoint and positive operators

Definition 4.4.1. An operator T ∈B(H) is called self-adjoint or Hermitean4 ifT = T ∗. The set of Hermitean operators onH is denoted by Bh(H).

Exercise 4.4.2. The operator T ∈B(H) is self-adjoint if and only if ⟨ f |T f ⟩ ∈ Rfor all f ∈H. (Hint: use the polarisation equality ??.)

Exercise 4.4.3. If T ∈B(H) is self-adjoint then ‖T ‖ = sup⟨ f |T f ⟩, f ∈H,‖ f ‖ ≤1.

Definition 4.4.4. An operator T ∈ B(H) is called positive if ⟨ f |T f ⟩ ≥ 0 for allf ∈H. Such an operator is necessarily self-adjoint. We denote by B+(H) the setof positive operators.

Exercise 4.4.5. Show that T ∈ B+(H) if and only if there exists S ∈ B(H) suchthat T = S∗S.

4.4.2 Projections

Definition 4.4.6. Let P,P1,P2 ∈B(H).

1. P is a projection if P 2 = P .

2. P is an orthoprojection if is a projection satisfying further P∗ = P .

3. Two orthoprojections P1,P2 ∈ B(H) are orthogonal, denoted P1 ⊥ P2 oftheir images are orthogonal subspaces ofH (equivalently P1P2 = 0).

3In some general unital C∗-algebras there are only two trivial projections 0 and 1. Thereforethe situation arising in B(H) is far from being a general property of C∗-algebras.

4Strictly speaking, the term Hermitean is more general; it applies also to unbounded op-erators and it means self-adjoint on a dense domain. The two terms coincide for boundedoperators.

/Users/dp/a/ens/mq/iq-algop.tex 46 lud on 1 October 2013

Page 56: 2_pdfsam_iq.pdf

Projections are necessarily positive (why?). The set of orhoprojections is de-noted by P(H). All projections considered henceforth will be orthoprojections.

Exercise 4.4.7. (A very important one!) Let (Pn) be a sequence of orthoprojec-tions. We have already shown that there is a bijection between P(H) and the setof closed subspaces of H and orthoprojections, given by P(H) 3 P 7→ P (H) ⊂H,with P (H) closed.

1. Show that that P(H) is partially ordered, i.e. P1 ≤ P2 if P1(H) subspace ofP2(H) (equivalently P1P2 = P1.)

2. For general orthoprojections P1 and P2, is P1P2 an orthoprojection?

3. Show that P1 and P2 have a least upper bound.

4. Is Q = P1 + . . .+Pn an orthoprojection?

5. Is Q = P2 −P1 an orthoprojection?

6. Show that a monotone sequence of orthoprojections converges stronglytowards an orthoprojection.

4.4.3 Unitary operators

Definition 4.4.8. An operator U ∈B(H) is unitary if U∗U =UU∗ = 1 . The setof unitary operators is denoted by U(H) = U ∈B(H) : U∗U =UU∗ = 1 (it is infact a group; forH=Cn it is the Lie group denoted by U (n).)

4.4.4 Isometries and partial isometries

Definition 4.4.9. An operator T ∈ B(H1,H2) is an isometry if T ∗T = 1 (orequivalently ‖T f ‖ = ‖ f ‖, for all f ∈H1.)

Exercise 4.4.10. LetH= `2(N) and for x = (x1, x2, x3, . . .) ∈H, define the left andright shifts by

Lx = (x2, x3, . . .) ∈H,

andRx = (0, x1, x2, x3, . . .) ∈H.

/Users/dp/a/ens/mq/iq-algop.tex 47 lud on 1 October 2013

Page 57: 2_pdfsam_iq.pdf

4.4. Classes of operators in B(H)

1. Show that R∗ = L.

2. Show that R is an isometry.

3. Determine RanR.

This exercise demonstrates that, in infinite dimensional spaces, isometriesare not necessarily surjective.

Theorem 4.4.11. For T ∈ B(H1,H2), the five following conditions are equiva-lent:

1. (T ∗T )2 = T ∗T ,

2. (T T ∗)2 = T T ∗,

3. T T ∗T = T ,

4. T ∗T T ∗ = T ∗,

5. there exist closed subspaces E1 ⊆ H1 and E2 ⊆ H2 such that T = I S Pwhere P :H1 → E1 is a projection, S : E1 → E2 an isometry, and I : E2 →H2

the inclusion map.

If one (hence all) condition holds then T ∗T is the projection H1 → E1 and T T ∗

is the projection H2 → E2. In this situation T is called a partial isometry withinitial space E1, initial projection T ∗T , final space E2, and final projection T T ∗.

Proof. Exercise! (See [2] or [26].) ä

Exercise 4.4.12. Let (Ω,F ,P) be a probability space and T :Ω→Ω a measurepreserving transformation i.e. P(T −1B) = P(B) for all B ∈ F . On the HilbertspaceH= L2(Ω,F ,P) define U :H→H by U f (ω) = f (T −1ω).

1. Show that U is a partial isometry.

2. Under which condition is U surjective (hence unitary)?

/Users/dp/a/ens/mq/iq-algop.tex 48 lud on 1 October 2013

Page 58: 2_pdfsam_iq.pdf

4.4.5 Normal operators

Definition 4.4.13. An operator T ∈ B(H) is normal if T ∗T = T T ∗ (or equiva-lently if ‖T ∗ f ‖ = ‖T f ‖ for all f ∈H.)

Exercise 4.4.14. A vector f ∈H \ 0 is called an eigenvector corresponding toan eigenvalueλ of an operator T ∈B(H) if T f =λ f for some λ ∈C. Show that ifT is normal and f1, f2 are eigenvectors corresponding to different eigenvaluesthen f1 ⊥ f2. (The proof goes as for the finite dimensional case.)

Exercise 4.4.15. Let M be the multiplication operator on L2[0,1] defined byM f (t ) = t f (t ), t ∈ [0,1]. Show that

1. M is self-adjoint (hence normal),

2. M has no eigenvectors.

Exercise 4.4.16. Choose some z ∈ C with |z| < 1 and consider ζ ∈ `2(N) givenby ζ= (1, z, z2, z3, . . .). Let L and R be the left and right shifts defined in exercise4.4.10.

1. Show that R is not normal,

2. compute R∗ζ,

3. conclude that R∗ has uncountably many eigenvalues.

4.5 States on algebras, GNS construction, represen-tations

Paragraphe incomplet.

Definition 4.5.1. Let A be an involutive Banach algebra. A representation ona Hilbert space H of A is a ∗-homomorphism of A into B(H), i.e. a linear mapπ : A→B(H) such that

1. π(ab) =π(a)π(b),∀a,b ∈A,

2. π(a∗) =π(a)∗,∀a ∈A,

/Users/dp/a/ens/mq/iq-algop.tex 49 lud on 1 October 2013

Page 59: 2_pdfsam_iq.pdf

4.5. States on algebras, GNS construction, representations

The space H is called the representation space. We write (π,H), or Hπ if nec-essary. Two representations (π1,H1) and (π2,H2) are said to be unitarily equiv-alent if there exists an isometry U : H1 → H2 such that for all a ∈ A, it holdsUπ1(a)U∗ = π2(a). If moreover for every non zero element of A, π(a) 6= 0, thenthe representation is called faithful.

Theorem 4.5.2 (Gel’fand-Naïmark). If A is an arbitrary C∗-algebra, there existsa Hilbert spaceH and a linear mapping π : A→B(H) that is a faithful represen-tation of A.

Proof: It can be found in [17, theorem 4.5.6, page 281]. ä

/Users/dp/a/ens/mq/iq-algop.tex 50 lud on 1 October 2013

Page 60: 2_pdfsam_iq.pdf

5Spectral theory in Banach algebras

5.1 Motivation

In linear algebra one often encounters systems of linear equations of the type

T f = g (5.1)

with f , g ∈Cn and T = (ti , j )i , j=1,...,n a n×n matrix with complex coefficients. El-ementary linear algebra establishes that this system of equations has solutionsprovided that the map f 7→ T f is surjective and the solution is unique providedthat this map is injective. Thus the system has a unique solution for each g ∈Cn

provided that the map is bijective, or equivalently the matrix T is invertible.This happens precisely when detT 6= 0. However, this criterion of invertibil-ity is of limited practical use even for the elementary (finite-dimensional) casebecause det is too complicated an object to be efficiently computed for largen. For infinite dimensional cases, this criterion becomes totally useless sincethere is no infinite dimensional analogue of det that discriminates between in-vertible and non-invertible operators T (see exercise 5.1.1 below!)

Another general issue connected with the system (5.1) is that of eigenval-ues. For every λ ∈ C, denote by Vλ = f ∈ Cn : T f = λ f . For most choices of λ,

51

Page 61: 2_pdfsam_iq.pdf

5.1. Motivation

the subspace Vλ is the trivial subspace 0; this subspace is not trivial only whenT −λ1 is not injective (i.e. ker(T −λ1 ) 6= 0.) On defining the spectrum of T by

spec(T ) = λ ∈C : T −λ1 is not invertible ,

one easily shows that spec(T ) 6= ; and cardspec(T ) ≤ n (why?) Not always thefamily (Vλ)λ∈spec(T ) spans the whole space Cn . When it does, on decomposingg = g (1) + . . .+ g (k) where g ( j ) ∈ Vλ j and spec(T ) = λ1, . . . ,λk , the solution of(5.1) is given by

f = g (1)

λ1+ . . .+ g (k)

λk.

(Notice that λi 6= 0, for all i = 1, . . . ,k; why?) When the family (Vλ)λ∈spec(T ) doesnot spanCn , the problem is more involved but the rôle of the spectrum remainsfundamental.

A final issue involving the spectrum of T is the functional calculus associ-ated with T . If p ∈ R[t ], this polynomial can be naturally extended on B(H).In fact, if p(t ) = an t n + . . . a0 is the expression of the polynomial p; the expres-sion p(T ) = anT n + . . . a01 is well defined for all T ∈ B(H). Moreover, if T ∈Bh(H) then p(T ) ∈Bh(H). Suppose now that T ∈Bh(H), m = inf‖ f ‖=1 ⟨ f |T f ⟩,M = sup‖ f ‖=1 ⟨ f |T f ⟩, and p(t ) ≥ 0 for all t ∈ [m, M ]; then p(T ) ∈ B+(H). Nowevery f ∈ C [m, M ] can be uniformly approximated by polynomials, i.e. thereis a sequence (pl )l∈N, with pl ∈ R[t ] such that for all ε > 0, there exists n0 ∈ Nsuch that for l ≥ n0, maxt∈[m,M ] | f (t )− pl (t )| < ε. It is natural then to definef (T ) = liml pl (T ). However, the computations involved in the right hand sideof this equation can be very complicated. Suppose henceforth that H = Cn

and T is a Hermitean n ×n matrix that is diagonalisable, i.e. T = U DU∗ with

D =

λ1. . .

λn

and U unitary. Then pl (T ) = Upl (D)U∗ and letting l →∞

we get f (T ) =U f (D)U∗. Thus, if T is diagonalisable, the computation of f (T ) isequivalent to the knowledge of f (t ) for t ∈ spec(T ). For the infinite dimensionalcase, the problem is more involved but again the spectrum remains fundamen-tal.

The rest of this chapter, based on [2], is devoted to the appropriate general-isation of the spectrum for infinite dimensional operators.

Exercise 5.1.1. (Infinite-dimensional determinant) Let H= `2(N) and (tn)n∈Nbe a fixed numerical sequence. Suppose that there exist constants K1,K2 > 0such that 0 < K1 ≤ tn ≤ K2 <∞ for all n ∈N. For every x ∈ `2(N) define (T x)n =tn xn ,n ∈N.

/Users/dp/a/ens/mq/iq-sptba.tex 52 lud on 1 October 2013

Page 62: 2_pdfsam_iq.pdf

1. Show that T ∈B(H).

2. Exhibit a bounded operator S onH such that ST = T S = 1 .

3. Assume henceforth that (tn)n∈N is a monotone sequence. Let ∆n(T ) =t1 · · · tn . Show that ∆n(T ) converges to a non-zero limit ∆(T ) if and only if∑

n(1− tn) <∞.

4. Any plausible generalisation, δ, of det in the infinite dimensional settingshould verify δ(1 ) = 1, δ(AB) = δ(A)δ(B), and if T is diagonal δ(T ) =∆(T ).Choosing tn = n

n+1 , for n ∈ N, conclude that although T is diagonal andinvertible, nevetheless has δ(T ) = 0.

5.2 The spectrum of an operator acting on a Banachspace

Let V be a C-Banach space. Denote by B(V) the set of bounded operators T :V→ V. This space is itself a unital Banach algebra for the induced operatornorm.

Exercise 5.2.1. If X and Y are metric spaces and dX and dY denote their re-spective metrics

1. verify that

dp ((x1, y1), (x2, y2)) = (dX(x1, x2)p +dY(y1, y2)p )1/p ,

with p ∈ [1,∞[ and

d∞((x1, y1), (x2, y2)) = max(dX(x1, x2),dY(y1, y2))

are metrics on X×Y; (the corresponding metric space (X×Y,dp ), p ∈[1,∞] is denoted1 X⊕Y)

2. show that the sequence (xn , yn)n in X×Y converges to a point (ξ,ψ) ∈X×Ywith respect to any of the metrics dp if and only if dX(xn ,ξ) → 0 anddY(yn ,ψ) → 0.

1more precisely X⊕`p Y.

/Users/dp/a/ens/mq/iq-sptba.tex 53 lud on 1 October 2013

Page 63: 2_pdfsam_iq.pdf

5.2. The spectrum of an operator acting on a Banach space

Exercise 5.2.2. Let X and Y be metric spaces and f : X→ Y be a continuousmap. We denote by

Γ( f ) = (x, f (x)) : x ∈X

the graph of f . Show that Γ( f ) is closed (i.e. if (xn)n is a sequence in X andif there exists (x, y) ∈ X×Y such that xn → x and f (xn) → y , then necessarilyy = f (x).)

Exercise 5.2.3. (The closed graph theorem) SupposeX andY are Banach spacesand T :X→Y a linear map having closed graph. Show that T is continuous.

Theorem 5.2.4. For every T ∈B(V), the following are equivalent:

1. for every y ∈V there is a unique x ∈V such that T x = y,

2. there is an operator S ∈B(V) such that ST = T S = 1 .

Proof: Only the part 1 ⇒ 2 is not trivial to show. Condition 1 implies that T isinvertible; call S its inverse. The only thing to show is the boundedness of S. Asa subset ofV⊕V, the graph of S is related to the graph of T . In fact

Γ(S) = (y,Sy) : y ∈V = (T x, x), x ∈V.

Now T is bounded, hence continuous, so that that the set (T x, x), x ∈ V isclosed (see exercise 5.2.2.) Thus the graph of S is closed, and by the closedgraph theorem (see exercise 5.2.3), S is continuous hence bounded. äDefinition 5.2.5. Let T ∈B(V) where V is a Banach space.

1. T is called invertible if there exists an operator S ∈ B(V) such that ST =T S = 1 .

2. The spectrum of T , denoted by spec(T ), is defined by

spec(T ) = λ ∈C : T −λ1 is not invertible.

3. The resolvent set of T , denoted by Res(T ), is defined by

Res(T ) =C\ spec(T ).

Notice that in finite dimension, invertibility of an operator R reduces es-sentially to injectivity of R since surjectivity of R can be trivially verified if wereduce the space V into Ran(R). In infinite dimension, several things can go

/Users/dp/a/ens/mq/iq-sptba.tex 54 lud on 1 October 2013

Page 64: 2_pdfsam_iq.pdf

wrong: of course injectivity may fail as in finite dimension; but a new phe-nomenon can appear when Ran(R) is not closed: in this latter case, Ran(R) canfurther be dense in V or fail to be dense in V . All these situations may occurand correspond to different types of sub-spectra.

Definition 5.2.6. Let T ∈B(V) where V is a Banach space.

1. The point spectrum of T is defined by

specp

(T ) = λ ∈C : T −λ1 is not injective.

Every λ ∈ specp (T ) is called an eigenvalue of T .

2. The continuous spectrum, specc (T ), of T is defined as the set of complexvalues λ such that T −λ1 is injective but not surjective and Ran(T −λ1 )is dense in V.

3. The residual spectrum, specr (T ), of T is defined as the complex valuesλ such that T −λ1 is injective but not surjective and Ran(T −λ1 ) is notdense in V.

Example 5.2.7. Let V be a finite dimensional Banach space and T : V→ V alinear transformation (hence bounded.) Since dimker(T −λ1 )+dimRan(T −λ1 ) = dimV, it follows that T −λ1 is injective if and only if Ran(T −λ1 ) = V.Therefore specr (T ) =;. Further, if T −λ1 is injective, then it has an inverse onV. Since any linear transformation of a finite dimensional space is continuous,it follows that (T −λ1 )−1 is continuous, hence specc (T ) =;. Therefore, in finitedimension we always have spec(T ) = specp (T ).

Exercise 5.2.8. LetV= `2(N) and consider the right shift, R, on V.

1. Show that R −λ1 is injective for all λ ∈C. Conclude that specp (R) =;.

2. Show that for |λ| > 1, Ran(R−λ1 ) =V. Conclude that all λ ∈Cwith |λ| > 1belong to Res(R).

3. For |λ| < 1, show that Ran(R−λ1 ) is orthogonal to the vectorΛ= (1,λ,λ2, . . .).Show that for |λ| < 1, Ran(R−λ1 ) = y ∈V : y ⊥Λ. Conclude that allλ ∈Cwith |λ| < 1 belong to specr (R).

4. The case |λ| = 1 is the most difficult. Try to show that Ran(R−λ1 ) is densein V so that the unit circle coincides with specc (R).

/Users/dp/a/ens/mq/iq-sptba.tex 55 lud on 1 October 2013

Page 65: 2_pdfsam_iq.pdf

5.3. The spectrum of an element of a Banach algebra

5.3 The spectrum of an element of a Banach algebra

In the previous section we studied spectra of bounded operators acting on Ba-nach spaces. They form a Banach algebra with unit. Spectral theory can beestablished also abstractly on Banach algebras. Before stating spectral proper-ties, it is instructive to give some more examples.

Example 5.3.1. Let Cc (R) be the set of continuous functions onRwhich vanishoutside a bounded interval; it is a normed vector space (with respect to the L1

norm for instance; its completion is the Banach space L1(R,λ), where λ standsfor the Lebesgue measure.) A product can be defined by the convolution

f ? g (x) =∫R

f (y)g (x − y)λ(d y)

turning this space into a commutative Banach algebra. This algebra is not uni-tal (this can be seen by solving the equation f ? f = f in L1), but it has an ap-proximate unit (i.e. a sequence ( fn)n of integrable functions with ‖ fn‖ = 1 forall n and such that for all g ∈ L1(R), ‖g ? fn − g‖→ 0. (Give an explicit exampleof such an approximate unit!)

Example 5.3.2. The algebra Mn(C) is a unital non-commutative algebra. Thereare many norms that turn it into a finite-dimensional Banach algebra, for in-stance:

1. ‖A‖ =∑ni , j=1 |ai , j |

2. ‖A‖ = sup‖x‖≤1‖Ax‖‖x‖ .

Definition 5.3.3. Let A be a unital Banach algebra. (We can always assumethat ‖1 ‖ = 1, may be after re-norming the elements of A.) An element a ∈ A iscalled invertible if there is an element b ∈ A such that ab = ba = 1 . The set ofall invertible elements of A is denoted by GL(A) and called the general lineargroup of invertible elements of A.

Theorem 5.3.4. Let A be a unital Banach algebra. If a ∈ A and ‖a‖ < 1 then1 −a is invertible and

(1 −a)−1 =∞∑

n=0an .

Moreover,

‖(1 −a)−1‖ ≤ 1

1−‖a‖

/Users/dp/a/ens/mq/iq-sptba.tex 56 lud on 1 October 2013

Page 66: 2_pdfsam_iq.pdf

and

‖1 − (1 −a)−1‖ ≤ ‖a‖1−‖a‖ .

Proof: Since ‖an‖ ≤ ‖a‖n for all n, we can define b ∈ A as the sum of the abso-lutely convergent series b =∑∞

n=0 an . Moreover, b(1−a) = (1−a)b = limN→∞∑N

n=0 bn =limN→∞(1 −bN+1) = 1 . Hence 1 −a is invertible and (1 −a)−1 = b. The first ma-jorisation holds because ‖b‖ ≤∑∞

n=0 ‖a‖n = 11−‖a‖ . The second one follows from

remarking that 1 −b =−∑∞n=1 an =−ab, hence ‖1 −b‖ ≤ ‖a‖‖b‖. ä

Exercise 5.3.5. 1. Prove that GL(A) is an open set in A and that the map-ping a 7→ a−1 is continuous on GL(A).

2. Justify the term “general linear group” of invertible elements, i.e. showthat GL(A) is a topological group in the relative norm topology.

Definition 5.3.6. Let A be a unital Banach algebra. For every a ∈ A, the spec-trum of a is the set

spec(a) = λ ∈C : a −λ1 6∈GL(A).

In the rest of this section, A will be a unital algebra and we shall write a −λinstead of a −λ1 .

Proposition 5.3.7. For every a ∈ A, the set spec(a) is a closed subset of the diskλ ∈C : |λ| ≤ ‖a‖.

Proof: Consider the resolvent set

Res(a) = λ ∈C : a −λ ∈GL(A) =C\ spec(a).

Since the set GL(A) is open (see exercise 5.3.5) and the map C 3 λ 7→ a −λ ∈ A

continuous, the set Res(a) is open hence the set spec(a) is closed. Moreover,if |λ| > ‖a‖, on writing a −λ = (−λ)[1−a/λ] and remarking that ‖a/λ‖ < 1, weconclude that a −λ ∈GL(A). äTheorem 5.3.8. For every a ∈A, the set spec(a) is non-empty.

Proof: Fix some λ0 ∈ Res(a). Since Res(a) is open, there is a small neighbour-hood Vλ of λ0 contained in Res(a). The A-valued function λ 7→ (a −λ)−1 is welldefined for all λ ∈ Vλ′ . Moreover, for λ,λ0 ∈Res(a),

(a −λ)−1 − (a −λ0)−1 = (a −λ)−1[(a −λ0)− (a −λ)](a −λ0)−1

= (λ−λ0)(a −λ)−1(a −λ0)−1.

/Users/dp/a/ens/mq/iq-sptba.tex 57 lud on 1 October 2013

Page 67: 2_pdfsam_iq.pdf

5.4. Relation between diagonalisability and the spectrum

Thus

limλ→λ0

1

λ−λ0[(a −λ)− (a −λ0)] = (a −λ0)−2.

Assume now that spec(a) = ; and choose an arbitrary bounded linear func-tional φ : A → C. Then, the scalar function f : C→ C defined by λ 7→ f (λ) =φ((a −λ)−1) is defined on the whole C. By linearity, the function f has every-where a complex derivative, satisfying f ′(λ) = φ((a −λ)−2). Thus f is an entirefunction. Notice moreover that f is bounded and for |λ| > ‖a‖, by theorem5.3.4,

‖(a −λ)−1‖ = ‖(1−a/λ)−1‖|λ|

≤ 1

|λ|(1−‖a‖/|λ|)= 1

|λ|−‖a‖ .

Thus limλ→∞ f (λ) = 0 and since this function is bounded and entire, by Liou-ville’s theorem (see [?] for instance), it is constant, hence f (λ) = 0 for all λ ∈ Cand every linear functional φ. The Hahn-Banach theorem implies then that(a −λ)−1 = 0 for all λ ∈ C. But this is absurd because (a −λ) is invertible and1 6= 0 in A. äDefinition 5.3.9. For every a ∈A, the spectral radius of a is defined by r (a) =sup|λ| :λ ∈ spec(a).

Exercise 5.3.10. 1. Let p ∈R[t ] and a ∈A. Show that p(spec(a)) ⊆ spec(p(a)).(Hint: if λ ∈ spec(a), the map λ′ 7→ p(λ′)−p(λ) is a polynomial vanishingat λ′ =λ. Conclude that p(a)−p(λ) cannot be invertible.)

2. For every a ∈A show that r (a) = limn→∞ ‖an‖1/n .

5.4 Relation between diagonalisability and the spec-trum

Motivated again by elementary linear algebra, we recall that a self-adjoint n ×n matrix T can be diagonalised, i.e. it is possible to find a diagonal matrix

D =

d1. . .

dn

and a unitary matrix U such that T =U DU∗; we have then

/Users/dp/a/ens/mq/iq-sptba.tex 58 lud on 1 October 2013

Page 68: 2_pdfsam_iq.pdf

spec(T ) = d1, . . . ,dn. We shall generalise this result to infinite dimensionalspaces.

An orthonormal basis forH is a sequence E = (e1,e2, . . .) of mutually orthog-onal unit vectors of H such that2 spanE =H. On fixing such a basis, we define aunitary operator U : `2(N) →H by

U f = ∑i∈N

fi ei

for f = ( f1, f2, . . .). Specifying a particular orthonormal basis in H is equiva-lent to specifying a particular unitary operator U . Suppose now that T ∈B(H)is a normal operator and admits the basis vectors of E as eigenvectors, i.e.Tek = tk rk , tk ∈ C, k ∈ N. Then t = (tk )k ∈ `∞(N) and U∗TU = M where Mis the multiplication operator defined by (M f )k = (U∗TU f )k = (U−1TU f )k =(U−1T

∑i fi ei )k = fk tk . Thus an operator T on H is diagonalisable in a given

basis E if the unitary operator associated with E implements an equivalencebetween T and a multiplication operator M acting on `2(N). This notion is stillinadequate since it involves only normal operators with pure point spectrum;it can nevertheless be appropriately generalised.

Definition 5.4.1. An operator T acting on a Hilbert space H is said diagonal-isable if there exist a (necessarily separable) σ-finite measure space (Ω,F ,µ), afunction m ∈ L∞(Ω,F ,µ), and a unitary operator U : L2(Ω,F ,µ) →H such that

U Mm = TU

where Mm denotes the multiplication operator by m, defined by Mm f (ω) =m(ω) f (ω), for all ω ∈Ω and all f ∈ L2(Ω,F ,µ)

Example 5.4.2. Let H = L2([0,1]) and T : H→ H defined by T f (t ) = t f (t ), fort ∈ [0,1] and f ∈H. This operator is diagonalisable since it is already a multipli-cation operator.

Notice that a diagonalisable operator is always normal because the multi-plication operator is normal. The following theorem asserts the converse.

Theorem 5.4.3. Every normal operator acting on a Hilbert space is diagonalis-able.

Proof: Long but without any particular difficulty; it can be found in [2], pp. 52–55. ä

Le reste du chapitre doit être re-écrit.

2Recall thatH is always considered separable.

/Users/dp/a/ens/mq/iq-sptba.tex 59 lud on 1 October 2013

Page 69: 2_pdfsam_iq.pdf

5.5. Spectral measures and functional calculus

5.5 Spectral measures and functional calculus

Start again from some heuristic ideas. Let (Ω,F ,µ) be a probability space andf : Ω→ R a bounded measurable function. Standard integration theory statesthat f can be approximated by simple functions. More precisely, for every ε> 0,there exists a finite family (Ei )i of disjoint measurable sets Ei ∈ F and a finitefamily of real numbers (αi )i such that | f (ω)−∑

i αi 1 Ei (ω)| < ε for all ω ∈Ω. It isinstructive to recall the main idea of the proof of this elementary result.

m

M

Ij

f (I )j!1

Figure 5.1: The approximation of a bounded measurable function by simplefunctions

Let m = inf f (ω), M = sup f (ω), and subdivide the interval [m, M ] into afinite family of disjoint intervals (I j ) j , with |I j | < ε (see figure 5.1.) For each j ,select an arbitrary α j ∈ I j ; in the subset f −1(I j ) ∈ F , the values of f lie withinε from α j . Therefore, we get the desired result by setting E j = f −1(I j ). If forevery Borel set B ∈B(R), we define P (B) = 1 f −1(B) (this is a function-valued set

/Users/dp/a/ens/mq/iq-sptba.tex 60 lud on 1 October 2013

Page 70: 2_pdfsam_iq.pdf

function!), the approximation result can be rewritten as

| f (ω)−∑jα j P (I j )(ω)| < ε, ∀ω ∈Ω.

Now, P is set function (a measure actually) and the sum∑

j α j P (I j ) tends (insome sense3) to

∫αP (dα). More precisely, the function f is equivalent to a

deterministic stochastic kernel K := K f from (Ω,F ) to (R,B(R)), defined by theformula

Ω×B(R) 3 (ω,B) 7→ K (ω,B) = ε f (·)(B) = 1 f −1(B)(·).

The action of the kernel on µ is

(µK f )(B) :=∫Ωµ(dω)K f (ω,B) =µ f (B),

where µ f denotes the law of f . Conversely, the kernel acts on measurable pos-itive functions g defined on the real axis by

(K f g )(ω) :=∫R

K f (ω,d x)g (x) = g ( f (ω)).

In particular, if g = id then

(K f id)(ω) =∫R

K f (ω,d x)x = f (ω).

Summarising the heuristics developed so far: the approximability of a real-valued, bounded, measurable function f by simple functions can be expressedby writing f = ∫

αP (dα), where P is the function-valued measure

B(R) 3 B 7→ P (B) = 1 f −1(B)

with the following properties:

1. P is idempotent: i.e. P (B)2 = P (B) for all B ∈B(R),

2. P is multiplicative: i.e. P (B ∩C ) = P (B)P (C ) for all B ,C ∈B(R),

3. P is supported by Ran( f ): i.e. P (B) ≡ 0 for all B ∈ B(R) such that B ∩Ran( f ) =;.

3As a matter of fact, it is possible to construct a descent theory of integration in which∫αP (dα) acquires a precise meaning.

/Users/dp/a/ens/mq/iq-sptba.tex 61 lud on 1 October 2013

Page 71: 2_pdfsam_iq.pdf

5.5. Spectral measures and functional calculus

The measure P reflects the properties of f ; it is called the spectral measure off .

In the non-commutative setting, the analogue of a bounded, real-valued,measurable function is a bounded Hermitean operator on H. Idempotence,characterising indicators in the commutative case, is verified by projections be-longing to P(H). Hence, we are seeking approximations of bounded Hermiteanoperators by complex finite combinations of projections. Now we can turn intoprecise definitions.

Definition 5.5.1. Let (X,F ) be a measurable space and H a Hilbert space. Afunction P : F →P(H) is called a spectral measure on (X,F ) if

1. P (X) = 1 ,

2. if (Fn)n∈N is a sequence of disjoint elements in F , then P (tn∈NFn) =∑n∈NP (Fn).

Example 5.5.2. Let (X,F ,µ) be a probability space and H= L2(X,F ,µ). Thenthe mapping F 3 F 7→ P (F ) ∈ P(H), defined by P (F ) f = 1 F f for all f ∈ H, is aspectral measure.

Exercise 5.5.3. If P is a spectral measure on (X,F ), then P (;) = 0 and P isfinitely disjointly additive.

Theorem 5.5.4. Let (X,F ) be a measurable space and H a Hilbert space. If Pis a finitely disjointly additive function F → P(H) such that P (X) = 1 then (forF,G ∈F )

1. P is monotone: F ⊆G ⇒ P (F ) ≤ P (G),

2. P is subtractive: F ⊆G ⇒ P (G \ F ) = P (G)−P (F ),

3. P is modular: P (F ∪G)+P (F ∩G) = P (F )+P (G),

4. P is multiplicative: P (F ∩G) = P (F )P (G).

Proof: The statements 1 and 2 are immediate by noticing that F1 ⊆ F2 ⇒ F2 =F1 t (F2 \ F1).

3) Since F ∪G = (F \G)t (F ∩G)t (G \ F ) we have: P (F ∪G)+P (F ∩G) = [P (F \G)+P (F ∩G)]+ [P (G \ F )+P (G ∩F )] = P (F )+P (G).

4) By 1)P (F ∩G) ≤ P (F ) ≤ P (F ∪G). (∗)

/Users/dp/a/ens/mq/iq-sptba.tex 62 lud on 1 October 2013

Page 72: 2_pdfsam_iq.pdf

Multiplying the first inequality of (*) by P (F ∩G), we get P (F ∩G) ≤ P (F )P (F ∩G) and since P (F ) ≤ 1 , the right hand side of the latter inequality is boundedfurther by P (F ∩G). Hence P (F )P (F ∩G) = P (F ∩G). Similarly, multiplying thesecond inequality of (*) by P (F ) and since again P (F ∪G) ≤ 1 , we get P (F )P (F ∪G) = P (F ). Adding the thus obtained equalities, we get:

P (F )[P (F ∪G)+P (F ∩G)] = P (F ∩G)+P (F )

and we conclude by modularity. äExercise 5.5.5. Show that for all F,G ∈F ,

1. P (F ) is an orthoprojection, and

2. we have [P (F ),P (G)] = 0.

Theorem 5.5.6. Let (X,F ) be a measurable space andH a Hilbert space. A mapP : F →P(H) is a spectral measure if and only if

1. P (X) = 1 , and

2. for all f , g ∈H, the set function µ f ,g : F →C, defined by

µ f ,g (F ) = ⟨ f |P (F )g ⟩,F ∈F ,

is countably additive.

Proof:

(⇒): If P is a spectral measure, then statements 1 and 2 hold trivially.

(⇐): Suppose, conversely, that 1 and 2 hold. If F ∩G =; then ⟨ f |P (F ∪G)g ⟩ =⟨ f |P (F )g ⟩ + ⟨ f |P (G)g ⟩ = ⟨ f | [P (F )+P (G)]g ⟩, hence P is finitely addi-tive (hence multiplicative). Let now (Fn)n be a sequence of disjoint setsin F . Multiplicativity of P implies (P (Fn))n is a sequence of orthogo-nal projections and hence (P (Fn)g )n a sequence of orthogonal vectorsfor any g ∈H. Let F =∪nFn . Hence, for all f , g ∈H, we have: ⟨ f |P (F )g ⟩ =⟨ f | ∑n P (Fn)g ⟩, due to the countable additivity property of µ f ,g . We aretempted to conclude that P (F ) =∑

n P (Fn). Yet, it may happen that∑

n P (Fn)does not make any sense because weak convergence does not imply con-vergence in the operator norm. However,

∑n ‖P (Fn)g‖2 =∑

n ⟨g |P (Fn)g ⟩ =⟨g |P (F )g ⟩ = ‖P (F )g‖2. It follows that the sequence (P ( fn)g )n is summable.If we write

∑n P ( fn)g = T g , it defines a bounded operator T coinciding

with P (F ).

/Users/dp/a/ens/mq/iq-sptba.tex 63 lud on 1 October 2013

Page 73: 2_pdfsam_iq.pdf

5.5. Spectral measures and functional calculus

äNotation 5.5.7. Let (X,F ) be a measurable space and F :X→C. We denote by‖F‖ ≡ sup|F (x)| : x ∈X, and B(X) = F :X→C | measurable, ‖F‖ <∞.

Henceforth, the Hilbert space H will be fixed and B(H) (respectively P(H))will denote as usual the set of bounded operators (respectively projections) onH.

Theorem 5.5.8. Let (X,F ) be a measurable space and H a Hilbert space. If P isa spectral measure on (X,F ) and F ∈ B(X), then there exists a unique operatorTF ∈B(H) such that

⟨ f |TF g ⟩ =∫X

F (x)⟨ f |P (d x)g ⟩,

for all f , g ∈H. We write TF = ∫XF (x)P (d x).

Proof: The boundedness of F implies that the right hand side of the integralgives rise to a well-defined sesquilinear functionalφ( f , g ) = ∫

XF (x)⟨ f |P (d x)g ⟩,for f , g ∈ H. Moreover, |φ( f , f )| ≤ ∫

X |F (x)|‖P (d x) f ‖2 ≤ ‖F‖‖ f ‖2, hence thefunctional φ is bounded. Existence and uniqueness of TF follows from theRiesz-Fréchet theorem. äTheorem 5.5.9 (Spectral decomposition theorem). If T ∈Bh(H) then there ex-ists a spectral measure on (C,B(C)), supported by spec(T ) ⊆R, such that

T =∫spec(T )

λP (λ).

Proof: Let p ∈ R[t ] and f , g ∈ H be two arbitrary vectors. Denote by L f ,g (p) =⟨ f |p(T )g ⟩. Then |L f ,g (p)| ≤ ‖p(T )‖‖ f ‖‖g‖ and since p(T ) ∈ B(H) we havealso ‖p(T )‖ = sup|p(λ)| : λ ∈ spec(T ) (exercise!). Since spec(T ) is a boundedset, ‖p(T )‖ <∞ for all p ∈ R[t ]. Hence the linear functional L f ,g is a boundedlinear functional on R[t ]. By Riesz-Fréchet theorem, there exists consequentlya unique complex measure µ f ,g , supported by spec(T ), such that

L f ,g (p) ≡ ⟨ f |p(T )g ⟩ =∫spec(T )

p(λ)µ f ,g (dλ),

for all p ∈R[t ], verifying |µ f ,g (B)| ≤ ‖ f ‖‖g‖, for all B ∈B(C). Using the unique-ness of µ f ,g , it is immediate to show that for every B ∈B(C), SB ( f , g ) = µ f ,g (B)is a sesquilinear form. Now, |SB ( f , g )| = |µ f ,g (B)| ≤ ‖ f ‖‖g‖, for all B . Hence the

/Users/dp/a/ens/mq/iq-sptba.tex 64 lud on 1 October 2013

Page 74: 2_pdfsam_iq.pdf

sesquilinear form is bounded; therefore, there exists an operator P (B) ∈Bh(H)such that SB ( f , g ) = ⟨ f |P (B)g ⟩ for all f , g ∈ H. Recall that neither µ f ,g , norSB , nor P depend on the initially chosen polynomial p. Choosing p0(λ) = 1, weget

∫spec(T ) ⟨ f |P (dλ)g ⟩ = ⟨ f |P (spec(T ))g ⟩ = ⟨ f |g ⟩ and choosing p1(λ) =λ, we

get∫spec(T ) ⟨ f |λP (dλ)g ⟩ = ⟨ f |T g ⟩, for all f , g ∈H. To complete the proof, it re-

mains to show that P is a projection-valued measure. It is enough to show themultiplicativity property. For any fixed pair f , g ∈ H and any fixed real poly-nomial q , introduce the auxiliary complex measure ν(B) = ∫

B q(λ)⟨ f |P (dλ)g ⟩,with B ∈B(C). For every real polynomial p, we have∫

p(λ)ν(dλ) =∫

p(λ)q(λ)⟨ f |P (dλ)g ⟩= ⟨ f |P (p(T )q(T )g ⟩= ⟨q(T ) f |p(T )g ⟩=

∫p(λ)⟨q(T ) f |P (dλ)g ⟩.

Therefore,

ν(B) =∫

q(λ)1 B (λ)⟨ f |P (dλ)g ⟩= ⟨q(T ) f |P (B)g ⟩= ⟨ f |q(T )P (B)g ⟩=

∫q(λ)⟨ f |P (dλ)P (B)g ⟩.

Since q is arbitrary,

⟨ f |P (B ∩C )g ⟩ =∫

C⟨ f |P (dλ)P (B)g ⟩

= ⟨ f |P (B)P (C )g ⟩,

and since f , g ∈H are arbitrary, we get P (B ∩C ) = P (B)P (C ). ä

Theorem 5.5.10. If T is a normal operator in B(H), then there exists a necessar-ily unique complex spectral measure on (C,B(C)), supported by spec(T ), suchthat

T =∫spec(T )

λP (dλ).

Proof: Exercise! (Hint: T = T1 + i T2 with T1,T2 ∈Bh(H).) ä

/Users/dp/a/ens/mq/iq-sptba.tex 65 lud on 1 October 2013

Page 75: 2_pdfsam_iq.pdf

5.6. Some basic notions on unbounded operators

5.6 Some basic notions on unbounded operators

The operators arising in quantum mechanics are very often unbounded.

Definition 5.6.1. Let H be a Hilbert space. An operator on H, possibly un-bounded, is a pair (Dom(T ),T ) where Dom(T ) ⊆ H is a linear manifold andT : Dom(T ) →H is a linear map. The set of operators onH is denoted L(H).

The graph of an operator T ∈L(H) is the linear sub-manifold ofH⊕H of theform

Γ(T ) = (( f ,T f ) ∈H×H : f ∈Dom(T ).

The operator T is closed if Γ(T ) is closed. The operator T is closable if thereexists T ∈L(H) such that Γ(T ) = Γ(T ) in H⊕H. Such an operator is unique andis called the closure of T . An operator T is said densely defined if Dom(T ) =H.

If T1,T2 ∈L(H) with Dom(T1) ⊆Dom(T2) and T1 f = T2 f for all f ∈Dom(T1),then T2 is called an extension of T1 and T1 the restriction of T2 on Dom(T1); wewrite T1 ⊆ T2. If T is bounded on its domain and Dom(T ) = H, then T can beextended by continuity on the whole space.

The definitions of null space and range are also modified for unboundedoperators:

ker(T ) = f ∈Dom(T ) : T f = 0

Ran(T ) = T f ∈H : f ∈Dom(T ).

The operator T is invertible if ker(T ) = 0 and its inverse, T −1 is the operatordefined on Dom(T −1) =Ran(T ) by T −1(T f ) = f for all f ∈Dom(T ).

If T1,T2 ∈L(H), then T1+T2 is defined on Dom(T1+T2) =Dom(T1)∩Dom(T2)by (T1+T2) f = T1 f +T2 f . Similarly, the product T1T2 is defined on Dom(T1T2) = f ∈Dom(T2) : T2 ∈Dom(T1) by (T1T2) f = T2(T1 f ).

Definition 5.6.2. Suppose that T is densely defined. Then T is the adjointoperator with Dom(T ∗) = g ∈ H : sup |⟨g |T f ⟩| < ∞, f ∈ Dom(T ),‖ f ‖ = 1;since Dom(T ) = H, by Riesz theorem, there exists a unique g∗ ∈ H such that⟨g∗ | f ⟩ = ⟨g |T f ⟩ for all f ∈Dom(T ). We define then T ∗g = g∗.

Example 5.6.3. (The position operator) Let (Ω,F ,µ) be any separable,σ-finitemeasure space, H = L2(Ω,F ,µ;C), and f ∈ H measurable. Let T ∈ L(H) be theoperator defined by Dom(T ) = g ∈ H :

∫(1 + | f |2)|g |2dµ < ∞ and T g (ω) =

f (ω)g (ω) for g ∈ Dom(T ) and ω ∈ Ω. Then T is closed, densely defined, with

/Users/dp/a/ens/mq/iq-sptba.tex 66 lud on 1 October 2013

Page 76: 2_pdfsam_iq.pdf

Dom(T ∗) = Dom(T ) and T ∗g (ω) = f (ω)g (ω). When Ω = R, F = B(R), and µ isthe Lebesque measure, we say that T is the position operator; it is obviouslyself-adjoint.

Example 5.6.4. (The momentum operator) LetH= L2(R). A function u :R→R

is called absolutely continuous, (a.c.) if there exists a function v : R→ R suchthat

u(b)−u(a) =∫ b

av(x)d x, for all a < b.

In such a case, we write u′ = v , u′ is called the derivative of u. The function v isdetermined almost everywhere. Define now T ∈L(H) on

Dom(T ) = f ∈H : f a.c.,∫

(| f |2 +| f ′|2)d x <∞

by T f = f ′. Then T is a closed, densely defined operator with T ∗ = −i T . Theoperator −i T is called the momentum operator.

Exercise 5.6.5. Let q be the position operator, p the momentum operator.Show that [q, p] ⊆ i 1 .

Exercise 5.6.6. (Heisenberg’s uncertainty principle) Denote by S(R) the Schwartzspace of indefinitely differentiable functions of rapid decrease. If f ∈ S(R), de-note by f its Fourier transform f (ξ) = ∫

R f (x)exp(−iξx)d x. Let p : S(R) → S(R)be defined by p f = −i f ′ and q : S(R) → S(R) by q f (x) = x f (x), for all x ∈ R.Show that [q, p] = i 1 . If ⟨ · | · ⟩ denotes the L2 scalar product on S(R), show that

|⟨ f | f ⟩| ≤ 2‖p f ‖2‖q f ‖2.

Conclude that for any f ∈ S(R),

‖ f ‖2 ≤ 4π‖x f ‖L2(R)‖ξ f ‖L2(R).

Below are depicted the graphs of pairs | f (x)|2 and | f (ξ)|2, chosen among aclass of Gaussian functions, for different values of some parameter. How do youinterpret these results?

/Users/dp/a/ens/mq/iq-sptba.tex 67 lud on 1 October 2013

Page 77: 2_pdfsam_iq.pdf

5.6. Some basic notions on unbounded operators

0.2

0.4

0.6

0.8

1

–1 –0.8 –0.6 –0.4 –0.2 0 0.2 0.4 0.6 0.8 1

x

0

0.5

1

1.5

2

–6 –4 –2 2 4 6

xi

0

1

2

3

4

–1 –0.8 –0.6 –0.4 –0.2 0.2 0.4 0.6 0.8 1

x

0.1

0.2

0.3

0.4

0.5

–10 –8 –6 –4 –2 0 2 4 6 8 10

xi

0

2

4

6

8

10

12

14

16

–1 –0.8 –0.6 –0.4 –0.2 0.2 0.4 0.6 0.8 1

x

0.02

0.04

0.06

0.08

0.1

0.12

–40 –30 –20 –10 0 10 20 30 40

xi

0

20

40

60

80

100

–1 –0.8 –0.6 –0.4 –0.2 0.2 0.4 0.6 0.8 1

x

0.002

0.004

0.006

0.008

0.01

0.012

0.014

0.016

0.018

0.02

–300 –200 –100 0 100 200 300

xi

/Users/dp/a/ens/mq/iq-sptba.tex 68 lud on 1 October 2013

Page 78: 2_pdfsam_iq.pdf

6Propositional calculus and quantum

formalism based on quantum logic

Phenomenology is an essential step in constructing physical theories. Phe-nomenological results are of the following type: if a physical system is subjectto conditions A,B ,C , . . ., then the effects X ,Y , Z , . . . are observed. We further in-troduced yes-no experiments consisting in measuring questions in given states.However, there may exist questions that depend on other questions and holdindependently of the state in which they are measured. More precisely, sup-pose for instance that Q A denotes the question: “does the physical particle liein A, for some A ∈ B(R3)?” Let now B ⊇ A be another Borel set in R3. When-ever Q A is true (i.e. for every state for which Q A is true) QB is necessarily true.This remark defines a natural order relation in the set of questions. Consideringquestions on given physical system more abstractly, as a logical propositions,it is interesting to study first the abstract properties of a partially ordered set ofpropositions. This abstract setting allows the statement of the basic axioms forclassical or quantum systems on an equal footing.

69

Page 79: 2_pdfsam_iq.pdf

6.1. Lattice of propositions

6.1 Lattice of propositions

Let Λ be a set of propositions and for any two propositions a and b, denote bya ≤ b the implication “whenever a is true, it follows that b is true”

Definition 6.1.1. The pair (Λ,≤) is a partially ordered set (poset) if the relation≤ is a partial order (i.e. a reflexive, transitive, and antisymmetric binary opera-tion). For a,b ∈Λ, we say that u is a least upper bound if

1. a ≤ u and b ≤ u,

2. if a ≤ v and b ≤ v for some v ∈Λ, then u ≤ v .

If a least upper bound of two elements a and b exists, then it is unique anddenoted by sup(a,b) ∈Λ,

Definition 6.1.2. A lattice is a set Λ with two binary operations, denoted re-spectively by ∨ (’join’) and ∧ (’meet’), and two constants 0 ∈Λ and 1 ∈Λ, satis-fying, for all a,b,c ∈Λ the following properties:

1. idempotence: a ∧a = a = a ∨a,

2. commutativity: a ∧b = b ∧a and a ∨b = v ∨a,

3. associativity: a ∧ (b ∧ c) = (a ∧b)∧ c and a ∨ (b ∨ c) = (a ∨b)∨ c,

4. identity: a ∧1 = a and a ∨0 = a,

5. absorption: a ∧ (a ∨b) = a = a ∨ (a ∧b).

Theorem 6.1.3. Let (Λ,≤) be a poset. Suppose that

1. Λ has a least element 0 and a greatest element 1, i.e. for all a ∈Λ, we have0 ≤ a ≤ 1,

2. any two elements a,b ∈Λ have a least upper bound inΛ, denoted by a∨b,and a greatest lower bound in Λ, denoted by a ∧b. Then (Λ,∧,∨,0,1) is alattice.

Conversely, if (Λ,∧,∨,0,1) is a lattice, then, on defining a ≤ b whenever a∧b = b,the pair (Λ,≤) is a poset verifying properties 1 and 2 of definition 6.1.1

/Users/dp/a/ens/mq/iq-proca.tex 70 lud on 17 February 2013

Page 80: 2_pdfsam_iq.pdf

12

3

1, 21, 3

2, 3

1, 2, 3

Figure 6.1: The Hasse diagram of the lattice of subsets of the set 1,2,3.

Proof: : Exercise! äDefinition 6.1.4. A lattice (Λ,∧,∨,0,1) is called distributive if it verifies, for alla,b,c ∈Λ,

a ∨ (b ∧ c) = (a ∨b)∧ (a ∨ c),

anda ∧ (b ∨ c) = (a ∧b)∨ (a ∧ c).

Remark 6.1.5. A finite lattice (or finite poset) can be represented by its Hassediagram in the plane. The points of the lattice are represented by points inthe plane arranged so that if a ≤ b then the representative of b lies higher inthe plane than the representative of a. We join the representatives of a and bby a segment when b covers a, i.e. when a ≤ b but there is no c ∈ Λ such thata < c < b.

Example 6.1.6. Let S be a finite set and P (S) the collection of its subsets. Then(P (S),⊆) is a poset, equivalent to the lattice (P (S),∩,∪,;,S), called the latticeof subsets of S. This lattice is distributive. For the particular choice S = 1,2,3its Hasse diagram is depicted in figure 6.1.

Exercise 6.1.7. Let V= R2 (viewed as a R-vector space) and E1,E2,E3 be threedistinct one-dimensional subspaces of V. Denote by ≤ the order relation “be avector subspace of”. Show that there is a finite set S of vector subspaces of Vcontaining E1,E2, and E3 such that (S,≤) is a lattice. Is this lattice distributive?

/Users/dp/a/ens/mq/iq-proca.tex 71 lud on 17 February 2013

Page 81: 2_pdfsam_iq.pdf

6.1. Lattice of propositions

In any latticeΛ, a complement of a ∈Λ is an element a′ ∈Λ such that a∧a′ =0 and a ∨ a′ = 1. Complements may fail to exist and they may be not unique.However, in a distributive lattice, any element has at most one complement.

Definition 6.1.8. A Boolean algebra is a complemented distributive lattice (i.e.a distributive lattice in which any element has a — necessarily unique — com-plement.)

When the latticeΛ is infinite, one can consider infinite subsets F ⊆Λ. Whenboth ∧a∈F a and ∨a∈F a exist (in Λ) for any countable subset F , the lattice iscalledσ-complete. A Booleanσ-algebra is a Boolean algebra that isσ-complete.

Definition 6.1.9. A lattice Λ is called modular of it satisfies the modularitycondition:

a ≤ c ⇒∀b ∈Λ, a ∨ (b ∧ c) = (a ∨b)∧ c.

If Λ is a modular and complemented lattice then satisfies, for every comple-ment a′ of a, the modularity condition,

a ≤ b ⇒ b = a ∨ (a′∧b).

If the complement of a is an orthocomplement, then the complemented mod-ular lattice is called orthomodular.

Example 6.1.10. The Dilworth lattice, whose Hasse diagram is depicted in fig-ure 6.2, is a complemented modular but not distributive.

Exercise 6.1.11. Show that a Boolean algebra is always modular.

Definition 6.1.12. An atom in a lattice is a minimal non-zero element, i.e. a ∈Λis an atom if a 6= 0 and if x < a for some x ∈Λ then x = 0. A lattice is atomic ifevery point is the join of a finite number of atoms.

Definition 6.1.13. A homomorphism from a complemented lattice Λ1 into acomplemented latticeΛ2 is a map h :Λ1 →Λ2 such that

1. h(01) = 02 and h(11) = 12,

2. h(a′) = h(a)′ for all a ∈Λ1,

3. h(a ∨b) = h(a)∨h(b) and h(a ∧b) = h(a)∧h(b), for all a,b ∈Λ1

An isomorphism is a lattice homomorphism that is bijective. If the condition 3above holds also for countable joins and meets, h is called aσ-homomorphism.IfΛ1 =Λ2 a lattice isomorphism is called lattice automorphism.

/Users/dp/a/ens/mq/iq-proca.tex 72 lud on 17 February 2013

Page 82: 2_pdfsam_iq.pdf

0

a b c d e f g

a’ b’ c’ d’ e’ f’ g’

1

Figure 6.2: The Hasse diagram of the Dilworth lattice.

Theorem 6.1.14. Let Λ be a Boolean σ-algebra. Then there exist an abstract setX, a σ-algebra, X , of subsets of X and a σ-homomorphism h : X →Λ.

Proof: It is first given in [?] and later reproduced in [33]. ä

This theorem serves to extend the notion of measurability, defined for mapsbetween measurable spaces, to maps defined on abstract Boolean σ-algebras.Recall that ifX is an arbitrary set of points equipped with a Booleanσ-algebra ofsubsets X , andY a complete separable metric space equipped with its Borelσ-algebra B(Y), a map f :X→Y is called measurable if for all B ∈B(Y), f −1(B) ∈X .

Definition 6.1.15. Let Λ be an abstract Boolean σ-algebra and (Y,B(Y)) acomplete separable metric space equipped with its Borelσ-algebra. AY-valuedclassical observable associated with Λ is a σ-homomorphism h : B(Y) → Λ. IfY=R, the observable is called real-valued.

The careful reader will have certainly remarked that the previous definitionis compatible with axiom 2.2.15. As a matter of fact, with every real randomvariable X on an abstract measurable space (Ω,F ) is associated a family ofpropositions Q X

B = 1 X∈B, for B ∈B(R). The aforementionedσ-homomorphismh : B(R) → F , stemming from X (·) through the spectral measure KX (·,B), is

/Users/dp/a/ens/mq/iq-proca.tex 73 lud on 17 February 2013

Page 83: 2_pdfsam_iq.pdf

6.2. Classical, fuzzy, and quantum logics; observables and states on logics

given byh(B) = ω ∈Ω : Q X

B (ω) = 1 = X −1(B) ∈F .

Notice that this does not hold for quantum systems where some more generalnotion is needed.

6.2 Classical, fuzzy, and quantum logics; observablesand states on logics

6.2.1 Logics

Definition 6.2.1. Let (Λ,≤) be a poset (hence a lattice). By an orthocomple-mentation onΛ is meant a mapping ⊥:Λ 3 a 7→ a⊥ ∈Λ, satisfying for a,b ∈Λ:

1. ⊥ is injective,

2. a ≤ b ⇒ b⊥ ≤ a⊥,

3. (a⊥)⊥ = a,

4. a ∧a⊥ = 0.

A lattice with an orthocomplementation operation is called orthocomplemented.

We remark that from condition 2 it follows that 0⊥ = 1 and 1⊥ = 0. Fromcondition 3 it follows that ⊥ is also surjective. Finally, conditions 1, 2, and 3imply that a ∨a⊥ = 1.

Definition 6.2.2. An orthocomplemented lattice,Λ, is said to be a logic if

1. for any countable sequence (an)n∈N of elements of Λ, both ∨n∈Nan and∧n∈N exist inΛ,

2. if a1, a2 ∈ Λ and a1 ≤ a2, then there exists b ∈ Λ, such that b ≤ a⊥1 and

b ∨a1 = a2.

Without loss of generality, we can always assume that an orthocomplementedlattice verifies orthomodularity for a⊥ = a′. Remark also that the element whose

/Users/dp/a/ens/mq/iq-proca.tex 74 lud on 17 February 2013

Page 84: 2_pdfsam_iq.pdf

existence is postulated in item 2 of the previous definition is unique and equalin fact to a⊥

1 ∧ a2. In fact, if b ≤ a⊥1 is such that b ∨ a1 = a2, then necessarily,

a1 ≤ b⊥ and b⊥∧a⊥1 = a⊥

2 . Using orthomodularity, a1∨(b⊥∧a⊥1 ) = b⊥ and sub-

stituting the left hand side parenthesis by a⊥2 , we get the dual of the required

equality. Dualising, we conclude.

The element a⊥ is called the orthogonal complement of a in Λ. If a ≤ b⊥

and b ≤ a⊥, then a and b are said orthogonal and we write a ⊥ b.

Exercise 6.2.3. Assume that (Λ,≤) is a poset (hence a lattice) that is orthocom-plemented. Let a,b ∈Λ be such that a < b. Denote by

Λ[a,b] = c ∈Λ : a ≤ c ≤ b.

Show that

1. Λ[0,b] becomes a lattice in which countable joins and meets exist andwhose zero element is 0 and unit element is b,

2. if we define, for x ∈ Λ[0,b], x ′ = x⊥∧b, then the operation ′ : Λ[0,b] →Λ[0,b] is an orthocomplementation,

3. conclude thatΛ[0,b] is a logic.

Example 6.2.4. Any Boolean σ-algebra is a logic provided we define, for anyelement a, its orthocomplement to be its complement a′. Boolean σ-algebrasare called classical logics.

Example 6.2.5. LetH be aC-Hilbert space. LetΛ be the collection of all Hilbertsubspaces ofH. If ≤ is meant to denote “be a Hilbert subspace of” and ⊥ the or-thogonal complementation in the Hilbert space sense, then Λ is a logic, calledstandard quantum logic.

Postulate 6.2.6. In any physical system (classical or quantum), the set of allexperimentally verifiable propositions is a logic (classical or standard quantum).

6.2.2 Observables associated with a logic

Suppose that Λ is the logic of verifiable propositions of a physical system andlet X be any real physical quantity relative to this system. Denoting x(B) theproposition “the numerical results of the observation of X lie in B”, it is naturaland harmless to consider that B ∈ B(R); obviously then, x is a mapping x :

/Users/dp/a/ens/mq/iq-proca.tex 75 lud on 17 February 2013

Page 85: 2_pdfsam_iq.pdf

6.2. Classical, fuzzy, and quantum logics; observables and states on logics

B(R) → Λ. We regard to physical quantities X and X ′ as identical wheneverthe corresponding maps x, x ′ : B(R) → Λ are the same. If f : R→ R is a Borelfunction, we mean by X ′ = f X a physical quantity taking value f (r ) wheneverX takes value r . The corresponding map is given by B(R) 3 B : x ′ 7→ x ′(B) =x( f −1(B)) ∈Λ. Hence we are led naturally to the following

Definition 6.2.7. LetΛ be a logic. A real observable associated withΛ is a map-ping x : B(R) →Λ verifying:

1. x(;) = 0 and x(R) = 1,

2. if B1,B2 ∈B(R) with B1 ∩B2 =; then x(B1) ⊥ x(B2),

3. if (Bn)n∈N is a sequence of mutually disjoint Borel sets, then x(∪n∈NBn) =∨n∈Nx(Bn).

We write O (Λ) for the set of all real observables associated withΛ.

Exercise 6.2.8. Let Λ be a logic and x ∈ O (Λ). Show that for any sequence ofBorel sets (Bn)n∈N we have

x(∪n∈NBn) =∨n∈Nx(Bn)

andx(∩n∈NBn) =∧n∈Nx(Bn).

Definition 6.2.9. LetΛ be a logic and O (Λ) the set of its associated observables.A real number λ is called a strict value of an observable x ∈ O (Λ), if x(λ) 6= 0.The observable x ∈ O (Λ) is called discrete if there exists a countable set C =c1,c2, . . . such that x(C ) = 1; it is called constant if there exists c ∈ R such thatx(c) = 1. It is called bounded if there exists a compact Borel set K such thatx(K ) = 1.

Definition 6.2.10. We call spectrum of x ∈O (Λ) the closed set defined by

spec(x) =∩C closed :x(C )=1C .

The numbers λ ∈ spec(x) are called spectral values of x.

Any strict value is a spectral value; the converse is not necessarily true.

Exercise 6.2.11. Show that λ ∈ spec(x) if and only if any open set U containingλ verifies x(U ) 6= 0.

/Users/dp/a/ens/mq/iq-proca.tex 76 lud on 17 February 2013

Page 86: 2_pdfsam_iq.pdf

If (an)n∈N is a partition of unity, i.e. a family of mutually orthogonal propo-sitions in Λ such that ∨n∈Nan = 1, there exists a unique discrete observableadmitting as spectral values a given discrete subset c1,c2, . . . of the reals. Infact, it is enough to define for all n ∈ N, x(cn) = an and for any B ∈ B(R),x(B) = ∨n:cn∈B an . Notice however that discrete observables do not exhaustall the physics of quantum mechanics; important physical phenomena involvecontinuous observables.

6.2.3 States on a logic

We have seen that to every classical system is attached a measurable space(Ω,F ) (its phase space); observables are random variables and states are prob-ability measures that may degenerate to Dirac masses on particular points ofthe phase space. This description is incompatible with the experimental obser-vation for quantum systems. For the latter, the Heisenberg’s uncertainty princi-ple stipulates that no matter how carefully the system is prepared, there alwaysexist observables whose values are distributed according to some non-trivialprobability distribution.

Definition 6.2.12. Let Λ be a logic and O (Λ) its set of associated observables.A state function is a mapping ρ : O (Λ) 3 x 7→ ρx ∈M+

1 (R,B(R)).

For every Borel function f : R→ R, for every observable x, and every Borelset B on the line, we have:

ρ f x(B) = ρx( f −1(B)).

Denoting by o the zero observable and 0 the zero of R, we have that ρo = δ0.In fact, suppose that f :R→R is the identically zero map. Then f o = o and

f −1(B) =R if 0 ∈ B; otherwise.

Hence, if 0 ∈ B , then ρo(B) = ρ f o(B) = ρo( f −1(B)) = 1, because ρo is a prob-ability on R; if 0 6∈ B then similarly ρo(B) = 0. Therefore, in all circumstances,ρo(B) = δ0(B).

If x ∈ O (Λ) is any observable and B ∈ B(R) is such that x(B) = 0 ∈ Λ, thenρx(B) = 0. In fact, for this B , we have 1 B x = o and ρx(B) = ρo(1) = δ0(1) = 0.This implies that if x is discrete, the measure ρx is supported by the set of thestrict values of x.

/Users/dp/a/ens/mq/iq-proca.tex 77 lud on 17 February 2013

Page 87: 2_pdfsam_iq.pdf

6.2. Classical, fuzzy, and quantum logics; observables and states on logics

Definition 6.2.13. An observable q ∈O (Λ) is a question if q(0,1) = 1. A ques-tion is the necessarily discrete. If q(1) = a ∈Λ, then q is the only question suchthat q(1) = a; we call it question associated with the proposition a and denoteby qa if necessary.

Definition 6.2.14. LetΛ be a logic. A function p :Λ→ [0,1] satisfying

1. p(0) = 0 and p(1) = 1,

2. if (an)n∈N is a sequence of mutually orthogonal propositions of Λ, anda =∨n∈Nan , then p(a) =∑

n∈Np(an)

is called state (or probability measure) on the logic Λ. The set of states on Λ isdenoted by S (Λ).

The concept of probability measure on a logic coincides with a classicalprobability measure when the logic is a Boolean σ-algebra. For non distribu-tive logics however, the associated probability measures are genuine generali-sations of the classical probabilities. For standard quantum logics, the associ-ated states are called quantum probabilities.

Theorem 6.2.15. Let p ∈S (Λ), whereΛ is a logic.

1. On defining a map ρp : O (Λ) →M+1 (R,B(R)), by the formula: for every x ∈

O (Λ) and for every B ∈B(R), ρpx (B) = p(x(B)), then ρp is a state function.

2. Conversely, if ρ is an arbitrary state function, then for every x ∈O (Λ), thenthere exists a unique probability measure p ∈S (Λ) such that for every x ∈O (Λ) and for every B ∈B(R), ρx(B) = p(x(B)).

Proof:

1. The map ρpx : B(R) → [0,1] is certainly a σ-additive, non-negative map.

Moreover, ρpx (R) = p(1) = 1, hence it is a probability. If f :R→R is a Borel

function,

ρpf x(B) = p( f x(B)) = p(x( f −1(B))) = ρp

x (( f −1(B)).

Hence ρp is a state function.

/Users/dp/a/ens/mq/iq-proca.tex 78 lud on 17 February 2013

Page 88: 2_pdfsam_iq.pdf

2. Let ρ be a state function. If a ∈Λ and qa ∈ O (Λ) the question associatedwith proposition a, then ρqa is a probability measure on B(R). Since qa isa question, ρqa (0,1) = 1. Define p(a) = ρqa (1). Obviously, for all a ∈Λ,p(a) is well defined and is taking values in [0,1]. It remains to show thatp is a probability measure onΛ, that is to say verify σ-additivity and nor-malisation. For 0 ∈ Λ, q0(1) = 0. Hence ρq0(1) = 0 = p(0). Similarly,we show that = p(1) = 1. This shows normalisation.

Let (an)n∈N be a sequence of mutually orthogonal elements ofΛ, and de-note by a = ∨n∈Nan . Let x ∈ O (Λ) be the discrete observable defined byx(0) = a⊥ and x(n) = an , for n = 1,2, . . .. Then, 1 n x(1) = x(n) =an . Hence qan = 1 n x and p(an) = ρx(n). Since ρx is a probabil-ity measure,

∑n p(an) = ρx(1,2,3, . . .) = ρx(N). Similarly, 1N x = qa

because 1N x(1) = x(N) = ∨n∈Nx(n) = ∨n∈Nan = a. Hence, finally,p(a) = ∑

n p(an) establishing thus σ-additivity of p. Finally, for x ∈ O (Λ)and B ∈B(R),

ρx(B) = ρ1 Bx(1) = ρqx(B) (1) = p(x(B)).

ä

If p ∈ S (Λ) and x ∈ O (Λ), the map B(R) 3 B 7→ p(x(B)) ∈ [0,1] defines aprobability measure on B(R). It is called the probability distribution inducedon the space of its values by the observable x when the system is in state p andis denoted ρp

x . The expected value of x in state p is

Ep (x) =∫R

tρpx (d t )

and for a Borel function f :R→R, we have

Ep ( f x) =∫R

f (t )ρpx (d t )

(provided the above integrals exist.) If Ep (x2) < ∞, the variance of x in p isVarp (x) = Ep (x2)− (Ep (x))2.

Postulate 6.2.16. The phase space of a physical system described by the logic Λ.States of the system are given by S (Λ).

Postulate 6.2.17. Observables of a physical system described by the logic Λ areO (Λ).

Postulate 6.2.18. Measuring whether the values of a physical observable x ∈O (Λ) lie in B ∈B(R) when the system is prepared in state p ∈S (Λ) means deter-mining ρp

x (B).

/Users/dp/a/ens/mq/iq-proca.tex 79 lud on 17 February 2013

Page 89: 2_pdfsam_iq.pdf

6.3. Pure states, superposition principle, convex decomposition

6.3 Pure states, superposition principle, convex de-composition

Proposition 6.3.1. Let S (Λ) be the set of states on the logic Λ. Let (pn)n∈N bea sequence in S (Λ) and (cn)n∈N a sequence in R+ such that

∑n∈N cn = 1. Then

p =∑n∈N cn pn , defined by p(a) =∑

n∈N cn pn(a) for all a ∈Λ, is a state.

Proof: Exercise! äCorollary 6.3.2. For any logicΛ, the set S (Λ) is convex.

Remark 6.3.3. Notice that if p = ∑n∈N cn pn as above, for every x ∈ O (Λ), we

have that ρpx =∑

n∈N cnρpnx . In fact, for all B ∈B(R),

ρpx (B) = p(x(B)) = ∑

n∈Ncn pn(x(B)) = ∑

n∈Ncnρ

pnx (B).

This decomposition has the following interpretation: the sequence (cn)n∈N de-fines a classical probability onNmeaning that in the sum defining p, each pn ischosen with probability cn . Therefore, for each integrable observable x ∈O (Λ),the expectation Ep (x) = ∑

n∈N cnEpn (x) consists in two averages: a classical av-erage on the choice of pn and a (may be) quantum average Epn (x).

Exercise 6.3.4. Give a plausible definition of the notion of integrable observ-able used in the previous remark and then prove the claimed equality: Ep (x) =∑

n∈N cnEpn (x)

Definition 6.3.5. A state p ∈ S (Λ) is said to be pure if the equation p = cp1 +(1−c)p2, for p1, p2 ∈S (Λ) and c ∈ [0,1] implies p = p1 = p2. We write Sp (Λ) forthe set of pure states ofΛ. Obviously Sp (Λ) =ExtrS (Λ).

Definition 6.3.6. Let D ⊆ S (Λ) and p0 ∈ S (Λ). We say that p0 is a superposi-tion of states in D if for a ∈Λ,

∀p ∈D, p(a) = 0 ⇒ p0(a) = 0.

It is an exercise to show that the state p =∑n∈N cn pn defined in the proposi-

tion ?? is a superposition of states in D = p1, p2, . . .. In the case Λ is a Booleanσ-algebra, the next theorem 6.3.7 shows that this is in fact the only kind of pos-sible superposition. This implies, in particular, the unicity of the decomposi-tion of a classical state into extremal (pure) states. If Λ is a standard quantumlogic, unicity of the decomposition does not hold any longer!

/Users/dp/a/ens/mq/iq-proca.tex 80 lud on 17 February 2013

Page 90: 2_pdfsam_iq.pdf

Theorem 6.3.7. Let Λ be a Boolean σ-algebra of subsets of a space X. Supposethat

1. Λ is separable1,

2. for all a ∈X, a ∈Λ.

For any a ∈X and any A ⊆X, let δa be the state defined by

δa(A) =

1 if a ∈ A0 otherwise.

Then, (δa)a∈X is precisely the set of all pure states in Λ. If D ⊆ Sp (Λ) and p0 ∈Sp (Λ), then p0 is a superposition of states in D if and only if p0 ∈D.

Proof: Denote A1, A2, . . . a denumerable collection of subsets of X generatingΛ. Purity of δa is trivially verified. Suppose that p is a pure state. If for someA0 ∈Λwe have 0 < p0(A) < 1, then, on putting for A ∈Λ

p1(A) = 1

p(A0)p(A∪ A0) (∗)

and

p2(A) = 1

1−p(A0)p(A∩ Ac

0), (∗∗)

we get p(A) = p(A0)p1(A)+ (1− p(A0))p2(A). Yet, applying (*) and (**) to A0,we get p1(A0) = 1 and p2(A0) = 0, hence p1 6= p2. This is in contradiction withthe assumed purity of p. Therefore, we conclude that for all A ∈ Λ, we havep(A) ∈ 0,1. Replacing An by Ac

n if necessary, we can assume without loss ofgenerality that p(An) = 1 for all the sets of the collection generating Λ. Let B =∩n An . Then p(B) = 1 and consequently B cannot be empty. Now B cannotcontain more than one point either. In fact, the collection of all sets C ∈Λ suchthat either B ⊆ C or B ∩C = ; is a σ-algebra containing all the sets An , n ∈ N.Hence, it coincides with Λ. As singletons are members of Λ, the set B must bea singleton, i.e. B = a for some a ∈ X. Put then p = δa . Finally, let p0 be asuperposition of states in D (all its elements are pure states). If p0 = δa0 butp0 6∈D, then p(a0) = 0 for all p ∈D but p0(a0) 6= 0, a contradiction. ä

1i.e. there is a countable collection of subsets An ⊆X, n ∈N, generating Λ by complementa-tion, intersections, and unions.

/Users/dp/a/ens/mq/iq-proca.tex 81 lud on 17 February 2013

Page 91: 2_pdfsam_iq.pdf

6.4. Simultaneous observability

6.4 Simultaneous observability

In quantum systems, the Heisenberg’s uncertainty principle, already shown inchapter 2, there are observables that cannot be simultaneously observed witharbitrary precision.

Definition 6.4.1. Let a,b ∈ Λ. Propositions a and b are said to be simultane-ously verifiable, denoted by a ↔ b, if there exists elements a1,b1,c ∈Λ such that

1. a1,b1,c are mutually orthogonal and,

2. a = a1 ∨ c and b = b1 ∨ c hold.

Observables x, y ∈O (Λ) are simultaneously observable if for all B ∈B(R), x(B) ↔y(B). For A,B ⊆Λ, we write A ↔ B if for all a ∈ A and all b ∈ B we have a ↔ b.

Lemma 6.4.2. Let a,b ∈Λ. The following are equivalent:

1. a ↔ b,

2. a ∧ (a ∧b)⊥ ⊥ b,

3. b ∧ (a ∧b)⊥ ⊥ a,

4. there exist x ∈O (Λ) and A,B ∈B(R) such that x(A) = a and x(B) = b,

5. there exists a Boolean sub-algebra ofΛ containing a and b.

Proof:

1 ⇒ 2:

a ↔ b ⇔ a = a1 ∨ c and b = b1 ∨ c

⇒ c ≤ a and c ≤ b

⇒ c ≤ a ∧b.

From the definition 6.2.2 (logic), it follows that there exists d ∈Λ such thatc ⊥ d and c ∨d = a ∧b.

Now d ≤ c∨d = a∧b ≤ a and d ≤ c⊥ (since d ⊥ d .) Hence, d ≤ a∧c⊥ = a1

(see remark immediately following the definition 6.2.2.) Similarly, d ≤b1 ⇒ d ≤ b1 ∧ q1 = 0. Therefore d = 0 and consequently c = a ∧ b. Itfollows a1 = a ∧ (a ∧b)⊥. Yet, a1 ⊥ c and a1 ⊥ b1 so that a1 ⊥ (b1 ⊥ c) = b.Summarising, a ∧ (a ∧b)⊥ ⊥ b.

/Users/dp/a/ens/mq/iq-proca.tex 82 lud on 17 February 2013

Page 92: 2_pdfsam_iq.pdf

1 ⇒ 3: By symmetry.

2 ⇒ 1: Since a ∧ (a ∧b)⊥ ⊥ b, on writing a1 = a ∧ (a ∧b)⊥, b1 = b ∧ (a ∧b)⊥, andc = a∧b, we find a = a1∨c and b = b1∨c. Since a1 ⊥ b, it follows that a1 ⊥b1 and a1 ⊥ c, while, by definition, c ⊥ b1 which proves the implication.

Henceforth, the equivalence 1 ⇔ 2 ⇔ 3 is established.

1 ⇒ 4: If a = a1 ∨ c, b = b1 ∨ c and a1,b1,c mutually orthogonal, write d = a1 ∨b1 ∨ c and define x to be the discrete observable such that x(0) = a1,x(1) = b1, x(2) = c, and x(3) = d . Then x(0,2) = a and x(1,2) = b.

4 ⇒ 5: x(A∩(A∩B)c ) = a∧(a∧b)⊥ and x(B ∩(A∩B)c ) = b∧(a∧b)⊥. On writinga1 = a ∧ (a ∧b)⊥, a2 = a ∧b, a3 = b ∧ (a ∧b)⊥, and a4 = (a ∨b)⊥, we seethat (ai )i=1,...,4 are mutually orthogonal and a1 ∨a2 ∨a3 ∨a4 = 1. If

A = ai1 ∨ . . .∨aik : k ≤ 4;1 ≤ i1 ≤ . . . ≤ ik ≤ 4,

it is easily verified that A is Boolean sub-algebra ofΛ. Since a,b ∈A , thisproves the implication.

5 ⇒ 2: Let A be a Boolean sub-algebra of Λ containing a and b. Now, [a ∧ (a ∧b)⊥]∧b = 0. As a,b, a ∧ (a ∧b)⊥,b⊥ ∈A , it follows that

a ∧ (a ∧b)⊥ = [(a ∧ (a ∧b)⊥)∧b]

∨[(a ∧ (a ∧b)⊥)∧b⊥]

= [(a ∧ (a ∧b)⊥)∧b⊥]

≤ b⊥.

Therefore a ∧ (a ∧b)⊥ ⊥ b.

ä

The significance of this lemma is that if two propositions are simultaneouslyverifiable, we can operate on them as if they were classical.

Theorem 6.4.3. Let Λ be any logic and (xλ)λ∈D a family of observables. Sup-pose that xλ ↔ xλ′ for all λ,λ′ ∈ D. Then there exist a space X, a σ-algebra X

of subsets of X, a family of measurable functions gλ : X→ R, λ ∈ D, and a σ-homomorphism τ : X → Λ such that τ(g−1

l (B)) = xλ(B) for all λ ∈ D and allb ∈B(R). Suppose further that either Λ is separable or D is countable. Then, forall λ ∈ D, there exist a x ∈ O (Λ) and a measurable function fλ : R→ R such thatxλ = fλ x.

/Users/dp/a/ens/mq/iq-proca.tex 83 lud on 17 February 2013

Page 93: 2_pdfsam_iq.pdf

6.5. Automorphisms and symmetries

The proof of this theorem is omitted. Notice that it allows to construct func-tions of several observables that are simultaneously observable. This latter re-sult is also stated without proof.

Theorem 6.4.4. Let Λ be any logic and (x1, . . . , xn) a family of observables thatare simultaneously observable. Then there exists aσ-homomorphism τ : B(Rn) →Λ such that for all B ∈B(R) and all i = 1, . . . ,n,

xi (B) = τ(π−1i (B)), (∗)

where πi : Rn → R is the projection π(t1, . . . , tn) = ti , i = 1, . . . ,n. If g is a Borelfunction on Rn , then g (x1, . . . , xn)(B) = τ(g−1(B)) is an observable. If g1, . . . , gk

are real valued Borel functions on Rn and yi = gi (x1, . . . , xn), then y1, . . . , yk aresimultaneously observable and for any real valued Borel function h on Rk , wehave h(y1, . . . , yk ) = h(g1, . . . , gk )(x1, . . . , xn) where, for t = (t1, . . . , tn), h(g1, . . . , gk )(t) =h(g1(t), . . . , gk (t)).

An immediate consequence of this theorem is that if p is a probability mea-sure on Λ, then ρ

px1,...,xn

(B) = p(τ(B)), for B ∈ B(Rn), is the joint probabilitydistribution of (x1, . . . , xn) in state p.

6.5 Automorphisms and symmetries

Let Λ be a logic. The set Aut(Λ), of automorphisms of Λ, acquires as usual agroup structure; they induce naturally automorphisms on S (Λ), called convexautomorphisms.

Let, in fact, α ∈ Aut(Λ) and p ∈ S (Λ). If we define α to be the inducedaction of α on p, by α(p)(a) = p(α−1(a)), for all a ∈Λ, then α is a convex auto-morphism of S (Λ).

Definition 6.5.1. A map β : S (Λ) →S (Λ) is a convex automorphisms if

1. β is bijective and

2. if (cn)n∈N is a sequence of non-negative reals such that∑

n∈N cn = 1 and(pn)n∈N is a sequence of states in S (Λ), then

β(∑

n∈Ncn pn) = ∑

n∈Ncnβ(pn).

/Users/dp/a/ens/mq/iq-proca.tex 84 lud on 17 February 2013

Page 94: 2_pdfsam_iq.pdf

The set of convex automorphisms of S (Λ) is denoted Aut(S (Λ)).

Lemma 6.5.2. Let α ∈ Aut(Λ). Then the induced automorphism α on S (Λ) isconvex.

Proof: Bijectivity of α follows immediately from the bijectivity ofα. If p =∑n∈N cn pn ∈

S (Λ) (with the notation of definition 6.5.1), then α(p)(a) = p(α−1(a)) =∑n∈N cn pn(α−1(a)) =∑

n∈N cnα(pn)(a) for all a ∈Λ. äRemark 6.5.3. It is obvious that convex automorphisms map pure states ofSp (Λ) into pure states.

Dynamics, i.e. time evolution of a system described by a logic Λ can bedefined in the following manner. For each t ∈ R, there exists a unique mapD(t ) : S (Λ) →S (Λ) having the following interpretation: if p ∈S (Λ) is the stateof the system at time t0, then D(t )(p) will represent the state of the system attime t + t0.

Definition 6.5.4. Let G be a locally compact topological group. By a represen-tation of G into Aut(S (Λ)), we mean a map π : G →Aut(S (Λ)) such that

1. π(g1g2) =π(g1)π(g2) for all g1, g2 ∈G ,

2. for each a ∈Λ and each p ∈ S (Λ), the mapping g 7→ π(g )(p)(a) is B(G)-measurable.

Postulate 6.5.5. Time evolution of an isolated physical system described by alogic Λ, is implemented by a map R 3 t 7→ D(t ) ∈Aut(S (Λ)). This map providesa representation of the Abelian group (R,+) into Aut(S (Λ)). More generally, anyphysical symmetry, implemented by the action of a locally compact topologicalgroup G, induces a representation into Aut(S (Λ)).

Here is an interpretation and/or justification of this axiom. If p =∑n∈N cn pn

represents the initial state of the system, we can realise this state as follows.First chose an integer n ∈N with probability cn and prepare the system at statepn . Let the system evolve under the dynamics. Then at time t it will be at statep ′

n = D(t )(pn) with probability cn . Assuming now that D(t ) is a convex auto-morphism means that D(t )(p) =∑

n∈N cnD(t )(pn), i.e. at time t , the system is instate p ′

n = D(t )(pn) with probability cn , exactly the result we obtained with thefirst procedure.

To further exploit the notions of logic, states, observables, and convex auto-morphisms, we must specialise the physical system.

/Users/dp/a/ens/mq/iq-proca.tex 85 lud on 17 February 2013

Page 95: 2_pdfsam_iq.pdf

6.5. Automorphisms and symmetries

/Users/dp/a/ens/mq/iq-proca.tex 86 lud on 17 February 2013

Page 96: 2_pdfsam_iq.pdf

7Standard quantum logics

We recall that a standard quantum logicΛwas defined in chapter 5 to be the setof Hilbert subspaces of C-Hilbert space H. For every Hilbert subspace M ∈ Λ,we denote by PM the orthogonal projection to M . If x ∈ O (Λ), then B 7→ Px(B),for B ∈ B(R), is a projection-valued measure on B(R). Conversely, for everyprojection-valued measure P on B(R), there exists an observable x ∈O (Λ) suchthat P (B) = Px(B), for all B ∈ B(R). We identify henceforth Hilbert subspaceswith the orthogonal projectors mapping the whole space on them (recall exer-cise 4.4.7.)

7.1 Observables

Lemma 7.1.1. Let M1, M2 ∈ Λ. Then propositions associated with M1 and M2

are simultaneously verifiable if and only if [PM1 ,PM2 ] = 0.

Proof:

• (⇒): Propositions M1 and M2 are simultaneously verifiable if there existmutually orthogonal elements N1, N2, N ∈ Λ such that Mi = Ni ∨ N , for

87

Page 97: 2_pdfsam_iq.pdf

7.2. States

i = 1,2. Then PMi = PNi +PN and the commutativity of the projectorsfollows immediately.

• (⇐): If [PM1 ,PM2 ] = 0, let P = PM1 PM2 . Then P is a projection. DefineQi = PMi −P , for i = 1,2; it is easily verified that Qi are projections andPQi = Qi P = 0. Therefore Q1Q2 = Q2Q1 = 0. If we define Ni = Qi (H),for i = 1,2 and N = P (H), then N1, N2, N are mutually orthogonal andMi = Ni ∨N which proves that M1 ↔ M2.

äTheorem 7.1.2. Let Λ be a standard logic with associated Hilbert space H. Forany x ∈O (Λ), denote X the self-adjoint (not necessarily bounded) operator on Hwith spectral measure given by the mapping B(R) 3 B 7→ Px(B) ∈Λ. Then

1. the map x 7→ X is a bijection between O (Λ) and self-adjoint operators onH,

2. the observable x is bounded if and only if X ∈Bh(H),

3. two bounded observables x1 and x2 are simultaneously observable if andonly if the corresponding bounded operators X1 and X2 commute,

4. if x is a bounded observable and Q ∈ R[t ], then the operator associatedwith Q x is Q(X ),

5. more generally, if x1, . . . , xr are bounded observables any two of them beingsimultaneously observable, and Q ∈ R[t1, . . . , t2], then the observable Q (x1, . . . , xr ) has associated operator Q(X1, . . . , Xr ).

Proof: Assertions 1–4 are simple exercises based on the spectral theorem forself-adjoint operators. Assertion 5 is a direct consequence of theorem 6.4.4. ä

7.2 States

In chapter 2, we defined (pure) quantum states to be unit vectors ofH. In chap-ter 6, states have been defined as probability measures on a logic. We first showthat in fact rays correspond to states viewed as probability measures onΛ.

/Users/dp/a/ens/mq/iq-stqlo.tex 88 lud on 17 February 2013

Page 98: 2_pdfsam_iq.pdf

Unit vectors of H are called rays. Let ξ ∈H, with ‖ξ‖ = 1 be a ray and denoteby pξ :Λ→ [0,1] the map defined by

Λ 3 M 7→ pξ(M) = ⟨ξ |PMξ⟩ = ‖PMξ‖2.

We have: pξ(1) ≡ pξ(H) = 1, pξ(0) ≡ pξ(0) = 0, and if (Mn)n∈N is a sequence ofmutually orthogonal Hilbert subspaces ofH and M =∨n∈NMn , then

pξ(M) = ‖PMξ‖2 = ∑n∈N

⟨ξ |PMnξ⟩ =∑

n∈Npξ(Mn).

Hence pξ ∈S (Λ). If c ∈C, with |c| = 1, then pcξ = pξ.

Theorem 7.2.1. Let H be a Hilbert space, (εn)n∈N an orthonormal basis in it anT ∈B+(H). We define the trace of T by

tr(T ) = ∑n∈N

⟨εn |T εn ⟩ ∈ [0,+∞].

Then for all T,T1,T2 ∈B+(H) the trace has the following properties

1. is independent of the chosen basis,

2. tr(T1 +T2) = tr(T1)+ tr(T2),

3. tr(λT ) =λtr(T ) for all λ≥ 0,

4. tr(U TU∗) = tr(T ), for all U ∈U(H).

Proof: (To be filled in a later version.) äDefinition 7.2.2. Let T ∈B(H). The operator T is called trace-class operator iftr(|T |) <∞. The family of trace-class operators is denoted by T1(H).

Lemma 7.2.3. The space T1(H) is a two-sided ideal of B(H) and tr(T B) = tr(BT )for all B ∈B(H).

Proof: This will be shown in several steps.

1. Every B ∈B(H) can be decomposed as a linear combination of four uni-tary operators. In fact, writing B = 1

2 (B +B∗)− i2 [i (B −B∗)], the opera-

tor B is decomposed into a sum of two self-adjoint operators. Now, ifA ∈ Bh(H), we can w.l.o.g. assume that ‖A‖ ≤ 1 and thence A ±

pI − A2

are unitary. We conclude that B = c1U1 + . . .+ c4U4 with Ui ∈ U(H) andc ∈C.

/Users/dp/a/ens/mq/iq-stqlo.tex 89 lud on 17 February 2013

Page 99: 2_pdfsam_iq.pdf

7.2. States

2. We show then that T1(H) is a vector space. In fact, for every λ ∈C, due tothe fact that |λA| = |λ||T |, it follows that if A ∈ T1(H) then λA ∈ T1(H) aswell.

For T1,T2 ∈ T1(H), denote by U ,V ,W the partial isometries arising intothe polar decompositions T1+T2 =U |T1+T2|, T1 =C |T1|, and T2 =W |T2|.Then,∑

n⟨en | |T1 +T2|en ⟩ = ∑

n⟨en |U∗(T1 +T2)en ⟩

= ∑n⟨en |U∗V |T1|en ⟩+

∑n⟨en |U∗W |T2|en ⟩

≤ ∑n|⟨en |U∗V |T1|en ⟩|+

∑n|⟨en |U∗W |T2|en ⟩|.

Now,∑n⟨en |U∗V |T1|en ⟩ = ∑

n⟨ |T1|

12 V ∗U en | |T1|

12 en ⟩

≤ ∑n‖|T1|

12 V ∗U en‖‖|T1|

12 en‖ Cauchy-Scwharz onH

≤ (∑n‖|T1|

12 V ∗U en‖2)1/2(

∑n‖|T1|

12 en‖2)1/2 Cauchy-Scwharz on `2(N).

We conclude that∑n‖|T1|

12 V ∗U en‖2 = ∑

n⟨ |T1|

12 V ∗U en | |T1|

12 V ∗U en ⟩

= ∑n⟨en |U∗V |T1|V ∗U en ⟩

≤ ∑n⟨en |V |T1|V ∗en ⟩

≤ ∑n⟨en | |T1|en ⟩

= tr(|T1|),

because U ,V are partial isometries. The second term is majorised sim-ilarly so that tr(|T1 +T2|) ≤ tr(|T1|)+ tr(|T2|) < ∞, showing that T1 +T2 ∈T1(H).

3. Using the decomposition of every B ∈B(H) into the combination of fourunitary operators B = ∑4

i=1 ciUi , we get tr(T B) = ∑4i=1 ci tr(TUi ) so that

it becomes sufficient to prove that T ∈ T1(H) and U ∈ U(H) implies thatTU ,U T ∈T1(H). But |U T | =p

(U T )∗U T =pT ∗T = |T | and |TU | =p

(TU )∗TU =√U∗|T |2U =U∗|T |U ; furthemore U∗|T |U ≥ 0. Hence tr(|TU |) = tr |T | =

tr(|U T |).

/Users/dp/a/ens/mq/iq-stqlo.tex 90 lud on 17 February 2013

Page 100: 2_pdfsam_iq.pdf

äExercise 7.2.4. Show the T ∈ T1(H) implies that T ∗ ∈ T1(H) (hence T 1(H) is abilateral ∗-ideal of T1(H).

Exercise 7.2.5. Show that T ∈ T1(H) is not necessarily closed with respect tothe operator norm stemming from the Hilbert norm. Nevertheless, T1(H) is aBanach space for the ‖ ·‖1 norm defined by ‖T ‖1 = tr |T |.Definition 7.2.6. If D is a bounded, self-adjoint, non-negative, trace-class op-erator onH, then D is called a von Neumann operator. If further tr(D) = 1, thenD is said to be a density matrix (operator). The set of density matrices on H isdenoted by D(H).

The states pξ, for ξ a ray of H, can also be described in another way. Let Dξ

be the projection operator on the one-dimensional subspace1 Cξ. Then Dξ istrace-class and for every X ∈ B(H), it follows that DξX is also trace-class. Let(εn)n∈N be an arbitrary orthonormal basis of H; without loss of generality, wecan then assume that ε1 = ξ. We have

tr(DξX ) = tr(X Dξ)

= ∑n∈N

⟨εn |X Dξεn ⟩= ⟨ξ |X ξ⟩= Eξ(X ).

In particular, if X = PM for M ∈Λ,

pξ(M) = ⟨ξ |PMξ⟩ = tr(DξPM ).

Lemma 7.2.7. Let (ξn)n∈N be an arbitrary sequence of rays in H and (cn)n∈N anarbitrary sequence of non-negative reals such that

∑n∈N cn = 1. Denote by Dn the

projection operator on the one-dimensional subspace Cξn , for n ∈N. Then

D = ∑n∈N

cnDn

is a well defined density matrix.

Proof: Exercise. äExercise 7.2.8. Show that D(H) is convex.

1We recall that the term subspace always means closed subspace.

/Users/dp/a/ens/mq/iq-stqlo.tex 91 lud on 17 February 2013

Page 101: 2_pdfsam_iq.pdf

7.3. Symmetries

Lemma 7.2.9. Let D be a density matrix defined as in lemma 7.2.7 and p :Λ→R

the mapping defined by Λ 3 M 7→ p(M) = tr(PM D). Then p ∈ S (Λ) and more-over it can be decomposed into p =∑

n∈N cn pξn .

Proof: First the superposition property follows from the linearity of the trace:for all M ∈Λ, we have p(M) = tr(PM D) =∑

n∈N cn tr(PM Dn) =∑n∈N cn pξn (M). It

is now obvious that p is a state: in fact, p(0) = p(0) = 0 and p(1) = p(H) = 1. ä

Conversely, if D is any density matrix, then the mapΛ 3 M 7→ p(M) = tr(DPM )is a state in S (Λ). States of this type are called tracial states. The natural ques-tion is whether every state in S (Λ) arises as a tracial state. The answer to thisquestion is one of the most profound results in the mathematical foundationsof quantum mechanics, the celebrated Gleason’s theorem:

Theorem 7.2.10 (Gleason). Let H be a complex separable Hilbert space with3 ≤ dimH≤ ℵ0, D(H) the convex set of density matrices on H, and Λ the logic ofsubspaces ofH. Then

1. the map D(H) 3 D 7→ ρD ∈S (Λ), defined by ρD (M) = tr(DPM ) for all M ∈Λ, is a convex isomorphism of D(H) on S (Λ),

2. a state p ∈S (Λ) is pure if and only if p = pξ for some ray ξ inH,

3. two pure states pξ and pζ are equal if and only if there exists a complexnumber c with |c| = 1 such that the rays ξ and ζ verify ξ= cζ.

The proof, lengthy and tricky, is omitted. It can be found, extending over 13pages (!), in [31], pages 147–160.

7.3 Symmetries

Definition 7.3.1. A linear map S :H→H is a symmetry if

1. S is bijective, and

2. for all f , g ∈H, the scalar product is preserved: ⟨S f |Sg ⟩ = ⟨ f |g ⟩.Exercise 7.3.2. Let α ∈ Aut(Λ) where Λ is the standard quantum logic associ-ated with a given Hilbert spaceH. Show that

/Users/dp/a/ens/mq/iq-stqlo.tex 92 lud on 17 February 2013

Page 102: 2_pdfsam_iq.pdf

1. there exists a symmetry S ∈B(H) such that for all M ∈Λ, α(M) = SM ,

2. if S′ is another symmetry corresponding to the same automorphism α,then there exists a complex number c, with |c| = 1 such that S′ = cS,

3. if S is any symmetry of H, the map Λ 3 M 7→ SM ∈Λ is an automorphismofΛ.

Notice that unitaries are obviously symmetries. It turns out that they are theonly symmetries encountered in elementary quantum systems2.

2In general, anti-unitaries may also occur as symmetries. They are not considered in thiscourse.

/Users/dp/a/ens/mq/iq-stqlo.tex 93 lud on 17 February 2013

Page 103: 2_pdfsam_iq.pdf

7.3. Symmetries

/Users/dp/a/ens/mq/iq-stqlo.tex 94 lud on 17 February 2013

Page 104: 2_pdfsam_iq.pdf

8States, effects, and the corresponding

quantum formalism

8.1 States and effects

8.2 Operations

8.3 General quantum transformations, complete pos-itivity, Kraus theorem

95

Page 105: 2_pdfsam_iq.pdf

8.3. General quantum transformations, complete positivity, Kraus theorem

/Users/dp/a/ens/mq/iq-proca.tex 96 lud on 12 January 2013

Page 106: 2_pdfsam_iq.pdf

9Quantum formalism based on the

informational approach

97

Page 107: 2_pdfsam_iq.pdf

Chapter 9

/Users/dp/a/ens/mq/iq-infor.tex 98 lud on 12 January 2013

Page 108: 2_pdfsam_iq.pdf

10Two illustrating examples

10.1 The harmonic oscillator

In chapter 6, a general formalism, covering both classical and quantum logics,has been introduced. Here we present a simple physical example, the harmonicoscillator, in its classical and quantum descriptions. Beyond providing a con-crete illustration of the formalism developed so far, this example has the ad-vantage of being completely solvable and illustrating the main similarities anddifferences between classical and quantum physics.

10.1.1 The classical harmonic oscillator

The system is described by a mass m attached to a spring of elastic constantk. The motion is assumed frictionless on the horizontal direction and the massoriginally equilibrates at point 0. The spring is originally elongated to positionq0 and the system evolves then freely under the equations of motion. The set-ting is described in figure 10.1. The system was already studied in chapter 2.

99

Page 109: 2_pdfsam_iq.pdf

10.1. The harmonic oscillator

0 q0

Figure 10.1: The experimental setting of the one-dimensional harmonic oscil-lator.

The equation of motion, giving the elongation q(t ) as a function of time t , is

mq(t ) = f (q(t )) =−kq(t )

q(0) = q0

q(0) = v0 = 0.

Introducing the new variable p = mq and transforming the second order differ-ential equation into a system of first order equations, we get the vector equation

d t(t ) = Aω(t ), (∗)

where

ω(t ) =(

q(t )p(t )

), with initial condition ω(0) =

(q0

p0

)and

A =(

0 1m

−k 0

).

The solution to equation (∗) is given by a flow on the phase space Ω=R2 givenby

ω(t ) = T tω(0),

where

T t = exp(t A) =(

cos(µt ) sin(µt )mµ

−kµ

sin(µt ) cos(µt )

),

and µ = pk/m. Since detT t = 1, it follows that the evolution is invertible and

(T t )−1 = T −t . The orbit of the initial condition ω(0) =(

q0

0

)under the flow reads

(T tω)t∈R, where ω(t ) = T tω=(

q0 cos(µt )−q0

sin(µt )

).

/Users/dp/a/ens/mq/iq-ilexa.tex 100 lud on 17 February 2013

Page 110: 2_pdfsam_iq.pdf

The system is classical, hence its logicΛ is a Boolean σ-algebra; the naturalchoice is Λ = B(R2). Now observables in O (Λ) are mappings x : B(R) → Λ ≡B(R2). Identify henceforth indicator functions with Borel sets in B(R2) (i.e.for any Borel set B ∈ B(R), instead of considering x(B) = F ∈ B(R2) we shallidentify x(B) = 1 F .)

Let now X : Ω → R be any measurable bounded mapping and chose asx(B) = 1 X −1(B) for all B ∈ B(R). Then, on defining X = ∫

λx(dλ), a bijectionis established between x and X . Now since (T tω)t∈R = (exp(t A)ω)t∈R is the or-bit of the initial conditionω0 inΩ, the value X (T tω) is well defined for all t ∈R;we denote by X t (ω) ≡ X (T tω). Then

d X t

d t(ω) = ∂1X (T tω)

d(T tω)1

d t+∂2X (T tω)

d(T tω)2

d t

= ∂1X (T tω)d q

d t(t )+∂2X (T tω)

d p

d t(t ),

provides the evolution of X under the flow (T t )t .

The Hamiltonian is a very particular measurable bounded map on the phasespace (hence an observable) H : Ω→ R, having the formula H(ω) = kω2

1/2+ω2

2/2m. It evolves also under the flow (T t )t : Then

d Ht

d t(ω) = kq(t )q(t )+ p(t )

mp(t )

= kq(t )q(t )+ q(t )(−kq(t ))

= 0.

Thus, the Hamiltonian is a constant of motion. Physically it represents the en-

ergy of the system. Initially, H(q0, p0) = kq20

2 = E and during the flow, the en-ergy always remains E , so that the energy takes arbitrary (but constant withrespect to the flow) values E ∈ R+. Moreover, ∂1H(T tω) = kq(t ) = −p(t ) and∂2H(T tω) = p(t )

m = q(t ). Hence we recover the Hamilton equations

d q

d t(t ) = ∂H

∂p= ∂2H

d p

d t(t ) = −∂H

∂q=−∂1H .

Therefore, d X td t = ∂1X∂2H + ∂2X (−∂1H) = LH X with LH = −(∂1H∂2 − ∂2H∂1).

Hence, denoting for every two function f , g ∈C 1(Ω) by f , g = ∂1 f ∂2g−∂2 f ∂1gthe Poisson’s bracket, we have for the flow of an observable, assuming integra-bility of the evolution equation, X t = exp(tLH )X . This means that the flow

/Users/dp/a/ens/mq/iq-ilexa.tex 101 lud on 17 February 2013

Page 111: 2_pdfsam_iq.pdf

10.1. The harmonic oscillator

(T tω)t ) on Ω induces a flow (exp(tLH )X )t on observables. Notice also thatX t = exp(tLH )X is a shorthand notation for

X t =∞∑

n=0

(−t )n

n!H , H , . . . H , X . . ..

Theorem 10.1.1 (Liouville’s theorem). Let µ be the Lebesgue measure on Ω, i.e.µ(dω1dω2) = dω1dω2. Then

1. the measure µ is invariant under T t , i.e. µ(T t B) = µ(B) for all B ∈ B(R2)and all t ∈R,

2. the operator LH is formally skew-adjoint on L2(Ω,F ,µ).

Proof:

1. µ(T t B) = ∫T t B dω1dω2. Now, if ω ∈ T t B ⇒ T −tω ∈ B . Hence, denoting

(x1, x2) = T −t (ω1,ω2), we have∫T t B

dω1dω2 =∫

B

∂(ω1,ω2)

∂(x1, x2)d x1d x2

=∫

Bd x1d x2 =µ(B),

because the Jacobian verifies

∂(ω1,ω2)

∂(x1, x2)= detexp(t A) = 1.

2. LH is not bounded on L2(Ω,F ,µ). It can be defined on dense subset ofL2(Ω,F ,µ), for instance the Schwartz space S(R2). For f , g ∈ S(R2), wehave

⟨ f |LH g ⟩ =∫

f (ω)LH g (ω)µ(dω)

= −∫

LH f (ω)g (ω)µ(dω)+ bdry terms.

Now the boundary terms vanish because f and g vanish at infinity. Hence,on S(R2), the operator is skew-adjoint L∗

H =−LH and hence formally skew-adjoint on L2(Ω,F ,µ).

/Users/dp/a/ens/mq/iq-ilexa.tex 102 lud on 17 February 2013

Page 112: 2_pdfsam_iq.pdf

ä

Notice that, as a consequence of the previous theorem, exp(tLH ) is formallyunitary on L2(Ω,F ,µ).

Any probability measure p onΛ is a state. We have for all B ∈B(R), ρx(B) =p(x(B)) = p(X −1(B)) whileρxt = p(xt (B)) = p(X −1

t (T −t B)) = p(x(T −t B)). Hencethe flow T t onΩ induces a convex automorphism α(p)(x(B)) = p(x((T −t B)) onstates.

10.1.2 Quantum harmonic oscillator

Standard quantum logicΛ coincides with the family of subspaces of an infinite-dimensional Hilbert spaceH. Since all separable Hilbert spaces are isomorphic,we can chose any of them. The Schrödinger’s choice for the one-dimensionalharmonic oscillator is H = L2(R). States are probability measures p :Λ→ [0,1]and thanks to Gleason’s theorem, we can limit ourselves to tracial states, i.e.

Λ 3 M 7→ p(M) = tr(PM D) = pD (M),

for some D ∈ D(H). Symmetries are implemented by unitary operators on H

(automorphisms on Λ.) Let U ∈ U(H). Then α : M 7→ α(M) = U M induces aprojection PU M =U∗PMU . Subsequently, the automorphism α induces a con-vex automorphism on S (Λ), given by

α(p)(M) = pD (α(M))

= tr(PU M D)

= tr(U∗PMU D)

= tr(PM D (U )),

with D (U ) =U DU∗. Physics remains invariant under time translations. Hencetime translation (evolution) must be a symmetry implemented by a unitaryoperator U (t ) acting on H. Define U (t ) = exp(−i t H/ħ) (this a definition ofH .) Then H is formally self-adjoint, hence an observable (a very particularone!) generating the Lie group of time translations. It will be shown belowthat H is time invariant. Now U (t ) acts on rays of H to give a flow. Denotingψ(t ) =U (t )ψ, we have the Schrödinger’s evolution equation in the Schrödinger’spicture:

iħdψ

d t(t ) = Hψ(t ).

/Users/dp/a/ens/mq/iq-ilexa.tex 103 lud on 17 February 2013

Page 113: 2_pdfsam_iq.pdf

10.1. The harmonic oscillator

Thanks to the spectral theorem (and, identifying for x ∈O (Λ) and B ∈B(R),x(B) with the projection-valued measure corresponding to the subspace x(B)),there is a bijection between x ∈ O (Λ) and self-adjoint operators on H throughX = ∫

λx(dλ). For every tracial states pD , we have EpD (X ) = ∫λtr(x(dλ)D) and

Eα(pD )(X ) =∫λtr(x(dλ)DU (t ))

=∫λtr(U∗(t )x(dλ)U (t )D)

= EpD (X t ),

where we defined X t =U∗(t )XU (t ). Hence the flow U (t )ψ on H induces a flowon observables satisfying

d X t

d t= i

ħ [H , X ] = LH X

with LH (·) = iħ [H , ·]. Notice incidentally that d Ht /d t = 0 proving the claim that

H is a constant of motion. Moreover, H has dimensions M · L2/T 2 (energy),therefore H is interpreted as the quantum Hamiltonian. If the flow is integrable,we have

X t = exp(tLH )X

=∞∑

n=0

(i t )n

ħnn![H , [H , . . . , [H , X ] . . .]].

Physics remains invariant also by space translations. Hence they must cor-respond to a symmetry implemented by a unitary transformation.

Lemma 10.1.2. The operator ∇x is formally skew-adjoint on L2(R).

Proof: For all f , g ∈ S(R) (dense in L2(R)), we have, ⟨ f |∇x g ⟩ = ∫f (x) d

d x g (x)d x =−∫ d

d x f (x)g (x)d x + f g |∞−∞. ä

Consequently, the operator exp(x ·∇x) is formally unitary and since exp(x ·∇x)ψ(y) =ψ(y+x), ∇x is the generator of space translations. If we write p = ħ

i ∇x

then p is formally self-adjoint, has dimensions L · M · (L/T 2) · (1/L) = M ·L/T(momentum), and exp(i x ·p/ħ) is unitary and implements space translations.

Define Hosc = p2/2m +kq2 as the formally self-adjoint operator on L2(R),with p = ħ

i ∇x and qψ(x) = xψ(x), the multiplication operator. Introduce µ =pk/m, Q =√

mµ/ħq , P = (1/√

mµħ)p, and H = (1/ħµ)Hosc. Then H = (1/2)(P 2+Q2) where P =−i∇ and Q is the multiplication operator; these two latter opera-tors are formally self-adjoint and verify the commutation relation [P,Q] =−i 1 .

/Users/dp/a/ens/mq/iq-ilexa.tex 104 lud on 17 February 2013

Page 114: 2_pdfsam_iq.pdf

Definition 10.1.3. (Creation and annihilation operators) Define the creationoperator A∗ = 1p

2(P + iQ) and the annihilation operator A = 1p

2(P − iQ).

Exercise 10.1.4. For the creation and annihilation operators, show

1. [A, A∗] = 1 ,

2. H = A∗A+1 /2,

3. [H , A] =−A,

4. [H , A∗] = A∗,

5. for n ∈N, [H , (A∗)n] = n(A∗)n .

Lemma 10.1.5. If ψ0 ∈ S(R) is a ray (in the L2 sense) satisfying Aψ0 = 0 then

1. ψ0(x) =π−1/4 exp(−x2/2),

2. Hψ0 =ψ0/2, and

3. H(A∗)nψ0 = (1/2+n)A∗nψ0, for all n ∈N.

Proof:

Aψ0 = 0 ⇒ 1p2

(P − iQ)ψ0

⇒ −id

d xψ0(x)− i xψ0(x) = 0

⇒ ψ0(x) = c exp(−x2/2),

and by normalisation, c =π−1/4. äLemma 10.1.6. Denote, for n ∈N, ψn = 1p

n!A∗nψ0. Then

1. (ψn)n∈N is an orthonormal sequence,

2. A∗ψn =pn +1ψn+1, for n ≥ 0,

3. Aψn =pnψn−1, for n ≥ 1, and

4. A∗Aψn = nψn , for n ≥ 0.

/Users/dp/a/ens/mq/iq-ilexa.tex 105 lud on 17 February 2013

Page 115: 2_pdfsam_iq.pdf

10.1. The harmonic oscillator

Proof: All the assertions can be shown by similar arguments. It is enough toshow the arguments leading to orthonormality:

⟨ψ0 | An A∗nψ0 ⟩ = ⟨ψ0 | An−1 A A∗A∗n−1ψ0 ⟩= ⟨ψ0 | An−1(1 + A∗A)A∗n−1ψ0 ⟩...

= n⟨ψ0 | An−1 A∗n−1ψ0 ⟩...

= n!⟨ψ0 |ψ0 ⟩.ä

Theorem 10.1.7. The sequence (ψn)n∈N is a complete orthonormal sequence inH.

The proof is based on an analogous result for Hermite polynomials that canbe shown using the two following lemmata.

Lemma 10.1.8. Let cn, j = n!(n−2 j )!2 j j !

, for n ∈N, and j ∈N such that 0 ≤ j ≤ n/2.

Then

cn, j = (1− 2 j

n +1)cn+1, j = 2( j +1)

(n +1)(n −2 j ))cn+1, j+1

and if

ηn(x) =[n/2]∑j=0

(−1) j cn, j xn−2 j ,

then

(x − d

d x)ηn(x) = ηn+1(x)

while xn =∑[n/2]j=0 cn, jηn−2 j (x).

Proof: Substitute and make induction. äLemma 10.1.9. (A∗nψ0)(x) = ηn(

p2x)ψ0(x).

Proof: True for n = 0. Conclude by induction. äCorollary 10.1.10. spec(H) = 1/2+N.

Therefore the energy is quantised in quantum mechanics i.e. it can take onlydiscrete values. It is this surprising phenomenon that gave its adjective quan-tum to the term quantum mechanics.

/Users/dp/a/ens/mq/iq-ilexa.tex 106 lud on 17 February 2013

Page 116: 2_pdfsam_iq.pdf

Exercise 10.1.11. Using Dirac’s notation |n ⟩ ≡ψn , for n ∈N,

1. H |n ⟩ = (1/2+n)|n ⟩,

2. A∗|n ⟩ =pn +1|n +1⟩,

3. A|n ⟩ =pn|n −1⟩, and

4. A∗A|n ⟩ = n|n ⟩.

10.1.3 Comparison of classical and quantum harmonic oscil-lators

Figure 10.2: Comparison of probability densities. In blue is depicted the prob-ability density of the classical oscillator. In red the corresponding density forthe quantum oscillator for n = 10 (left) and n = 60 (right).

/Users/dp/a/ens/mq/iq-ilexa.tex 107 lud on 17 February 2013

Page 117: 2_pdfsam_iq.pdf

10.2. Schrödinger’s equation in the general case, rigged Hilbert spaces

Figure 10.3: Comparison of distribution functions. In blue is depicted the dis-tribution of the classical oscillator. In red the corresponding distribution for thequantum oscillator for n = 1 (left), n = 10 (middle), and n = 30 (right). Alreadyfor n = 30, the classical and quantum distributions are almost indistinguish-able.

10.2 Schrödinger’s equation in the general case, riggedHilbert spaces

10.3 Potential barriers, tunnel effect

10.4 The hydrogen atom

/Users/dp/a/ens/mq/iq-ilexa.tex 108 lud on 17 February 2013

Page 118: 2_pdfsam_iq.pdf

11Quantifying information: classical

and quantum

11.1 Classical information, entropy, and irreversibil-ity

The information content of a message is a probabilistic notion. The less prob-able a message is, the more information it carries. Let X be a random variabledefined on (Ω,F ,P) taking values in the finite setX= x1, . . . , xn Let PVn = p ∈Rn+ :

∑ni=1 pi = 1. To each element p ∈ PVn corresponds a probability measure

PpX defined by Pp

X (xi ) = pi , for i = 1, . . . ,n. Ask about the information contentcarried by the random variable X is the same thing as trying to quantify the pre-dictive power of the law PX . The main idea is that the information content ofX is equal to the average information missing in order to decide the outcomevalue of X when the only thing we know is its lawPX . Some reasonable require-ments on the information content of X are given below:

• Suppose that all pi , i = 1, . . . ,n but one are 0 and p j = 1, for some j . ThenP

pX (x j ) = 1 and no information is missing, there is no uncertainty about

the possible outcome of X ;

109

Page 119: 2_pdfsam_iq.pdf

11.1. Classical information, entropy, and irreversibility

• Suppose on the contrary that pi = 1/n, i = 1, . . . ,n. Our perplexity is max-imal and this perplexity increases with n.

• If S is to be interpreted as a missing information associated with a prob-ability vector p ∈PVn , on denoting PV=∪n∈NPVn , the function S : PV→R+ and the first statement implies that S(1,0,0, . . . ,0) = 0 while S(1/n, . . . ,1/n)is an increasing function of n.

• The function S must be invariant under permutations of its argumentsi.e. S(pσ(1), . . . , pσ(n)) = S(p1, . . . , pn) for all the permutations σ ∈ Sn .

• If we split the possible outcome values into two sets, the function S mustverify the grouping property, i.e.

S(p1, . . . , pn ; pn+1, . . . , pN ) = S(qA, qB )

+qAS(p1

qA, . . . ,

pn

qA)

+qB S(pn+1

qB, . . . ,

pN

qB),

where qA = p1 + . . .+pn and qB = pn+1 + . . .+pN .

• Finally, we require S(p1, . . . , pn ;0, . . . ,0) = S(p1, . . . , pn).

Theorem 11.1.1. The only function S : PV → R+ satisfying the above require-ments is the function defined by

PV 3 (p1, . . . , pn) 7→ S(p1, . . . , pn) =−kn∑

i=1pi log pi ,

where k is an arbitrary non-negative constant and the convention 0log0 = 0 isused. The function S is called the (classical) entropy of the probability vector.

Proof: (To be filled in a later version.) ä

Entropy is closely related to irreversibility since the second principle of ther-modynamics states: Entropy of an isolated system is a non decreasing functionof time. It can remain constant only for reversible evolutions. For a system Aundergoing an irreversible transformation the entropy increases; however thesystem can be considered as part of a larger isolated composite system (A andenvironment), undergoing globally a reversible transformation. In that casethe total entropy (of the system A and of the environment) remains constantbut since the entropy of A must increase, the entropy of the environment mustdecrease1 hence the missing information decreases. In other words, when the

1Notice that this assertion is not in contradiction with the second principle of thermody-namics because the environment is not isolated.

/Users/dp/a/ens/mq/iq-qinfo.tex 110 lud on 12 January 2013

Page 120: 2_pdfsam_iq.pdf

system A undergoes an irreversible transformation, the environment gains in-formation.

This leads to the Landauer’s principle: When a computer erases a single bitof information, the environment gains at least k ln2 units of information, wherek > 0 is a constant.

/Users/dp/a/ens/mq/iq-qinfo.tex 111 lud on 12 January 2013

Page 121: 2_pdfsam_iq.pdf

11.1. Classical information, entropy, and irreversibility

/Users/dp/a/ens/mq/iq-qinfo.tex 112 lud on 12 January 2013

Page 122: 2_pdfsam_iq.pdf

12Turing machines, algorithms,

computing, and complexity classes

All computers, from Babbage’s never constructed project of analytical machine(1833) to the latest model of supercomputer, are based on the same principles.A universal computer uses some input (a sequence of bits) and a programme(a sequence of instructions) to produce an output (another sequence of bits.)Universal computers are modelled by Turing machines. Never forget howeverthat an abstract Turing machine never computed anything. We had to wait untilthe first ENIAC was physically constructed to obtain the first output of numbers.

12.1 Deterministic Turing machines

There are several variants of deterministic Turing machines; all of them areequivalent in the sense that a problem solvable by one variant is also solvableby any other variant within essentially the same amount of time (see below,definition ?? and section 12.3.) A Turing machine is a model of computation; itis to be thought as a finite state machine disposing of an infinite scratch space

113

Page 123: 2_pdfsam_iq.pdf

12.1. Deterministic Turing machines

(an external tape1.) The tape consists of a semi-infinite or infinite sequence ofsquares, each of which can hold a single symbol. A tape-head can read a symbolfrom the tape, write a symbol on the tape, and move one square in either direc-tion (for semi-infinite tape, the head cannot cross the origin.) More precisely, aTuring machine is defined as follows.

Definition 12.1.1. A deterministic Turing machine is a quadruple (A,S,u, s0)where

1. A is a finite set, the alphabet, containing a particular symbol called theblank symbol and denoted by ]; the alphabet deprived from its blank sym-bol, denoted Ab = A \ ], is assumed non-empty,

2. S is a finite non-empty set, the states of the machine, partitioned into theset Si of intermediate states and the set S f of final states,

3. D = L,R ≡ −1,1 is the displacement set,

4. u : A×S → A×S ×D is the transition function, and

5. s0 ∈ Si the initial state of the machine.

The set of deterministic Turing machines is denoted by DTM.

The machine is presented an input, i.e. a finite sequence of contiguous non-blank symbols, and either it stops by producing an output, i.e. another finitesequence of symbols, else the programme does never halt.

Example 12.1.2. (A very simple Turing machine) Let M ∈DTM with A = 0,1,],S = Si ∪S f where Si = go, S f = halt, and transition function u(a, s) = (a′, s′,d)defined by the following table:

a s a′ s′ d0 go 0 go L1 go 1 go L] go ] halt R

1Mind that during Turing’s times no computer was physically available. The external tapewas invented by Alan Turing — who was fascinated by typewritters — as an external storagedevice.

/Users/dp/a/ens/mq/iq-turin.tex 114 lud on 17 February 2013

Page 124: 2_pdfsam_iq.pdf

If the programme, described by this Turing machine, starts with the head overany non-blank symbol of the input string, it ends with the head over the left-most non-blank symbol while the string of symbols remains unchanged.

Other equivalent variants of the deterministic Turing machine may havedisplacement sets with a 0 (do not move) displacement, have their alphabet Apartitioned into external and internal alphabet, etc. The distinction into inter-nal and external alphabet is particularly useful in the case of semi-infinite tape,an internal character ∗, identified as “first symbol”, can be used to prevent thehead from going outside the tape. It is enough to define U (∗,go) = (∗,go,R).

Notation 12.1.3. If W is a finite set, we denote by W ∗ = ∪n∈Z+W n and W ∞ =∂W = W Z+ . Notice that Z+ = 0,1,2, . . . 6= N = 1,2, . . . and that W 0 = ;. El-ements of W ∗ are called words of finite length over the alphabet W . For everyw ∈ W ∗, there exists n ∈ Z+ such that w ∈ W n ; we denote then by |W | = n thelength of the word w .

For every α ∈ A∗b , we denote by α ∈ A∞ the completion of the word α by

blanks, namely α= (α1, . . . ,α|α|,],],], . . .).

Considering the example 12.1.2, we can, without loss of generality, alwaysassume that the machine starts at the first symbol of the input stringα=α ∈ A∗

b .Starting from (α, s0,h0 = 1), successive applications of the transition functionU induce a dynamical system on X = A∗×S ×Z. A configuration is an instan-taneous description of the word written on the tape, the internal state of themachine, and the position of the head, i.e. an element of X.

Let τα = infn ≥ 1 : sn ∈ S f . The programme starting from initial configura-tion (α, s0,h0 = 1) stops running if τα <∞, it never halts when τα =∞. While1 ≤ n < τα, the sequence (α(n), sn ,hn)n≤τα is defined by updates of single char-acters; if, for 0 ≤ n < τα, we have u(α(n)

hn, sn) = (a′, s′,d), then (α(n+1), sn+1,hn+1),

is defined by

sn+1 = s′

hn+1 = hn +d

α(n+1) = (α(n)1 , . . . ,α(n)

hn−1, a′,α(n)hn+1, . . . ,α(n)

|α(n)|).

If the machine halts at some finite instant, the output is obtained by readingthe tape from left to right until the first blank character. The sequence of words(α(n))n is called a computational path or computational history starting fromα.

/Users/dp/a/ens/mq/iq-turin.tex 115 lud on 17 February 2013

Page 125: 2_pdfsam_iq.pdf

12.2. Computable functions and decidable predicates

12.2 Computable functions and decidable predicates

Every M ∈DTM computes a particular partial functionφM : A∗b → A∗

b . Since thevalue of φM (α) remains undetermined when the programme M does not halt,the function φM is termed partial because in general Dom(φM ) ⊂ A∗

b .

Definition 12.2.1. A partial function f : A∗b → A∗

b is called computable if thereexists a M ∈ DTM such that φM = f . In that case, f is said to be computed bythe programme M .

Exercise 12.2.2. Show that there exist non-computable functions.

Definition 12.2.3. A predicate, P , is a function taking Boolean values 0 or 1. Alanguage, L, over an alphabet A is a subset of A∗

b .

Thus, for predicates P with Dom(P ) = A∗b , the set α ∈ A∗

b : P (α) is a lan-guage. Hence predicates are in bijection with languages.

Definition 12.2.4. A predicate P : A∗b → 0,1 is decidable, if the function P is

computable.

Let P be a predicate and L the corresponding language. The predicate isdecidable if there exists a M ∈DTM such that for every wordα, the programmehalts after a finite number of steps and

• if α ∈ L, then the machine halts returning 1, and

• if α 6∈ L, then the machine halts returning 0.

Definition 12.2.5. Let M ∈DTM and sM , tM :Z+ →R+ be given functions. If foreveryα ∈ A∗

b , the machine stops after having visited at most sM (|α|) cells, we saythat it works in computational space sM . We say that it works in computationaltime tM if τα ≤ tM (|α|).

12.3 Complexity classes

Computability of a function does not mean effective computability since thecomputing algorithm can require too much time or space. We say that r :N→R+ is of polynomial growth if there exist constants c,C > 0 such that r (n) ≤C nc ,for large n. We write symbolically r (n) = poly(n).

/Users/dp/a/ens/mq/iq-turin.tex 116 lud on 17 February 2013

Page 126: 2_pdfsam_iq.pdf

Henceforth, we shall assume Ab ≡A= 0,1.

Definition 12.3.1. The complexity class P consists of all languages L whosepredicates P are decidable in polynomial time, i.e. for every L in the class, thereexists a machine M ∈ DTM such that φM = P and tM (|α|) = poly(|α|) for allα ∈A∗.

Similarly, we can define the class PSPACE of languages whose predicates aredecidable in polynomial space. functions computable in polynomial space.

Other complexity classes will be determined in the subsequent sections.Obviously P ⊆ PSPACE.

Conjecture 12.3.2. P 6= PSPACE.

12.4 Non-deterministic Turing machines and the NP

class

Definition 12.4.1. A non-deterministic Turing machine is a quadruplet (A,S,u, s0)where A, S and s0 are as in definition 12.1.1; u is now a multivalued function,i.e. there are r different branches ui , i = 1, . . . ,r and ui : A ×S → A ×S ×D . Forevery pair (a, s) ∈ A×S there are different possible outputs (a′

i ,σ′i ,di )i=1,...,r , the

choice of a particular branch can be done in a non-deterministic way at eachmoment. All such choices are legal actions. The set of non-deterministic Turingmachines is denoted by NTM.

A computational path for a M ∈NTM is determined by a choice of one legaltransition at every step. Different steps are possible for the same input. Noticethat NTM do not serve as models of practical devices but rather as logical toolsfor the formulation of problems rather than their solution.

Definition 12.4.2. A language L (or its predicate P ) belongs to the NP class ifthere exists a M ∈NTM such that

• if α ∈ L (i.e. P (α) = 1) for some α ∈A∗, then there exists a computationalpath with τα ≤ poly(|α|) returning 1,

• ifα 6∈ L (i.e. P (α) = 0) for someα ∈A∗, then there exists no computationalpath with this property.

/Users/dp/a/ens/mq/iq-turin.tex 117 lud on 17 February 2013

Page 127: 2_pdfsam_iq.pdf

12.5. Probabilistic Turing machine and the bpp class

It is elementary to show that P ⊆ NP. Clay Institute offers you2 USD 1 000 000if you solve the following

Exercise 12.4.3. Is it true that P = NP?

12.5 Probabilistic Turing machine and the BPP class

Definition 12.5.1. Let R be the set of real numbers computable by a determin-istic Turing machine within accuracy 2−n in poly(n) time. A probabilistic Turingmachine is a quintuple (A,S,u,p, s0) where A, S, u, and s0 are as in definition12.4.1 while p = (p1, . . . , pr ) ∈ R+, with

∑ri=1 pi = 1 is a probability vector on the

set of branches of u. All branches correspond to legal actions; at each step, thebranch i is chosen with probability pi , independently of previous choices. Theset of probabilistic Turing machine is denoted by PTM.

Each α ∈A∗ generates a family of computational paths. The local probabil-ity structure on the transition functions induces a natural probability structureon the computational path space. The evolution of the machine is a Markovprocess with the state space A∗

b ×S ×Z and stochastic evolution kernel deter-mined by the local probability vector p. Hence, any input gives a set of possibleoutputs each of them being assigned a probability of occurrence. A machine inPTM is also called a Monte Carlo algorithm.

Definition 12.5.2. Let ε ∈]0,1/2[. A predicate P (hence a language L) belongsto the BPP class if there exists a M ∈ BPP such that for any α ∈A∗, τα ≤ poly(|α|)and

• if α ∈ L, then P(P (α) = 1) ≥ 1−ε, and

• if α 6∈ L, then P(P (α) = 1) ≤ ε.

Exercise 12.5.3. Show that the definition of the class does not depend on thechoice of ε provided it lies in ]0,1/2[.

Church-Turing thesis . . .

2http://www.claymath.org/millennium/

/Users/dp/a/ens/mq/iq-turin.tex 118 lud on 17 February 2013

Page 128: 2_pdfsam_iq.pdf

12.6 Boolean circuits

Notation 12.6.1. For b ∈N and Zb = 0, . . . ,b−1, we denote by x = ⟨xn1 · · ·x0 ⟩b

the mapping defined by

Znb 3 (x0, . . . , xn) 7→ x = ⟨xn · · ·x0 ⟩b =

n−1∑k=0

xk bk ∈Zbn .

Since conversely for every x ∈Zbn the sequence (x0, . . . , xn) ∈Znb is uniquely

determined, we identify x with the sequence of its digits. For b = 2 we omit thebasis subscript and we write simply ⟨ ·⟩.Definition 12.6.2. Let f : An → Am be a Boolean function of n entries and moutputs. Let B be a fixed set of Boolean functions of different arities. We callBoolean circuit of f in terms of the basis B a representation of f in terms offunctions from B.

Example 12.6.3. (Addition with carry of 2 binary 2-digit numbers) Let x =⟨x1x0 ⟩ and y = ⟨ y1 y0 ⟩. We wish to express z = x + y = ⟨z2z1z0 ⟩ in terms ofBoolean functions in B = XOR, AND = ⊕,∧. The truth table is given in table12.1. We verify immediately that:

z0 = x0 ⊕ y0

z1 = (x0 ∧ y0)⊕ (x1 ⊕ y1)

z2 = (x1 ∧ y1)⊕ [(x1 ⊕ y1)∧ (x0 ∧ y0)]

Consequently, the Boolean circuit is depicted in figure ??.

A basis B is complete if any Boolean function f can be constructed as a cir-cuit with gates from B.

Example 12.6.4. NOT, OR, AND is a complete but redundant basis; NOT, OR,NOT, AND, and AND, XOR are complete minimal bases.

Definition 12.6.5. The minimal number of gates from B needed to compute f ,denoted by cB ( f ), is circuit complexity of f in B.

The function implementing the addition with carry of table 12.1 over thebasis B = AND, XOR, has circuit complexity 7.

Any DTM can be implemented by circuits.

/Users/dp/a/ens/mq/iq-turin.tex 119 lud on 17 February 2013

Page 129: 2_pdfsam_iq.pdf

12.6. Boolean circuits

x1 x0 y1 y0 z2 z1 z0

0 0 0 0 0 0 00 1 0 0 0 0 11 0 0 0 0 1 01 1 0 0 0 1 10 0 0 1 0 0 10 1 0 1 0 1 01 0 0 1 0 1 11 1 0 1 1 0 00 0 1 0 0 1 00 1 1 0 0 1 11 0 1 0 1 0 01 1 1 0 1 0 10 0 1 1 0 1 10 1 1 1 1 0 01 0 1 1 1 0 11 1 1 1 1 1 0

Table 12.1: The truth table of the Boolean functionA4 →A3 implementing theaddition with carry of two binary 2-digit numbers.

Classical computers are based on gates XOR, AND for example. It is easilyshown that these gates are irreversible. Therefore it is intuitively clear why clas-sical computers can produce information. What is much less intuitively clearis how quantum processes can produce information since they are reversible(unitary).

In 1973, BENNETT predicted that it is possible to construct reversible uni-versal gates. In 1982, FREDKIN exemplifies such a reversible gate. Fredkin’s gateis a 3 inputs - 3 outputs gate, whose truth tableau is given in table ??. This gateproduces both AND (since inputs 0, x, y return outputs x ∧ y, x ∧ y, x) and NOT

gates (since inputs 1,0, x return outputs x, x, x.) The gates AND and NOT form-ing a complete basis for Boolean circuits, the universality of Freidkin’s gate isestablished.

In 1980, BENIOFF describes how to use quantum mechanics to implementa Turing machine, in 1982, FEYNMAN proves that there does not exist a Turingmachine (either deterministic or probabilistic) on which quantum phenom-ena can be efficiently simulated; only a quantum Turing machine could do so.Finally, in 1985, DEUTSCH constructs (on paper) a universal quantum Turing

/Users/dp/a/ens/mq/iq-turin.tex 120 lud on 17 February 2013

Page 130: 2_pdfsam_iq.pdf

Input Outputa b c a′ b′ c ′

0 0 0 0 0 00 0 1 0 0 10 1 0 0 1 00 1 1 1 0 11 0 0 1 0 01 0 1 0 1 11 1 0 1 1 01 1 1 1 1 1

Table 12.2: The truth table of Fredkin’s gate. We remark that c ′ = c and if c = 0then (a′ = a and b′ = b) else (a′ = b and b′ = a.)

machine.

12.7 Composite quantum systems, tensor products,and entanglement

Example 12.7.1. Let n = 2 andH1 =H2 =C2. A basis ofH⊗2 is given by (|00⟩, |01⟩, |10⟩, |11⟩).An arbitrary vector ψ ∈H⊗2 is decomposed as

ψ=ψ0|00⟩+ψ1|01⟩+ψ2|10⟩+ψ3|11⟩.

Ifψ2 =ψ3 = 0, thenψ=ψ0|00⟩+ψ1|01⟩ = |0⟩⊗ (ψ0|0⟩+ψ1|1⟩) and the state isnot entangled. Ifψ1 =ψ2 = 0 whileψ0ψ3 6= 0 then the state is entangled since itcannot be written as a tensor product.

12.8 Quantum Turing machines

Definition 12.8.1. Let C be the set of complex numbers whose real and imag-inary part can be computed by a deterministic algorithm with precision 2−n

within poly(n) time. A pre-quantum Turing machine is a quadruple (A,S,c, s0),where A,S, s0 are as for a deterministic machine and c : (A×S)2×D → C , whereD is the displacement set.

/Users/dp/a/ens/mq/iq-turin.tex 121 lud on 17 February 2013

Page 131: 2_pdfsam_iq.pdf

12.8. Quantum Turing machines

Any configuration x of the machine is represented by a triple x = (α, s,h) ∈A∗×S ×Z = X. The quantum configuration space H is decomposed into HT ⊗HS ⊗HH , where the indices T,S, H stand respectively for tape, internal states,and head. The spaceH is spanned by the orthonormal system (|ψ⟩)ψ∈X = (|αsh ⟩)α∈A∗,s∈S,h∈Z.

Define now onservables having (|α⟩)α∈A∗ , (| s ⟩)s∈S , and (|h ⟩)h∈Z as respec-tive eigenvectors. To do so, identify the sets A with 0, . . . , |A| − 1 and S with0, . . . , |S|−1. Denoty by T , S, and H the self-adjoint operators describing theseobservables, i.e.

S =|S|−1∑s=0

s| s ⟩⟨ s |

H = ∑h∈Z

h|h ⟩⟨h |

T = ⊗i∈ZTi where Ti =|A|−1∑a=0

a|a ⟩⟨a |.

Due to the linearity of quantum flows, it is enough to describe the flow onthe basis vectors ψ = |α, s,h ⟩; a ∈ AZ, s ∈ S,h ∈ Z. The machine is prepared atsome initial pure state ψ = |α, s,h ⟩, with α a string of contiguous non blanksymbols and we assume that the time is discretised:

|ψn ⟩ =U n |ψ⟩.Suppose that the displacement set D reads −1,0,1. Then for ψ= |α, s,h ⟩ andψ′ = |α′, s′,h′ ⟩

Uψ,ψ′ = ⟨α′, s′,h′ |Uα, s,h ⟩= [δh′,h+1c(αh , s,α′

h , s′,1)

+δh′,hc(αh , s,α′h , s′,0)

+δh′,h−1c(αh , s,α′h , s′,−1)]

∏j∈Z\h

δα j ,α′j.

Definition 12.8.2. A pre-quantum Turing machine is called a quantum Turingmachine if the function c is such that the operator U is unitary.

Exercise 12.8.3. Find the necessary and sufficient conditions on the functionc so that U is unitary.

Wavelets, Cuntz-Krieger algebras, Bratteli-Jørgensen . . .

To halt the machine, we can not perform intermediate measurements ofthe composite state because quantum mechanical measurement perturbs the

/Users/dp/a/ens/mq/iq-turin.tex 122 lud on 17 February 2013

Page 132: 2_pdfsam_iq.pdf

system. To proceed, suppose that S f = halt ≡ 0 and introduce a halting flagoperator F = |0⟩⟨0 |. Once the state s is set to 0, the function c is such that Udoes not any longer change either the state s or the result of the computation.

A predicate is a projection operator Pα = |α⟩⟨α |. Let the machine evolvefor some time n: it is at the state |ψn ⟩ = U n |ψ⟩. Perform the measurement⟨ψn |Pα⊗ F ⊗ Iψn ⟩ = p ∈ [0,1].

Definition 12.8.4. A language L belongs to the BQP complexity class if there isa machine M ∈QTM such that

• if α ∈ L, then the machine accepts with probability p > 2/3,

• if α 6∈ L, then the machine rejects with probability p > 2/3,

within a running time poly(|α|).

Theorem 12.8.5. P ⊆ BPP ⊆ BQP ⊆ PSPACE.

/Users/dp/a/ens/mq/iq-turin.tex 123 lud on 17 February 2013

Page 133: 2_pdfsam_iq.pdf

12.8. Quantum Turing machines

/Users/dp/a/ens/mq/iq-turin.tex 124 lud on 17 February 2013

Page 134: 2_pdfsam_iq.pdf

13Cryptology

Cryptology, grouping cryptography and cryptanalysis, is an old preoccupationof mankind because information is, as a matter of fact, a valuable resource.Nowadays classical technology allows secure ciphering of information that can-not be deciphered in real time. However, the cryptologic protocols used nowa-days are all based on the unproven conjecture that factoring large integers is ahard computational task. Should this conjecture be proved false, and an ef-ficient polynomial factorisation algorithm be discovered, the security of ourcommunication networks could become vulnerable. But even without any tech-nological breakthrough, the ciphered messages we exchange over public chan-nels (internet, commutated telephone network, fax, SMS, etc.) can be deci-phered by spending 8–10 months of computing time; hence our informationexchange is already vulnerable for transporting information that remains im-portant 10 months after its transmission.

Quantum information acquired an unprecedented impetus when Peter Shor[30] proved that on a quantum computer, factoring is a polynomially hard prob-lem. On the other hand, quantum communication can use the existing tech-nology to securely cipher information. It is therefore economically and strate-gically important to master the issues of advanced cryptography and to inventnew cryptologic methods.

125

Page 135: 2_pdfsam_iq.pdf

13.1. An old idea: the Vernam’s code

13.1 An old idea: the Vernam’s code

In 1917, Gilbert VERNAM proposed [34] the following ciphering scheme.1 LetA be a finite alphabet, identified with the set 0, . . . , |A| − 1 and m a messageof length N over the alphabet A, i.e. a word m ∈ AN . The Vernam’s cipheringalgorithm uses a ciphering key of same length as m, i.e. a word k ∈ AN andperforms character-wise addition as explained in the following

Algorithm 13.1.1. VernamsCipheringRequire: Original message m ∈ AN and UNIFRANDOMGENERATOR(AN )Ensure: Ciphered message c ∈ AN

Choose randomly ciphering key k ∈ AN

i ← 1repeat

Add character-wise ci = mi +ki mod |A|i ← i +1

until i > N

The recipient of the ciphered message c, knowing the ciphering key k per-forms the following

Algorithm 13.1.2. VernamsDecipheringRequire: Ciphered message c ∈ AN and ciphering key k ∈ AN

Ensure: Original message m ∈ AN

i ← 1repeat

Subtract character-wise mi = ci −ki mod |A|i ← i +1

until i > N

As far as the ciphering key is used only once and the key word has the samelength as the message, the Vernam’s algorithm is proved [28] to be perfectlysecure. The main problem of the algorithm is how to securely communicatethe key k?

1Appeared as a first patent US Patent 1310719 issued on 22 July 1919, and further improvedin a series of patents: US Patent 1416765, US Patent 1584749, and US Patent 1613686.

/Users/dp/a/ens/mq/iq-crypt.tex 126 lud on 17 February 2013

Page 136: 2_pdfsam_iq.pdf

13.2 The classical cryptologic scheme RSA

Theorem 13.2.1. (Fermat’s little theorem) Let p be a prime. Then

1. any integer a satisfies ap = a mod p,

2. any integer a, not divisible by p, satisfies ap−1 = 1 mod p.

Definition 13.2.2. The Euler’s function φ :N→N is defined by

φ(n) = card0 < a < n : gcd(a,n) = 1,n ∈N.

In particular, if p is prime, then φ(p) = p −1.

Theorem 13.2.3. (Euler’s) If gcd(a,m) = 1, then aφ(m) = 1 mod m.

Proposition 13.2.4. Let m be an integer, strictly bigger than 1, without squarefactors, and r a multiple of φ(m). Then

• ar = 1 mod m, for all integers a relatively prime with respect to m, and

• ar+1 = a mod m for all integers.

The proofs of all the previous results are straightforward but outside thescope of the present course; they can be found in pages 50–60 of [?].

The RSA protocols, named after its inventors Rivest, Shamir, and Adleman[24], involves two legal parties: Alice and Bob, and an eavesdropper, Eve. Bobproduces by the classical key distribution algorithm a private key d and a publickey π. Alice uses the public key of Bob to cipher the message and Bob uses hisprivate key to decipher it. Eve, even if she intercepts the ciphered message,cannot decipher it in real time.

Algorithm 13.2.5. ClassicalKeyDistributionRequire: Two primes p and qEnsure: Public, π, and private, d, keys of Bob

n ← pq (hence φ(n) = (p −1)(q −1))Choose any e < n, such that gcd(e,φ(n)) = 1d ← e−1 mod φ(n)π← (e,n)

Bob publishes his public key π on his internet page. Alice uses π to cipherthe message m using the following

/Users/dp/a/ens/mq/iq-crypt.tex 127 lud on 17 February 2013

Page 137: 2_pdfsam_iq.pdf

13.2. The classical cryptologic scheme RSA

Algorithm 13.2.6. CipheringRequire: Public key π= (e,n) and message m ∈N, with m < nEnsure: Ciphered text c ∈N

c ← me mod n

Alice transmits the ciphered text c through a vulnerable public channel toBob. He uses his private key to decipher by using the following

Algorithm 13.2.7. DecipheringRequire: Private key d and ciphered message c ∈NEnsure: Deciphered text µ ∈Nµ← cd mod n

Theorem 13.2.8. µ= m

Proof:

cd = med mod n

ed = 1+kφ(n), for some k ∈Nmed = m1+kφ(n),

and since n = pq has no square factors, by using proposition 13.2.4, we getm1+kφ(n) mod n = m mod n. ä

If Eve intercepts the message, to compute d she must know φ(n), hencethe factoring of n into primes. Security of the protocol is based on the conjec-ture that it is algorithmically hard to factor n. If we denote by N = logn, thenit is worth noticing that when the RSA protocol has been introduced, the bestknown algorithm of factor n run in exp(N ) time. The best 2 known algorithmnowadays [18] runs in exp(N 1/3(log N )2/3) time. This algorithmic improvement,combined with the increasing in the computational capabilities of computers,allows the factoring of a 1000 digits number in ca. 8 months instead of a timeexceeding the age of the universe at the moment the algorithm has been pro-posed. Until May 2007, the RSA company ran an international contest offeringseveral hundreds thousand dollars to whoever could factor multi-digit num-bers they provided on line. When the contest stopped the company gave theofficial reasons explained in RSA factoring challenge.

2See also [20] for an updated state of the art.

/Users/dp/a/ens/mq/iq-crypt.tex 128 lud on 17 February 2013

Page 138: 2_pdfsam_iq.pdf

13.3 Quantum key distribution

13.3.1 The BB84 protocol

Theorem 13.3.1. (No cloning theorem) Let |φ⟩ and |ψ⟩ be two rays in H suchthat ⟨φ |ψ⟩ 6= 0 and |φ⟩ 6= exp(iθ)|ψ⟩. Then there does not exist any quantumdevice allowing duplication of φ and ψ.

Proof: Suppose that such a device exists. Then, for some n ≥ 1, there exists aunitary U :H⊗(n+1) →H⊗(n+1) and some ancillary ray |α1 · · ·αn ⟩ ∈H⊗n such thatwe get

|φφβ1 · · ·βn−1 ⟩ = U |φα1 · · ·αn ⟩|ψψγ1 · · ·γn−1 ⟩ = U |ψα1 · · ·αn ⟩.

Then

⟨ψ |φ⟩ = ⟨ψα1 · · ·αn |U∗U |φα1 · · ·αn ⟩

= ⟨ψ |φ⟩2n−1∏i=1

⟨γi |βi ⟩.

Since ⟨φ |ψ⟩ 6= 0 we get ⟨ψ |φ⟩∏n−1i=1 ⟨γi |βi ⟩ = 1 and since |φ⟩ 6= exp(iθ)|psi ⟩,

it follows that 0 < |⟨ψ |φ⟩| < 1. Subsequently,∏n−1

i=1 |⟨γi |βi ⟩| > 1 but this is im-possible since for every i , |⟨γi |βi ⟩| ≤ 1. ä

This theorem is at the basis of the BB84 quantum key distribution proto-col [8]. Alice and Bob communicate through a quantum and a classical publicchannels; they agree publicly to use two different orthonormal bases of H=C2

(describing the photon polarisation):

B+ = ε+0 = |0⟩,ε+1 = |1⟩B× = ε×0 = |0⟩− |1⟩p

2,ε×1 = |0⟩+ |1⟩p

2.

The first element of each basis is associated with the bit 0, the second with thebit 1. Moreover Alice and Bob agree on some integer n = (4+δ)N with someδ > 0, where N is the length of the message they wish to exchange securely;it will be also the length of their key. Alice needs also to know the function

/Users/dp/a/ens/mq/iq-crypt.tex 129 lud on 17 February 2013

Page 139: 2_pdfsam_iq.pdf

13.3. Quantum key distribution

T : 0,12 →H defined by

T (x, y) =

ε+0 if (x, y) = (0,0)ε+1 if (x, y) = (0,1)ε×0 if (x, y) = (1,0)ε×1 if (x, y) = (1,1).

Algorithm 13.3.2. AlicesKeyGenerationRequire: UNIFRANDOMGENERATOR(0,1), T , nEnsure: Two strings of n random bits a,b ∈ 0,1n and a sequence of n qubits

(|ψi ⟩)i=1,...,n

Generate randomly a1, . . . , an

a ← (a1, . . . , an) ∈ 0,1n

Generate randomly b1, . . . ,bn

b ← (b1, . . . ,bn) ∈ 0,1n

i ← 1repeat|ψi ⟩← T (ai ,bi )Transmit |ψi ⟩ to Bob via public quantum channeli ← i +1

until i > n

On reception of the i th qubit, Bob performs a measurement of the projec-tion operator P ] = |ε]1 ⟩⟨ε]1 |, where ] ∈ +,×.

/Users/dp/a/ens/mq/iq-crypt.tex 130 lud on 17 February 2013

Page 140: 2_pdfsam_iq.pdf

Algorithm 13.3.3. BobsKeyGenerationRequire: UNIFRANDOMGENERATOR(0,1), n, sequence |ψi ⟩ for i = 1, . . . ,n, P ]

for ] ∈ +,×Ensure: Two strings of n bits ′,b′ ∈ 0,1n

Generate randomly b′1, . . . ,b′

nb′ ← (b′

1, . . . ,b′n) ∈ 0,1n

i ← 1repeat

if b′i = 0 then

ask whether P+ takes value 1else

ask whether P× takes value 1end ifif Counter triggered then

a′i ← 1

elsea′

i ← 0end ifi ← i +1

until i > na′ ← (a′

1, . . . , a′n) ∈ 0,1n

Transmit string b′ ∈ 0,1n to Alice via public classical channel

When Alice receives the string b, she performs the conciliation algorithmdescribed below.

Algorithm 13.3.4. ConciliationRequire: Strings b,b′ ∈ 0,1Ensure: Sequence (k1, . . . ,kL) with some L ≤ n of positions of coinciding bits

c ← b⊕b′

i ← 1k ← 1repeat

k ← min j : k ≤ j ≤ n such that c j = 0if k ≤ n then

ki ← ki ← i +1

end ifuntil k > nL ← i −1transmit (k1, . . . ,kL) to Bob via public classical channel

/Users/dp/a/ens/mq/iq-crypt.tex 131 lud on 17 February 2013

Page 141: 2_pdfsam_iq.pdf

13.3. Quantum key distribution

Theorem 13.3.5. If there is no eavesdropping on the quantum channel then

P((a′k1

, . . . , a′kL

) = (ak1 , . . . , akL )|a,b) = 1.

Proof: Compute ⟨ψi |P+ψi ⟩ and ⟨ψi |P×ψi ⟩ for all different possible choices ofψi ∈ B+∪B×. We observe that for those i ’s such that b′

i = bi we have P(a′i =

ai ) = 1. Hence on deciding to consider only the substrings of a and a′ definedon the locations where b and b′ coincide, we have the certainty of sharing thesame substrings, although a and a′ have never been exchanged. ä

Lemma 13.3.6. If there is no eavesdropping, for N large enough, L is of the order2N .

Proof: Elementary use of the law of large numbers. ä

If Eve is eavesdropping, since she cannot copy quantum states (no-cloningtheorem), she can measure with the same procedure as Bob and in order forthe leakage not to be apparent, she re-emits a sequence of qubits |ψi ⟩ to Bob.Now again L is of the order 2N but since Eve’s choice of the b’s is independent ofthe choices of Alice and Bob, the string a′ computed by Bob will coincide withAlice’s string a at only L/2 ' N positions.

Hence to securely communicate, Alice and Bob have to go through the eaves-dropping detection procedure and reconciliation.

Bob randomly chooses half of the bits of the substring (a′k1

, . . . , a′kL

), i.e. (a′r1

, . . . , a′rL/2

)with ri ∈ k1, . . . ,kL and ri 6= r j for i 6= j , and sends the randomly chosen po-sitions (r1, . . . ,rL/2) and the corresponding bit values (a′

r1, . . . , a′

rL/2) to Alice. If

(a′k1

, . . . , a′kL

) = (ak1 , . . . , akL ) (reconciliation) then Alice announces this fact to

Bob and they use the complementary substring of (a′k1

, . . . , a′kL

) (of length L/2 'N ) as their key to cipher with Vernam’s algorithm. Else, they restart BB84 pro-tocol.

Notice that Alice and Bob never exchanged the ultimate substring of N bitsthey use as key.

/Users/dp/a/ens/mq/iq-crypt.tex 132 lud on 17 February 2013

Page 142: 2_pdfsam_iq.pdf

13.3.2 Simple eavesdropping strategies, disturbance and infor-mational gain

13.3.3 Other cryptologic protocols

/Users/dp/a/ens/mq/iq-crypt.tex 133 lud on 17 February 2013

Page 143: 2_pdfsam_iq.pdf

13.3. Quantum key distribution

/Users/dp/a/ens/mq/iq-crypt.tex 134 lud on 17 February 2013

Page 144: 2_pdfsam_iq.pdf

14Elements of quantum computing

In this chapter, B denotes the set 0,1 and elements b ∈ B are called bits; Hwill denote C2 and rays |ψ⟩ ∈H are called qubits. Similarly, arrays of n bits aredenoted by b = (b1, . . . ,bn) ∈ B n ; arrays of n qubits by |ψ⟩ = |ψ1 · · ·ψn ⟩ ∈H⊗n .

14.1 Classical and quantum gates and circuits

A classical circuit implements a Boolean mapping f : B n → B n by using ele-mentary gates of small arities1, chosen from a family G ; A quantum circuit im-plements a unitary mapping U :H⊗n →H⊗n by using unitary elementary gatesof small arities2, chosen from a family G .

Definition 14.1.1. Let U : H⊗n → H⊗n for some n and G be a fixed family ofunitary operators of different arities. A quantum circuit over G is a product ofoperators from G acting on appropriate qubit entries.

It is usually assumed that G is closed under inversion.

1usually acting on O (1) bits.2usually acting on O (1) qubits.

135

Page 145: 2_pdfsam_iq.pdf

14.2. Approximate realisation

Definition 14.1.2. Let V : H⊗n → H⊗n be a unitary operator. This operator issaid to be realised by a unitary operator W : H⊗N → H⊗N , with N ≥ n entries,acting on n qubits and N −n ancillary qubits, if for all |ξ⟩ ∈H⊗n ,

W (|ξ⟩⊗ |0N−n ⟩) = (V |ξ⟩)⊗|0N−n ⟩.

Ancillary qubits correspond to some memory in a fixed initial state we bor-row for intermediate computations that is returned into the same state. Return-ing ancillary qubits into the same state can be relaxed. What cannot be relaxedis that ancilla must not be entangled with the n qubits (it must remain in tensorform); otherwise the anicllary subsystem could not be forgotten.

Quantum circuits are supposed to be more general than classical circuits.However, arbitrary Boolean circuits cannot be considered as classical counter-parts of quantum ones because the classical analogue of a unitary operator onH⊗n is an invertible map on B n , i.e. a permutationπ ∈ S2n . Since to any n-bit ar-ray ξ= (ξ1 · · ·ξn) ∈ B n corresponds a basis vector |ξ⟩ = |ξ1 · · ·ξn ⟩ ∈H⊗n , to everypermutation π ∈ S2n naturally corresponds a unitary operator π, defined by

π|ξ⟩ = |π(ξ)⟩,with π∗ = π−1 = π−1. Hence we can define:

Definition 14.1.3. Let G ⊆ S2n . A reversible circuit over G is a sequence of per-mutations from G .

An arbitrary Boolean function F : B m → B n can be extended to a functionF⊕ : B m+n → B m+n , defined by

F⊕(x, y) = (x, y ⊕F (x)),

where the symbol ⊕ in the right hand side stands for the bit-wise addition mod-ulo 2. It is easily checked that F⊕ is a permutation. Moreover F⊕(x,0) = (x,F (x)).

Notice that 2-bit permutation gates do not suffice the realise all functionsof the form F⊕. On the contrary G = NOT,Λ with Λ : B 3 → B 3 the Toffoli gate,defined byΛ(x, y, z) = (x, y, z ⊕ (x ∧ y)), is a basis.

14.2 Approximate realisation

There are uncountably many unitary operators U : H⊗n → H⊗n . Hence if aquantum computer is to be constructed, the notion of exact realisation of a

/Users/dp/a/ens/mq/iq-qcomp.tex 136 lud on 16th March 2004

Page 146: 2_pdfsam_iq.pdf

unitary operator must be weakened to an approximate realisation. The samerationale prevails also in classical computing, instead of all real functions (un-countably many), only Boolean functions are implemented.

Lemma 14.2.1. An arbitrary unitary operator U : Cm → Cm can be representedas a product V =∏m(m−1)/2

i=1 V (i ) of matrices of the form

1. . .

1 (a bc d

)1

. . .1

,

(a bc d

)∈ U(2).

Moreover, the sequence of matrices appearing in the product can be explicitlyconstructed in a running time O (m3)poly(log(1/δ)) where δ= ‖U −V ‖.

Proof: An exercise if one recalls that for all c1,c2 ∈C, there exists a unitary oper-ator W ∈ U(2) such that

W

(c1

c2

)=

(√|c1|2 +|c2|20

).

ä

Basic properties of the operator norm are recalled below:

‖X Y ‖ ≤ ‖X ‖‖Y ‖‖X ‖ = ‖X ‖‖U‖ = 1

‖X ⊗Y ‖ = ‖X ‖‖Y ‖,

where X and Y are arbitrary operators and U is a unitary.

Definition 14.2.2. A unitary operator U ′ approximates a unitary operator Uwithin δ if ‖U −U ′‖ ≤ δ.

Lemma 14.2.3. If a unitary U ′ approximates a unitary U within δ, then U ′−1

approximates U−1 within δ.

/Users/dp/a/ens/mq/iq-qcomp.tex 137 lud on 16th March 2004

Page 147: 2_pdfsam_iq.pdf

14.2. Approximate realisation

Proof: Since U ′−1(U ′−U )U−1 =U−1 −U ′−1, it follows that ‖U−1 −U ′−1‖ ≤ ‖U ′−U‖ ≤ δ. äLemma 14.2.4. If unitary operators (U ′

k )k=1,...,L approximate unitary operators(Uk )k=1,...,L within δk , then U ′ = U ′

L · · ·U ′1 approximates U = UL · · ·U1 within∑L

k=1δk .

Proof: ‖U ′2U ′

1 −U2U1‖ ≤ ‖U ′2(U ′

1 −U1)+ (U ′2 −U2)U1‖ ≤ δ1 +δ2. ä

Definition 14.2.5. A unitary operator U :H⊗n →H⊗n is approximated by a uni-tary operator U :H⊗N →H⊗N , with N ≥ n, within δ if for all |ξ⟩ ∈H⊗n

‖U ′(|ξ⟩⊗ |0N−n ⟩)−U |ξ⟩⊗ |0N−n ⟩‖ ≤ δ‖ξ‖.

Definition 14.2.6. For every unitary operator U :H⊗n →H⊗n there exists a uni-tary operator C (U ) : H⊗H⊗n → H⊗H⊗n , called the controlled-U operator, de-fined for all |ξ⟩ ∈H⊗n by

C (U )|ε⟩⊗ |ξ⟩ = |ε⟩⊗ |ξ⟩ if ε= 0

|ε⟩⊗U |ξ⟩ if ε= 1

Similarly, multiply controlled-U C k (U ) :H⊗k ⊗H⊗n →H⊗n ⊗H⊗n , is defined by

C k (U )|ε1 · · ·εk ⟩⊗ |ξ⟩ = |ε1 · · ·εk ⟩⊗ |ξ⟩ if ε1 · · ·εk = 0

|ε1 · · ·εk ⟩⊗U |ξ⟩ if ε1 · · ·εk = 1

Example 14.2.7. Let σ1 =(0 11 0

)be the unitary operator corresponding to the

classical NOT gate. Then C 2(σ1) = Λ, whereΛ is the Toffoli gate.

Definition 14.2.8. The set

G = H ,K ,K −1,C (σ1),C 2(σ1),

with H = 1p2

(1 11 −1

)(Hadamard gate) and K =

(1 00 −i

)(phase gate), is called

the standard computational basis.

Theorem 14.2.9. Any unitary operator U : H⊗n → H⊗n can be approximatedwithin δ by a poly(log(1/δ))-size circuit over the standard basis using ancillaryqubits. There is a poly(n)-time algorithm describing the construction of the ap-proximating circuit.

Proof: An exercise, once you have solved the exercise 14.2.10 below. ä

/Users/dp/a/ens/mq/iq-qcomp.tex 138 lud on 16th March 2004

Page 148: 2_pdfsam_iq.pdf

Exercise 14.2.10. Let σ0,...,3 be the 3 Pauli matrices augmented by the identity

matrix, H the Hadamard gate, andΦ(φ) =(1 00 exp(2iφ)

).

1. Show that if A ∈M2(C) with A2 = 1 and φ ∈R, then

exp(iφA) = cosφσ0 + i sinφA.

2. Let R j (θ) = exp(−i θ2σ j ), for j = 1,2,3 and Rn(θ) = exp(−i θ2 n ·~σ), wheren = (N1,n2,n3) with n2

1 +n22 +n2

3 = 1 and ~σ = (σ1,σ2,σ3). Express R j (θ)and Rn(θ) on the basis σ0, . . . ,σ3.

3. Show that H = exp(iφ)R1(α)R3(β), for some φ,α,β to be determined.

4. If |ξ⟩ ∈ C2 is a ray represented by a vector of the Bloch sphere S2 = x ∈R3 : ‖x‖2 = 1, show that

Rn(θ)|ξ⟩ = |Tn(θ)x ⟩where Tn(θ)x is the rotation of x around n by an angle θ.

5. Show that every U ∈ U(2) can be written as

U = exp(iα)Rn(θ)

for some α,θ ∈R.

6. Show that every U ∈ U(2) can be written as

U = exp(iα)R3(β)R2(γ)R3(δ)

for some α,β,γ,δ ∈R.

7. Suppose that m and n are two not parallel vectors of S2. Show that everyU ∈ U(2) can be written as

U = exp(iα)Rn(β1)Rm(γ1)Rn(β2)Rm(γ2) · · · .

8. Establish identities

Hσ1H = σ3

Hσ2H = −σ2

Hσ3H = σ1

HΦ(π

8)H = exp(iα)R1(

π

4)

for some α.

/Users/dp/a/ens/mq/iq-qcomp.tex 139 lud on 16th March 2004

Page 149: 2_pdfsam_iq.pdf

14.3. Examples of quantum gates

14.3 Examples of quantum gates

14.3.1 The Hadamard gate

H = 1p2

(1 11 −1

).

H |ε⟩ = 1p2

((−1)ε|ε⟩+ |1−ε⟩),ε ∈ B.

H⊗3|000⟩ = 1p8

7∑x=0

|x ⟩.

14.3.2 The phase gate

Φ(φ) =(1 00 exp(2iφ)

).

Φ(φ)|ε⟩ = exp(2iεφ)|ε⟩

Φ(π

4+ φ

2)HΦ(θ)H |0⟩ = cosθ|0⟩+exp(iφ)sinθ|1⟩.

14.3.3 Controlled-NOT gate

C (σ1) =

1 0 0 00 1 0 00 0 0 10 0 1 0

.

For any x ∈ B , C (σ1)|x0⟩ = |xx ⟩, but for arbitrary |ψ⟩ =α|0⟩+β|1⟩,

C (σ1)|ψ0⟩ =α|00⟩+β|11⟩ 6= |ψψ⟩.

/Users/dp/a/ens/mq/iq-qcomp.tex 140 lud on 16th March 2004

Page 150: 2_pdfsam_iq.pdf

14.3.4 Controlled-phase gate

C (Φ(φ)) =

1 0 0 00 1 0 00 0 1 00 0 0 exp(2iφ)

.

For x, y ∈ B ,C (Φ(φ))|x y ⟩ = exp(2iφx y)|x y ⟩.

14.3.5 The quantum Toffoli gate

For all x, y, z ∈ B ,C 2(σ3)|x y z ⟩ = |x, y, (x ∧ y)⊕ z ⟩.

Suppose that f : B m → B n is a Boolean function, implemented by the uni-tary operator U f :H⊗(n+m) →H⊗(n+m). If |ψ⟩ = 1

2m/2

∑ε1,...,εm∈B |ε1, . . . ,εm ⟩ then

U f |ψ⟩⊗ |0n ⟩ = 1

2m/2

2m−1∑x=0

|x, f (x)⟩.

Hence computing simultaneously all values of f over its domain of definitionrequires the same computational effort as computing the value over a singletonof the domain.

/Users/dp/a/ens/mq/iq-qcomp.tex 141 lud on 16th March 2004

Page 151: 2_pdfsam_iq.pdf

14.3. Examples of quantum gates

/Users/dp/a/ens/mq/iq-qcomp.tex 142 lud on 16th March 2004

Page 152: 2_pdfsam_iq.pdf

15The Shor’s factoring algorithm

143

Page 153: 2_pdfsam_iq.pdf

Chapter 15

/Users/dp/a/ens/mq/iq-shora.tex 144 lud on 16th March 2004

Page 154: 2_pdfsam_iq.pdf

16Error correcting codes, classical and

quantum

145

Page 155: 2_pdfsam_iq.pdf

References

/Users/dp/a/ens/mq/iq-proca.tex 146 lud on 12 January 2013

Page 156: 2_pdfsam_iq.pdf

Bibliography

[1] N. I. Akhiezer and I. M. Glazman. Theory of linear operators in Hilbertspace. Dover Publications Inc., New York, 1993. Translated from the Rus-sian and with a preface by Merlynd Nestell, Reprint of the 1961 and 1963translations, Two volumes bound as one. 31

[2] William Arveson. A short course on spectral theory, volume 209 of GraduateTexts in Mathematics. Springer-Verlag, New York, 2002. 48, 52, 59

[3] Alain Aspect, Jean Dalibard, and Gérard Roger. Experimental test of5brell’s inequalities using time-varying analyzers. Phys. Rev. Lett., 49:1804–1807, Dec 1982. 16, 20

[4] Alain Aspect, Philippe Grangier, and Gérard Roger. Experimental tests ofrealistic local theories via Bell’s theorem. Phys. Rev. Lett., 47:460–463, Aug1981. 16

[5] Alain Aspect, Philippe Grangier, and Gérard Roger. Experimental realiza-tion of Einstein-Podolsky-Rosen-Bohm Gedankenexperiment: A new vio-lation of Bell’s inequalities. Phys. Rev. Lett., 49:91–94, Jul 1982. 16, 19

[6] William Aspray. John von Neumann and the origins of modern computing.MIT Press, Cambrdige, MA, 1990. 23

[7] John S. Bell. On the problem of hidden variables in quantum mechanics.Rev. Modern Phys., 38:447–452, 1966. 16

[8] C. H. Bennett and G. Brassard. Quantum public key distribution system.IBM Technical disclosure bulletin, 28:3153–3163, 1985. 129

[9] David Bohm. A suggested interpretation of the quantum theory in termsof "hidden" variables. I. Phys. Rev., 85:166–179, Jan 1952a. 16

[10] David Bohm. A suggested interpretation of the quantum theory in termsof "hidden" variables. II. Phys. Rev., 85:180–193, Jan 1952b. 16

147

Page 157: 2_pdfsam_iq.pdf

References

[11] Giulio Chiribella, Giacomo Mauro D’Ariano, and Paolo Perinotti. Theoret-ical framework for quantum networks. Phys. Rev. A (3), 80(2):022339, 20,2009. 23

[12] Giulio Chiribella, Giacomo Mauro D’Ariano, and Paolo Perinotti. Infor-mational derivation of quantum theory. Phys. Rev. A, 84:012311, Jul 2011.23

[13] A. Einstein, B. Podolsky, and N. Rosen. Can quantum-mechanical descrip-tion of physical reality be considered complete? Phys. Rev., 47:777–780,May 1935. 15

[14] Nicolas Gisin, Grégoire Ribordy, Wolfgang Tittel, and Hugo Zbinden.Quantum cryptography. Rev. Mod. Phys., 74:145–195, Mar 2002. 28

[15] Anthony J.G. Hey. Feynman and computation. Perseus Books Publishing,1998. 22

[16] John K. Hunter and Bruno Nachtergaele. Applied analysis. World ScientificPublishing Co. Inc., River Edge, NJ, 2001. 31, 33

[17] Richard V. Kadison and John R. Ringrose. Fundamentals of the theory ofoperator algebras. Vol. I, volume 15 of Graduate Studies in Mathematics.American Mathematical Society, Providence, RI, 1997a. Elementary the-ory, Reprint of the 1983 original. 36, 50

[18] A. K. Lenstra and H. W. Lenstra, Jr. Algorithms in number theory. In Hand-book of theoretical computer science, Vol. A, pages 673–715. Elsevier, Ams-terdam, 1990. 4, 128

[19] A. K. Lenstra and H. W. Lenstra, Jr., editors. The development of the numberfield sieve, volume 1554 of Lecture Notes in Mathematics. Springer-Verlag,Berlin, 1993. 6

[20] Arjen K. Lenstra. Integer factoring. Des. Codes Cryptogr., 19(2-3):101–128,2000. Towards a quarter-century of public key cryptography. 128

[21] H. Maassen. Quantum probability and quantum information theory. InQuantum information, computation and cryptography, volume 808 of Lec-ture Notes in Phys., pages 65–108. Springer, Berlin, 2010. 16, 20

[22] Dimitri Petritis. Markov chains on measurable spaces, 2012. Lecture notesof the University of Rennes 1. 21

/Users/dp/a/ens/mq/iq-proca.tex 148 lud on 12 January 2013

Page 158: 2_pdfsam_iq.pdf

BIBLIOGRAPHY

[23] Michael Reed and Barry Simon. Methods of modern mathematical physics.I. Functional analysis. Academic Press, New York, 1972. 31, 36

[24] R. L. Rivest, A. Shamir, and L. Adleman. A method for obtaining digital sig-natures and public-key cryptosystems. Comm. ACM, 21(2):120–126, 1978.6, 127

[25] Walter Rudin. Real and complex analysis. McGraw-Hill Book Co., NewYork, third edition, 1987. 31

[26] Walter Rudin. Functional analysis. International Series in Pure and Ap-plied Mathematics. McGraw-Hill Inc., New York, second edition, 1991. 48

[27] Raymond A. Ryan. Introduction to tensor products of Banach spaces.Springer Monographs in Mathematics. Springer-Verlag London Ltd., Lon-don, 2002. 36

[28] Claude Shannon. Communication theory of secrecy systems. Bell SystemTechnical Journal, 28:656–715, 1949. 126

[29] Albert Nikolaevich Shiryayev. Probability, volume 95 of Graduate Texts inMathematics. Springer-Verlag, New York, 1984. Translated from the Rus-sian by R. P. Boas. 11

[30] Peter W. Shor. Polynomial-time algorithms for prime factorization anddiscrete logarithms on a quantum computer. SIAM J. Comput., 26(5):1484–1509, 1997. 4, 6, 125

[31] V. S. Varadarajan. Geometry of quantum theory. Vol. I. D. Van Nostrand Co.,Inc., Princeton, N.J.-Toronto, Ont.-London, 1968. The University Series inHigher Mathematics. 92

[32] V. S. Varadarajan. Geometry of quantum theory. Springer-Verlag, New York,second edition, 1985. 23

[33] V. S. Varadarajan. Geometry of quantum theory. Springer-Verlag, New York,second edition, 1985. 73

[34] Gilbert S. Vernam. Cipher printing telegraph systems for secret wire andradio telegraphic communications, volume 55. 1926. 126

[35] Johann von Neumann. Mathematische Grundlagen der Quanten-mechanik. Unveränderter Nachdruck der ersten Auflage von 1932. DieGrundlehren der mathematischen Wissenschaften, Band 38. Springer-Verlag, Berlin, 1968. 22

/Users/dp/a/ens/mq/iq-proca.tex 149 lud on 12 January 2013

Page 159: 2_pdfsam_iq.pdf

Index

[36] John von Neumann. Collected works. Vol. I: Logic, theory of sets and quan-tum mechanics. General editor: A. H. Taub. Pergamon Press, New York,1961a. 23

[37] John von Neumann. Collected works. Vol. II: Operators, ergodic theory andalmost periodic functions in a group. General editor: A. H. Taub. PergamonPress, New York, 1961b. 23

[38] John von Neumann. Collected works. Vol. III: Rings of operators. Generaleditor: A. H. Taub. Pergamon Press, New York, 1961c. 23

[39] John von Neumann. Collected works. Vol. IV: Continuous geometry andother topics. General editor: A. H. Taub. Pergamon Press, Oxford, 1962. 23

[40] John von Neumann. Collected works. Vol. V: Design of computers, theory ofautomata and numerical analysis. General editor: A. H. Taub. A PergamonPress Book. The Macmillan Co., New York, 1963a. 23

[41] John von Neumann. Collected works. Vol. VI: Theory of games, astro-physics, hydrodynamics and meteorology. General editor: A. H. Taub. APergamon Press Book. The Macmillan Co., New York, 1963b. 23

[42] Joachim Weidmann. Linear operators in Hilbert spaces, volume 68 of Grad-uate Texts in Mathematics. Springer-Verlag, New York, 1980. Translatedfrom the German by Joseph Szücs. 36

/Users/dp/a/ens/mq/iq-proca.tex 150 lud on 12 January 2013

Page 160: 2_pdfsam_iq.pdf

Index

algebra, 42Banach, 43Boolean, 72Boolean σ-, 72commutative, 42C∗, 44normal element of an, 43normed, 43self-adjoint element of an, 43unital, 42unitary element of an, 43

algebraicadjoint, 43isometry, 43tensor product, 24

algorithmMonte Carlo, 118

automorphismconvex, 84

Banach algebrageneral linear group of invertible

elements of, 56invertible element of, 56

Boolean circuit, 119

complement, 72completion, 32complexity class, 116

BPP, 118NP, 117P, 117PSPACE, 117

computable function, 116

computationalspace, 116time, 116

computational history, 115computational path, 115conditional expectation

classical, 34consistency, 10

density matrix, 91determinant, 52direct sum, 34

eigenvalue, 55entanglement, 25entropy, 110equation

Hamilton, 13Schroedinger, 103

Euler function, 127expected value, 79

function-valued measure, 61

gateFreidkin, 120

harmonic oscillatorclassical, 99quantum, 103

Hasse diagram, 71hatling flag, 123Hilbert

norm, 32

inequality

151

Page 161: 2_pdfsam_iq.pdf

Index

Cauchy-Schwarz-Buniakovski, 32involution, 42isometry, 47

partial, 48

joint probability distribution, 84

language, 116lattice, 70

atom (in a), 72atomic, 72automorphism, 72complemented, 72Dilworth, 72distributive, 71isomorphism, 72modular, 72orthocomplemented, 74orthomodular, 72latticeσ-complete, 72

lattice of propositions, 70logic, 74

classical, 75probability measure on, 78standard quantum, 75state on, 78

measurement, 7classical, 14quantum, 23

momentum operator, 67

observable, 7associated with a logic, 76bounded, 76classical, 14, 73constant, 76discrete, 76quantum, 23spectral value of, 76spectrum of, 76

strict value of an, 76operator

adjoint, 66annihilation, 105bounded, 33closable, 66closed, 66closure, 66creation, 105densely defined, 66density, 91diagonalisable, 59extension of an, 66graph of, 66Hermitean, 46invertible, 54, 66isometric, 47norm of —, 33normal, 49orhoprojection, 46positive, 46projection, 46restriction of an, 66self-adjoint, 46trace-class, 89unbounded, 66unitary, 47von Neumann, 91

orhogonalsubsets, 34vectors, 34

orthocomplementation, 74orthogonal

complement, 34orthoprojection, 35, 46othogonal

orthoprojections, 46

partial isometry, 48partially ordered set, 70phase space

/Users/dp/a/ens/mq/iq-proca.tex 152 lud on 12 January 2013

Page 162: 2_pdfsam_iq.pdf

INDEX

classical, 13quantum, 23

Poisson bracket, 101polynomial growth, 116poset, 70position operator, 66predicate, 116predicate:decidable, 116probability distribution, 79projection, 35, 46proposition

simultaneously verifiable, 82

quantum circuit, 135question, 78

associated with proposition, 78

random variable, 9law of, 9

ray, 89representation

faithful, 50of locally compact group, 85space, 50∗, 49unitarily equivalent, 50

resolventset, 54

scalar product, 32sequence

Cauchy, 32fundamental, 32

simultaneous observability, 82space

Banach, 32complete, 32Hilbert, 32metric, 32normed, 32pre-Hilbert, 32

spectral measure, 62

spectral radius, 58spectrum, 54, 57

continuous, 55point, 55residual, 55

∗-algebra, 43∗-homomorphism, 43state

classical, 13entagled, 25entangled, 25pure, 80superposition, 80tracial, 92

state function, 77stochastic process, 10symmetry, 92

tensor product, 24theorem

Gel’fand-Naïmark, 50Gleason, 92Kolmogorov’s existence, 11Liouville, 102of spectral decomposition, 64

time evolutionclassical, 14quantum, 23

trace, 89Turing machine

deterministic, 114non-deterministic, 117pre-quantum, 121probabilistic, 118quantum, 122

variance, 79

yes-no question, 14

/Users/dp/a/ens/mq/iq-proca.tex 153 lud on 12 January 2013