Quantum Field Theory: An Introduction Ryan Reece1 1ryan.reece@cern.ch, reece.scipp.ucsc.edu, Santa Cruz Institute for Particle Physics, University of California, 1156 High St., Santa Cruz, CA 95064, USA December 23, 2007 Abstract This document is a set of notes I took on QFT as a graduate student at the University of Pennsylvania, mainly inspired in lectures by Burt Ovrut, but also working through Peskin and Schroeder (1995), as well as David Tong's lecture notes available online. They take a slow pedagogical approach to introducing classical field theory, Noether's theorem, the principles of quantum mechanics, scattering theory, and culminating in the derivation of Feynman diagrams. Contents 1 Preliminaries 3 1.1 Overview of Special Relativity . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.1.1 Lorentz Boosts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.1.2 Length Contraction and Time Dilation . . . . . . . . . . . . . . . . . 3 1.1.3 Four-vectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1.1.4 Momentum and Energy . . . . . . . . . . . . . . . . . . . . . . . . . 5 1.2 Units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 1.2.1 Natural Units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 1.2.2 Barns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 1.2.3 Electromagnetism . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 1.3 Relativistic Kinematics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 1.3.1 Lorentz Invariant Phase Space . . . . . . . . . . . . . . . . . . . . . 8 1.3.2 Mandelstam Variables . . . . . . . . . . . . . . . . . . . . . . . . . . 8 Quantum Field Theory: An Introduction 2 Variation of Fields 9 2.1 The Field Worldview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.2 Variation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.3 The Principle of Least Action . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.4 Noether's Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 2.5 Spacetime Translation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 2.6 Lorentz Transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 2.7 Internal Symmetries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 3 The Free Real Scalar Field 19 3.1 The Classical Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 3.2 Principles of Quantum Mechanics . . . . . . . . . . . . . . . . . . . . . . . . 23 3.2.1 States . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 3.2.2 Operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 3.2.3 The Fundamental Postulate of Quantum Mech. . . . . . . . . . . . . 26 3.2.4 The Poincaré Algebra . . . . . . . . . . . . . . . . . . . . . . . . . . 29 3.2.5 Canonical Quantization . . . . . . . . . . . . . . . . . . . . . . . . . 29 3.3 The Quantum Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 3.3.1 Equal Time Commutation Relations . . . . . . . . . . . . . . . . . . 30 3.3.2 Creation and Annihilation Operators . . . . . . . . . . . . . . . . . . 36 3.3.3 Energy Eigenstates . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 3.3.4 Normal Ordering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 3.3.5 Interpretation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 3.3.6 Internal Charge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 3.3.7 Spin-statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 3.3.8 The Feynman Propagator . . . . . . . . . . . . . . . . . . . . . . . . 47 4 The Interacting Real Scalar Field 51 4.1 Correlation Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 4.2 Wick's Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 4.3 Feynman Diagrams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 Acknowledgments 71 References 71 2 R. Reece 1 Preliminaries 1.1 Overview of Special Relativity 1.1.1 Lorentz Boosts Searches in the later part 19th century for the coordinate transformation that left the form of Maxwell's equations and the wave equation invariant lead to the discovery of the Lorentz Transformations. The "boost" transformation from one (unprimed) inertial frame to another (primed) inertial frame moving with dimensionless velocity ~β = ~v/c, respect to the former frame, is given by( c t′ x′ ) = ( γ −γβ −γβ γ )( c t x ) Because a boost along one of the spacial dimensions leaves the other two unchanged, we can suppress the those two spacial dimensions and let β = |~β |. γ is the Lorentz Factor, defined by γ ≡ 1√ 1− β2 (1) γ ranges from 1 to ∞ monotonicly in the nonrelativistic (β → 0) and relativistic (β → 1) limits, respectively. It is useful to remember that γ ≥ 1. Note that being the magnitude of a vector, β has a lower limit at 0. β also has an upper limit at 1 because γ diverges as β approaches 1 and becomes unphysically imaginary for values of β > 1. This immediately reveals that β = 1, or v = c, is Nature's natural speed limit. The inverse transformation is given by( c t x ) = ( γ γβ γβ γ )( c t′ x′ ) 1.1.2 Length Contraction and Time Dilation The differences between two points in spacetime follow from the transformations: c∆t′ = γ c∆t− γβ ∆x ∆x′ = −γβ c∆t+ γ ∆x c∆t = γ c∆t′ + γβ ∆x′ ∆x = γβ c∆t′ + γ ∆x′ Consider a clock sitting at rest in the unprimed frame (∆x = 0). The first of the four above equations and the fact that γ ≥ 1, imply that the time interval is dilated in the 3 Quantum Field Theory: An Introduction primed frame. ∆t′ = γ ∆t Now consider a rod of length ∆x in the unprimed frame. A measurement of the length in the primmed frame corresponds to determining the coordinates of the endpoints simultaneously in the unprimed frame (∆t′ = 0). Then the fourth equation implies that length is contracted in the primed frame. ∆x′ = ∆x γ We call time intervals and lengths "proper" if they are measured in the frame where the subject is at rest (in this case, the unprimed frame). In summary, proper times and lengths are the shortest and longest possible, respectively. 1.1.3 Four-vectors Knowing that lengths and times transform from one reference frame to another, one can wonder if there is anything that is invariant. Consider the following, using the last two of the four equations for the differences between two spacetime points. (c∆t)2 − (∆x)2 = ( γ c∆t′ + γ β ∆x′ )2 − (γβ c∆t′ + γ ∆x′)2 = γ2 [ (c∆t′)2 +(((( ((2β c∆t′∆x′ + β2 (∆x′)2 −β2 (c∆t′)2 −((((( ( 2β c∆t′∆x′ − (∆x′)2 ] = γ2 (1− β2)} {{ } γ−2 [ (c∆t′)2 − (∆x′)2 ] = (c∆t′)2 − (∆x′)2 ≡ (∆τ)2 Which shows that ∆τ has the same value in any frames related by Lorentz Transformations. ∆τ is called the "invariant length." Note that it is equal to the proper time interval. This motivates us to think of (t, ~x ) as a four-vector that transforms according to the Lorentz transformations, in a "spacetime vector space," and there should be some kind of "inner product," or contraction, of these vectors that leaves ∆τ a scalar. This can be done by defining the Minkowski metric tensor as follows. ημν ≡  1 0 0 0 0 −1 0 0 0 0 −1 0 0 0 0 −1  μν (2) Four-vectors are indexed by a Greek index, xμ = (c t, ~x )μ, μ ranging from 0 to 3 (x0 = c t, x1 = x, x2 = y, x3 = z). The contraction of a spacetime four-vector with itself, its square, 4 R. Reece is give by xμ xμ ≡ xμ ημν xν = (c t)2 − ~x * ~x = (∆τ)2 (3) giving the square of the invariant length between xμ and the origin. In equation (3), we have defined that the lowering of a four-vector index is done by multiplication by the metric tensor. Explicit matrix multiplication will show that ημν is the inverse Minkowski metric and has the same components as ημν . ημλ η λν = δνμ (4) Raising the indices of the metric confirms that the components of the metric and inverse metric are equal. ημν = ημλ ηνσ ηλσ = η μλ ηλσ (η T)σν Anything that transforms according to the Lorentz Transformations, like (c t, ~x ), is a four-vector. Another example of a four-vector is four-velocity, defined by uμ ≡ γ (c,~v )μ (5) One can show that the square of uμ is invariant as required. uμ uμ = γ 2 (c2 − v2) = 1 1− β2 (1− β2) c2 = c2 which is obviously invariant. Any equation where all of the factors are scalars (with no indices or contracting indices), or are four-vectors/tensors, with matching indices on the other side of the equal sign, is called "manifestly invariant." 1.1.4 Momentum and Energy The Classically conserved definitions of momentum and energy, being dependent on the coordinate frame, will not be conserved in other frames. We are motivated to consider the effect of defining momentum with the four-velocity instead of the classical velocity. The mass of a particle, m, being an intrinsic property of the particle, must be a Lorentz scalar. Therefore, the following definition of the four-momentum is manifestly a four-vector. pμ ≡ m uμ = γ m (c,~v )μ (6) The square of which is pμ pμ = m 2 uμ uμ = m 2 c2 (7) 5 Quantum Field Theory: An Introduction Now let's give some interpretation to the components of the four-momentum. To consider the nonrelativistic limit, let us expand γ in the β → 0 limit. γ ' 1 + 1 2 β2 + 3 8 β4 + * * * Then the leading order term of the space-like components of the four-momentum is just the Classical momentum. ~p = m~v + * * * We therefore, interpret the space-like components of the four-momentum as the relativistic momentum. ~p = γ m ~v (8) The expansion of the time-like term gives m c2 + 1 2 m v2 + * * * We can now recognize the second term as the Classical kinetic energy. The first term is evidently the "mass energy," energy present even when v = 0. Higher order terms give relativistic corrections. E = γ m c2 (9) We can therefore write the four-momentum in terms of the relativistic energy, E, and relativistic momentum, p. pμ = (E, ~p )μ (10) The four-momentum is the combination of momentum and energy necessary to transform according to Lorentz Transformations. Both E and ~p are conserved quantities in any given frame, but they are not invariant ; they transform when going to another frame. Scalar quantities, like mass, are invariant but are not necessarily conserved. Mass can be exchanged for kinetic energy and vice versa. Charge is an example of a scalar quantity that is also conserved. Looking at the square of the four-momentum with this energy-momentum interpretation of its components gives the very important relationship between energy, momentum, and mass. pμ pμ = E 2 − |~p |2 c2 = m2 c4 (11) Taking the ratio of equations (8) and (9) gives the following interesting relation. ~p E = ~v c2 (12) 6 R. Reece which leads to ~p c E = ~β (13) Note from equation (11), in the case that m = 0, we have that E = |~p | c, which implies that β = 1. Therefore, massless particles must travel at the speed of light. In which case (13) agrees that the following is the energy-momentum relation for massless particles. E = |~p | c (14) Also note the relationship to Einstein's equation for the energy of a photon, E = ~ω ⇒ |~p | = ~ ωc = ~ k, consistent with de Broglie's relation. 1.2 Units 1.2.1 Natural Units Factors of c were explicit in the above review of special relativity. From now on, we will use a form of natural units, where certain natural constants are set to one by using units derived from the God-given scales in Nature. ~ = c = ε0 = 1 (15) From ~ = 6.58 × 10−25 GeV * s = 1, it follows that if we choose to measure energy in units GeV, then time can be measured in units GeV−1. 1 GeV−1 = 6.58× 10−25 s (16) From c = 3× 108 m/s = 1, it follows that 1 = ( 3× 108m ) ( 6.58× 10−25 GeV ) = 1.97× 10−16 m *GeV (17) ⇒ 1 GeV−1 = 1.97× 10−16 m (18) Summarizing the dimensionality: time = length = 1 energy (19) 1.2.2 Barns When calculating cross sections, the conventional unit of area in particle physics is a barn. 1 barn ≡ (10 fm)2 = 10−24 cm2 (20) 7 Quantum Field Theory: An Introduction 1 mb ≡ 10−3 barns = 10−27 cm2 (21) From (18), it can be shown that 1 GeV−2 = 0.389 mb (22) 1.2.3 Electromagnetism Finally, from ε0 = 1 and c = 1 c = 1 √ μ0 ε0 ⇒ μ0 = 1 (23) giving Maxwell's equations the following form. Field Tensor: Fμν ≡ ∂μAν − ∂νAμ (24) Homogeneous: ∂μFνλ + ∂νFλμ + ∂λFμν = 0 (25) Inhomogeneous: ∂νF μν = Jμ (26) 1.3 Relativistic Kinematics 1.3.1 Lorentz Invariant Phase Space 1.3.2 Mandelstam Variables 8 R. Reece 2 Variation of Fields 2.1 The Field Worldview We will see that the dynamical variable in quantum field theory is the field itself. A field is a mathematical concept that has a number or a construction of numbers (a complex number, vector, spinor, tensor . . . ) defined at every point in spacetime. We assume that the fields are smoothly varying, such that derivatives are well defined. To simplify notation, in these first few sections, we will denote a general field by φa(x), where a indexes all of the components of φ, being components of vectors, spinors, etc. or some direct product of them. 2.2 Variation By studying the variation of fields we will discover the effect the Principle of Least Action has on dynamics, giving the Euler-Lagrange equation. Then we will discover the effects the symmetries of spacetime have on dynamics, giving Noether's Theorem. But first, we need to derive some properties of variation of functions in general. We define the following. δxμ ≡ x′μ − xμ δoφ(x) ≡ φ′(x)− φ(x) δφ(x) ≡ φ′(x′)− φ(x) δxμ is the difference in the values of coordinates referring to the same spacetime point but in different coordinate frames. δoφ(x) is the value of the field φ(x) subtracted from the value of a field with a slightly different functional form, φ′(x), evaluated at the same spacetime point, in the same coordinates. δφ(x) is the difference in the functions φ′(x′) and φ(x) evaluated in in different coordinate systems. Using the above definitions we can derive the following. δφ(x) = φ′(x+ δx)− φ(x) = φ′(x) + δxμ ∂μφ ′(x) + . . .− φ(x) ' δoφ(x) + δxμ ∂μφ′(x) = δoφ(x) + δx μ ∂μ ( φ(x) + δoφ(x) ) = δoφ(x) + δx μ ∂μφ(x) + . . . We have used that the variations are small and that field is smooth by Taylor expanding 9 Quantum Field Theory: An Introduction φ′(x). Therefore, to leading order in the variation, we have δφ(x) = δoφ(x) + δx μ ∂μφ(x) (27) Similarly for a function of several variables, we have δof(x, y) ≡ f ′(x, y)− f(x, y) δf(x, y) ≡ f ′(x′, y′)− f(x, y) δf(x, y) = f ′(x+ δx, y + δy)− f(x, y) = f ′(x, y) + δxμ ∂ ∂xμ f ′(x, y) + δyμ ∂ ∂yμ f ′(x, y) + . . .− f(x, y) ' δof(x, y) + δxμ ∂ ∂xμ f ′(x, y) + δyμ ∂ ∂yμ f ′(x, y) = δof(x, y) + δx μ ∂ ∂xμ ( f(x, y) + δof(x, y) ) + δyμ ∂ ∂yμ ( f(x, y) + δof(x, y) ) = δof(x, y) + δx μ ∂ ∂xμ f(x, y) + δyμ ∂ ∂yμ f(x, y) + . . . ∴ δf(x, y) = δof(x, y) + δx μ ∂ ∂xμ f(x, y) + δyμ ∂ ∂yμ f(x, y) (28) One can begin to see that variation, δ, follows similar operational rules as that of the differential operator, d. Indeed, we can derive a product rule for variation as follows. Let f(x, y) ≡ g(x) h(y) ⇒ δf(x, y) = δo ( g(x) h(y) ) + h(y) δxμ ∂ ∂xμ g(x) + g(x) δyμ ∂ ∂yμ h(y) δo ( g(x) h(y) ) = g′(x) h′(y)− g(x) h(y) = ( g(x) + δog(x) )( h(y) + δoh(y) ) − g(x) h(y) =  g(x) h(y) + ( δog(x) ) h(y) + g(x) δoh(y) +   : O[δ2]( δog(x) )( δoh(y) ) − g(x) h(y) = ( δog(x) ) h(y) + g(x) δoh(y) ⇒ δf(x, y) = h(y) ( δog(x) + δx μ ∂ ∂xμ g(x) ) + g(x) ( δoh(y) + δy μ ∂ ∂yμ h(y) ) ∴ δ ( g(x) h(y) ) = h(y) δg(x) + g(x) δh(y) (29) 10 R. Reece One can show that δo commutes with partial derivatives as follows. ∂μδoφ(x) = ∂μ ( φ′(x)− φ(x) ) = ∂μφ ′(x)− ∂μφ(x) = δo∂μφ(x) ∴ δo∂μφ(x) = ∂μδoφ(x) (30) 2.3 The Principle of Least Action The Classical Mechanics of a field can be described by introducing a Lagrangian density (often just called a Lagrangian), L, a function of the field and its first derivatives1. L(x) = L ( φa(x), ∂μφa(x) ) (31) The Principle of Least Action from Classical Mechanics states that the dynamics of a system obeying the physics described by some Lagrangian is such that the action functional is minimized. The action functional is defined by S[φa, ∂μφa] ≡ ∫ d4x L ( φa(x), ∂μφa(x) ) (32) Let us consider variation by perturbing the field, but leaving the coordinates alone. δoφa(x) = φ ′ a(x)− φa(x) Using the properties of variation that we have derived, we have δoS = ∫ d4x δoL = ∫ d4x [ ∂L ∂φa δoφa + ∂L ∂(∂μφa) δo(∂μφa)} {{ } ∂μ(δoφa) ] = ∫ d4x [ ∂L ∂φa δoφa − ∂μ ( ∂L ∂(∂μφa) ) δoφa + ∂μ ( ∂L ∂(∂μφa) δoφa )] = ∫ d4x [ ∂L ∂φa − ∂μ ( ∂L ∂(∂μφa) )] δoφa + ∮ dσμ ∂L ∂(∂μφa) δoφa} {{ } 0 Note that terms with repeated a have an implied sum over a. We assume that we know the boundary conditions of the field, and that we are trying to derive an equation of motion for 1It can be shown that L cannot depend on higher derivatives of the field if the theory is to remain causally consistent. 11 Quantum Field Theory: An Introduction the field within the boundary. Therefore, the surface integral above is zero because δoφa = 0 because we are not varying the field on the boundary. The action is minimized at a critical point. δoS = 0 Minimizing the action for arbitrary δoφa gives the Euler-Lagrange Equation. 0 = ∫ d4x [ ∂L ∂φa − ∂μ ( ∂L ∂(∂μφa) )] δoφa ⇒ ∂L ∂φa − ∂μ ( ∂L ∂(∂μφa) ) = 0 (33) Given a Lagrangian, the Euler-Lagrange Equation can be used to derive the equation of motion for the field. 2.4 Noether's Theorem Now let us allow for variations where the coordinates transform infinitesimally, called a diffeomorphism. δxμ = x′μ − xμ Then the Lagrangian varies like any other function of the coordinates. δL = δoL+ δxμ ∂μL = ∂L ∂φa δoφa + ∂L ∂(∂μφa) δo(∂μφa) + δx μ ∂μL = [ ∂L ∂φa − ∂μ ( ∂L ∂(∂μφa) )] } {{ } 0 δoφa + ∂μ ( ∂L ∂(∂μφa) δoφa ) + δxμ ∂μL The above term is zero because φa(x) that is the physical solution satisfies the EulerLagrange Equation. Therefore, the change in a Lagrangian under a general diffeomorphism is given by the following. δL = ∂μ ( ∂L ∂(∂μφa) δoφa ) + δxμ ∂μL (34) Let us consider only those diffeomorphisms that are symmetries of physics. That is, those transformations that leave the equations of motion invariant. This is guaranteed only if the diffeomorphism leaves the action invariant. Note that because a diffeomorphism is a 12 R. Reece change in coordinates, in general, the volume element, d4x, also varies. δS = ∫ ( δ ( d4x ) L+ d4x δL ) (35) The change in the volume element is given by the following. δ ( d4x ) = (∂μδx μ) d4x (36) Therefore δS = ∫ d4x [ (∂μδx μ)L+ ∂μ ( ∂L ∂(∂μφa) δoφa ) + δxμ ∂μL ] = ∫ d4x ∂μ [ L δxμ + ∂L ∂(∂μφa) δoφa ] using equation 27, δoφa = δφa − δxμ ∂μφa = ∫ d4x ∂μ [ L δxμ + ∂L ∂(∂μφa) (δφa − δxμ ∂μφa) ] = ∫ d4x ∂μ [ ∂L ∂(∂μφa) δφa + ( L ημν − ∂L ∂(∂μφa) ∂νφa ) δxν ] Let us define the energy-momentum tensor, whose convenience will become apparent, as Tμν ≡ ∂L ∂(∂μφa) ∂νφa − L ημν (37) Then δS = ∫ d4x ∂μ [ ∂L ∂(∂μφa) δφa − Tμν δxν ] Now define the Noether current as J μ ≡ ∂L ∂(∂μφa) δφa − Tμν δxν (38) Because physical symmetries leave the action invariant for any region of spacetime, we can derive a conservation law as follows. δS = ∫ d4x ∂μJ μ = 0 ⇒ ∂μJ μ = 0 (39) Therefore, for each diffeomorphism that is a symmetry of physics, there exist a conserved Noether current. 13 Quantum Field Theory: An Introduction Expanding the Einstein sum in the conservation law gives ∂μJ μ = ∂0J 0 − ~∇ * ~J = 0 We define the Noether charge as Q ≡ ∫ d3x J 0 (40) Consider the time derivative of the Noether charge. Using Stokes' Theorem, this can be converted into a surface integral. For large enough regions, on physical grounds, we expect the flux of the spacial Noether current to be zero across the boundary. dQ dt = ∫ d3x ∂0J 0 = ∫ d3x ~∇ * ~J = ∮ d~σ * ~J = 0 Therefore the Noether charges are conserved in time. dQ dt = 0 (41) They are a formal expression for the Classically conserved quantities in physics like energy and momentum. Noether's theorem links each of these conserved quantities to a physical symmetry. 2.5 Spacetime Translation Consider a diffeomorphism that does not mix components of the spacetime coordinates. All it does is shift each of the coordinates by a constant, cμ. x′μ = xμ + cμ (42) ⇒ δxμ = cμ (43) After the transformation, the value of the field is the same for the same spacetime point. δφa = 0 (44) Then, the definition of the Noether current, equation 38, gives J μ = − Tμν δxν = − Tμν cν (45) 14 R. Reece Because translating spacetime coordinates by an arbitrary constant is an observed symmetry in Nature, Noether's theorem says that the corresponding Noether currents are conserved. ∂μJ μ = 0 The arbitrariness of cν implies ∂μT μν = 0 (46) We define the total four-momentum of the field φa as the corresponding Noether charges. P ν ≡ ∫ d3x T 0ν (47) The four-momentum density, P ν , is defined by T 0ν , such that P ν ≡ ∫ d3x P ν (48) Using the definition of the energy-momentum tensor, equation 37, we have P ν = ∂L ∂(∂0φa) ∂νφa − L η0ν (49) We define the conjugate momentum of a field as follows. πa ≡ ∂L ∂φa (50) Then, in this notation, we have P ν = πa ∂νφa − L η0ν (51) The temporal component of the four-momentum density is the Hamiltonian density, denoted H ≡ P 0. The temporal component of the four-momentum is the Hamiltonian, representing the total energy of the field. H = P 0 = ∫ d3x ( πa φa − L ) = ∫ d3x H (52) The remaining spacial components are the three components of the total momentum of the field. ~P = − ∫ d3x πa ~∇φa (53) Energy and momentum conservation are a consequence of the translation symmetry of spacetime. 15 Quantum Field Theory: An Introduction 2.6 Lorentz Transformations Now consider a diffeomorphism that mixes components of the spacetime coordinates. We know that the Lorentz transformations do this, and that physics is invariant under such a transformation. The Lorentz transformations have a unitary representation, where Λμν are the components of a real antisymmetric matrix. x′μ = eΛ μ ν xν (54) Therefore, for infinitesimal Λμν , the transformation can be written as x′μ ' ( δμν + Λ μ ν ) xν (55) ⇒ δxμ = Λμν xν (56) There exists a set of spin matrices, ( Σαβ ) ab , labeled by four-vector indices, α and β, but the indices of a single matrix are a and b. The spin matrix relates the transformation of the spacetime four-vector coordinates to the transformation of the components of the field under a Lorentz transformation. φ′a(x ′) = ( δab + 1 2 ( Σαβ ) ab Λαβ ) φb(x) (57) Note that there is implied summation over all paired indices: b, α, and β. ⇒ δφa(x) = 1 2 ( Σαβ ) ab Λαβ φb(x) (58) Plugging in equations 56 and 58 into equation 38 for the Noether current gives J μ = 1 2 ∂L ∂(∂μφa) ( Σαβ ) ab Λαβ φb − Tμα Λαβ xβ Using the antisymmetry of Λαβ, we have J μ = 1 2 ∂L ∂(∂μφa) ( Σαβ ) ab Λαβ φb − 1 2 ( Tμα xβ − Tμβ xα ) Λαβ Now factoring out the arbitrary infinitesimal Λαβ and the factor of 1 2 gives J μ = 1 2 [ ∂L ∂(∂μφa) ( Σαβ ) ab φb − ( Tμα xβ − Tμβ xα )] Λαβ Let Mμαβ ≡ ∂L ∂(∂μφa) ( Σαβ ) ab φb − ( Tμα xβ − Tμβ xα ) (59) 16 R. Reece Then J μ = 1 2 Mμαβ Λαβ (60) Because the Lorentz transformations are an observed symmetry in Nature, Noether's theorem says that ∂μJ μ = 0. The arbitrariness of Λαβ means that there is a conserved current for each α and β. ∂μM μαβ = 0 (61) The corresponding Noether charges are then given by Mαβ ≡ ∫ d3xM0αβ (62) Plugging in equations 51, 50, and 59 gives Mαβ = ∫ d3x [ πa ( Σαβ ) ab φb + ( xα Pβ − xβ Pα )] (63) Like all Noether charges, each of these are conserved in time. The asymmetry of Λαβ implies that both Σαβ and Mαβ are asymmetric as well. Upon multiplication by 12 and a Levi-Civita symbol, the spacial terms with the fourmomentum density would give a cross product, ~x× ~P . This motivates us to interpret them as the orbital angular momentum. With this in mind, notice that the Σ term mixes the components of the field, a transformation in a space internal to the field as opposed to spacetime. This leads one to recognize this term as a component of angular momentum that is internal to the field, which will later be interpreted as spin of particles when the field is quantized. Mαβ = ∫ d3x [ πa ( Σαβ ) ab φb} {{ } spin + ( xα Pβ − xβ Pα ) } {{ } orbital angular momentum ] (64) The angular momentum is given by Jk = 1 2 εijkMij (65) Angular momentum conservation is a consequence of the rotation symmetry of spacetime, part of Lorentz invariance. The M0i charges, derived from the transformations mixing space and time components, correspond to Lorentz boosts. They too are conserved, but are not as physically apparent as angular momentum. 17 Quantum Field Theory: An Introduction 2.7 Internal Symmetries In addition to the symmetries of spacetime, some fields have symmetries relating their internal degrees of freedom. These types of transformations have the following unitary representation φ′a(x) = e i (Gr)ab θr φb(x) (66) where Gr are hermitian matrices, such that the transformation is unitary. (Note that summation is implied over r, a, and b.) Gr are called the "generators" of the transformation, and θr are the corresponding parameters. Note that this type of transformation has nothing to do with spacetime. Instead, it directly mixes the components of the field. If this type of transformation leaves the equations of motion invariant, and is therefore a symmetry, then physics is unaffected by whether phenomena are described by φa(x), or a field related to φa(x) by such a transformation. Fields related to one another by such a transformation are called different "gauges" of the same field, and this type of symmetry is known as "gauge invariance." For infinitesimal θr, the transformation can be expanded as φ′a(x) ' ( δab + i (Gr)ab θr ) φb(x) (67) ⇒ δφa(x) = i (Gr)ab θr φb(x) (68) Plugging this into equation 38 gives the Noether currents. J μr = i ∂L ∂(∂μφa) (Gr)ab θr φb(x) (69) If the Lagrangian is gauge invariant, then those Noether currents are conserved, and the corresponding Noether charges (derived using equations 40 and 50) are conserved in time. Qr = ∫ d3x i πa (Gr)ab θr φb (70) 18 R. Reece 3 The Free Real Scalar Field We will investigate the simplest example of a field, a real scalar field. That is, one that has a single component that is a real number and therefore transforms trivially under the Lorentz transformations. After studying the Classical properties of the real scalar field, we will summarize the principles of quantum mechanics, and finally quantize the real scalar field. We will use this simple example to explain the canonical quantization procedure that can then be applied to any other type of field. 3.1 The Classical Theory The Classical theory of a field has two inputs: the type of field, and the Lagrangian that describes its dynamics. The field we are now studying is the real scalar field that we will denote φ(x). The Lagrangian2 describing the free relativistic dynamics of this field is L = 1 2 ( (∂μφ) (∂ μφ)−m2 φ2 ) (71) where m is a real constant. The first step in studying the Classical theory of a field is to calculate its equation of motion and its Noether charges. The equation of motion, calculated by plugging the Lagrangian into the Euler-Lagrange equation 33, is ( ∂2 +m2 ) φ = 0 (72) This is called the Klein-Gordon equation. Equation 50 says that the conjugate momentum for the real scalar field is π(x) = φ(x) (73) The Hamiltonian and momentum are given by equations 52 and 53. H = ∫ d3x 1 2 ( π2 + (~∇φ)2 +m2 φ2 ) (74) ~P = − ∫ d3x π ~∇φ (75) This implies that the four-momentum density is given by P 0 = H = 1 2 ( π2 + (~∇φ)2 +m2 φ2 ) (76) 2The appropriateness of this Lagrangian will become apparent when we see that it insures that the field is relativistic by satisfying equation 11. 19 Quantum Field Theory: An Introduction and ~P = − π ~∇φ (77) which is used in equation 63 to calculate Mμν . Mμν = ∫ d3x (xμ P ν − xν Pμ) (78) Note that the spin term in Mμν is zero because the real scalar field does not change under the Lorentz transformation and therefore equation 57 implies that the spin matrix, Σ, is zero. We next turn our attention studying the solutions of the Klein-Gordon equation. Consider the plane-waves ei k*x of various k. Plugging this into the Klein-Gordon equation gives (∂2 +m2) ei k*x = ( (i k)2 +m2 ) ei k*x = (−k2 +m2) ei k*x = 0 . ⇒ k2 = m2 This shows that ei k*x is a solution if and only if we constrain one of the four degrees of freedom in the four-vector k such that k2 = ω2 − ~k 2 = m2. We do this by setting k0 = ωk ≡ √ ~k 2 +m2 . (79) As one knows from Fourier analysis, the functions { ei k*x } for various k form a complete basis for a complex function space. We can decompose any complex function φ(x) satisfying the Klein-Gordon equation by φ(x) = ∫ d3k (2π)3 b(k) ( a(k)e−i k*x + a∗(k)ei k*x ) . We are integrating over the three-momentum ~k instead of all four spacetime components because we constrained k by equation 79 to guaranty that φ(x) is a solution of the KleinGordon equation. The three-momentum integral is not manifestly invariant like an integral over d4k, so we have inserted the function b(k) which we will use to constrain the rest of the integrand to be Lorentz invariant. The a(k) are the coefficients of the Fourier expansion. We have added the complex conjugate to ensure that phi(x) is real. Note that three-momentum varies under Lorentz boosts, so we need to find b(k) such that d3k b(k) is the Lorentz invariant three-momentum integration measure. ∫ d4k L(k) is manifestly invariant if L(k) is a Lorentz invariant scalar. How can we make an integral over d3k Lorentz invariant? We can take a Lorentz invariant integral over d4k and constrain one of the degrees of freedom, while preserving the Lorentz invariance by forcing k2 to always 20 R. Reece remain m2 with a delta function.∫ d4k (2π)4 2π δ(k2 −m2) θ(k0) L(k) The factor of 2π is just a convention because we are going to integrate over one degree Figure 1: The hyperboloid over which Lorentz invariant three-momentum integration is performed. of freedom and we want to be left with a factor of 1/(2π) for each remaining momentum integral. Obviously it has no effect on the Lorentz invariance of the result. The step function limits the integration to the region where k0 ≥ 0 because only four-vectors with non-negative energy are reasonable. Visually, this is an integral over the hyperboloid where k2 = m2, or k0 = √ m2 + ~k 2 , shown in Figure 1. One can simplify this integral with the following identity for delta functions δ(g(x)) = ∑ i δ(x− xi) |g′(xi)| , (80) where {xi} are the zeros of g(x) and g′(x) denotes the derivative of g(x). Note that d dk0 (k2 −m2) = 2k0 = 2ωk , 21 Quantum Field Theory: An Introduction therefore ∫ d4k (2π)4 2π δ(k2 −m2) θ(k0) L(k) = ∫ d3k (2π)3 ∫ dk0 ( δ(k0 − ωk) 2 ωk + δ(k0 + ωk) 2 ωk ) θ(k0) L(k) = ∫ d3k (2π)3 1 2ωk L(k) . Therefore, d 3k (2π)3 2ωk is the Lorentz invariant three-momentum measure, also called the Lorentz invariant phase space element. Evidently, b(k) = 1/(2ωk) and our Fourier decomposition of φ(x) should be written as φ(x) = ∫ d3k (2π)3 2ωk ( a(k)e−i k*x + a∗(k)ei k*x ) . (81) We can project out the value a(k) for a specific k, but first we need to derive a couple completeness orthogonality relations for this basis of functions { ei k*x } . These relations use an inner product that involves the following operation, which takes the time derivative to the right first and then subtracts the time derivative acting to the left. a ↔ ∂0 b ≡ a ∂0b− (∂0a) b (82) The completeness relation is derived by the following.∫ d3x ei k ′*x i ↔ ∂0 e −i k*x = i ∫ d3x [ ei k ′*x ∂0e −i k*x − ( ∂0e i k′*x ) e−i k*x ] = (ωk + ωk′) ∫ d3x ei(k ′−k)*x = (ωk + ωk′) e i(ωk′−ωk)t} {{ } e0=1 ∫ d3x e−i( ~k′−~k)*~x} {{ } (2π)3 δ3(~k−~k′) = 2ωk (2π) 3 δ3 ( ~k − ~k′ ) . (83) 22 R. Reece Similarly, the orthogonality relation is derived by∫ d3x ei k ′*x i ↔ ∂0 e i k*x = i ∫ d3x [ ei k ′*x ∂0e i k*x − ( ∂0e i k′*x ) ei k*x ] = (ωk − ωk′) ∫ d3x ei(k ′+k)*x = :0(ωk − ωk′) ei(ωk′+ωk)t ∫ d3x e−i( ~k′+~k)*~x} {{ } (2π)3 δ3(~k+~k′) = 0 . (84) Using the completeness and orthogonality relations, we can invert this expansion to find an expression for the expansion coefficients a(k).∫ d3x ei k*x i ↔ ∂0 φ(x) = ∫ d3k (2π)3 [ a(k) ∫ d3x ei k*x i ↔ ∂0 e −i k*x} {{ } (2π)3 δ3(~k−~k) + a∗(k) ∫ d3x ei k*x i ↔ ∂0 e i k*x} {{ } 0 ] ∴ a(k) = i ∫ d3x ei k*x ↔ ∂0 φ(x) . (85) We can write this in terms of the conjugate momentum by letting the ↔ ∂0 act on φ. a(k) = i ∫ d3x [ ei k*x ∂0φ(x)− φ(x) ∂0ei *x ] = i ∫ d3x [ ei k*x π(x)− φ(x) (i ωk) ei k*x ] ∴ a(k) = ∫ d3x ei k*x ( ωk φ(x) + i π(x) ) (86) 3.2 Principles of Quantum Mechanics It is now appropriate to pause our study of the real scalar field and to outline the canonical quantization procedure for how one goes from studying a classical field to a quantum field in general. First we will summarize what quantum mechanics means. 3.2.1 States Quantum mechanics fundamentally concerns the study of the dynamics of states and operators in a Hilbert space. A Hilbert space is a complex vector space that often has an 23 Quantum Field Theory: An Introduction infinite number dimensions, where a vector, |α〉, denotes the state of a quantum system. In general, a vector can be expanded in terms of a complete orthonormal basis {|bi〉}. |α〉 = ∑ i ai |bi〉 In the case that we use a basis that is parametrized by a real number, x, instead of integers, the sum becomes an integral. |α〉 = ∫ dx a(x) |x〉 There exists a dual vector space that is the adjoint of the first. 〈α| ≡ |α〉† = ∑ i a∗i 〈bi| A Hilbert space has an binary operation between a vector and a dual vector called an inner product. The inner product of two vectors |α〉 and |β〉 denoted as 〈α|β〉. Two vectors are orthogonal if there inner product is zero. 〈α|β〉 = 0 A set of vectors, {|bi〉}, is an orthonormal basis if every vector is orthogonal with every other vector and the inner product of every vector with itself is one. 〈bi|bj〉 = δij The basis is complete if they span the space and therefore the following sum is unity.∑ i |bi〉 〈bi| = 1 Using an orthonormal basis, one can decompose any vector into a sum over the basis vectors, whose coefficients are given by the inner products of the basis vectors and the vector being decomposed. |α〉 = ∑ i |bi〉 〈bi|α〉} {{ } ai = ∑ i ai |bi〉 We can see why we choose the dual vectors be the adjoint of vectors because it conve24 R. Reece niently forces the inner product of a vector with itself, its norm, to be real and nonnegative. 〈α|α〉 = (∑ i a∗i 〈bi| )∑ j aj |bj〉  = ∑ i a∗i ai :1〈bi|bi〉 = ∑ i a∗i ai ≥ 0 This is because a∗i ai is real and nonnegative for any complex number ai. We normalize state vectors by constraining the above sum to be one. 〈α|α〉 = 1 3.2.2 Operators Like any vector space, one can define linear operators, denoted with hâts, to act on a vector and return another vector. Ô |ψ〉 = |ψ′〉 Linear means that Ô ( c1 |α〉+ c2 |β〉 ) = c1 Ô |α〉+ c2 Ô |β〉 where c1 and c2 are some complex numbers. Physical quantities like four-momentum, charge, etc. are represented by operators. When a state is an eigenstate of an operator, we interpret it as having a well defined value for that physical quantity. Bi |bi〉 = bi |bi〉 In the above equation, the state |bi〉 is interpreted as being a state with a well defined value bi for the physical quantity represented by the operator Bi. Operators representing physical quantities must always be hermitian such that they real eigenvalues. If one chooses operators that have eigenstates that form a complete orthonormal basis, one can decompose a general vector into a sum over eigenstates. |ψ〉 = ∑ i ai |bi〉 Such a state with nonzero values for more than one ai is said to be in a quantum mechanical superposition of the corresponding eigenstates. For example, the following state has no well 25 Quantum Field Theory: An Introduction defined value for the quantity represented by B. |ψ〉 = 1√ 2 ( |b1〉+ |b2〉 ) Instead, it has some probability to have the value b1 and some probability to have the value b2. The probabilities are determined by calculating the magnitude squared of the coefficients in the expansion. In this case, (1/ √ 2)2 = 0.5, so each value has a 50% probability. In general, an inner product can be thought of as selecting out the coefficient corresponding to a basis vector in the expansion of some vector. 〈bj |ψ〉 = 〈bj | ∑ i ai |bi〉 = ∑ i ai 〈bj |bi〉 = aj An inner product of a state vector with an eigenvector for some observable is known as the quantum mechanical amplitude for the state to have that value for that observable, and the probability for it is the magnitude squared of the amplitude, |〈bj |ψ〉|2. An amplitude, 〈β|ψ〉, is also referred to as the quantum mechanical overlap between the states |β〉 and |ψ〉. 3.2.3 The Fundamental Postulate of Quantum Mech. When one quantizes a classical field theory, the field φ(x), and its canonical momentum π(x) become the operators, φ(x) and π(x) in a Hilbert space. Classically, φ(x) and π(x), are determined by the initial conditions of the field (like initial position and momentum for point particle mechanics) and contain all the information of the system. Similarly, all operators in the Hilbert space can be written in terms of the operators φ(x) and π(x). For example, we will see that the Noether's charges Pμ and Mμν become the operators Pμ and Mμν , written in terms of φ(x) and π(x). This will be explained more concretely when we discuss the specific example of the quantum theory of the real scalar field. Since we are going to represent the state of a physical system by a vector in the Hilbert space, we need to understand how state vectors and operators change when the physical system changes. How do we represent transformations like spacetime translation and Lorentz transformations in the Hilbert space? For each type of transformation, there must be some operator, Û(θa), that acts on a state vector, |α〉, and returns a state vector, |α′〉, corresponding to the state transformed by some amount parametrized by the parameters θa. |α′〉 = Û(θa) |α〉 (87) Because we want to keep state vectors normalized after such an operation, the operators representing transformations must be unitary. 〈α′|α′〉 = 〈α| Û †(θa) Û(θa) |α〉 = 〈α|α〉 = 1 26 R. Reece ⇒ Û †(θa) Û(θa) = 1 Any unitary operator can be written as Û(θa) = e −i Ĝa θa where Ĝa are hermitian, insuring that Û(θa) is unitary. In the limit θa go to zero, the transformation operator becomes the identity, as one would expect. Û(θa) = 1− i Ĝa θa + . . . Similar to the generators discussed in the Section 2.7 on internal symmetries, Ĝa are know as the "generators" of some transformation. We represent spacetime translations in the Hilbert space by Û(xμ) = e−i Tμ x μ where xμ the spacetime four-vector by which the system was translated. And we represent the Lorentz transformations in the Hilbert space by Û(θμν) = e−i Lμν θ μν What could these generators be? What quantities are fundamentally associated with the spacetime translation and the Lorentz transformations? Noether's theorem points out that the Noether charges are fundamentally linked to the symmetry of a transformation. Following this hint, we state what this text calls The Fundamental Postulate of Quantum Mechanics:3 The generators of the representation of a transformation in the Hilbert space are the operators representing the classical Noether's Charges that are conserved under that transformation. In other words, Tμ = Pμ and Lμν = 1 2Mμν (the 1 2 is just a convention). Therefore, the operation of translating a system by xμ in spacetime is represented by the following in the Hilbert space. Û(xμ) = e−i Pμ x μ (88) Similarly, a Lorentz transformation where θμν parametrizes a combination of boosts and rotations, is represented by Û(θμν) = e−i 1 2 Mμν θμν (89) 3 TODO: Note this is actually a rephrasing of Wigner's theorem as the cornerstone of quantum mechanics. 27 Quantum Field Theory: An Introduction in the Hilbert space. You may have learned in a quantum mechanics class that the Hamiltonian is the generator of time translation. Û(t) = e−i Ĥ t Equations 88 and 89 are a generalization of that fact. In equation 87, we introduced the representations of transformations as fixed operators that act on and transform state vectors. This is called the "Schrödinger picture." But there is an alternate picture, called the "Heisenberg picture," where the state vectors are fixed and the operators are dynamic. We will more often use the Heisenberg picture, where the operators φ(x) and π(x) are dynamic in the quantum theory like the fields φ(x) and π(x) were dynamic in the Classical theory. From now on in this text, we will use Heisenberg Operators exclusively unless otherwise noted. What is important is that the amplitudes and expectation values are the same in either picture the Heisenberg or Schrödinger picture. Consider a state translated in spacetime by εμ. |α+ ε〉 = e−i Pμ εμ |α〉 The expectation value of some operator, Ô(x), for this state is 〈α+ ε|Ô(x)|α+ ε〉 = 〈α| ei Pμ εμ Ô(x) e−i Pμ εμ |α〉 Here we see that as we translate the system by εμ, we can either think of transforming the vector or the operator. Evidently, Ô(x+ ε) = ei Pμ ε μ Ô(x) e−i Pμ ε μ (90) is how Heisenberg Operators4 transform under translations in spacetime. Consider an infinitesimal transformation. Ô(x) + εμ ∂μÔ(x) = (1 + i Pμ ε μ)Ô(x)(1− i Pμ εμ) +O[ε2] = Ô(x) + i Pμ Ô(x) ε μ − i Pμ Ô(x) εμ +O[ε2] = Ô(x) + i [Pμ, Ô(x)] ε μ ∴ ∂μÔ(x) = i[Pμ, Ô(x)] (91) This is the Heisenberg Equation of Motion. It determines the equations of motion for operators much like the Euler-Lagrange equation determines the equations of motion for Classical fields. 4operators in the Heisenberg Picture 28 R. Reece 3.2.4 The Poincaré Algebra Spacetime is observed to have certain symmetries, namely spacetime translation invariance, and Lorentz invariance, including boosts and rotations. Together, the group of these transformations is collectively called the Poincaré group. If we are going to represent physical systems in spacetime by states in a Hilbert space, then those states should observe those same symmetries. This means that the generators of those transformations in the Hilbert space must satisfy the Poincaré algebra: [ Pμ, Pν ] = 0 (92)[ Mμν , Pλ ] = −i ημλ Pν + i ηνλ Pμ (93)[ Mμν , Mλσ ] = i ηνλ Mμσ − i ημλ Mνσ − i ηνσ Mμλ + i ημσ Mνλ (94) Plugging in Pν for Ô in the Heisenberg Equation of Motion, and using the Poincaré algebra, shows that the operator Pν is conserved just like its classical Noether's charge. Similarly, Mμν is conserved. ∂0Pν = 0 ∂0Mμν = 0 3.2.5 Canonical Quantization We will now outline the procedure for canonical quantization, a method for turning a classical theory into a quantum theory by representing it in a Hilbert space. First, the dynamical variables of the classical theory become operators in the Hilbert space. For a field theory, the dynamical variables are the field, φ(x), and its conjugate momentum, π(x). The classical Noether's charges become operators as well. The operators Pμ and Mμν are defined in terms of the same classical relations as the functions Pμ andMμν , with φ(x) and π(x) changed to their corresponding operators. Following the fundamental postulate of quantum mechanics, we choose the operators representing the corresponding Noether charges to generate the Poincaré transformations. Therefore we must demand that those operators satisfy the Poincaré algebra. One should now ask what effect the Poincaré algebra has on the field operators φ(x) and π(x). We will see when we study the quantum theory of the real scaler field as an example, that demanding that Pμ and Mμν satisfy the Poincaré algebra imposes canonical commutation relations on φ(x) and π(x). These commutation relations are what generate all the hallmarks quantum mechanics including uncertainty relations and the spin-statistics theorem. To clarify the details of this procedure, we now turn our attention back to the real scalar field to quantize it. 29 Quantum Field Theory: An Introduction 3.3 The Quantum Theory The dynamical variables φ(x) and π(x) are now operators in a Hilbert space. Equations 74, 75, and 78 with the dynamical variables turned into operators, give expressions for the operators Pμ and Mμν . We will now use these expressions and the Poincaré algebra to derive the canonical commutation relations, also known as the "equal time commutation relations." 3.3.1 Equal Time Commutation Relations Equation 75 implies that Pj(x 0) = ∫ d3x π(x) ∂jφ(x) (95) We require that this satisfy equation 92 from the Poincaré algebra. [ Pj(x 0), Pk(y 0) ] = 0 Evaluating this commutator in terms of φ(x) and π(x), we have [ Pj(x 0), Pk(y 0) ] = ∫ d3x d3y [ π(x) ∂xj φ(x), π(y) ∂yk φ(y) ] = ∫ d3x d3y ( π(x) ∂xj φ(x) π(y) ∂yk φ(y) − π(y) ∂yk φ(y) π(x)} {{ } commute ∂xj φ(x) ) We will commute φ and π several times in this derivation asuming they have some commutation relation. Note that for any operators Â and B, Â B = [ Â, B ] ± ∓ B Â where [ Â, B ] − is a commutator and [ Â, B ] + is an anticommutator. [ Â, B ] − ≡ [ Â, B ] ≡ Â B − B Â [ Â, B ] + ≡ { Â, B } ≡ Â B + B Â 30 R. Reece Using this to successively commute operators, we get [ Pj(x 0), Pk(y 0) ] = ∫ d3x d3y ( π(x) ∂xj φ(x) π(y) ∂yk φ(y)− π(y) ∂yk [ φ(y), π(x) ] ± ∂xj φ(x) ± π(y) π(x)} {{ } commute ∂yk φ(y) ∂xj φ(x) ) = ∫ d3x d3y ( π(x) ∂xj φ(x) π(y) ∂yk φ(y)− π(y) ∂yk [ φ(y), π(x) ] ± ∂xj φ(x) ± [ π(y), π(x) ] ± ∂yk φ(y) ∂xj φ(x)− π(x) π(y) ∂yk φ(y) ∂xj φ(x)} {{ } commute ) = ∫ d3x d3y ( π(x) ∂xj φ(x) π(y) ∂yk φ(y)− π(y) ∂yk [ φ(y), π(x) ] ± ∂xj φ(x) ± [ π(y), π(x) ] ± ∂yk φ(y) ∂xj φ(x)− π(x) π(y) ∂yk∂xj [ φ(y), φ(x) ] ± ± π(x) π(y) ∂xj φ(x)} {{ } commute ∂yk φ(y) ) = ∫ d3x d3y ( ((( (((( ((( ((( π(x) ∂xj φ(x) π(y) ∂yk φ(y)− x↔y{ }} { π(y) ∂yk [ φ(y), π(x) ] ± ∂xj φ(x) ± [ π(y), π(x) ] ± ∂yk φ(y) ∂xj φ(x)− π(x) π(y) ∂yk∂xj [ φ(y), φ(x) ] ± ± π(x) ∂xj [ π(y), φ(x) ] ± ∂yk φ(y)− ((((( (((( (((( π(x) ∂xj φ(x) π(y) ∂yk φ(y) ) Note that in the second term, we are allowed to interchange the four-vectors x and y with no effect on [ Pj(x 0), Pk(y 0) ] only if it is an equal time commutator, x0 = y0. The three-vector components of x and y are irrelevant because they are integrated over. Therefore all the commutators in this derivation are evaluated at equal times. In that same term, we also flip the commutator. [ Pj , Pk ] = ∫ d3x d3y ( ∓ π(y) ∂yk [ π(y), φ(x) ] ± ∂xj φ(x) ± [ π(y), π(x) ] ± ∂yk φ(y) ∂xj φ(x) − π(x) π(y) ∂yk∂xj [ φ(y), φ(x) ] ± ± π(x) ∂xj [ π(y), φ(x) ] ± ∂yk φ(y) ) 31 Quantum Field Theory: An Introduction Now we integrate by parts to get the derivatives off of the commutators. Note that all the surface terms are zero because the field dies off at infinity. Watch the signs. [ Pj , Pk ] = ∫ d3x d3y ( ± ( ∂yk π(y) ) [ π(y), φ(x) ] ± ∂xj φ(x) ± [ π(y), π(x) ] ± ∂yk φ(y) ∂xj φ(x) − ( ∂xj π(x) ) ( ∂yk π(y) ) [ φ(y), φ(x) ] ± ∓ ( ∂xj π(x) ) [ π(y), φ(x) ] ± ∂yk φ(y) ) = 0 This must equal zero to satisfy the Poincaré algebra. Note that for general φ(x) and π(x), there is no way for the [ φ(y), φ(x) ] and [ π(y), π(x) ] terms to cancel in general. Therefore, it must be that [ φ(y), φ(x) ] ± = [ π(y), π(x) ] ± = 0 (96) The [ π(y), φ(x) ] terms, on the other hand, will cancel if we hypothesize that [ π(y), φ(x) ] ± = C± δ 3(~x− ~y) 1 (97) where C± is some complex constant. This is easily demonstrated by integrating out the δ-functions, and integrating by parts again. [ Pj , Pk ] = ∫ d3x d3y ( ± ( ∂yk π(y) ) [ π(y), φ(x) ] ± ∂xj φ(x) ∓ ( ∂xj π(x) ) [ π(y), φ(x) ] ± ∂yk φ(y) ) = C± ∫ d3x d3y δ3(~x− ~y) ( ± ( ∂yk π(y) ) ∂xj φ(x) ∓ ( ∂xj π(x) ) ∂yk φ(y) ) = C± ∫ d3x ( ± ( ∂kπ(x) ) ∂jφ(x) ∓ ( ∂j π(x) ) ∂kφ(x) ) = C± ∫ d3x ( ∓((((( ((((∂j∂kπ(x)) φ(x) ± (((((((((∂k∂j π(x)) φ(x)) = 0 We have derived the equal time commutation relations for the real scalar quantum field, except that we do not know the constant C± and it is ambiguous whether the commutation relations should involve commutators or anticommutators. We can discover the value of the constant C± by examining another part of the Poincaré algebra. Equation 93 from the Poincaré algebra says [ Mjk, P` ] = −i ηj` Pk + i ηk` Pj Mjk is given by equations 75 and 78. Mjk(x 0) = ∫ d3x ( xj π(x) ∂kφ(x)− xk π(x) ∂jφ(x) ) 32 R. Reece We will calculate the following equal time commutator (x0 = y0) in terms of φ and π. [ Mjk(x 0), P`(y 0) ] = ∫ d3x d3y [ xj π(x) ∂xk φ(x)− xk π(x) ∂xj φ(x), π(y) ∂y` φ(y) ] = ∫ d3x d3y [ xj π(x) ∂xk φ(x), π(y) ∂y` φ(y) ] − ( j ↔ k ) Consider the first term. Note that xj is not an operator and can therefore be commuted past everything. ∫ d3x d3y [ xj π(x) ∂xk φ(x), π(y) ∂y` φ(y) ] = ∫ d3x d3y xj ( π(x) ∂xk φ(x) π(y) ∂y` φ(y)− π(y) ∂y` φ(y) π(x)} {{ } commute ∂xk φ(x) ) = ∫ d3x d3y xj ( π(x) ∂xk φ(x) π(y) ∂y` φ(y)− π(y) ∂y` [ φ(y), π(x) ] ± ∂xk φ(x) ± π(y) π(x)} {{ } commute ∂y` φ(y) ∂xk φ(x)} {{ } commute ) Note that we have already concluded that [ φ(y), φ(x) ] ± = [ π(y), π(x) ] ± = 0, from equation 96. = ∫ d3x d3y xj ( π(x) ∂xk φ(x) π(y) ∂y` φ(y)− π(y) ∂y` [ φ(y), π(x) ] ± ∂xk φ(x) ± π(x) π(y) ∂xk φ(x)} {{ } commute ∂y` φ(y) ) = ∫ d3x d3y xj ( (((( ((( (((( (( π(x) ∂xk φ(x) π(y) ∂y` φ(y)− π(y) ∂y` [ φ(y), π(x) ] ± ∂xk φ(x) ± π(x) ∂xk [ π(y), φ(x) ] ± ∂y` φ(y)−((((( (((( (((( π(x) ∂xk φ(x) π(y) ∂y` φ(y) ) Flip the first commutator and integrate the remaining terms by parts. = ∫ d3x d3y xj ( ∓ π(y) ∂y` [ π(x), φ(y) ] ± ∂xk φ(x) ± π(x) ∂xk [ π(y), φ(x) ] ± ∂y` φ(y) ) = ∫ d3x d3y ( ± xj ( ∂y` π(y) ) [ π(x), φ(y) ] ± ∂xk φ(x) ∓ ∂xk ( xj π(x) ) [ π(y), φ(x) ] ± ∂y` φ(y) ) = C± ∫ d3x d3y δ3(~x− ~y) ( ± xj ( ∂y` π(y) ) ∂xk φ(x) ∓ ∂xk ( xj π(x) ) ∂y` φ(y) ) = C± ∫ d3x ( ± xj ( ∂`π(x) ) ∂kφ(x) ∓ ∂k ( xj π(x) ) ∂`φ(x) ) 33 Quantum Field Theory: An Introduction ∂k ( xj π(x) ) = ∂xj ∂xk}{{} ηjk π(x) + xj ∂kπ(x) = C± ∫ d3x ( ± xj ( ∂`π(x) ) ∂kφ(x)∓ ηjk π(x) ∂`φ(x)∓ xj ∂kπ(x)∂`φ(x) ) Now, integrate the first and third terms by parts. = C± ∫ d3x ( ∓ π(x) ∂` ( xj ∂kφ(x) ) ∓ ηjk π(x) ∂`φ(x)± π(x) ∂k ( xj∂`φ(x) )) = C± ∫ d3x ( ∓ π(x) ηj` ∂kφ(x)∓((((( (((xj π x) ∂`∂kφ(x) ∓((((( ((( ηjk π(x) ∂`φ(x)±((((( ((( π(x) ηjk ∂`φ(x)±    xj π(x)∂k∂`φ(x) ) We see that we are left with the definition of Pk.∫ d3x d3y [ xj π(x) ∂xk φ(x), π(y) ∂y` φ(y) ] = ∓ C± ηj` ∫ d3x π(x) ∂kφ(x)} {{ } Pk = ∓ C± ηj` Pk Similarly for the (j ↔ k) term:∫ d3x d3y [ xk π(x) ∂xj φ(x), π(y) ∂y` φ(y) ] = ∓ C± ηk` Pj Putting these two terms together gives [ Mjk, P` ] . [ Mjk, P` ] = ∓ C± ( ηj` Pk − ηk` Pj ) This agrees with equation 93 from the Poincaré algebra only if C± = ±i (98) We are still left wondering whether we should use commutators or anticommutators. At this point, there is no way to resolve this ambiguity. For the sake of further discussion, we will correctly choose to use commutators for the real scalar field. We will revisit how this ambiguity is resolved when we discuss the spin-statistics theorem in section 3.3.7. Summarizing our conclusions, the equal time commuation relations for the real scalar field are 34 R. Reece [ φ(x), φ(y) ] = [ π(x), π(y) ] = 0 (99)[ π(x), φ(y) ] = −i δ3(~x− ~y) 1 (100) Let us now investigate some of the consequences of these equal time commutation relations. Plugging φ(x) into the Heisenberg equation of motion, equation 91 gives ∂0φ(x) = i [ Ĥ, φ(x) ] = i 2 [ ∫ d3y ( π2(y) + ( ∇yφ(y) )2 +m2φ2(y) ) , φ(x) ] = i 2 ∫ d3y [ π2(y), φ(x) ] = i 2 ∫ d3y ( π(y) π(y) φ(x)− φ(x) π(y) π(y) ) = i 2 ∫ d3y ( ((( ((((π(y) φ(x) π(y) + π(y) [ π(y), φ(x) ] −((((( (( π(y) φ(x) π(y) + [ π(y), φ(x) ] π(y) ) = i ∫ d3y π(y) (−i) δ3(~x− ~y) = π(x) ∴ π(x) =  φ(x) (101) Therefore, the conjugate momentum has the same expression we derived in equation 73 for the classical case. Now plug π(x) into the Heisenberg equation of motion. ∂0π(x) = i 2 [ ∫ d3y (  *0 π2(y) + ( ∇yφ(y) )2 +m2φ2(y) ) , π(x) ] Integrate the second term by parts once. ∂0π(x) = i 2 ∫ d3y [ φ(y) (−∇2y +m2) φ(y), π(x) ] = i 2 ∫ d3y ( φ(y) (−∇2y +m2) φ(y) π(x)− π(x) φ(y) (−∇2y +m2) φ(y) ) = i 2 ∫ d3y ( (((( (((( ((( (( φ(y) (−∇2y +m2) π(x) φ(y)− φ(y) (−∇2y +m2) [ π(x), φ(y) ] − ((( (((( (((( (( φ(y) π(x) (−∇2y +m2) φ(y)− [ π(x), φ(y) ] (−∇2y +m2) φ(y) ) Integrate the second term by parts twice. ∂0π(x) = −i ∫ d3y [ π(x), φ(y) ] (−∇2y +m2) φ(y) = − ∫ d3y δ3(~x− ~y) (−∇2y +m2) φ(y)   φ(x) = −(−∇2 +m2) φ(x) 35 Quantum Field Theory: An Introduction ⇒  φ(x)−∇2φ(x) +m2φ(x) = 0 ∴ ( ∂2 +m2 ) φ = 0 (102) Which is the Klein-Gordon equation, the same equation of motion we derived for the classical real scalar field. We have just seen an example of the general fact that quantum operators satisfy the classical equations of motion. 3.3.2 Creation and Annihilation Operators Similar to equation 85, we expand φ(x) in terms of plane-waves, where the "coefficients" in the expansion are now themselves operators. φ(x) = ∫ d3k (2π)3 ( â(k) fk(x) + â †(k) f∗k (x) ) (103) Similar to equation 85, we can project out â(k). â(k) = i ∫ d3x ei k*x ↔ ∂0 φ(x) (104) â(k) = ∫ d3x ei k*x ( ωk φ(x) + i π(x) ) (105) We would now like to derive the algebra for the â(k) operators. Consider [ â(k), â(k′) ] = ∫ d3x d3y f∗k (x) f ∗ k′(y) [ ωk φ(x) + i π(x), ωk′ φ(y) + i π(y) ] = ∫ d3x d3y f∗k (x) f ∗ k′(y) ( ωk ωk′  :0[ φ(x), φ(y) ] −  :0[ π(x), π(y) ] + i ωk [ φ(x), π(y) ]} {{ } i δ3(~x−~y) 1 + i ωk′ [ π(x), φ(y) ]} {{ } −i δ3(~x−~y) 1 ) = ∫ d3x d3y f∗k (x) f ∗ k′(x) (ωk′ − ωk) 1 = ωk′ − ωk 2 √ ωk ωk′ ∫ d3x ei(k+k ′)*x 1 = ωk′ − ωk 2 √ ωk ωk′ ei(ωk+ωk′ )x 0 ∫ d3x e−i( ~k+~k′)*x} {{ } (2π)3 δ3(~k+~k′) 1 = 0 Because ~k = −~k′ ⇒ ωk = ωk′ . ⇒ [ â(k), â(k′) ] = [ â†(k), â†(k′) ] = 0 (106) 36 R. Reece Now consider [ â(k), â†(k′) ] = ∫ d3x d3y f∗k (x) fk′(y) [ ωk φ(x) + i π(x), ωk′ φ(y)− i π(y) ] = ∫ d3x d3y f∗k (x) fk′(y) ( ωk ωk′  :0[ φ(x), φ(y) ] +  :0[ π(x), π(y) ] − i ωk [ φ(x), π(y) ]} {{ } i δ3(~x−~y)1 + i ωk′ [ π(x), φ(y) ]} {{ } −i δ3(~x−~y) 1 ) = ∫ d3x d3y f∗k (x) fk′(x) (ωk′ + ωk) 1 = ωk′ + ωk 2 √ ωk ωk′ ei(ωk−ωk′ )x 0 ∫ d3x e−i( ~k−~k′)*x} {{ } (2π)3 δ3(~k−~k′) 1 = 2 ωk 2 ωk e0 (2π)3 δ3 ( ~k − ~k′ ) 1 ∴ [ â(k), â†(k′) ] = (2π)3 δ3 ( ~k − ~k′ ) 1 (107) We will come to see why â†(k) and â(k) are known as creation and annihilation operators respectively. But first, we define the number operators, N(k) as N(k) ≡ â†(k) â(k) (108) Note that N(k) are hermitian. N †(k) = N(k) For the sake of clarity in the following discussion, let us discretize the four-momentum k, to have enumerable values kj . Our conclusions will be true in the continuous k limit. In this case, the creation and annihilation operators have the following algebra. [ â(ki), â †(kj) ] = δij 1 Let |nj〉 denote the eigenstate of the N(kj) operator, with eigenvalue nj . N(kj) |nj〉 = nj |nj〉 Let |ñj〉 ≡ â(kj) |nj〉 37 Quantum Field Theory: An Introduction Then N(kj) |ñj〉 = â†(kj) â(kj) â(kj) |nj〉 = ( â(kj) â †(kj)− [ â(kj), â †(kj) ]) â(kj) |nj〉 = ( â(kj) â †(kj)− 1 ) â(kj) |nj〉 = â(kj) ( â(kj) â †(kj)− 1 ) |nj〉 = â(kj) (nj − 1) |nj〉 = (nj − 1) |ñj〉 Therefore, â(kj) |nj〉 is an eigenstate of N(kj), with eigenvalue one less than that of |nj〉. ⇒ â(kj) |nj〉 ∝ |nj − 1〉 We say that "â(kj) lowers eigenstates". An analogous argument shows that â †(kj) raises eigenstates. â†(kj) |nj〉 ∝ |nj + 1〉 Assuming that |nj〉 are normalized such that 〈nj |nj〉 = 1, the following argument shows that the eigenvalues nj must be nonnegative. nj = 〈nj |N(kj)|nj〉 = 〈nj |â†(kj) â(kj)|nj〉 = 〈ñj |ñj〉 ≥ 0 We hypothesize that there is a lowest eigenstate, |0〉, with eigenvalue zero5. â(kj) |0〉 = 0 ∀ kj (109) We call this state the "vacuum." We will now find the normalization constant needed to keep raised and lowered states normalized. Let â(kj) |nj〉 = C |nj − 1〉 Then nj = 〈nj |N(kj)|nj〉 = 〈nj |â†(kj) â(kj)|nj〉 = C2 〈nj − 1|nj − 1〉 = C2 ⇒ C = √nj ∴ â(kj) |nj〉 = √ nj |nj − 1〉 (110) 5 Note that |0〉 is not the zero vector. |ψ〉+ |0〉 6= |ψ〉 38 R. Reece Similarly, one can show that â†(kj) |nj〉 = √ nj + 1 |nj + 1〉 (111) Using this, we can build up all the states from the vacuum by raising. |nj〉 ∝ ( â†(kj) )n |0〉 ( â†(kj) )n |0〉 = (â†(kj))n−1 √1 |1j〉 = (â†(kj))n−2 √2 |2j〉 = . . . = √n! |nj〉 ⇒ |nj〉 = 1√ n! ( â†(kj) )n |0〉 (112) Since each raising increases the eigenvalue nj by one, and the vacuum has eigenvalue zero, we know that the eigenvalues of the number operators are all nonnegative integers. The fact that [ â(ki), â(kj) ] = 0 implies that the N(kj) commute as well.[ N(ki), N(kj) ] = 0 Which implies that N(kj) have simultaneous eigenstates. We can label the states by the eigenvalues of each N(kj). |n1, n2, . . .〉 = 1√ n1! ( â†(k1) )n1 1√ n2! ( â†(k2) )n2 . . . |0〉 = ∏ j 1√ nj ! ( â†(kj) )nj |0〉 In the limit of continuous k, we can label a state by a single integer-valued function, n(k). |n(k)〉 = ∏ k 1√ n(k)! ( â†(k) )n(k) |0〉 (113) So far, we have not discussed the interpretation of these states |n(k)〉. We will come to see that |n(k)〉 is a state with n(k) particles with momentum k, and the number operators, N(k), count the number of particles with momentum k. In order to motivate this interpretation, we should first investigate the total energy and momentum of these states. 3.3.3 Energy Eigenstates Recall that the Klein-Gordon Hamiltonian (equation 74) is Ĥ = 1 2 ∫ d3x ( π2(x) + (~∇φ(x))2 +m2 φ2(x) ) 39 Quantum Field Theory: An Introduction Also, recall our plane-wave expansion for φ(x) (equation 103). φ(x) = ∫ d3k (2π)3 ( â(k) fk(x) + â †(k) f∗k (x) ) We want to express Ĥ in terms of φ(x) and π(x), but first we need to calculate π2(x), (~∇φ(x))2, and φ2(x). π(x) =   φ(x) = i ∫ d3k (2π)3 ωk ( − â(k) fk(x) + â†(k) f∗k (x) ) ~∇φ(x) = i ∫ d3k (2π)3 ~k ( â(k) fk(x)− â†(k) f∗k (x) ) π2(x) = − ∫ d3k (2π)3 d3k′ (2π)3 ωk ωk′ ( − â(k) fk(x) + â†(k) f∗k (x) )( − â(k′) fk′(x) + â†(k′) f∗k′(x) ) = − ∫ d3k (2π)3 d3k′ (2π)3 ωk ωk′ ( â(k) â(k′) fk(x) fk′(x) + â †(k) â†(k′) f∗k (x) f ∗ k′(x) − â†(k) â(k′) f∗k (x) fk′(x)− â(k) â†(k′) fk(x) f∗k′(x) ) Similarly |~∇φ(x)|2 = − ∫ d3k (2π)3 d3k′ (2π)3 ~k * ~k′ ( â(k) â(k′) fk(x) fk′(x) + â †(k) â†(k′) f∗k (x) f ∗ k′(x) − â†(k) â(k′) f∗k (x) fk′(x)− â(k) â†(k′) fk(x) f∗k′(x) ) and6 m2 φ2(x) = ∫ d3k (2π)3 d3k′ (2π)3 ~k * ~k′ ( â(k) â(k′) fk(x) fk′(x) + â †(k) â†(k′) f∗k (x) f ∗ k′(x) + â†(k) â(k′) f∗k (x) fk′(x) + â(k) â †(k′) fk(x) f ∗ k′(x) ) Therefore, putting these together gives Ĥ = 1 2 ∫ d3k (2π)3 d3k′ (2π)3 d3x [ ( − ωk ωk′ − ~k * ~k′ +m2 )( â(k) â(k′) fk(x) fk′(x) + â †(k) â†(k′) f∗k (x) f ∗ k′(x) ) + ( + ωk ωk′ + ~k * ~k′ +m2 )( â†(k) â(k′) f∗k (x) fk′(x) + â(k) â †(k′) fk(x) f ∗ k′(x) )] 6 Note the signs of the terms of π2(x) and |~∇φ(x)|2 are the same, but m2 φ2(x) has all positive terms. 40 R. Reece We then do the following integrals∫ d3x fk(x) f ∗ k′(x) = e−i(ωk−ωk′ )x 0 2 √ ωk ωk′ ∫ d3x ei( ~k−~k′)*~x = 1 2 ωk (2π)3 δ3 ( ~k − ~k′ ) ∫ d3x fk(x) fk′(x) = e−i(ωk+ωk′ )x 0 2 √ ωk ωk′ ∫ d3x ei( ~k+~k′)*~x = e−i 2 ωk x 0 2 ωk (2π)3 δ3 ( ~k + ~k′ ) This leaves Ĥ = 1 2 ∫ d3k (2π)3 d3k′  (2π)3 [ ( − ωk ωk′ − ~k * ~k′ +m2 )( â(k) â(k′) e−i 2 ωk x 0 + â†(k) â†(k′) e+i 2 ωk x 0 )(2π)3 δ3 (~k + ~k′) 2 ωk + ( + ωk ωk′ + ~k * ~k′ +m2 )( â†(k) â(k′) + â(k) â†(k′) )(2π)3 δ3 (~k − ~k′) 2 ωk ] = 1 2 ∫ d3k (2π)3 [ ( − ω2k + ~k2 +m2 ) } {{ } 0 ( â(k) â(−k) e−i 2 ωk x0 + â†(k) â†(−k) e+i 2 ωk x0 ) 1 2 ωk + ( + ω2k + ~k2 +m2 ) } {{ } 2 ω2k ( â†(k) â(k) + â(k) â†(k) ) 1 2 ωk ] Therefore Ĥ = 1 2 ∫ d3k (2π)3 ωk ( â†(k) â(k) + â(k) â†(k) ) Note that we have not yet used a commutation relation to commute operators. We have left them in order. We will now use our commutation relation for the creation and annihilation operators (equation 107) that we derived as a consequence of the equal time commutation 41 Quantum Field Theory: An Introduction relation for φ and π (equation 100). Ĥ = ∫ d3k (2π)3 ωk ( â†(k) â(k) + 1 2 [ â(k), â†(k) ]) = ∫ d3k (2π)3 ωk ( â†(k) â(k) + 1 2 δ3 ( ~k − ~k ) 1 ) = ∫ d3k (2π)3 ωk ( N(k) + 1 2 δ3k(0) 1 ) (114) The Hamiltonian commutes with the number operators, because the number operators commute with themselves. Therefore, the Hamiltonian and the number operators have simultaneous eigenstates. Similarly, using equations 75, 99, 100, 103, 106, and 107, one can show a similar expression for the three-momentum. ~P = ∫ d3k (2π)3 ~k ( N(k) + 1 2 δ3k(0) 1 ) (115) Therefore Pμ = ∫ d3k (2π)3 kμ ( N(k) + 1 2 δ3k(0) 1 ) (116) From equation 114, we see that for any given ~k, the energy steps in integer units of ωk because we know the number operators have nonnegative integer eigenvalues. The second term involves a δ-function7 always evaluated at its singular point. This term is an infinite constant energy that is there even when all the number operators have eigenvalue zero, called the "zero point energy." Evidently, any dynamics of the system will take the system to states with energies of various amounts of ωk always added on top of this uninteresting infinite zero point energy. We can shift our energy scale so that the vacuum has zero energy, but we have to find a way to properly subtract the infinity. In order to do this, we introduce the concept of normal ordering. 3.3.4 Normal Ordering Let us break Pμ into its finite part and the infinite zero point energy. Let P ′μ ≡ ∫ d3k (2π)3 kμ N(k) and Cμ ≡ 1 2 ∫ d3k (2π)3 kμ δ 3 k(0) Then Pμ = P ′μ + Cμ 1 7 The subscript k on the δ-function is to remind us that the δ-function is in momentum space. 42 R. Reece 〈0|Pμ|0〉 = :0〈0|P ′μ|0〉+ Cμ 〈0|0〉 = Cμ because 〈0|N(k)|0〉 = 0 ⇒ P ′μ = Pμ − 〈0|Pμ|0〉 Therefore, P ′μ is the total four-momentum without the infinite zero point energy. If we define M ′μν as in equation 78, only with P ′μ instead of Pμ, then P ′μ and M ′μν still satisfy the Poincaré algebra because P ′μ is the same operator as Pμ shifted by a constant. The Heisenberg equation of motion is also unchanged because ∂μφ(x) = i[Pμ, φ(x)] = i[P ′μ, φ(x)] ∂μπ(x) = i[Pμ, π(x)] = i[P ′μ, π(x)] which implies that the equations of motion are unchanged by using P ′μ and M ′μν as the generators of the Poincaré transformations, instead of Pμ and Mμν . Now we should formalize what we mean by P ′μ, other than Pμ minus infinity. We will do this with normal ordering. Consider a real scalar field, φ(x), with the plane-wave expansion given by equation 103. Let φ(x) = φ+(x) + φ−(x) (117) where8 φ+(x) ≡ ∫ d3k (2π)3 â(k) fk(x) φ−(x) ≡ ∫ d3k (2π)3 â†(k) f∗k (x) Then φ(x) φ(y) = ( φ+(x) + φ−(x) )( φ+(y) + φ−(y) ) = φ+(x) φ+(y) + φ+(x) φ−(y) + φ−(x) φ+(y) + φ−(x) φ−(y) = φ+(x) φ+(y) + φ−(y) φ+(x) + φ−(x) φ+(y) + φ−(x) φ−(y) + [ φ+(x), φ−(y) ] In the last step we have commuted one term of operators so that all the φ+ operators are on the right and all the φ− operators are on the left. We define this ordering of operators 8 Note that creation operators, â†(k), are in φ−, while the annihilation operators, â(k), are in φ+. This notation is confusing, but it is a convention used by many, so we will conform to it. 43 Quantum Field Theory: An Introduction as the normal ordered product: N [ φ(x) φ(y) ] ≡ φ+(x) φ+(y) + φ−(y) φ+(x) + φ−(x) φ+(y) + φ−(x) φ−(y) (118) The normal ordering of operators has the convenient property that it has zero vacuum expectation value because of the ordering of creation and annihilation operators acting on the vacuum state. 〈0|N [ φ(x) φ(y) ] |0〉 = 0 (119) Therefore φ(x) φ(y) = N [ φ(x) φ(y) ] + [ φ+(x), φ−(y) ] (120) 〈0|φ(x) φ(y)|0〉 =    :0 〈0|N [ φ(x) φ(y) ] |0〉+ 〈0| [ φ+(x), φ−(y) ] |0〉 Note that because of commutation relation 107, [ φ+(x), φ−(y) ] ∝ [ â(k), â†(k′) ] ∝ 1 ⇒ 〈0| [ φ+(x), φ−(y) ] |0〉 = [ φ+(x), φ−(y) ] ⇒ 〈0|φ(x) φ(y)|0〉 = [ φ+(x), φ−(y) ] Combining this with 120 gives N [ φ(x) φ(y) ] = φ(x) φ(y)− 〈0|φ(x) φ(y)|0〉 (121) Therefore, the normal ordering of operators subtracts the vacuum expectation value of those operators. Therefore, P ′μ is the normal ordering of Pμ. P ′μ = N [ Pμ ] We will use the operators representing the classical Noether's Charges normal ordered as the generators of the Poincaré transformations. Requiring that these operators satisfy the Poincaré algebra will imply the same equal time commutation relations in equations 99 and 100 and these operators will have zero vacuum expectation values. From now on we will drop the primes from P ′μ and M ′μν and let Pμ and Mμν denote normal ordered operators. All this really amounts to is equating Pμ to Pμ = ∫ d3k (2π)3 kμ N(k) (122) One could argue that we could have just ignored the term creating the zero point energy from the start, but it was worth it to introduce the concept of normal ordering because we 44 R. Reece will use it in our discussion of Wick's theorem in section 4.2. 3.3.5 Interpretation We are now in a position to give an interpretation to the eigenstates of the number operators given in equation 113. Consider an example for n(k) given by |n(k)〉 = 1√ 3! â†(k1) â †(k2) 3 |0〉 (123) Recall that n(k) is an integer valued function of the continuous four-vector k. It denotes the number of times the creation operators â†(k) need to act on the vacuum to give the state |n(k)〉. Figure 2 illustrates how you might visualize the above example n(k). Equation 122 Figure 2: An example n(k) plotted with three of the four-momentum components suppressed. implies that the four-momentum of this example is Pμ = kμ1 + 3 k μ 2 We interpret the state |n(k)〉 as a state with one particle with four-momentum kμ1 and three particles with four-momentum kμ2 . The eigenvalues n(k) of the N(k) operators are the number of particles with four-momentum kμ. As one might have guessed, the creation operator â†(k) creates a particle with four-momentum kμ, and the annihilation operator â(k) destroys a particle with four-momentum kμ. If no such particle exists in the state, then â(k′) |n(k)〉 = 0 in accordance with equation 109. 45 Quantum Field Theory: An Introduction 3.3.6 Internal Charge 3.3.7 Spin-statistics Now we will reface the issue of ambiguity between commutators and anticommutators we encountered in section 3.3.1. One can show that if we had chosen anticommutation relations for the field back in equations 99 and 100, then we would have also derived similar anticommutation relations for the creation and annihilation operators in equations 106 and 107. In particular, we would have shown that { â†(k), â†(k′) } = 0 (124) A field that satisfies anticommutation relations, like above, is a fermion field. The particles of such a field are fermions. On the other hand, a field that satisfies relations with commutators, like those we derived for the real scalar field, is a boson field. The particles this field are bosons. Note that the state in equation 123 is one example of any integer combinations of creation operators. That is, a state can have any integer number of bosons with same four-momentum. For fermions on the other hand, the anticommutation relation given by equation 124 has an interesting consequence. Note that if the same four-momentum is used as the argument each creation operator, then { â†(k), â†(k) } = â†(k) â†(k) + â†(k) â†(k) = 0 ⇒ â†(k)2 = 0 Therefore, one cannot create two of the same fermion with the same four-momentum. This is the explanation of the Pauli exclusion principle that says that no two identical fermions may occupy the same quantum state simultaneously. This feature of fermions is the cause of the stability of matter. Without it, electrons and nuclei would not stack into the orbitals and energy levels that make chemistry possible. Instead, they would all fall into the same ground state like bosons in a Bose-Einstein condensate. Whether or not two identical particles can occupy the same quantum state affects the statistics behind counting the possible states for a system of many particles. This point is of central importance in the study of statistical mechanics. It turns out that whether a particle is a fermion or boson is directly correlated with its spin. Recall that the real scalar field, a boson field, has spin zero, as we discussed after seeing equation 78. Later on, we will see that the Dirac field has spin 1/2. It will satisfy anticommutation relations, and is therefore a fermion field. This motivates the spinstatistics theorem, which states that particles with integer spin are bosons and particles with half integer spin are fermions. 46 R. Reece 3.3.8 The Feynman Propagator Recall the plane-wave expansion, equation 103, and let it operate on the vacuum. φ(x) |0〉 = ∫ d3k (2π)3 ( â(k) fk(x) + â †(k) f∗k (x) ) |0〉 = ∫ d3k (2π)3 ei k*x√ 2ωk â†(k) |0〉 Note that the state φ(x) |0〉 is the sum of one particle momentum eigenstates. We interpret φ(x) |0〉, not as a state with determined momentum, but as a state with one particle at a definite point in spacetime. Similarly, 〈0| φ(y) = ∫ d3k′ (2π)3 e−i k ′*y√ 2ω′k 〈0| â(k′) Consider the following inner product. 〈0|φ(y) φ(x)|0〉 We interpret this as the quantum mechanical amplitude for a particle to be created at spacetime point x and then destroyed at point y, called the "propagator". The problem is that this inner product is not Lorentz invariant because it relies on a specific time ordering of events. In order to be physically sensible, the particle must be created at point x before it is destroyed at point y. But for spacelike separated points x and y, some observer sees events at x happen after y. In order to remedy this, we use the step function defined as θ(t) ≡ 0 if t < 01 if t > 0 (125) Then we define the time ordered product as T [ φ(y) φ(x) ] ≡ φ(y) φ(x) θ(y0 − x0) + φ(x) φ(y) θ(x0 − y0) (126) Basically, the time ordered product orders operators such that those evaluated at later times are to the left. We will see that the correct form of the propagator that is Lorentz invariant is given by 〈0|T [ φ(y) φ(x) ] |0〉 47 Quantum Field Theory: An Introduction First consider one term in the time ordering. 〈0|φ(y) φ(x)|0〉 = ∫ d3k (2π)3 d3k′ (2π)3 e−ik*(y−x)√ 4 ωk ωk′ 〈0|â(k) â†(k′)|0〉 = ∫ d3k (2π)3 d3k′ (2π)3 e−ik*(y−x)√ 4 ωk ωk′ 〈0| [ â(k), â†(k′) ] |0〉 = ∫ d3k (2π)3 d3k′ (2π)3 e−ik*(y−x)√ 4 ωk ωk′ (2π)3 δ3 ( ~k − ~k′ ) = ∫ d3k (2π)3 e−ik*(y−x) 2 ωk Therefore, 〈0|T [ φ(y) φ(x) ] |0〉 = ∫ d3k (2π)3 2 ωk ( e−ik*(y−x) θ(y0 − x0) + e−ik*(x−y) θ(x0 − y0) ) The step function has the following integral representation θ(t) = −1 2πi ∫ ∞ −∞ dω e−iωt ω + iε (127) where ε is an infinitesimal, positive, real number taken to zero at the end of a calculation. Plugging this in gives, 〈0|T [ φ(y) φ(x) ] |0〉 = i (2π)4 ∫ d3k dω 2 ωk ( e−ik*(y−x)−iω(y 0−x0) ω + iε + e+ik*(y−x)+iω(y 0−x0) ω + iε ) = i (2π)4 ∫ d3k dω 2 ωk ( e+i ~k*(~y−~x)−i(ωk+ω)(y0−x0) ω + iε + e−i ~k*(~y−~x)+i(ωk+ω)(y0−x0) ω + iε ) In the second term, take ~k → −~k. = i (2π)4 ∫ d3k dω 2 ωk e+i ~k*(~y−~x) ( e−i(ωk+ω)(y 0−x0) ω + iε + e+i(ωk+ω)(y 0−x0) ω + iε ) Now in the first term, take ω → ω − ωk. In the second term, take ω → −ω − ωk. = i (2π)4 ∫ d3k dω 2 ωk e+i ~k*(~y−~x) e−iω(y 0−x0) ( 1 ω − ωk + iε + 1 −ω − ωk + iε ) = i (2π)4 ∫ d4k 2 ωk e−ik*(y−x) ( 1 ω − ωk + iε − 1 +ω + ωk − iε ) In the last step, we have combined the variables ω and ~k into a single four-vector, k. Note that their components are unrelated. ωk is the energy corresponding to momentum ~k given by ωk = √ ~k2 +m2, but omega is free to range over all real values. Cross multiplying the 48 R. Reece terms gives = i (2π)4 ∫ d4k 2 ωk e−ik*(y−x) ( ω + ωk − iε−ω + ωk − iε( ω − (ωk − iε) )( ω − (−ωk + iε) )) We ignore the ε terms subtracted from one in the numerator because they are infinitesimal and the only purpose of the ε in this expression is to indicate how we should push the poles in the denominator. See Figure ??. = i ∫ d4k (2π)4 e−ik*(y−x)( ω − (ωk − iε) )( ω − (−ωk + iε) ) (128) How we should integrate around these poles was completely dictated by the step function Figure 3: The integration of ω in equation 128 shown in the complex ω plane. in the time ordering. We arrived at equation 128 in its form so that the orientation of the poles would become evident. In order to simply, we multiple terms back out. = i ∫ d4k (2π)4 e−ik*(y−x) ω2 − ω2k + i(−ω + ωk + ω + ωk)ε+O[ε2] = i ∫ d4k (2π)4 e−ik*(y−x) ω2 − ~k2 −m2 + i 2 ωk ε In the last step we have used that ω2k = ~k2 +m2. Now take 2 ωk ε→ ε. Therefore, 〈0|T [ φ(y) φ(x) ] |0〉 = i ∫ d4k (2π)4 e−ik*(y−x) k2 −m2 + i ε We divide out the i factor and define the Feynman propagator as ∆F (x− y) ≡ −i 〈0|T [ φ(y) φ(x) ] |0〉 (129) 49 Quantum Field Theory: An Introduction ∆F (x− y) = ∫ d4k (2π)4 eik*(x−y) k2 −m2 + i ε (130) While it may not seem obvious at the moment, the propagator is an essential concept in field theory. We will refer back to it numerous times. We will see that it is often preferable to Fourier transform to get the propagator in momentum space. Evidently, it is given by ∆F (k) = ∫ d4x e−ik*x ∆F (x) = ∫ d4x ∫ d4k′ (2π)4 eix*(k ′−k) 1 k′2 −m2 + i ε = ∫ d4k′ δ(k′ − k) 1 k′2 −m2 + i ε ∆F (k) = 1 k2 −m2 + i ε (131) 50 R. Reece 4 The Interacting Real Scalar Field In the free field case, the Lagrangian was simple enough that the equations of motion could be solved and a general solution could be expressed as an expansion in plane-waves. Quantum mechanically, each plane-wave represents a single particle with a certain momentum. In a field configuration that is the sum of several plane-waves, each plane-wave satisfied the equations of motion on its own and therefore evolved in time independent of the other plane-waves. This means that the particles described by the free field do not interact with each other. We will now add terms to the Lagrangian that will represent interactions. This complication will have the result that the equations of motion can not be solved exactly. We will consider the special case of scattering, where particles travel virtually free and only interact briefly in a collision. We will see that the amplitude for a particle to go from x to y can be written as a power series that will converge if the interaction terms are small enough to be thought of as perturbation on the free field Lagrangian. We will introduce the notation of Feynman diagrams which can be used to pictorally represent the terms of the perturbation series. Finally, we will show that the terms in this perturbation series can be grouped and factored in a way that simplifies calculations immensely. 4.1 Correlation Functions A Lagrangian for an interacting real scalar field is of the form L = L0 − V where L0 is the free field Lagrangian given in equation 71, and V , known as the "interaction potential", is a function of the field φ(x). For example, the first interaction potential we will consider is V (x) = λ4! φ 4(x), where λ is a real constant. The presence of V in the Lagrangian density results in the same additional terms in the Hamiltonian density (with the signs flipped9). H = H0 + V So the Hamiltonian has the additional terms Hint, defined as follows. H = H0 +Hint Hint ≡ ∫ d3x V 9recall equation 52 51 Quantum Field Theory: An Introduction We will come to see that a primary interest in quantum field theory is the calculation of correlation functions, also called Green's functions. The n-point correlation function, or just the n-point function for short, is defined by G(n)(x1, x2, . . . , xn) ≡ 〈Ω|T [ φ(x1) φ(x2) * * * φ(xn) ] |Ω〉 (132) where |Ω〉 is the vacuum state of the interacting field. It will be our goal to calculate the two point Green's function, 〈Ω|T [ φ(x1) φ(x2) ] |Ω〉, which analogous to the Feynman propagator of the free field theory, represents the amplitude for a particle to propagate from x1 to x2. The interaction will affect this calculation in two ways: the interaction terms in the Hamiltonian, and the distinction of |Ω〉 from the vacuum of free field, |0〉. Like any Heisenberg operator, the field evolves as φ(t, ~x) = eiĤ (t−t0) φ(t0, ~x) e −iĤ (t−t0) As we turn off the interaction, V → 0 and H → H0, then φ(t, ~x)→ φI(t, ~x) ≡ eiĤ0 (t−t0) φ(t0, ~x) e−iĤ0 (t−t0) (133) φI is called the "interaction picture field," or just the "interaction field." The interaction picture is an intermediate picture between the Schrödinger picture and the Heisenberg picture, where the time dependence of the free Hamiltonian is carried by the operators, but the time dependence due to the interaction part of the Hamiltonian is carried by the state vectors. Since φI(t, ~x) = φ(t, ~x) ∣∣∣ V=0 , then φI satisfies the same equation of motion as the free real scalar field, the Klein-Gordon equation 102. Inverting the definition of the interaction field, equation 133, implies that φ(t0, ~x) = e −iĤ0 (t−t0) φI(t, ~x) e iĤ0 (t−t0) (134) So for general times φ(t, ~x) = e+iĤ (t−t0) e−iĤ0 (t−t0) φI(t, ~x) e +iĤ0 (t−t0) e−iĤ (t−t0) = Û †(t, t0) φI(t, ~x) Û(t, t0) (135) where we have defined the "interaction picture time evolution operator," Û(t, t0), as Û(t1, t2) ≡ e+iĤ0 (t1−t2) e−iĤ (t1−t2) 52 R. Reece Explicitly taking the time derivative of Û(t, t0) gives ∂ ∂t Û(t, t0) = e +iĤ0 (t−t0)(iĤ0 − iĤ)e−iĤ (t−t0) = −i e+iĤ0 (t−t0) Ĥint e−iĤ (t−t0) = −i e+iĤ0 (t−t0) Ĥint e−iĤ0 (t−t0) e+iĤ0 (t−t0) e−iĤ (t−t0) In this last step we have inserted a factor of 1 = e−iĤ0 (t−t0) e+iĤ0 (t−t0). Now we define the interaction picture interaction Hamiltonian: ĤI(t) ≡ e+iĤ0 (t−t0) Ĥint(t) e−iĤ0 (t−t0) Then we see that Û(t, t0) satisfies a Schrödinger-like equation: i ∂ ∂t Û(t, t0) = ĤI(t) Û(t, t0) (136) If Û(t, t0) and ĤI(t) were just functions and not operators, then we could divide both sides by U and integrate getting the simple solution of U(t, t0) = exp ( −i ∫ t t0 dt′ H(t′) ) . But because Û(t, t0) is an operator, we do this more carefully, treating Û(t, t0) as an operator and noting its ordering. Integrating the differential equation 136, from t0 to t gives. Û(t1, t0) ∣∣∣t t1=t0 = −i ∫ t t0 dt1 ĤI(t1) Û(t1, t0) We now use the initial condition that Û(t0, t0) = 1 because we know from equations 135 that when t = t0, it should be that φ(t, ~x) = φI(t, ~x). Û(t, t0) = 1− i ∫ t t0 dt1 ĤI(t1) Û(t1, t0) This "solution" only gives Û(t, t0) in terms of an integral of itself. Iteratively plugging this expression into itself gives the solution known as the "Dyson series." Û(t, t0) = 1 + (−i) ∫ t t0 dt1 ĤI(t1) + (−i)2 ∫ t t0 dt1 ∫ t1 t0 dt2 ĤI(t1) ĤI(t2) +(−i)3 ∫ t t0 dt1 ∫ t1 t0 dt2 ∫ t2 t0 dt3 ĤI(t1) ĤI(t2) ĤI(t3) + * * * (137) Note that the integrals are nested in the sense that the limits of the inner integrals depend on the integration variables of the outer integrals. Also note that because the limits of integration enforce that t1 ≥ t2 ≥ * * * ≥ tn−1 ≥ tn ≥ t0, the ĤI operators are all time ordered. We can un-nest these integrals with the following trick. Consider the second order 53 Quantum Field Theory: An Introduction (a) equation 138 (b) equation 139 Figure 4: Un-nesting integrals term for example. I ≡ ∫ t t0 dt1 ∫ t1 t0 dt2 ĤI(t1) ĤI(t2) Now consider the following expression without nested integrals, I ′ ≡ ∫ t t0 dt1 ∫ t t0 dt2 T [ ĤI(t1) ĤI(t2) ] which we will relate to the previous integrals by the following I ′ = ∫ t t0 dt1 ∫ t1 t0 dt2 ĤI(t1) ĤI(t2) + ∫ t t0 dt1 ∫ t t1 dt2 ĤI(t2) ĤI(t1) (138) = I + ∫ t t0 dt2 ∫ t2 t0 dt1 ĤI(t2) ĤI(t1) (139) = I + ∫ t t0 dt1 ∫ t1 t0 dt2 ĤI(t1) ĤI(t2) (140) = 2 I In equation 139, we have changed the limits of integration from those depicted in figure 4(a) to those depicted in 4(b), while still integrating over the same region. In equation 140, we have simply switched the names of the dummy variables t1 ↔ t2. Therefore,∫ t t0 dt1 ∫ t1 t0 dt2 ĤI(t1) ĤI(t2) = 1 2 ∫ t t0 dt1 ∫ t t0 dt2 T [ ĤI(t1) ĤI(t2) ] 54 R. Reece and in general,∫ t t0 dt1 ∫ t1 t0 dt2 * * * ∫ tn−1 t0 dtn ĤI(t1) * * * ĤI(tn) = 1 n! ∫ t t0 dt1 * * * dtn T [ ĤI(t1) * * * ĤI(tn) ] Plugging this result into the Dyson series, equation 137, gives Û(t, t0) = 1 + (−i) 1! ∫ t t0 dt1 ĤI(t1) + (−i)2 2! ∫ t t0 dt1 dt2 T [ ĤI(t1) ĤI(t2) ] + (−i)3 3! ∫ t t0 dt1 dt2 dt3 T [ ĤI(t1) ĤI(t2) ĤI(t3) ] + * * * which we will write compactly as Û(t, t0) = T [ exp ( −i ∫ t t0 dt′ ĤI(t ′) )] (141) From the properties of limits of integrals it is now evident that adjacent Û operators, with the appropriate arguments, can be combined into one10. Û(t1, t2) Û(t2, t3) = T [ exp ( −i ∫ t1 t2 dt′ ĤI(t ′)− i ∫ t2 t3 dt′′ ĤI(t ′′) )] = T [ exp ( −i ∫ t1 t3 dt′ ĤI(t ′) )] Û(t1, t2) Û(t2, t3) = Û(t1, t3) (142) Û †(t2, t1) = T [ exp ( −i ∫ t2 t1 dt′ ĤI(t ′) )†] = T [ exp ( +i ∫ t2 t1 dt′ ĤI(t ′) )] = T [ exp ( −i ∫ t1 t2 dt′ ĤI(t ′) )] Û †(t2, t1) = Û(t1, t2) (143) Therefore, Û(t1, t2) Û †(t3, t2) = Û(t1, t3) (144) We are almost ready to derive what will be a very useful relation for correlation functions, 10Note that in combining the exponentials, we are not complicated by the Baker-Campbell-Hausdorff formula (shown below) because thankfully, ĤI(t ′) commutes with ĤI(t ′′) for all times, a fact most readily seen because we have required that the Poincaré algebra be satisfied (see equation 92). In general, one has to be careful when combining exponentials of operators. e(Â+B) = eÂ eB e− 1 2 [ Â,B ] e 1 6 ( 2 [ B, [ Â,B ]] + [ Â, [ Â,B ]]) * * * 55 Quantum Field Theory: An Introduction but first we turn our attention to the vacuum, |Ω〉. We will relate the vacuum state of the interacting field, |Ω〉, to the vacuum of a free field, |0〉, by expanding |0〉 in terms of the eigenstates of the complete Hamiltonian with interaction terms, and then evolving it in time with the complete Hamiltonian. |0〉 = ∑ n |n〉 〈n|0〉 where |n〉 are the eigenstates of Ĥ. Ĥ |n〉 = En |n〉 Now evolve it in time, but with the complete interacting Hamiltonian. e−i Ĥ t |0〉 = ∑ n e−i En t |n〉 〈n|0〉 Since the ground state (vacuum) is a term in this sum, we can pull it out explicitly, e−i Ĥ t |0〉 = e−i E0 t |Ω〉 〈Ω|0〉+ ∑ n6=0 e−i En t |n〉 〈n|0〉 (145) where E0 = 〈Ω|Ĥ|Ω〉 is the ground state energy of the interacting theory. We define zero energy to be the energy of the ground state of the noninteracting theory, Ĥ0 |0〉 = 0. Now, in order to kill off all the other terms in the sum, we take time to infinity in a direction slightly rotated towards the negative imaginary. lim t→∞(1−i ε) e−i Ĥ t |0〉 = e−i E0 t |Ω〉 〈Ω|0〉+ ∑ n 6=0  :0 e−i En t |n〉 〈n|0〉 These terms are subdominant to the term with E0 because E0 is the smallest of all En. Inverting this expression for |Ω〉 gives |Ω〉 = lim t→∞(1−i ε) ( e−i E0 t 〈Ω|0〉 )−1 e−i Ĥ t |0〉 Since t is now very large, we can shift it by a finite constant, t0. |Ω〉 = lim t→∞(1−i ε) ( e−i E0 (t+t0) 〈Ω|0〉 )−1 e−i Ĥ (t+t0) |0〉 56 R. Reece And we can insert a factor of e−i Ĥ0(−t−t0) because recall that Ĥ0 |0〉 = 0. |Ω〉 = lim t→∞(1−i ε) ( e−i E0 (t+t0) 〈Ω|0〉 )−1 e+i Ĥ ((−t)−t0) e−i Ĥ0((−t)−t0)} {{ } Û†(−t,t0) |0〉 = lim t→∞(1−i ε) ( e−i E0 (t+t0) 〈Ω|0〉 )−1 Û(t0,−t) |0〉 (146) Note that this indicates that one can get from the free vacuum state to the interacting vacuum by evolving it with the interaction picture time evolution operator from the asymtotically infinite past to the reference time t0. The rest is just a normalization factor. Similar to equation 145, we can express 〈Ω| as follows. 〈Ω| e−i Ĥ t = 〈0|Ω〉 〈Ω| e−i E0 t + ∑ n6=0 〈0|n〉 〈n| e−i En t Following the same reasoning we have 〈Ω| = lim t→∞(1−i ε) ( e−i E0 t 〈0|Ω〉 )−1 〈0| e−i Ĥ t Shifting t→ t− t0 and inserting a factor of e+i Ĥ0 (t−t0) gives 〈Ω| = lim t→∞(1−i ε) ( e−i E0 (t−t0) 〈0|Ω〉 )−1 〈0| e+i Ĥ0 (t−t0)e−i Ĥ (t−t0) = lim t→∞(1−i ε) ( e−i E0 (t−t0) 〈0|Ω〉 )−1 〈0| Û(t, t0) (147) Combining our results, we can now write the 2-point function in terms of the free vacuum and interaction fields. For the moment, assume x01 > x 0 2 > t0 so that we can explicitly write out the time ordering. Then we use equations 132, 134, 146, and 147. 〈Ω|T [ φ(x1) φ(x2) ] |Ω〉 = 〈Ω|φ(x1) φ(x2)|Ω〉 = lim t→∞(1−i ε) ( e−i 2 E0 t) |〈0|Ω〉|2 )−1 〈0|Û(t, t0) Û †(x01, t0) φI(x1) Û(x01, t0) Û †(x02, t0) φI(x2) Û(x02, t0) Û(t0,−t)|0〉 = lim t→∞(1−i ε) ( e−i 2 E0 t) |〈0|Ω〉|2 )−1 〈0|Û(t, x01) φI(x1) Û(x01, x02) φI(x2) Û(x02,−t)|0〉 We have used the property of Û operators that allows one to combine them, equation 144. 57 Quantum Field Theory: An Introduction We can get rid of the awkward factor in front by dividing by a convenient form of 1: 1 = 〈Ω|Ω〉 = lim t→∞(1−i ε) ( e−i 2 E0 t) |〈0|Ω〉|2 )−1 〈0|Û(t, t0) Û(t0,−t)|0〉 = lim t→∞(1−i ε) ( e−i 2 E0 t) |〈0|Ω〉|2 )−1 〈0|Û(t,−t)|0〉 Therefore, 〈Ω|T [ φ(x2) φ(x1) ] |Ω〉 = lim t→∞(1−i ε) 〈0|Û(t, x01) φI(x1) Û(x01, x02) φI(x2) Û(x02,−t)|0〉 〈0|Û(t,−t)|0〉 = lim t→∞(1−i ε) 〈0|T [ φI(x1) φI(x2) Û(t, x 0 1) Û(x 0 1, x 0 2) Û(x 0 2,−t) ] |0〉 〈0|Û(t,−t)|0〉 = lim t→∞(1−i ε) 〈0|T [ φI(x1) φI(x2) Û(t,−t) ] |0〉 〈0|Û(t,−t)|0〉 And finally, plugging in our integral solution for Û(t,−t), equation 141 gives our formula for the 2-point correlation function. 〈Ω|T [ φ(x1) φ(x2) ] |Ω〉 = lim t→∞(1−i ε) 〈0|T [ φI(x1) φI(x2) exp ( −i ∫ t −t dt ′ ĤI(t ′) )] |0〉 〈0|T [ exp ( −i ∫ t −t dt ′ ĤI(t′) )] |0〉 (148) It may not seem like we have made much progress, but we actually have. We will see that the expression on the right is one that we can actually calculate because it is in terms of the free vacuum and the interaction picture fields, which have the same equation of motion as a free field. Because of this, we will see that we can calculate these expectation values much in the same way we were able to calculate the expectation value called the Feynman propagator for free field theory in Section 3.3.8. Also, we will see that the fact that this expression is the ratio of expectation values will allow many terms to factor and cancel. For the general n-point function, our derivation of the general formula analogous to equation 148 would be exactly the same except we would have had more φI operators to write down and more Û operators to lump together. The general formula is 〈Ω|T [ φ(x1) φ(x2) * * * φ(xn) ] |Ω〉 = lim t→∞(1−i ε) 〈0|T [ φI(x1) φI(x2) * * * φI(xn) exp ( −i ∫ t −t dt ′ ĤI(t ′) )] |0〉 〈0|T [ exp ( −i ∫ t −t dt ′ ĤI(t′) )] |0〉 (149) To make things more concrete let's choose a specfic ĤI to plug into equation 148. It is 58 R. Reece called the φ4 theory. Let V (x) = λ 4! φ4(x) where λ is a small positive real number. Then ĤI = ∫ d3x λ 4! φ4I(x) Equation 148 is suited for doing purturbative calculations because since λ is small, the Taylor expansion of the exponential is a purturbation series in λ. The first few terms of the expansion of the numerator in equation 148 are 〈0|T [ φI(x1) φI(x2) exp ( −i ∫ t −t dt′ ĤI(t ′) )] |0〉 = 〈0|T [ φI(x1) φI(x2) ] |0〉 + 1 1! ( −i λ 4! )∫ d4y1 〈0|T [ φI(x1) φI(x2) φ 4 I(y1) ] |0〉 + 1 2! ( −i λ 4! )2 ∫ d4y1 d 4y2 〈0|T [ φI(x1) φI(x2) φ 4 I(y1) φ 4 I(y2) ] |0〉 + * * * (150) We have combined the time integral from equation 148 with the space integral from ĤI into an integral overall all spacetime with the caviet that the time integral is infinitesmially rotated towards the imaginary by the t → ∞(1 − i ε) limit. Each successive term in this series is supressed by higher powers of λ, therefore we only need to calculate as many terms as our precision requires or until the calculation gets unbearably complicated. 4.2 Wick's Theorem What we have really accomplished in equation 148 is to relate expressions like 〈Ω|T [ φ(x2) φ(x1) ] |Ω〉 to a series of expressions like 〈0|T [ φI(x1) φI(x2) φI(x3) * * * ] |0〉. In this section, we will derive a theorem called Wick's theorem that allows us to write any expression of the form of 〈0|T [ φI(x1) φI(x2) φI(x3) * * * ] |0〉 as a product of 2-point functions like 〈0|T [ φI(x1) φI(x2) ] |0〉. Consider the following time-ordered product, where we know y0 > x0. T [ φ(y) φ(x) ] = φ(y) φ(x) = ( φ+(y) + φ−(y) )( φ+(x) + φ−(x) ) = φ+(y) φ+(x) + φ+(y) φ−(x) + φ−(y) φ+(x) + φ−(y) φ−(x) = φ+(y) φ+(x) + φ−(x) φ+(y) + φ−(y) φ+(x) + φ−(y) φ−(x) + [ φ+(y), φ−(x) ] = N [ φ(y) φ(x) ] + [ φ+(y), φ−(x) ] 59 Quantum Field Theory: An Introduction We are using the φ± notation we introduced in equation 117. In the last step we have commuted one term of operators so that all the φ+ operators are on the right and all the φ− operators are on the left. This is what we defined as the normal-ordered product in equation 118. For general times we have T [ φ(y) φ(x) ] = N [ φ(y) φ(x) ] +  [ φ+(y), φ−(x) ] if y0 > x0[ φ+(x), φ−(y) ] if x0 > y0 (151) We will define the Wick contraction of two field operators as follows to denote this difference between time ordering and normal ordering. φ(y)φ(x) ≡ T [ φ(y) φ(x) ] −N [ φ(y) φ(x) ] (152) Now we state an identity, known as Wick's theorem, for converting time order products of multiple operators to a sum of normal ordered products and contractions. T [ φ(x1) φ(x2) * * * φ(xn) ] = N [ φ(x1) φ(x2) * * * φ(xn) ] + all possible contractions (153) To see explicitly what is meant by "all possible contractions," consider the case of n = 4. For brevity, let φi denote φ(xi). T [ φ1 φ2 φ3 φ4 ] = N [ φ1 φ2 φ3 φ4 ] + φ1 φ2 φ3 φ4 + φ1 φ2 φ3 φ4 + φ1 φ2 φ3 φ4 + φ1 φ2 φ3 φ4 + φ1 φ2 φ3 φ4 + φ1 φ2 φ3 φ4 + φ1 φ2 φ3 φ4 + φ1 φ2 φ3 φ4 + φ1 φ2 φ3 φ4 The right-hand side of this identity is known as the "Wick expansion." When a term has uncontracted fields, our notation means that those uncontracted fields are normal ordered. φ1 φ2 φ3 φ4 ≡ φ1 φ3 N [ φ3 φ4 ] To prove Wick's theorem, one uses induction. Wick's theorem is true for n = 2 by definition. We assume it is true for n − 1 and prove that it is true for n. Let us label xi 60 R. Reece such that x01 ≥ x02 ≥ * * * ≥ x0n. T [ φ1 φ2 * * *φn ] = φ1 φ2 * * *φn = φ1 ( N [ φ2 * * *φn ] + ( all contractions not involving φ1 )) = (φ+1 + φ − 1 ) ( N [ φ2 * * *φn ] + ( all contractions not involving φ1 )) In the step before last, we used Wick's theorem for φ2 * * *φn. To make this last step look like Wick's theorem, we want to pull the φ1 into the normal-ordered product. The φ − 1 is already normal-ordered so we can just bring it in, but the φ+1 must be commuted passed all the other operators. φ+1 N [ φ2 * * *φn ] = N [ φ2 * * *φn ] φ+1 + [ φ+1 ,N [ φ2 * * *φn ]] = N [ φ+1 φ2 * * *φn ] + [ φ+1 , φ − 2 ] N [ φ3 * * *φn ] + [ φ+1 , φ − 3 ] N [ φ2 φ3 * * *φn ] + * * * = N [ φ+1 φ2 * * *φn ] + [ φ+1 , φ − 2 ] N [ φ3 * * *φn ] + [ φ+1 , φ − 3 ] N [ φ2 φ3 * * *φn ] + * * * = N [ φ+1 φ2 φ3 φ4 ] + φ1 φ2 φ3 φ4 + φ1 φ2 φ3 φ4 + * * * = N [ φ+1 φ2 φ3 φ4 ] + ( all contractions involving φ1 ) Combining this with the term multiplied by φ−1 gives back the form of Wick's theorem. T [ φ1 φ2 * * *φn ] = N [ φ+1 φ2 * * *φ4 ] + ( all contractions involving φ1 ) + φ−1 ( N [ φ2 * * *φn ] + ( all contractions not involving φ1 )) = N [ φ1 φ2 * * *φn ] + all possible contractions This completes our proof of Wick's theorem. Now recall that what we are really interested in calculating is the free vacuum expectation value of the time ordered product of interaction picture fields, 〈0|T [ φI(x1) φI(x2) φI(x3) * * * ] |0〉. Using Wick's theorem, we can make this the free vacuum expectation value of the Wick expansion. But recall that the vacuum expectation value of any normal-ordered product is zero11. Therefore, any term with uncontracted fields is zero. 〈0|T [ φI(x1) φI(x2) φI(x3) * * * ] |0〉 = 〈0| ( all complete contractions ) |0〉 11See equation 119 61 Quantum Field Theory: An Introduction As before, for brevity, let φi denote φI(xi). Note that now we are also dropping the I to denote interaction picture field. When doing contractions we will always be dealing with interaction picture fields. For example, 〈0|T [ φ1 φ2 φ3 φ4 ] |0〉 = 〈0|φ1 φ2 φ3 φ4|0〉+ 〈0|φ1 φ2 φ3 φ4|0〉+ 〈0|φ1 φ2 φ3 φ4|0〉 So what can we do with these completely contracted terms? From the definition of Wick contraction, equation 152, and knowing that the vacuum expectation value of any normalordered product is zero, we see that the vacuum expectation value of a Wick contraction is actually a Feynman propagator. 〈0|φ1 φ2|0〉 = 〈0|T [ φ1 φ2 ] |0〉 −   :0〈0|N [ φ1 φ2 ] |0〉 = i∆F (x1 − x2) Note from equations 151 and 107, that because a Wick contraction is ultimately a commutator between creation and annihilation operators, as an operator, it is proportional to unity. φ1 φ2 ∝ 1 Therefore, 〈0|φ1 φ2 φ3 φ4|0〉 = 〈0|φ1 φ2|0〉 〈0|φ3 φ4|0〉 and 〈0|T [ φ1 φ2 φ3 φ4 ] |0〉 = 〈0|φ1 φ2|0〉 〈0|φ3 φ4|0〉 + 〈0|φ1 φ3|0〉 〈0|φ2 φ4|0〉 + 〈0|φ1 φ4|0〉 〈0|φ2 φ3|0〉 =−∆F (x1 − x2) ∆F (x3 − x4) −∆F (x1 − x3) ∆F (x2 − x4) −∆F (x1 − x4) ∆F (x2 − x3) Note that the vacuum expectation value of an odd number of field operators will always be zero because it will always leave an uncontracted field. 4.3 Feynman Diagrams So far, in this chapter, we have derived a formula for relating correlation functions to a perturbation series where each term is a free vacuum expectation value of a time-ordered product of interaction picture fields. Then we showed how these vacuum expectation values 62 R. Reece could be converted to products of Feynman propagators using Wick's theorem. Let us go back to considering the numerator of the 2-point function, equation 148, whose first few terms we expanded previously in equation 150. Let us consider writing these terms as Wick contractions yielding Feynman propagators. The first term is already a Feynman propagator. For the next term, we need to write out the Wick expansion. 〈0|T [ φI(x1) φI(x2) φ 4 I(y1) ] |0〉 = 〈0|φI(x2) φI(x1) φI(y1) φI(y1) φI(y1) φI(y1)|0〉 + 〈0|φI(x1) φI(x2) φI(y1) φI(y1) φI(y1) φI(y1)|0〉 + all other complete contractions (154) Note that only a subset of all the complete contractions involve unique combinations of spacetime points. For example, the following contractions are equivalent. 〈0|φI(x1) φI(x2) φI(y1) φI(y1) φI(y1) φI(y1)|0〉 = 〈0|φI(x1) φI(x2) φI(y1) φI(y1) φI(y1) φI(y1)|0〉 To help organize things, we will now introduce a diagrammatic notation, called Feynman diagrams, for keeping track of the possible unique contractions. To denote the contraction of two fields, we draw a line connecting two points representing the spacetime points at which the fields are evaluated. x y ≡ 〈0|φI(y) φI(x)|0〉 = i∆F (x− y) This line equivalently represents a Feynman propagator from one spacetime point to the other. Stitching together these lines, we can summarize the Wick expansions. Using this notation, the first-order term in equation 154 reads as follows. 1 1! ( −i λ 4! )∫ d4y1 〈0|T [ φI(x1) φI(x2) φ 4 I(y1) ] |0〉 = 3 1! 4! x1 x2 y1 + 12 1! 4! x1 x2y1 (155) Each internal vertex has an implied factor of (−i λ) and is integrated over all spacetime. For now, the rest of the numerical factors are left out front. There are only two diagrams because there are only two unique topologies for pairing the operators in contractions. The 63 Quantum Field Theory: An Introduction factors of 3 and 12 count the number of Wick contractions that generate the same diagram. The first term in equation 154 is an example of a contraction contributing to the first diagram, and the second term in equation 154 is an example of a contraction contributing to the second. Moving on to the next (second-order) term in the numerator of the 2-point function, equation 150, we have the following diagrams. 1 2! ( −i λ 4! )2 ∫ d4y1 d 4y2 〈0|T [ φI(x1) φI(x2) φ 4 I(y1) φ 4 I(y2) ] |0〉 = 9 2! (4!)2 + 72 2! (4!)2 + 24 2! (4!)2 + 72 2! (4!)2 + 288 2! (4!)2 + 192 2! (4!)2 + 288 2! (4!)2 (156) You are probably daunted by the combinatoric coefficients in front of these diagrams, but the following trick makes calculating them much more simple. • In a diagram with n vertices, if each vertex is topologically unique, then there are n! combinatoric ways for exchanging the labels of each vertex which give the same diagram. This n! cancels the 1n! in the Taylor series. • If each line coming from a vertex connects that vertex to a unique spacetime point, then there are 4! ways of exchanging the lines which give the same diagram. This 4! cancels the 14! in the λ 4! coupling. This was the reason for putting this factor in the coupling coefficient. If we assume these rules but a diagram has symmetries that violate these rules, then we would have over counted the Wick contractions contributing to that diagram by some symmetry factor. This symmetry factor is the number of ways of interchanging components of the diagram without changing the topology of the diagram. Therefore the overall coefficient of a diagram is the reciprocal of its symmetry factor. Examples of symmetry factors are as 64 R. Reece follows. S = 2 flipping the loop S = 2 * 2 * 2 = 8 flipping each loop and flipping over the whole diagram S = 3! = 6 exchanging the internal lines joining the vertices Combining all these terms up to second-order, we have the following for the numerator of the 2-point function, equation 150. 〈0|T [ φI(x1) φI(x2) exp ( −i λ 4! ∫ d4y φ4I(y) )] |0〉 = + 1 8 + 1 2 + 1 2! 82 + 1 2 * 8 + 1 48 + 1 16 + 1 4 + 1 6 + 1 4 + * * * (157) Looking at this expression carefully, notice that the parts of diagrams that are not connected 65 Quantum Field Theory: An Introduction to any external spacetime points, called vacuum bubbles, can be factored as follows. = [ + 1 2 + 1 4 + 1 6 + 1 4 + * * * ] × [ 1 + 1 8 + 1 16 + 1 48 + * * * + 1 2! ( 1 8 + * * * )2 + * * * ] (158) This motivates the amazing fact that the factor of vacuum bubbles forms an exponential. = [ + 1 2 + 1 4 + 1 6 + 1 4 + * * * ] × exp ( 1 + 1 8 + 1 16 + 1 48 + * * * ) = ∑( external diagrams ) exp (∑( vacuum bubbles )) (159) Note that the 1/2! symmetry factor from exchanging the vacuum vertices in the term with the double figure-eight bubbles in equation 158, factored out to given the proper coefficient for the exponential. Also note that after pulling out this 1/2!, the figure-eight bubble was left with the reciprocal of its own symmetry factor as a coefficient. This shows that even in the exponent, the coefficient of any diagram is just the reciprocal of its symmetry factor. 66 R. Reece Therefore, people often write these sums as just the sum of the possible diagrams, leaving the symmetry factor implied because it can be determined directly from the diagram. From now on, we leave the symmetry factors of individual connected diagrams implied. We prove that the vacuum bubbles factor into an exponential as follows. In general, a diagram has a part with external points and the product of some set of vacuum bubbles. For example Since the set of possible vacuum bubbles is enumerable, albeit infinite, we can label the i-th vacuum bubble by Vi, including its symmetry factor. Vi ∈  , , , , * * *  Then the value of a diagram with a given external part of a diagram and a given set of vacuum bubbles is( a given external part with {ni} bubbles ) = (external part) (∏ i 1 ni! V nii ) (160) where {ni} is an ordred set of integers indicating that there are ni coppies of the Vi bubble in the total diagram. That external part can be can be paired with every possible combination of vacuum bubbles, so we sum over every possible ordered set {ni}.( a given external part with all possible bubbles ) = (external part) ∑ {ni} (∏ i 1 ni! V nii ) (161) Finally, we have to sum over every possible external diagram, each term having a sum over all sets of vacuum bubbles, so we factor the vacuum bubbles out. 〈0|T [ φI(x1) φI(x2) exp ( −i λ 4! ∫ d4y φ4I(y) )] |0〉 = ∑( external diagrams ) ∑ {ni} (∏ i 1 ni! V nii ) This sum over all possibles sets {ni} can be written as a product of sums over all values ni, 67 Quantum Field Theory: An Introduction the cross terms giving every possible combination. = ∑( external diagrams ) (∑ n1 1 n1! V n11 )(∑ n2 1 n2! V n22 )(∑ n3 1 n3! V n33 ) * * * = ∑( external diagrams ) ∏ i (∑ ni 1 ni! V nii ) = ∑( external diagrams ) ∏ i exp(Vi) = ∑( external diagrams ) exp (∑ i Vi ) Therefore we have verified equation 159. We have simplified the numerator of the 2-point function as much as possible. Now we turn our attention to the denominator. Using the same argument we used on the numerator: expanding out the exponential, writing the Wick contractions for each term, converting each Wick contraction to a Feynman diagram, and regrouping the terms into an exponential; we would find that the denominator is simply the exponential of the sum of all vacuum bubbles. 〈0|T [ exp ( −i λ 4! ∫ d4y φ4I(y) )] |0〉 = exp ( + + + * * * ) (162) Therefore12, it cancels the exponential in the numerator. To summarize, for the 2-point 12One may be tempted to look at equation 162 and think that on the left side one can bring the time ordering and expectation value into the exponential, and conclude that the arguments of the exponentials themselves are equal, but this is not so. Recall that the left exponential is actually an abuse of notation, a mnemonic to denote the entire Dyson series, equation 141. Each successive term in the "exponential" on the left has an additional integration over a new spacetime point. Without expanding the series to reveal all the integrals, we would never get additional vertices for the higher order vacuum bubbles. The exponential on the right, however, is an exponential in the true sense. Recall that the Feynman diagrams really represent a product of Feynman propagators, which is just a scalar function. So the argument of this exponential is just a number, even though doing the integrals shows that it is actually infinite. 68 R. Reece function we have. G(2)(x1, x2) ≡〈Ω|φ(x1) φ(x2)|Ω〉 = ∑( external diagrams ) = + + + + + * * * (163) This factoring and canceling of the vacuum bubbles works the same way for all n-point functions, so in general, the n-point function is given by G(n)(x1, x2, . . . , xn) ≡〈Ω|φ(x1)φ(x2) * * * φ(xn)|Ω〉 = ∑( diagrams with n external points ) (164) This sum is over all possible diagrams where every connected part of the diagram has an external point, but these connected parts can be disconnected from one another. Examples 69 Quantum Field Theory: An Introduction of terms like this are in the 4-point function, whose series is as follows. G(4) = + + + + + * * * + + + * * * + * * * 70 R. Reece Acknowledgments I gratefully acknowledge the help and perspective of many others in preparing this document. All the remaining errors and defficiencies in clarity are my fault. Burt Ovrut, Peskin and Schroeder (1995), David Tong. References