7. Quantum Electrodynamics#
We saw in The Canonical Formalism how Lagrangian (densities) may be guessed, passed to the interaction picture, and verified against the free fields derived from Lorentz invariance and causality in Quantum Fields and Antiparticles. There were, however, cases we couldn’t handle back in Massless Fields, which include, in particular, massless vector fields. It is this difficulty we will try to handle in this section, which, as it turns out, leads to the most successful quantum field theory to date, namely, the quantum electrodynamics (QED).
The plan to quantize electrodynamics is the following. First, we’ll show that the failed attempt to write down a free massless helicity-\(1\) vector field as Eq.4.7.12 can be remedied by the introduction of a gauge symmetry. The introduction of gauge symmetry turns out to also introduce constraints on the canonical variables, which we’ll try to fix using Dirac’s method discussed in Constraints and Dirac Brackets. Once the constraints have been dealt with, we pass to the interaction picture and write down the Hamiltonian as before. With the propagators calculated using the appropriate commutation relations, the machinery of Feynman diagrams may be applied to calculate S-matrix elements. This is done first theoretically by writing down a recipe, and then applied explicitly to one example – the famous Compton scattering.
7.1. Gauge Invariance and Action#
Recall from The failure for vector fields that we’ve tried to write down a quantum vector field \(a_{\mu}(x)\) for massless helicity-\(1\) particle in Eq.4.7.12. The issue, which stems from the obviously non-Lorentz-invariant gauge fixing condition Eq.4.7.14, is that \(a_{\mu}(x)\) fails to be covariant in light of Eq.4.7.18.
From perspective of Lagrangian formalism, however, it’s completely fine that the field itself is not covariant, as long as the action is Lorentz invariant. This consideration motivates the introduction of the co-called mass action \(I_M\), which is required to be invariant under the gauge transformation
Here we switch to the uppercase \(A_{\mu}\) to indicate that it is considered to be a Heisenberg-picture field, i.e., a field that knows everything about the interactions. Note also that an explicit formula for \(I_M\) will only be worked out much later, after we formally apply all sorts of symmetry principles.
Now the variation of \(I_M\) with respect to \(A_{\mu}(x)\) can be written as follows
It follows from integration-by-parts that in order for \(\delta I_M = 0\), one must have
Note that this is trivially true if \(I_M\) is a functional of
and its derivatives.
To construct an \(I_M\) that satisfies Eq.7.1.2, we need to couple \(A_{\mu}(x)\) to a conserved \(4\)-current \(J^{\mu}(x)\) in the sense that \(\p_{\mu} J^{\mu} = 0\). This will guarantee
and hence Eq.7.1.2. Here we’ve secretly absorbed a potential proportional constant into the definition of \(J^{\mu}\), which, in turn, will be absorbed into the definition of charges will be discussed below.
Now recall from Global symmetries that such conserved \(4\)-current can be obtained by assuming symmetries on \(I_M\) in the form of Eq.6.3.3. In QED, we’re specifically interested in variations of the following form
which is diagonalized into charges \(q_{\ell}\). Following discussions in Lagrangian density preserving symmetry, if we assume
such that \(\Lscr_M\) is invariant under \(\Psi_{\ell} \to \Psi_{\ell} + \delta \Psi_{\ell}\) with a constant \(\epsilon(x) = \epsilon\). Then the conserved current, according to Eq.6.3.9, takes the following form
Let
be the conserved charge operator. Then it follows from Eq.6.3.11 (modulo an extremely confusing shuffle of notations)
where \(\Psi_{\ell}\) is now officially one of the canonical variables.
To summarize, we have concluded that the matter action \(I_M\) is invariant under the joint (local) transformation
on \(A_{\mu}(x)\) and \(\Psi_{\ell}(x)\). Symmetries like this are known as gauge symmetries.
It turns out that in QED, besides the matter action, we also need a light action which takes the following form
It’s hard to argue logically for why such a term is needed, but it turns out to work, and it appears also in the Lagrangian density Eq.6.4.6 of massive spin-\(1\) vector fields.
Combining the mass and light actions together, we see that the field equation is given by the variational principle as follows
which is recognized as the inhomogeneous Maxwell equations. The homogeneous Maxwell equations
follows readily from the definition Eq.7.1.3.
To recap our (or rather, Weinberg’s) approach to QED, it starts with a partially successful construction of a massless helicity-\(1\) vector field \(a_{\mu}(x)\) from Eq.4.7.12. Its transformation law Eq.4.7.18, abstracted into Eq.7.1.1, necessitates a conserved coupling field \(J^{\mu}\), which then comes out of a variational principle. In the end, we’ve derived the gauge symmetry Eq.7.1.7 as a consequence of the need of a Lorentz invariant Lagrangian density.
It’s very interesting that the above argument can be totally reverted, as is done in most textbooks. We outline this reversed argument in the dropdown below.
7.2. Constraints and Gauge Conditions#
Recall from the previous section that the QED Lagrangian density splits up into the matter part and the light parts as follows
where
and \(\Lscr_M\) involves both \(\Psi\)-field and \(A\)-field. Moreover, in light of Eq.7.1.9 and Eq.7.1.10, there is no involvement of derivatives of \(A\)-field in \(\Lscr_M\). It follows that if we define the field conjugate to \(A_{\mu}\) as follows
then just as in the theory of (massive) spin-\(1\) vector fields, we get the following primary constraint
Moreover we use the Euler-Lagrange equation to find the secondary constraint as follows
where the middle equality is one of the Euler-Lagrange equations Eq.6.2.5 with the time-derivative term dropped out due to the primary constraint, the equality on the left is the definition Eq.7.2.2, and the equality on the right follows from Eq.7.1.4.
Using Eq.7.1.5 and letting \(Q \coloneqq \Psi\) (not to be confused with the charge operator Eq.7.1.6 defined in the previous section) and \(P \coloneqq \delta \Lscr / \delta \dot{\Psi}\) be the conjugate canonical variables, we can write the charge density \(J^0\) as follows
Together with Eq.7.2.3 and Eq.7.2.4, we have obtained two constraints
Obviously the Poisson bracket \([\chi_1, \chi_2]_P = 0\). Hence the constraints are of first class as discussed in First class constraints. Moreover, one cannot hope to solve explicitly for \(A_0\), the conjugate of \(\Pi^0\), in terms of the other canonical variables, as we did in the massive case Eq.6.4.9, due to the presence of gauge symmetry Eq.7.1.7.
One way, which is unfortunately not Lorentz invariant, to solve this problem is to “fix the gauge” by imposing an artificial condition on the \(A\)-field so that \(A_0\) may be solved explicitly. [1] It turns out that there are many such conditions that find their use cases under various circumstances. Some of the most common gauge-fixing conditions are listed below
Name |
Condition |
---|---|
Lorenz gauge [2] |
\(\p_{\mu} A^{\mu} = 0\) |
Coulomb gauge |
\(\nabla \cdot \Abf = 0\) |
Temporal gauge |
\(A^0 = 0\) |
Axial gauge |
\(A^3 = 0\) |
Important
We will use the Coulomb gauge unless otherwise specified throughout this chapter. The reason, according to J. Schwinger, is that it is in this gauge that photons come out as helicity-\(1\) particles.
Two things remain to be settled. One is to argue that the \(A\)-field can always be put into the Coulomb gauge, and the other is to solve for \(A^0\) in terms of the other fields.
To address the first point, it suffices, according to Eq.7.1.7, to show that for any \(A = (A_0, \Abf)\), there exists a function \(\lambda(\xbf)\) such that
The equation on the right is known as Poisson’s equation and can be solved using Green’s function. In fact, the same equation shows up again when solving for \(A^0\), which we now explain.
Combining the secondary constraint Eq.7.2.4 (cf. Eq.7.2.1) with the Coulomb gauge condition, we have
This equation can be solved explicitly [3] by
where the charge density \(J^0\) can be further written in the form of Eq.7.2.5.
Now Eq.7.2.3 and Eq.7.2.7 allow us to get rid of the constrained canonical variables \(A_0\) and \(\Pi^0\), which, as we will show in the next section, removes the first class constraints.
7.3. Quantization in Coulomb Gauge#
In the previous section, we imposed a Coulomb gauge condition \(\nabla \cdot \Abf = 0\) and used it to eliminate the constrained variables \(A_0\) and \(\Pi^0\). Now we’re facing a new pair (of families parametrized by spatial coordinates \(\xbf\)) of constraints, listed as follows
where we also remember that \(J^0\) can be further expressed in terms of canonical variables as in Eq.7.2.5.
The \(C\) matrix as defined by Eq.6.5.4 is given in this case by
where, for example, the upper-right entry \(C_{1\xbf, 2\ybf}\) may be calculated as follows
Clearly \(C\) is non-singular, and therefore the constraints Eq.7.3.1 are of second class. To apply Dirac’s method, we need the inverse matrix \(C^{-1}\) given by
Indeed, one easily verifies that \(C C^{-1} = 1\) by, for example, the following calculation
where the last equality follows again from the solution to Poisson’s equation.
Now we apply Dirac’s recipe Eq.6.5.6 and Eq.6.5.5 to compute the commutators as follows
It’s straightforward to check that they are indeed compatible with the constraints Eq.7.3.1.
Next let’s bring in the matter fields introduced in Eq.7.2.5. Due to the appearance of \(J^0\) in the constraints Eq.7.3.1, one can check that while \(\Abf\) commutes with the matter fields, \(\bm{\Pi}\) doesn’t. Indeed, if \(F\) is any function of the matter fields \(Q_n\) and \(P^n\), then we can calculate the Dirac bracket as follows
where we’ve used Eq.7.2.6 in the third equality.
This calculation motivates the following definition a new field
where the last equality follows from Eq.7.3.3, such that
for any function \(F\) of matter fields. Moreover, one can verify that \(\bm{\Pi}_{\bot}\) satisfies exactly the same commutation relations with \(\Abf\) just as \(\bm{\Pi}\) which we repeat as follows
Finally, note that the second constraint in Eq.7.3.1 can now be written as follows
With all the preparations above, let’s first write down the Hamiltonian in a rather general form
despite the existence of constraints. For the rest of this chapter, we’ll be considering Lagrangian density of the following form
where \(J^{\mu}\) is the charge density as before, but \(\Lscr_{\text{matter}}\) is different from \(\Lscr_M\) considered in Eq.7.1.10 since it contains only the terms that don’t interact with \(A\), i.e., only the \(Q\) and \(P\) fields.
Note
Both \(J^{\mu}\) and \(\Lscr_{\text{matter}}\) will be further specified later for a theory specific to spin-\(1/2\) fermions, e.g., electrons. In fact, according to [Wei95] (page 349), the QED for spinless particles would require a more complicated Lagrangian density than Eq.7.3.8, but it wasn’t mentioned in the book which physical particle(s) such theory would describe.
Plugging Eq.7.3.8 into Eq.7.3.7, and making use of Eq.7.3.4 and Eq.7.3.3, we can write the Hamiltonian, again with a separation between matter and light, as follows
where
Expanding out \((\bm{\Pi}_{\bot} + \nabla A^0)^2\) in Eq.7.3.9 and using integration-by-parts together with Eq.7.3.6 and Eq.7.2.6, we can further rewrite Eq.7.3.9 as follows
Note
The term \(\tfrac{1}{2} J^0 A^0\) in Eq.7.3.10 gives nothing but the Coulomb energy as the following calculation (cf. Eq.7.2.7) shows
where we’ve suppressed the \(t\)-dependence as usual in the Lagrangian formalism.
7.4. Electrodynamics in the Interaction Picture#
As before, let’s split the Hamiltonian Eq.7.3.10 into the free part and the interaction part as follows
where \(V_{\text{Coul}}\) is defined by Eq.7.3.11 and \(H_{\text{matter}} = H_{\text{matter}, 0} + V_{\text{matter}}\) is the splitting of the matter Hamiltonian into the free and interaction parts.
Before passing to the interaction picture, let’s make one more (potentially confusing) change of notation:
Warning
For the rest of this chapter, we’ll simply write \(\bm{\Pi}\) in place of \(\bm{\Pi}_{\bot}\), which is not the original \(\bm{\Pi}\) as in Eq.7.3.4. Since \(\bm{\Pi}\) and \(\bm{\Pi}_{\bot}\) satisfy the same commutation relations with the other fields such as \(\Abf\) and the matter fields, the only thing we need to keep in mind is that we should use the constraint Eq.7.3.6 rather than the second equation in Eq.7.3.1.
Finally we can introduce the interaction-picture fields \(\abf, \bm{\pi}, q, p\) corresponding to the Heisenberg-picture fields \(\Abf, \bm{\Pi}, Q, P\), respectively. We’ll focus in this section on the \(\abf\) and \(\bm{\pi}\) fields. In fact, the letters \(q, p\) will soon be occupied by something completely different, namely, the momentum-space coordinates. It turns out in the theory for spin-\(1/2\) particles, which will be completely specified in the next section, the matters fields will be named by another letter, so we’ll not run into conflicts of notations.
The interaction-picture free Hamiltonian \(H_0\) takes the following form
The commutation relations between \(\abf\) and \(\bm{\pi}\) follow directly from Eq.7.3.5 and the definition of interaction-picture fields Eq.2.5.7
Moreover, they satisfy the following constraints
due to the Coulomb gauge condition and Eq.7.3.6, respectively.
For later reference, let’s note that the Coulomb interaction \(V_{\text{Coul}}\) defined by Eq.7.3.11 becomes
in the interaction picture, and the interaction part of the full Hamiltonian becomes
To derive the field equations, we resort to Hamilton’s equations Eq.6.1.7 as follows
These two equations combined together give the familiar wave equation
To upgrade \(\abf\) to a \(4\)-vector field, we must set
due to the assumption that \(a_0\) shouldn’t depend on the charge density, and it must vanish when charge density vanishes according to Eq.7.2.7.
The general solutions to Eq.7.4.6 and Eq.7.4.7, under the constraint Eq.7.4.3, can be written as follows
where \(p_0 = |\pbf|\) and \(e_{\mu}(\pbf, \pm 1)\) are two independent “polarization vectors” satisfying the following familiar conditions (cf. Eq.4.7.14)
It follows that \(\ebf(\pbf, \sigma)\) can be normalized as follows
Without working out the calculations, we claim, following [Wei95] page 352 –353, that the commutation relations Eq.7.4.2 are satisfied if the operators \(a(\pbf, \sigma)\) and \(a^{\dagger}(\pbf, \sigma)\) satisfy
Moreover, the free-photon Hamiltonian, i.e., Eq.7.4.1 without the matter term, can be written as follows
where the last expression contains one of the infinities in QED.
7.5. The Photon Propagator#
As explained in The Feynman Rules, to calculate the S-matrix using Feynman diagrams, one must calculate the propagators defined in Eq.5.1.9. Using Eq.7.4.8 and the commutation relations Eq.7.4.10, the photon propagator can be calculated as follows
Since the spinor sum
doesn’t depend on \(p_0\), it follows from Eq.5.2.7 and Eq.5.2.13 and the massless condition that
where \(q^2 = q_0^2 - |\qbf|^2\) as usual but \(q\) is not constrained to the mass shell.
Using the momentum-space Feynman rules derived in Feynman rules in momentum space, one gets a contribution of
for each internal photon line in a Feynman diagram.
Note that the photon line contribution given by Eq.7.5.3 is not Lorentz covariant, which is ultimately a consequence of the \(A\)-field \(a_{\mu}\) (cf. Eq.7.4.8) not being Lorentz covariant. It turns out, rather miraculously, that such Lorentz non-covariance can be countered by yet another Lorentz non-covariant term in the interaction Hamiltonian, namely, the Coulomb interaction Eq.7.4.4
The heuristic for such cancellation starts by rewriting the spinor sum Eq.7.5.1 as follows
where \(n = (1, 0, 0, 0)\). Here \(q_0\) is completely arbitrary, but its value will be fixed in Feynman diagram evaluations, to be discussed in the next section, by momentum conservation at each vertex.
The point is that the last three terms in the nominator are proportional to \(q_{\mu}\) or \(q_{\nu}\), which means that they will not contribute to the S-matrix due to the coupling \(j^{\mu} a_{\mu}\) in the interaction Eq.7.4.5 and the conservation law \(\p_{\mu} j^{\mu} = 0\). The remaining \(-q^2 n_{\mu} n_{\nu} / |\qbf|^2\), which is non-vanishing only if \(\mu = \nu = 0\), can be plugged into Eq.7.5.2, and then follow the Feynman rules to produce the following contribution to the S-matrix
which is countered by \(V_{\text{Coul}}\) (cf. Eq.7.4.4).
It means that effectively, one can replace \(\Delta_{\mu \nu}(x - y)\) with the following
as long as one forgets about the (instantaneous) Coulomb interaction term \(V_{\text{Coul}}(t)\) in Eq.7.4.5. The corresponding photon line contribution Eq.7.5.3 then takes the following form
7.6. Feynman Rules for QED#
Having settled the photon field in the previous section, we’re now ready to specify the matter field in the Lagrangian density Eq.7.3.8. We’ll be considering the theory describing the interaction between a massive spin-\(1/2\) fermion with charge \(q = -e\), [4] e.g., an electron, and photon. In this case, the Lagrangian density takes the following form
It follows from Eq.7.1.4 that the charge density \(J^{\mu}\) takes the following form
The interaction Eq.7.4.5 now takes the following form
where we remember that \(V_{\text{Coul}}\) becomes irrelevant if we take the photon line contribution to the Feynman diagram to be Eq.7.5.4.
Following the general recipe described in Feynman rules in momentum space, let’s spell out the key points, while ignoring routines, in constructing and evaluating Feynman diagrams in QED as follows.
First of all, a Feynman diagram consists of electron lines, photon lines, and \(3\)-valent vertices, each of which has one incoming electron line, one outgoing electron line, and one photon line attached. Electron lines that flow backward in time may also be called positron lines. To each internal line an off-mass-shell \(4\)-momentum is attached. To each external line an on-mass-shell \(4\)-momentum as well as a spin \(z\)-component or helicity, depending on whether it’s an electron line or a photon line, are attached.
Next, to evaluate a Feynman diagram, we associate factors to each vertex, external line, and internal line as follows.
- Vertices
To each vertex, we assign a \(\gamma\)-matrix (cf. Eq.4.4.7) index \(\alpha\) to the incoming electron line, and a similar index \(\beta\) to the outgoing electron line, and a spacetime index \(\mu\) to the photon line. Moreover, such a vertex contributes the following factor
\[(2\pi)^4 e (\gamma^{\mu})_{\beta \alpha} \delta^4(k - k' \pm q)\]where \(k, k'\) are the \(4\)-momentum of the incoming and outgoing electrons, respectively, and \(\pm q\) is the \(4\)-momentum of the incoming/outgoing photon.
- External lines
In what follows, we’ll use \(\alpha\) (for incoming) and \(\beta\) (for outgoing) to index Dirac spinors as in Eq.4.4.31, and \(\mu\) to index photon spinors as in Eq.7.4.8.
For each in-state electron line, include a factor \((2\pi)^{-3/2} u_{\alpha}(\pbf, \sigma)\).
For each in-state positron line, include a factor \((2\pi)^{-3/2} \bar{v}_{\beta}(\pbf, \sigma)\).
For each out-state electron line, include a factor \((2\pi)^{-3/2} \bar{u}_{\beta}(\pbf, \sigma)\).
For each out-state positron line, include a factor \((2\pi)^{-3/2} v_{\alpha}(\pbf, \sigma)\).
For each in-state photon line, include a factor \((2\pi)^{-3/2} (2p_0)^{-1/2} e_{\mu}(\pbf, \sigma)\).
For each out-state photon line, include a factor \((2\pi)^{-3/2} (2p_0)^{-1/2} e_{\mu}^{\ast}(\pbf, \sigma)\).
- Internal lines
In what follows, the same indexing convention as for external lines is used. In fact, to index multiple photon spinors, we’ll use \(\mu\) and \(\nu\).
For each internal electron line from \(\beta\) to \(\alpha\), include a factor (cf. Eq.5.2.5 and Eq.5.2.15)
\[\frac{-\ifrak}{(2\pi)^4} \frac{\left( -\ifrak \kslash + m \right)_{\alpha \beta}}{k^2 + m^2 - \ifrak \epsilon}\]where the \(\beta\) matrix in Eq.5.2.5 is dropped because of the use of \(\bar{\psi}\), rather than \(\psi^{\dagger}\), in Eq.7.6.2. Moreover, the Feynman slash notation
\[\kslash \coloneqq \gamma^{\mu} k_{\mu}\]is adopted here.
For each internal photon line between \(\mu\) and \(\nu\), include a factor (cf. Eq.7.5.4)
\[\frac{-\ifrak}{(2\pi)^4} \frac{\eta_{\mu \nu}}{q^2 - \ifrak \epsilon}\]
The rest of the calculation is the same as any general evaluation of scattering amplitudes using Feynman diagrams, and was discussed in Spacetime Feynman Rules.
Although the full S-matrix calculation can be done in principle by summing up all Feynman diagrams, it quickly becomes computationally infeasible as the number of vertices in the diagram increases. Therefore the calculation is only practically possible if the contribution of a Feynman diagram also diminishes as the complexity grows. Fortunately, this is indeed the case as we now demonstrate.
Consider a connected Feynman diagram [5] with \(V\) vertices, \(I\) internal lines, \(E\) external lines, and \(L\) (independent) loops. Then the following relations hold
Now the goal is to calculate the constant coefficient involving \(e\) (electric charge) and \(\pi\) (mathematical constant) for any given diagram. According to the explicit Feynman rules described above
Each vertex contributes a factor \((2\pi)^4 e\).
Each internal line contributes a factor \((2\pi)^{-4}\).
Each loop contributes a momentum-space integral (cf. Eq.5.2.16), which, in turn, contributes a factor \(\pi^2\) (cf. Volume of an n-ball).
Note that we don’t include contributions from external lines since they are fixed by the in- and out-states. Multiplying the contributions above all together (and eliminating \(I\) and \(V\) using Eq.7.6.3), the constant coefficient of a Feynman diagram is given as follows
It follows that the Feynman diagrams may be organized by an increasing number of (independent) loops so that the higher order terms are suppressed by a power of
Finally, note that the Feynman rules developed in this section apply readily to the case where the in and out photon states don’t have definite helicity. More precisely, for photon states with definite helicity, the polarization vectors may be taken to be of the following form (cf. Eq.4.7.11)
where \(R(\hat{\pbf})\) rotates \((0, 0, 1)\) to \(\hat{\pbf}\). Now a general polarization vector may be written as
where \(|\alpha_+|^2 + |\alpha_-|^2 = 1\). The polarization is called circular if \(\alpha_+ = 0\) or \(\alpha_- = 0\), and linear if \(|\alpha_+| = |\alpha_=| = 1/\sqrt{2}\), just as in Polarization of photons. Note, in particular, that
7.7. Compton Scattering#
In this section, we’ll go through some really exciting calculations, utilizing many tools we’ve learned so far, on the scattering amplitudes between a photon and an electron, culminating in the famous estimation of the electron radius.
The in-state consists of a on-mass-shell electron labeled by \(4\)-momentum \(p_{\mu}\) and spin \(z\)-component \(\sigma\) and a on-mass-shell photon labeled by \(4\)-momentum \(k_{\mu}\) and polarization vector \(e_{\mu}\). The out-state consists similarly of a on-mass-shell electron labeled by \(p'_{\mu}\) and \(\sigma'\) and a on-mass-shell photon labeled by \(k'_{\mu}\) and \(e'_{\mu}\). In the lowest order of the electric charge \(e\), there are just two Feynman diagrams, shown as follows
Fig. 7.7.1 The two lowest-order Feynman diagrams for the electron-photon scattering. The solid lines are electron lines, and the wavy lines are photon lines.#
The corresponding S-matrix element is given as follows
Evaluating the (trivial) integral, using obvious shorthand notations, and with matrix multiplication rules understood, the amplitude \(S = S(\pbf', \sigma'; \kbf', e',~\pbf, \sigma; \kbf, e)\) may be written as follows
Using the mass-shell conditions \(p^2 = m^2\) and \(k^2 = 0\), the denominators in the big parenthesis may be simplified as follows
Under the assumption that the out-state is different from the in-state, we can split the amplitude \(S\) according to Eq.2.4.4 as follows
where
Now the differential cross-section \(d\sigma\) (not to be confused with spin \(z\)-component \(\sigma\)) according to Eq.2.4.10 is
where \(u\) denotes the relative velocity between the in-state electron and photon which, according to Eq.2.4.11, is given by
Here remember that \(u\) is not physical, albeit being called velocity, as it may exceed \(1\). It turns out to be convenient to work in the so-called “laboratory frame” where the initial electron is at rest, i.e.,
In this frame Eq.7.7.3 becomes simply
Next, to handle \(\delta^4(p'+k'-p-k)\), it’ll be convenient to introduce the following variables that represent photon energies
Splitting the energy-momentum delta function into the energy part and the momentum part
and using the laboratory frame Eq.7.7.4, we see that the momentum delta function
serves the purpose of equating \(\pbf' = \kbf - \kbf'\) and eliminating \(d^3 p'\) from \(d\sigma\) in Eq.7.7.2. It follows that we can write
where \(\theta\) denotes the angle between \(\kbf\) and \(\kbf'\), and the energy delta function
where \(\Omega\) is the solid angle into which the out-state photon is scattered and the solution to
is given by
Plugging our evaluation of \(\delta^4(p'+k'-p-k)\) into Eq.7.7.2, we get
where \(p'_0 = m + \omega - \omega'\) and \(\omega'\) given by the right-hand-side of Eq.7.7.6.
Now let’s evaluate \(|M|^2\) where \(M\) is given by Eq.7.7.1. In fact, we’ll sum over the spin \(z\)-component as in the spin sum evaluations. Besides the fact that such summation makes the calculation easier, it can be justified by noting that spin \(z\)-component is usually not measured in experiments. Instead of working with the specific \(M\) in Eq.7.7.1, let’s first compute more generally with an arbitrary matrix \(A\), using the Dirac field spin sum formula Eq.4.4.33, the following quantity
Applying this calculation to Eq.7.7.1, and using Eq.7.7.5 and Eq.4.4.12, we get
A few tricks can be applied to simplify such a complicated expression. First, recall that the Coulomb gauge condition Eq.7.4.9 implies \(e_0 = 0\), and the laboratory frame condition Eq.7.7.4 implies \(\pbf = 0\). It follows that
This, together with the Clifford algebra relation Eq.4.4.3, in turn, implies the following
and likewise for \(\cancel{e^{\ast}}, \cancel{e'},\) and \(\cancel{{e'}^{\ast}}\). This allows the following simplification
where the last equality uses the fact that the trace of a product of odd number of gamma matrices vanishes.
Further simplifications can be achieved by assuming, on top of previous assumptions including Eq.7.7.8 and Eq.7.6.5, that incoming and outgoing photons are linearly polarized, i.e.,
in which case one can, modulo several rather tedious calculations, arrive at the following simple form
Using Eq.7.7.5, Eq.7.7.6, and the fact that \(\theta\) is the angle between \(\kbf\) and \(\kbf'\), one have the following identities
which can then be plugged into Eq.7.7.9 and be combined with Eq.7.7.7 to obtain the following averaged differential cross-section for linearly polarized photons
This formula is known as the Klein-Nishina formula for linearly polarized photons.
Now if the incoming photon is prepared without any particular polarization (which usually is the case in experiments/observations), then one should further average over the two independent polarization vectors to get the following
Indeed, if we write \(e_1, e_2\) for the two orthogonal, linearly polarized, polarization vectors of the incoming photon, then using Eq.7.5.1 we have
Note
It follows from Eq.7.7.11 that for randomly polarized incoming photon, the scattered photon is preferably polarized in the direction perpendicular to \(\hat{\kbf}\) – the direction of the incoming photon. Since the polarization of the scattered photon is also perpendicular to the direction of the outgoing photon, it must be in the direction perpendicular to the plane of scattering. According to [Wei95] page 368, such phenomenon is indeed observed from eclipsing binary stars. See Chandrasekhar polarization.
Finally let’s average also over the polarizations of the outgoing photon using the following identity
where we recall that \(\theta\) is the angle between \(\kbf\) and \(\kbf'\). Plugging this into Eq.7.7.10, we get
Note that it’s customary to multiply both sides of the above equality by \(2\) since the polarizations \(e\) and \(e'\) are correlated, as observed above.
In the low-energy limit \(\omega \ll m\), we have \(\omega \approx \omega'\) by Eq.7.7.6. It follows that
Integrating over the solid angle \(d\Omega\), we get the so-called Thomson cross-section
Footnotes