2. Scattering Theory#

Physics would have been rather boring if nothing interacts, like the free particles that we have been studying so far. On the flip side, physics would have been impossible if we try to know exactly what happens in the interactions. The middle ground, where we assume that the particles are non-interacting long before and after the interaction, and something mysterious happened in between, is called scattering theory – a place where theories meet experiments.

2.1. Non-Interacting Many-Particles State#

We shall, as always, start from the easiest part of the theory, which is clearly the non-interacting parts. Recall our grand formulae for the Lorentz transformation on one-particle state Eq.1.3.1 and Eq.1.3.10. For a non-interacting many-particles system, it’s conceivable to assume that the Lorentz transformation law is simply a direct product of the individual particles. Since \(U(\Lambda, a) = U(1, a) U(\Lambda, 0)\), we have the following

(2.1.1)#\[\begin{split}U(\Lambda, a) \Psi_{p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots} = &~ \exp(-\ifrak a^{\mu} ((\Lambda p_1)_{\mu} + (\Lambda p_2)_{\mu} + \cdots)) \\ &\times \sqrt{\frac{(\Lambda p_1)_0 (\Lambda p_2)_0 \cdots}{(p_1)_0 (p_2)_0 \cdots}} \\ &\times \sum_{\sigma'_1 \sigma'_2 \cdots} D_{\sigma'_1 \sigma_1}(W_1(\Lambda, p_1)) D_{\sigma'_2 \sigma_2}(W_2(\Lambda, p_2)) \cdots \\ &\times \Psi_{\Lambda p_1, \sigma'_1, n_1; ~\Lambda p_2, \sigma'_2, n_2; ~\cdots}\end{split}\]

where the first component is the translation transformation Eq.1.3.1, the second component is the normalization factor, and the third component is the little group representation (cf. Eq.1.3.5), and the \(\sigma\)’s are either the spin \(z\)-component for massive particles or the helicity for massless particles, and the \(n\)’s are additional (discrete) labels such as mass, charge, spin, etc.

Notice that by writing a many-particles state as \(\Psi_{p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots}\), we have given the particles an order, which is rather random. Hence the normalization of these states must take permutations of the particles into account. More precisely, we have

(2.1.2)#\[\begin{split}\left( \Psi_{p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots}, \Psi_{p'_1, \sigma'_1, n'_1; ~p'_2, \sigma'_2, n'_2; ~\cdots} \right) &= \delta^3(\pbf_1 - \pbf'_1) \delta_{\sigma_1 \sigma'_1} \delta_{n_1 n'_1} \delta^3(\pbf_2 - \pbf'_2) \delta_{\sigma_2 \sigma'_2} \delta_{n_2 n'_2} \\ &\quad \pm \text{permutations}\end{split}\]

The sign in front of the permutations has to do with the species of the particles, which will be discussed later. Note that although there are many terms in Eq.2.1.2, there is at most one nonzero term, which happens exactly when the two states differ by a permutation.

To suppress the annoyingly many sub-indexes in the states, we shall use letters such as \(\alpha, \beta, \cdots\) to denote the compound index such as \((p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots)\), so that, for example, Eq.2.1.2 can be simplified as

\[\left( \Psi_{\alpha}, \Psi_{\alpha'} \right) = \delta(\alpha - \alpha')\]

where the integral volume element reads

\[\int d\alpha \cdots = \sum_{\sigma_1, n_1; ~\sigma_2, n_2; ~\cdots} \int d^3 \pbf_1 d^3 \pbf_2 \cdots\]

We have postulated that the transformation law Eq.2.1.1 works for non-interacting particles, but in fact, it’s also only possible for non-interacting particles. One way to see this is through an energy calculation by letting \(\Lambda = 1\) and \(a = (\tau, 0, 0, 0)\) in Eq.2.1.1 to see that

\[\exp(\ifrak \tau E_{\alpha}) \Psi_{\alpha} = \exp(\ifrak \tau H) \Psi_{\alpha} = \exp(\ifrak \tau (E_1 + E_2 + \cdots)) \Psi_{\alpha} \implies E_\alpha = E_1 + E_2 + \cdots\]

where \(E_i \coloneqq (p_i)_0\) is the energy of the \(i\)-th particle. There is obviously no energy left for any interaction.

2.2. In- and Out-states#

As mentioned earlier, scattering theory is concerned with a scenario where interactions happen within a finite time period, long before and after which the system can be regarded as non-interacting. We can therefore define the in-state \(\Psi_{\alpha}^-\) and the out-state \(\Psi_{\alpha}^+\), where \(\alpha\) is the compound index as defined in the previous section, such that the states appear to be non-interacting with the prescribed particle states when observed at \(t \to \mp \infty\), respectively. [1]

Now it’s time to bring forward an implicit assumption on the quantum states that we’ve been studying so far: they’re defined in one chosen inertial frame. Indeed, the Lorentz transformation law Eq.2.1.1 tells us exactly how to transform the state to any other frame. States of this sort are called Heisenberg picture states: they contain the entire history/future of the system and are not dynamical in time as opposed to the so-called Schrödinger picture states.

Back to the scattering scenario, let’s imagine a reference observer \(\Ocal\), who at \(t = 0\) observes that the system is in a state \(\Psi\). Then imagine another observer \(\Ocal'\) at rest with respect to \(\Ocal\), who sets his clock \(t' = 0\) when \(t = \tau\), in other words \(t' = t - \tau\). Then from the viewpoint of \(\Ocal'\), the time-\(0\) state should look like \(\exp(-\ifrak \tau H) \Psi\). It follows that the state \(\Psi\), viewed long before and long after the reference \(t = 0\), should look like \(\exp(-\ifrak \tau H) \Psi\) for \(\tau \to \mp\infty\), respectively.

It follows that energy eigenstates such as \(\Psi_{\alpha}\) will look the same at all time since

\[\exp(-\ifrak \tau H) \Psi_{\alpha} = \exp(-\ifrak \tau E_{\alpha}) \Psi_{\alpha}\]

creates merely an inconsequential phase factor. This is one form of the uncertainty principle: if the energy is definitely known, then the time is completely unknown. Therefore we must consider a localized packet (or superposition) of states as follows

(2.2.1)#\[\int d\alpha ~g(\alpha) \Psi_{\alpha}\]

where \(g(\alpha)\) is a reasonably smooth function (e.g. without poles) which is non-vanishing within a finite range of energies. We can then demand that the time limits

\[\exp(-\ifrak \tau H) \int d\alpha ~g(\alpha) \Psi_{\alpha}^{\pm} = \int d\alpha ~\exp(-\ifrak \tau E_{\alpha}) g(\alpha) \Psi_{\alpha}^{\pm}\]

as \(\tau \to \pm\infty\), respectively, approach the corresponding superpositions of non-interacting particle states.

To be more precise, let’s split the Hamiltonian into the free part and the interaction part as follows

(2.2.2)#\[H = H_0 + V\]

such that the energy eigenstates \(\Phi_{\alpha}\) of \(H_0\) (in the same frame as \(\Psi_{\alpha}^{\pm}\)) transform according to Eq.2.1.1. Then the asymptotic freeness translates into the following conditions

(2.2.3)#\[\lim_{\tau \to \pm\infty} \int d\alpha ~\exp(-\ifrak \tau E_{\alpha}) g(\alpha) \Psi_{\alpha}^{\pm} = \lim_{\tau \to \pm\infty} \int d\alpha ~\exp(-\ifrak \tau E_{\alpha}) g(\alpha) \Phi_{\alpha}\]

or equivalently in terms of the Hamiltonians

(2.2.4)#\[\lim_{\tau \to \pm\infty} \exp(-\ifrak \tau H) \int d\alpha ~g(\alpha) \Psi_{\alpha}^{\pm} = \lim_{\tau \to \pm\infty} \exp(-\ifrak \tau H_0) \int d\alpha ~g(\alpha) \Phi_{\alpha}\]

This motivates the following definition

(2.2.5)#\[\Omega(\tau) \coloneqq \exp(\ifrak \tau H) \exp(-\ifrak \tau H_0)\]

so that \(\Psi_{\alpha}^{\pm} = \Omega(\pm\infty) \Phi_{\alpha}\), at least formally. Moreover, since \(\Omega\) is unitary, the in- and out-states \(\Psi_{\alpha}^{\pm}\) are normalized as long as \(\Phi_{\alpha}\) are normalized.

In practice it will be assumed that the interaction term \(V\) in Eq.2.2.2 is relatively small so that a formal solution as power series in \(V\) may be meaningful. As the first step, let’s try to apply Eq.2.2.2 to \(\Psi_{\alpha}^{\pm}\) as follows

\[E_{\alpha} \Psi_{\alpha}^{\pm} = H \Psi_{\alpha}^{\pm} = (H_0 + V) \Psi_{\alpha}^{\pm} \implies (E_{\alpha} - H_0) \Psi_{\alpha}^{\pm} = V \Psi_{\alpha}^{\pm}\]

Note that \(\Phi_{\alpha}\) is also annihilated by \(E_{\alpha} - H_0\). Considering the asymptotic Eq.2.2.3 or Eq.2.2.4, it’s reasonable to guess the following formal solution

(2.2.6)#\[\Psi_{\alpha}^{\pm} = \Phi_{\alpha} + (E_{\alpha} - H_0 \mp \ifrak \epsilon)^{-1} V \Psi_{\alpha}^{\pm}\]

where the infinitesimal \(\mp \ifrak \epsilon\) is a mathematical trick added to avoid division by zero, and the signs will be justified momentarily. One can obviously apply Eq.2.2.6 recursively to get an expansion of \(\Psi_{\alpha}^{\pm}\) as a power series in \(V\), and we shall come back to this point later. In order to express \(\Psi_{\alpha}^{\pm}\) in terms of \(\Phi_{\alpha}\), let’s expand the right-hand-side of Eq.2.2.6 as follows

(2.2.7)#\[\Psi_{\alpha}^{\pm} = \Phi_{\alpha} + \int d\beta ~\frac{(\Phi_{\beta}, V \Psi_{\alpha}^{\pm}) \Phi_{\beta}}{E_{\alpha} - E_{\beta} \mp \ifrak \epsilon}\]

Both Eq.2.2.6 and Eq.2.2.7 are known as the Lippmann-Schwinger equation.

Now let’s justify the term \(\pm \ifrak \epsilon\) by showing that Eq.2.2.7 indeed satisfies the asymptotic condition Eq.2.2.3 as follows

(2.2.8)#\[\begin{split}\int d\alpha ~\exp(-\ifrak \tau E_{\alpha}) g(\alpha) \Psi_{\alpha}^{\pm} &= \int d\alpha ~\exp(-\ifrak \tau E_{\alpha}) g(\alpha) \Phi_{\alpha} \\ &\quad + \int d\alpha d\beta ~\frac{\exp(-\ifrak \tau E_{\alpha}) g(\alpha) (\Phi_{\beta}, V \Psi_{\alpha}^{\pm}) \Phi_{\beta}}{E_{\alpha} - E_{\beta} \mp \ifrak \epsilon} \\ &= \int d\alpha ~\exp(-\ifrak \tau E_{\alpha}) g(\alpha) \Phi_{\alpha} \\ &\quad + \int d\beta ~\Phi_{\beta} \blue{\int d\alpha ~\frac{\exp(-\ifrak \tau E_{\alpha}) g(\alpha) (\Phi_{\beta}, V \Psi_{\alpha}^{\pm})}{E_{\alpha} - E_{\beta} \mp \ifrak \epsilon}}\end{split}\]

Now the integral colored in blue can be integrated over \(E_{\alpha}\) by a contour that runs from \(-\infty\) to \(+\infty\), followed by a semicircle at infinity, in the upper-half-plane in the case of \(\Psi_{\alpha}^-\) and the lower-half-plane in the case of \(\Psi_{\alpha}^+\), back to \(-\infty\). In either case, the sign in \(\mp \ifrak \epsilon\) is chosen so that the integrant has no poles with infinitesimally small imaginary part, though both \(g(\alpha)\) and \((\Phi_{\beta}, V \Psi_{\alpha}^{\pm})\), viewed as complex functions, may have poles with finite imaginary parts. It follows then from the residual theorem and the damping factor \(\exp(-\ifrak \tau E_{\alpha})\) as \(\tau \to \pm\infty\) that the integral in blue vanishes, as desired.

2.3. S-matrix and its Symmetry#

The S-matrix defined by

(2.3.1)#\[S_{\beta \alpha} \coloneqq \left( \Psi_{\beta}^+, \Psi_{\alpha}^- \right)\]

records the probability amplitude of finding the out-state \(\Psi_{\beta}^+\) given the in-state \(\Psi_{\alpha}^-\). Note that since the in- and out-states both form an orthonormal basis of the same Hilbert space, the S-matrix is unitary. However, the way \(S\) is defined in Eq.2.3.1 disqualifies it as an operator on the Hilbert space. Therefore it’ll be convenient to convert both in- and out-states to the free states and define the S-operator by

(2.3.2)#\[(\Phi_{\beta}, S \Phi_{\alpha}) \coloneqq S_{\beta \alpha}\]

Using Eq.2.2.5 we see that

(2.3.3)#\[\begin{split}& \phantom{\implies} S_{\beta \alpha} = \left( \Omega(\infty) \Phi_{\beta}, \Omega(-\infty) \Phi_{\alpha} \right) = \left( \Phi_{\beta}, \Omega^{\dagger}(\infty) \Omega(-\infty) \Phi_{\alpha} \right) \\ & \implies S = \Omega^{\dagger}(\infty) \Omega(-\infty) \eqqcolon U(\infty, -\infty)\end{split}\]

where

(2.3.4)#\[U(\tau_1, \tau_0) = \exp(\ifrak \tau_1 H_0) \exp(\ifrak (\tau_0 - \tau_1) H) \exp(-\ifrak \tau_0 H_0)\]

The most straightforward way to calculate \(S_{\beta \alpha}\) is probably to use Eq.2.2.7 directly. However. this turns out to be rather involved, and doesn’t lead to a simple result. The issue is that we don’t really want to convert both the in- and out-states to the non-interacting states, but rather to push, say, the in-states from the far past to the far future and compare with the out-states. To spell out the details, let’s first calculate the asymptotic of the in-packet as \(\tau \to \infty\) (but omitting the \(\lim_{\tau \to \infty}\) symbol) using Eq.2.2.8

(2.3.5)#\[\begin{split}& \int d\alpha ~\exp(-\ifrak \tau E_{\alpha}) g(\alpha) \Psi_{\alpha}^- \\ &\quad = \int d\beta ~\exp(-\ifrak \tau E_{\beta}) g(\beta) \Phi_{\beta} + \int d\beta ~\Phi_{\beta} \int d\alpha \frac{\exp(-\ifrak \tau E_{\alpha}) g(\alpha) (\Phi_{\beta}, V \Psi_{\alpha}^-)}{E_{\alpha} - E_{\beta} + \ifrak \epsilon} \\ &\quad = \int d\beta ~\exp(-\ifrak \tau E_{\beta}) g(\beta) \Phi_{\beta} \\ &\qquad - 2\pi\ifrak \int d\beta ~\Phi_{\beta} \int d\alpha ~\delta(E_{\alpha} - E_{\beta}) \exp(-\ifrak \tau E_{\beta}) g(\alpha) (\Phi_{\beta}, V \Psi_{\alpha}^-) \\ &\quad = \int d\beta ~\exp(-\ifrak \tau E_{\beta}) \Phi_{\beta} \left( g(\beta) - 2\pi\ifrak \int d\alpha ~\delta(E_{\alpha} - E_{\beta}) g(\alpha) (\Phi_{\beta}, V \Psi_{\alpha}^-) \right) \\ &\quad = \int d\beta ~\exp(-\ifrak \tau E_{\beta}) \Phi_{\beta} \int d\alpha ~g(\alpha) \left( \blue{\delta(\alpha - \beta) - 2\pi\ifrak \delta(E_{\alpha} - E_{\beta}) (\Phi_{\beta}, V \Psi_{\alpha}^-)} \right)\end{split}\]

where we’ve used the residue theorem again in the second equality. Next expand the left-hand-side of the equation in terms of the out-states and then let \(\tau \to \infty\)

(2.3.6)#\[\begin{split}\int d\alpha ~\exp(-\ifrak \tau E_{\alpha}) g(\alpha) \Psi_{\alpha}^- &= \int d\alpha ~\exp(-\ifrak \tau E_{\alpha}) g(\alpha) \int d\beta ~(\Psi_{\beta}^+, \Psi_{\alpha}^-) \Psi_{\beta}^+ \\ &= \int d\beta ~\exp(-\ifrak \tau E_{\beta}) \Psi_{\beta}^+ \int d\alpha ~g(\alpha) S_{\beta \alpha} \\ &= \int d\beta ~\exp(-\ifrak \tau E_{\beta}) \Phi_{\beta} \int d\alpha ~g(\alpha) \blue{S_{\beta \alpha}}\end{split}\]

where we’ve used the fact that the S-matrix contains a \(\delta(E_{\alpha} - E_{\beta})\) factor by energy conservation in the second equality, and the defining property Eq.2.2.3 of the out-state in the third equality.

Equating the blue terms from Eq.2.3.5 and Eq.2.3.6, we’ve derived the following formula

(2.3.7)#\[S_{\beta \alpha} = \delta(\beta - \alpha) - 2\pi\ifrak \delta(E_{\beta} - E_{\alpha}) (\Phi_\beta, V \Psi_{\alpha}^-)\]

Up to the first order in \(V\), one can replace \(\Psi_{\alpha}^-\) on the right-hand-side by \(\Phi_{\alpha}\) and arrive at the so-called Born approximation of the S-matrix.

2.3.1. Lorentz symmetry#

Recall that in Eq.2.1.1, or really in Lorentz symmetry of one-particle states, we understood how Lorentz transformations act on particle states. Now we’d like to understand how they act on the S-matrix. Of course, since \(U(\Lambda, a)\) is unitary, we always have

\[S_{\beta \alpha} = (\Psi_{\beta}^+, \Psi_{\alpha}^-) = \left( U(\Lambda, a) \Psi_{\beta}^+, U(\Lambda, a) \Psi_{\alpha}^- \right)\]

but this is not what we mean by Lorentz symmetry. What we do want to know is, just like in Eq.2.1.1, how Lorentz transformation acts on the particle states, i.e., the (compound) indexes \(\alpha\) and \(\beta\). Now although Eq.2.1.1 doesn’t work for general (interacting) states, it does work for, say, \(\Psi_{\alpha}^-\) in the \(\tau \to -\infty\) limit because of the asymptotic freeness. By Lorentz we mean that \(U(\Lambda, a)\) acts the same way on both in- and out-states. In other words, we’ll be looking for some \(U(\Lambda, a)\) such that the following general formula holds.

(2.3.8)#\[\begin{split}& S_{p'_1, \sigma'_1, n'_1; ~p'_2, \sigma'_2, n'_2; ~\cdots, ~~p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots} \\ &\quad = \blue{\exp\left( \ifrak a^{\mu} {\Lambda_{\mu}}^{\nu} \left( (p'_1)_{\nu} + (p'_2)_{\nu} + \cdots - (p_1)_{\nu} - (p_2)_{\nu} - \cdots \right) \right)} \\ &\qquad \times \sqrt{\frac{(\Lambda p'_1)_0 (\Lambda p'_2)_0 \cdots (\Lambda p_1)_0 (\Lambda p_2)_0 \cdots}{(p'_1)_0 (p'_2)_0 \cdots (p_1)_0 (p_2)_0 \cdots}} \\ &\qquad \times \sum_{\bar{\sigma}'_1 \bar{\sigma}'_2 \cdots} D^{\ast}_{\bar{\sigma}'_1 \sigma'_1} (W(\Lambda, p'_1)) D^{\ast}_{\bar{\sigma}'_2 \sigma'_2} (W(\Lambda, p'_2)) \cdots \\ &\qquad \times \sum_{\bar{\sigma}_1 \bar{\sigma}_2 \cdots} D_{\bar{\sigma}_1 \sigma_1} (W(\Lambda, p_1)) D_{\bar{\sigma}_2 \sigma_2} (W(\Lambda, p_2)) \cdots \\ &\qquad \times S_{\Lambda p'_1, \bar{\sigma}'_1, n'_1; ~\Lambda p'_2, \bar{\sigma}'_2, n'_2; ~\cdots, ~~\Lambda p_1, \bar{\sigma}_1, n_1; ~\Lambda p_2, \bar{\sigma}_2, n_2, ~\cdots}\end{split}\]

where we’ve used primes to distinguish between labels from in- and out-states, and bars to distinguish between labels, specifically the spin-\(z\) or helicity, before and after the Lorentz transformation.

Since the left-hand-side doesn’t depend on the translation parameter \(a\), the blue term on the right-hand-side must be \(1\). In other words,

\[p_1 + p_2 + \cdots = p'_1 + p'_2 + \cdots\]

which is nothing but the conservation of (total) momentum. Note that a special case, which is the energy conservation, has already been used in the derivation of Eq.2.3.6 from the previous section.

As a consequence, we can now extract a delta function from the S-matrix as follows

(2.3.9)#\[S_{\beta \alpha} \eqqcolon \delta(\beta - \alpha) - 2\pi\ifrak M_{\beta \alpha} \delta^4 (p_{\beta} - p_{\alpha})\]

which should be compared with Eq.2.3.7.

Back to the core question of this section, how in the world can one engineer a magic \(U(\Lambda, a)\) to satisfy the monstrous Eq.2.3.8? One cannot. But remember that Eq.2.3.8 is readily satisfied for non-interacting particles. It follows that if we consider instead the S-operator defined by Eq.2.3.2, and let \(U_0(\Lambda, a)\) be the Lorentz transformation on free particles defined by Eq.2.1.1, then Eq.2.3.8 would be satisfied if \(U_0(\Lambda, a)\) commutes with \(S\). Indeed, using shorthand notations, we have

\[S_{\beta \alpha} = \left( \Phi_{\beta}, S \Phi_{\alpha} \right) \ = \left( U_0 \Phi_{\beta}, U_0 S \Phi_{\alpha} \right) \ = \left( U_0 \Phi_{\beta}, S U_0 \Phi_{\alpha} \right)\]

where the last quantity is nothing but the right-hand-side in Eq.2.3.8, as desired.

Now in order for \(S\) to commute with \(U_0(\Lambda, a)\), it suffices that it commutes with the infinitesimal generators of \(U_0(\Lambda, a)\), namely,

(2.3.10)#\[\begin{split}\begin{alignat*}{2} &[H_0, S] &&= 0 \\ &[\Pbf_0, S] &&= 0 \\ &[\Jbf_0, S] &&= 0 \\ &[\Kbf_0, S] &&= 0 \end{alignat*}\end{split}\]

where \(H_0, \Pbf_0, \Jbf_0, \Kbf_0\) are discussed in Quantum Lorentz symmetry and satisfy the commutation relations Eq.1.2.22.

This shall be done in three steps, where the commutation between \(S\) and \(P_0, J_0\) will be handled first, followed by \(K_0\), and finally \(H_0\).

Step 1.

Recall from Eq.2.3.3 and Eq.2.3.4 that the S-operator can be understood as a composition of time translations governed by \(H\) and \(H_0\). It’s therefore necessary to understand how the free infinitesimal Lorentz transformations commute with \(H\). To this end, let’s consider the in-states at \(\tau \to -\infty\), which is approximately free. There we can similarly define infinitesimal operators \(\Pbf, \Jbf, \Kbf\) that together with \(H\) satisfy the same commutation relations Eq.1.2.22.

Now comes the crucial part, which is to make assumptions about \(H\) so that Eq.2.3.10 are satisfied. Recall from Eq.2.2.2 that \(H = H_0 + V\) where \(V\) describes the interactions. The first assumption we’ll make is the following

Assumption on \(H\) for Lorentz invariance of S-matrix #1

The interaction \(V\) affects neither the momentum \(\Pbf\) nor the angular momentum \(\Jbf\). In other words, we assume that

(2.3.11)#\[\Pbf = \Pbf_0, ~~\Jbf = \Jbf_0, ~\text{ and }~ [V, \Pbf_0] = [V, \Jbf_0] = 0\]

It follows readily from this assumption that \(H\), and hence \(S\), commutes with \(\Pbf_0\) and \(\Jbf_0\).

Step 2.

Next we turn to \(\Kbf\). This time we cannot “cheat” by assuming that \(\Kbf = \Kbf_0\) because it would led to the undesirable consequence \(H = H_0\) by Eq.1.2.22. So instead, let’s write

(2.3.12)#\[\Kbf = \Kbf_0 + \Wbf\]

where \(\Wbf\) denotes the perturbation term. Let’s calculate

\[\begin{split}[\Kbf_0, S] &= \lim_{\substack{\tau_0 \to -\infty \\ \tau_1 \to \infty\phantom{-}}} [\Kbf_0, U(\tau_1, \tau_0)] \\ &= \lim_{\substack{\tau_0 \to -\infty \\ \tau_1 \to \infty\phantom{-}}} [\Kbf_0, \exp(\ifrak \tau_1 H_0) \exp(\ifrak (\tau_0 - \tau_1) H) \exp(-\ifrak \tau_0 H_0)]\end{split}\]

as follow. First using Eq.1.2.22 again, we have

\[\begin{split}\begin{alignat*}{2} [\Kbf_0, \exp(\ifrak \tau H_0)] &= [\Kbf_0, \ifrak \tau H_0] \exp(\ifrak \tau H_0) &&= \tau \Pbf_0 \exp(\ifrak \tau H_0) \\ [\Kbf, \exp(\ifrak \tau H)] &= [\Kbf, \ifrak \tau H] \exp(\ifrak \tau H) &&= \tau \Pbf \exp(\ifrak \tau H) =\tau \Pbf_0 \exp(\ifrak \tau H) \end{alignat*}\end{split}\]

from which we can calculate

(2.3.13)#\[\begin{split}[\Kbf_0, U(\tau_1, \tau_0)] &= [\Kbf_0, \exp(\ifrak \tau_1 H_0) \exp(\ifrak (\tau_0 - \tau_1) H) \exp(-\ifrak \tau_0 H_0)] \\ &= [\Kbf_0, \exp(\ifrak \tau_1 H_0)] \exp(\ifrak (\tau_0 - \tau_1) H) \exp(-\ifrak \tau_0 H_0) \\ &\quad + \exp(\ifrak \tau_1 H_0) [\Kbf - \Wbf, \exp(\ifrak (\tau_0 - \tau_1) H)] \exp(-\ifrak \tau_0 H_0) \\ &\quad + \exp(\ifrak \tau_1 H_0) \exp(\ifrak (\tau_0 - \tau_1) H) [\Kbf_0, \exp(-\ifrak \tau_0 H_0)] \\ &= \blue{\tau_1 \Pbf_0 \exp(\ifrak \tau H_0) \exp(\ifrak (\tau_0 - \tau_1) H) \exp(-\ifrak \tau_0 H_0)} \\ &\quad \blue{+ (\tau_0 - \tau_1) \Pbf_0 \exp(\ifrak \tau_1 H_0) \exp(\ifrak (\tau_0 - \tau_1) H) \exp(-\ifrak \tau_0 H_0)} \\ &\quad \blue{- \tau_0 \Pbf_0 \exp(\ifrak \tau_1 H_0) \exp(\ifrak (\tau_0 - \tau_1) H) \exp(-\ifrak \tau_0 H_0)} \\ &\quad - \exp(\ifrak \tau_1 H_0) [\Wbf, \exp(\ifrak (\tau_0 - \tau_1) H)] \exp(-\ifrak \tau_0 H_0) \\ &= -\Wbf(\tau_1) U(\tau_1, \tau_0) + U(\tau_1, \tau_0) \Wbf(\tau_0)\end{split}\]

where \(\Wbf(\tau) \coloneqq \exp(\ifrak \tau H_0) \Wbf \exp(-\ifrak \tau H_0)\). Note that the three blue terms cancel out.

We see that Eq.2.3.13 would vanish as \(\tau \to \pm\infty\) if \(W(\tau) \to 0\). The latter, in turn, would follow from the following assumption

Assumption on \(H\) for Lorentz invariance of S-matrix #2

The matrix elements of \(W\) with respect to the eigenstates \(\Phi_{\alpha}\) of \(H_0\) is smooth, so that \(W(\tau)\) vanishes on any local packet of \(\Phi_{\alpha}\) as in Eq.2.2.1 as \(\tau \to \pm\infty\).

This assumption is line with Eq.2.2.3 and Eq.2.2.4, which are fundamental to the validity of S-matrix theory.

Step 3.

Finally let’s handle the commutation between \(H_0\) and \(S\). Recall from Eq.2.3.3 that \(S = \Omega^{\dagger}(\infty) \Omega(-\infty)\). Hence the idea is to work out how \(H\) and \(H_0\) intertwine with \(\Omega(\pm\infty)\). To this end, let’s use Eq.2.3.13 by setting \(\tau_1 = 0\) and \(\tau_0 = \mp\infty\) as follows

\[[\Kbf_0, \Omega(\mp \infty)] = -\Wbf \Omega(\mp \infty) \implies \Kbf \Omega(\mp\infty) = \Omega(\mp\infty) \Kbf_0\]

Moreover, by Eq.2.3.11, we have also \(\Pbf \Omega(\mp\infty) = \Omega(\mp\infty) \Pbf_0\). Finally, using the commutation relations Eq.1.2.22 again we conclude that

\[H \Omega(\mp\infty) = \Omega(\mp\infty) H_0\]

which completes the proof of Eq.2.3.10.

Note

Besides showing that Eq.2.3.10 hold, our calculations actually establish the following intertwining identities

\[\begin{split}H \Omega(\pm\infty) &= \Omega(\pm\infty) H_0 \\ \Pbf \Omega(\pm\infty) &= \Omega(\pm\infty) \Pbf_0 \\ \Jbf \Omega(\pm\infty) &= \Omega(\pm\infty) \Jbf_0\end{split}\]

which imply, in particular, that the standard commutation relations Eq.1.2.22 also hold in a frame where \(\tau \to \infty\), as expected.

2.3.2. Internal symmetry#

As we noticed in Eq.2.1.1, the Lorentz symmetry doesn’t act on the labels \(n\). An internal symmetry, on the other hand, is a symmetry that leaves \(p\) and \(\sigma\) invariant and acts on the other labels such as charge, spin, and so on. We can write the general form of an internal symmetry on in- and out-states as follows

(2.3.14)#\[U(T) \Psi^{\pm}_{p_1, \sigma_1, n_1;~p_2, \sigma_2, n_2;~\cdots} = \sum_{n'_1, n'_2, \cdots} \Dscr_{n'_1 n_1}(T) \Dscr_{n'_2 n_2}(T) \cdots \Psi^{\pm}_{p_1, \sigma_1, n'_1;~p_2, \sigma_2, n'_2;~\cdots}\]

where \(U(T)\) is the unitary operator associated with the symmetry transformation \(T\), and the \(\Dscr\)’s are analogs of the little group representations from Eq.1.3.5.

Similar to Eq.2.3.8, we can formulate the internal symmetry of S-matrix as follows

(2.3.15)#\[\begin{split}S_{n'_1, n'_2, \cdots, ~n_1, n_2, \cdots} = \sum_{\bar{n}'_1, \bar{n}'_2, \cdots, \bar{n}_1, \bar{n}_2, \cdots} & \Dscr^{\ast}_{\bar{n}'_1 n'_1}(T) \Dscr^{\ast}_{\bar{n}'_2 n'_2}(T) \cdots \\ & \times \Dscr_{\bar{n}_1 n_1}(T) \Dscr_{\bar{n}_2 n_2}(T) \cdots S_{\bar{n}'_1, \bar{n}'_2, \cdots, ~\bar{n}_1, \bar{n}_2, \cdots}\end{split}\]

where we have suppressed the irrelevant \(p\) and \(\sigma\) labels.

For what kind of Hamiltonian \(H\) does there exist an internal symmetry \(U(T)\) that acts like Eq.2.3.14? The answer is similar to the case of Lorentz symmetry. Namely, if we can split \(H = H_0 + V\) into the free and perturbation terms, such that the free symmetry transformation \(U_0(T)\), which satisfies Eq.2.3.14 with \(\Phi\) in place of \(\Psi^{\pm}\), commutes with both \(H_0\) and \(V\).

Similar to the translations in Lorentz symmetry, let’s consider a symmetry \(T(\theta)\) parametrized by a real number. It follows from Eq.1.2.4 that we can write

(2.3.16)#\[U(T(\theta)) = \exp(\ifrak \theta Q)\]

where \(Q\) is a Hermitian operator called the charge. Probably the best known example of it is the electric charge. In this case, we can also write

\[\Dscr_{n n'}(T(\theta)) = \delta_{n n'} \exp(\ifrak \theta q_n)\]

The general formula Eq.2.3.15 then translates into

\[q_1 + q_2 + \cdots = q'_1 + q'_2 + \cdots\]

which is nothing about the conservation of charges. Besides the electric charge, there exist also other similar conserved, or at least approximately conserved, quantities, such as baryon number and lepton number.

2.3.3. Parity symmetry#

Our considerations on Lorentz symmetry has so far been restricted to proper orthochronous Lorentz transformations. Let’s consider the effect of the spatial inversion on the S-matrix now. Recall from Space inversion for massive particles that for non-interacting massive particles

(2.3.17)#\[U(\Pcal) \Psi^{\pm}_{p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots} = \eta_{n_1} \eta_{n_2} \cdots \Psi^{\pm}_{U(\Pcal)p_1, \sigma_1, n_1; ~U(\Pcal)p_2, \sigma_2, n_2; ~\cdots}\]

where \(\eta_n\) denotes the intrinsic parity of particle \(n\). The S-matrix version of the parity symmetry is as follows

(2.3.18)#\[\begin{split}& S_{p'_1, \sigma'_1, n'_1; ~p'_2, \sigma'_2, n'_2; ~\cdots, ~~p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots} \\ & \quad = \eta^{\ast}_{n'_1} \eta^{\ast}_{n'_2} \cdots \eta_{n_1} \eta_{n_2} \cdots S_{U(\Pcal)p'_1, \sigma'_1, n'_1; ~U(\Pcal)p'_2, \sigma'_2, n'_2; ~\cdots, ~~U(\Pcal)p_1, \sigma_1, n_1; ~U(\Pcal)p_2, \sigma_2, n_2; ~\cdots}\end{split}\]

While the space inversion operator \(\Pcal\) is defined explicitly in Eq.1.2.11, the parity operator \(U(\Pcal)\) is only characterized by Eq.2.3.17 and Eq.2.3.18. In particular, it’s not uniquely determined if the particle species under question possesses internal symmetries as discussed in the previous section, because their composition with \(\Pcal\) will also satisfy Eq.2.3.17 and Eq.2.3.18, and therefore may equally well be called a parity operator.

Since \(\Pcal^2 = 1\), it’s an obvious question to ask whether \(U(\Pcal)^2 = 1\) necessarily. This would have been the case if \(U\) furnishes a genuine representation, but it doesn’t have to. In general, we have

\[U(\Pcal)^2 \Psi^{\pm}_{p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots} = \eta_{n_1}^2 \eta_{n_2}^2 \cdots \Psi^{\pm}_{p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots}\]

which looks just like an internal symmetry. Now if \(U(\Pcal)^2\) belongs to a continuous family of internal symmetries, then it may be redefined, by suitably composing with internal symmetries, so that all \(\eta^2 = 1\). Examples of this kind include notably protons and neutrons. On the other hand, non-examples, i.e., those whose intrinsic parity cannot be reduced to \(\pm 1\), include only those hypothetical Majorana fermions.

Todo

Revise this part after I’ve learned more…

Parities of elementary particles

We shall in this section assume familiarity with angular momentum as discussed in Clebsch-Gordan coefficients.

Can parity be other than \(\pm 1\)?

Aside from electric charges, there exist other quantities that are (at least approximately) conserved by internal symmetries, for example, baryon numbers \(B\) and lepton numbers \(L\). Examples of baryons include protons and neutrons. Examples of leptons include electrons, muons and neutrinos. The internal symmetry operator generalizes Eq.2.3.16 in a straightforward way as follows

(2.3.19)#\[U(T(\alpha, \beta, \gamma)) = \exp(\ifrak(\alpha B + \beta L + \gamma Q))\]

so that \(T\) is isomorphic to \(\Rbb^3\) instead of \(\Rbb\). This will be the most general internal symmetry that will be considered here.

By the conservation of angular momentum, the parity of the number of half-integer spin particles, which we denote by \((-1)^F\), is conserved. Here \(F\) stands for fermion. For all known (to Weinberg at least) particles, the following equality of parities holds

(2.3.20)#\[(-1)^F = (-1)^{B + L}\]

In particular, the above mentioned protons, neutrons, electrons, neutrinos are all spin-\(1/2\) particles.

If, for whatever reason, the following holds

\[\orange{U(\Pcal)^2 = (-1)^F}\]

and in addition Eq.2.3.20 holds, then \(U(\Pcal)^2\) is part of a continuous symmetry Eq.2.3.19 and hence can be set to one. A hypothetical example that breaks Eq.2.3.20 is the so-called Majorana fermions that are their own antiparticles, which implies \(B = L = 0\). For these particles, we have \(U(\Pcal)^4 = 1\), and hence the intrinsic parity may be \(\pm 1\) or \(\pm \ifrak\).

Can parity be \(-1\)?

The following reaction is observed experimentally

(2.3.21)#\[\pi^- + d \to n + n\]

where a negative pion is absorbed by a deuteron to produce two neutrons. Moreover, the reaction assumes that the initial state, i.e., the left-hand-side of Eq.2.3.21 has orbital angular momentum \(\ell = 0\) and total angular momentum \(j = 1\). Note that the spin of pion and deuteron is \(0\) and \(1\), respectively.

The conservation of angular momentum demands that the total angular momentum of the right-hand-side must also be \(1\), and this can be achieved, a priori, in a number of ways. Since neutrons have spin \(1/2\), the total spin \(\sfrak\) of \(n + n\) may be either \(0\) or \(1\) by Eq.1.3.26. But since neutrons are fermions and therefore the state \(n + n\) must be anti-symmetric, we conclude that \(\sfrak = 0\) by Eq.1.3.24. [2] Then it follows again from Eq.1.3.26 that the orbital angular momentum of the right-hand-side of Eq.2.3.21 must be \(1\). We are left with only one choice.

Now since the orbital angular momentum changes from \(0\) in the initial state to \(1\) in the final state, the S-matrix elements flip sign by the action of \(U(\Pcal)\) (WHY? I guess I’m missing knowledge about how orbital angular momentum enters the S-matrix.). It follows from Eq.2.3.18 that

\[\eta_{\pi^-} \eta_d = -\eta_n^2\]

Deuteron is a nucleus consisting of a proton and a neutron. By the previous discussions about internal symmetries, one can arrange so that they have the same intrinsic parity and hence \(\eta_d = \eta_n^2\). It follows that \(\eta_{\pi^-} = -1\) and the pion \(\pi^-\) is what we set out to look for. Indeed, all its companions \(\pi^0\) and \(\pi^+\) also have parity \(-1\) due to the isospin symmetry.

The fact that \(\eta_{\pi} = -1\) had led to a profound consequence because it was discovered through experiments that there are two spin-\(0\) particles, now known as \(K\)-mesons, one of which decays into two pions and the other into three pions. By rotational invariance one can exclude the effects of orbital angular momentum and conclude, assuming parity conservation, that they must have opposite intrinsic parities. However, as more experimental evidence pointing towards the fact that the two \(K\)-mesons look alike, walk alike and quack alike, it was finally suggested by T. D. Lee and C. N. Yang that they’re really the same particle and it’s the parity conservation that fails to hold in these reactions, now known as the weak interactions. This suggestion was later verified more directly by an experiment of C. S. Wu.

2.3.4. Time inversion symmetry#

Recall from Time inversion for massive particles that for a single massive particle

\[U(\Tcal) \Psi_{p, \sigma, n} = \zeta (-1)^{\jfrak - \sigma} \Psi_{\Pcal p, -\sigma, n}\]

To generalize this to the in- and out-states, we need to remember that the time inversion also interchanges the very frame with respect to which the in- and out-states are defined. The result is as follows

(2.3.22)#\[U(\Tcal) \Psi^{\pm}_{p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots} = \ \zeta_{n_1} (-1)^{\jfrak_1 - \sigma_1} \zeta_{n_2} (-1)^{\jfrak_2 - \sigma_2} \cdots \ \Psi^{\mp}_{\Pcal p_1, -\sigma_1, n_1; ~\Pcal p_2, -\sigma_2, n_2; ~\cdots}\]

The invariance of S-matrix can then be formulated as follows

(2.3.23)#\[\begin{split}& S_{p'_1, \sigma'_1, n'_1; ~p'_2, \sigma'_2, n'_2; ~\cdots, ~p_1, \sigma_1, n_1; ~p_2, \sigma_2, n_2; ~\cdots} \\ & \quad = \zeta_{n'_1} (-1)^{\jfrak'_1 - \sigma'_1} \zeta_{n'_2} (-1)^{\jfrak'_2 - \sigma'_2} \cdots \ \zeta^{\ast}_{n_1} (-1)^{\jfrak_1 - \sigma_1} \zeta^{\ast}_{n_2} (-1)^{\jfrak_2 - \sigma_2} \\ & \qquad \times S_{\Pcal p_1, -\sigma_1, n_1; ~\Pcal p_2, -\sigma_2, n_2; ~\cdots; ~\Pcal p'_1, -\sigma'_1, n'_1; ~\Pcal p'_2, -\sigma'_2, n'_2; ~\cdots}\end{split}\]

Since we’ll be mainly concerned with the rate of interactions in this section, the phase factors in front of \(\Psi\) play little role. So let’s simplify the notations in Eq.2.3.22 and Eq.2.3.23 using compound indexes as follows

(2.3.24)#\[\begin{split}U(\Tcal) \Psi^{\pm}_{\alpha} &= \Psi^{\mp}_{\Tcal \alpha} \\ S_{\beta, \alpha} &= S_{\Tcal\alpha, \Tcal\beta}\end{split}\]

where the phase factors have been “absorbed” in the right-hand-side by a re-definition of \(\Tcal \alpha\) and \(\Tcal \beta\).

Unlike the space inversions discussed in the previous section, time inversions don’t directly lead to implications on reaction rates because, after all, we cannot turn time around in any experiment. However, under certain circumstances, one can use a trick to draw experimentally verifiable conclusions, which we now present.

The main assumption here is that one can split the S-operator as follows

(2.3.25)#\[S_{\beta \alpha} = S^{(0)}_{\beta \alpha} + S^{(1)}_{\beta \alpha}\]

such that \(S^{(0)}\) is also unitary, i.e., \({S^{(0)}}^{\dagger} S^{(0)} = 1\), and is much larger than \(S^{(1)}\). Now the unitarity of \(S\) can be written as follows

\[1 = S^{\dagger} S = {S^{(0)}}^{\dagger} S^{(0)} + {S^{(0)}}^{\dagger} S^{(1)} + {S^{(1)}}^{\dagger} S^{(0)}\]

which then implies

\[S^{(1)} = -S^{(0)} {S^{(1)}}^{\dagger} S^{(0)}\]

Assuming \(S^{(0)}\) and \(S^{(1)}\) both satisfy Eq.2.3.24, the above equality can be rewritten as follows

(2.3.26)#\[S^{(1)}_{\beta \alpha} = -\int d\gamma' \int d\gamma ~S^{(0)}_{\beta \gamma'} ~S^{(1)~\ast}_{\Tcal \gamma' ~\Tcal \gamma} ~S^{(0)}_{\gamma \alpha}\]

where we recall that the adjoint \(\dagger\) is the composition of the (complex) conjugation and transpose. Together with the unitarity of \(S^{(0)}\), we see that the rate of reaction \(\left| S^{(1)}_{\beta \alpha} \right|^2\), when summed up against a complete set of \(S^{(0)}\)-eigenstates, remains the same after applying \(\Tcal\) to both initial and final states.

The simplest case where Eq.2.3.26 becomes applicable is when both \(\alpha\) and \(\beta\) are eigenstates of \(S^{(0)}\), with eigenvalues, say, \(\exp(\ifrak \theta_{\alpha})\) and \(\exp(\ifrak \theta_{\beta})\), respectively. In this case Eq.2.3.26 becomes

\[S^{(1)}_{\beta \alpha} = -\exp(\ifrak(\theta_{\alpha} + \theta_{\beta})) S^{(1)~\ast}_{\Tcal \beta ~\Tcal \alpha} \implies \left| S^{(1)}_{\beta \alpha} \right|^2 = \left| S^{(1)}_{\Tcal \beta ~\Tcal \alpha} \right|^2\]

This is to say that under the assumption that Eq.2.3.25 is valid, at least approximately, the rate of reaction \(S^{(1)}_{\beta \alpha}\) should be invariant under a flip of the \(3\)-momentum as well as the spin \(z\)-component. This is not contradicted by Wu’s experiment which disproved the parity conservation.

2.4. Rates and Cross-Sections#

As we already mentioned, the S-matrix entries \(S_{\beta \alpha}\) can be interpreted as probability amplitudes of a reaction that turns an in-state \(\Psi^-_{\alpha}\) into an out-state \(\Psi^+_{\beta}\). In other words, the probability \(P(\Psi^-_{\alpha} \to \Psi^+_{\beta}) = \left| S_{\beta \alpha} \right|^2\). It is, however, not completely straightforward to square S-matrix entries because, as we’ve seen in Eq.2.3.9, they contain Dirac delta functions.

2.4.1. Derivation in box model#

One trick that is often used in physics to deal with integration over an infinite space is to restrict the space to a (large) box, often with additional periodic boundary conditions, and hope that the final results will not depend on the size of the box, as long as it’s large enough. This is exactly what we shall do.

Consider a cubic box whose sides have length \(L\) and has volume \(V = L^3\). Imposing the periodic boundary condition on the cube, the \(3\)-momentum is discretized as follows

(2.4.1)#\[\pbf = \frac{2\pi}{L} (n_1, n_2, n_3)\]

where \(n_1, n_2, n_3\) are nonnegative integers. Of course, the higher the \(n\), the shorter the wave length if we interpret it as wave mechanics. By analogy with the continuous case, we can define the Dirac delta function as follows

(2.4.2)#\[\delta^3_V (\pbf - \pbf') \coloneqq \frac{1}{(2\pi)^3} \int_V d^3 \xbf ~\exp(\ifrak (\pbf - \pbf') \cdot \xbf) = \frac{V}{(2\pi)^3} \delta_{\pbf \pbf'}\]

where \(\delta_{\pbf \pbf'}\) is the usual Kronecker delta. With this setup, the states inner product Eq.2.1.2 will produce, from the Dirac deltas, an overall factor of \(\left( V/(2\pi)^3 \right)^N\) where \(N\) denotes the number of particles in the box. In order for the amplitudes to be independent of the size of the box, let’s normalize the states as follows

\[\Psi_{\alpha}^{\square} \coloneqq \left( \frac{(2\pi)^3}{V} \right)^{N/2} \Psi_{\alpha}\]

such that \(\left( \Psi^{\square}_{\beta}, \Psi^{\square}_{\alpha} \right) = \delta_{\beta \alpha}\) is properly normalized. Correspondingly, we can express the S-matrix with respect to the box-normalized states as follows

\[S^{\square}_{\beta \alpha} = \left( \frac{(2\pi)^3}{V} \right)^{(N_{\alpha} + N_{\beta})/2} S_{\beta \alpha}\]

where \(N_{\alpha}, N_{\beta}\) are the numbers of particles in the in- and out-states, respectively.

Now the transition probability in the box model takes the following form

\[P(\alpha \to \beta) = \left| S^{\square}_{\beta \alpha} \right|^2 = \left( \frac{(2\pi)^3}{V} \right)^{N_{\alpha} + N_{\beta}} \left| S_{\beta \alpha} \right|^2\]

which we can further turn to a differential form as follows

(2.4.3)#\[\begin{split}dP(\alpha \to \beta) &= P(\alpha \to \beta) d\Nscr_{\beta} \\ &= P(\alpha \to \beta) \left( \frac{V}{(2\pi)^3} \right)^{N_{\beta}} d\beta \\ &= \left( \frac{(2\pi)^3}{V} \right)^{N_{\alpha}} \left| S_{\beta \alpha} \right|^2 d\beta\end{split}\]

where \(d\beta\) denotes an infinitesimal volume element around the state \(\beta\), or more precisely, a product of \(d^3 \pbf\), one for each particle. Then \(\Nscr_{\beta}\) counts the number of states within the infinitesimal \(d\beta\), which can be readily calculated from Eq.2.4.1.

Back to our core problem, which is to define \(\left| S_{\beta \alpha} \right|^2\) as calculated by Eq.2.3.9. The first assumption we will make, at least for now, is a genericity condition

Genericity assumption on the S-matrix

No subset of particles in the state \(\beta\) have exactly the same (total) \(4\)-momentum as some subset in the state \(\alpha\).

Under this assumption, we can remove the term \(\delta(\beta - \alpha)\) from Eq.2.3.9 and write

(2.4.4)#\[S_{\beta \alpha} = -2 \pi \ifrak \delta^4(p_{\beta} - p_{\alpha}) M_{\beta \alpha}\]

and moreover, ensure that \(M_{\beta \alpha}\) contains no more delta functions. Now the question becomes how to define \(\left| \delta^4(p_{\beta} - p_{\alpha}) \right|^2\). In fact, to align with the main theme of using in- and out-states to calculate the S-matrix, the interaction must only be turned on for a finite period of time, say, \(T\). Hence the timed delta function can be written as

(2.4.5)#\[\delta_T(E_{\beta} - E_{\alpha}) \coloneqq \frac{1}{2 \pi} \int_{-T/2}^{T/2} dt ~\exp(\ifrak (E_{\beta} - E_{\alpha}) t)\]

We can then modify Eq.2.4.4 in a “timed box” as follows

(2.4.6)#\[S_{\beta \alpha} = -2\pi\ifrak \delta^3_V (\pbf_{\beta} - \pbf_{\alpha}) \delta_T(E_{\beta} - E_{\alpha}) M_{\beta \alpha}\]

Now using Eq.2.4.2 and Eq.2.4.5, we can calculate the squares as follows

\[\begin{split}\begin{alignat*}{2} \left( \delta^3_V(\pbf_{\beta} - \pbf_{\alpha}) \right)^2 &= \delta^3_V(\pbf_{\beta} - \pbf_{\alpha}) \delta^3_V(0) &&= \delta^3_V(\pbf_{\beta} - \pbf_{\alpha}) V/(2\pi)^3 \\ \left( \delta_T(E_{\beta} - E_{\alpha}) \right)^2 &= \delta_T(E_{\beta} - E_{\alpha}) \delta_T(0) &&= \delta_T(E_{\beta} - E_{\alpha}) T/(2\pi) \end{alignat*}\end{split}\]

All together, we can now rewrite Eq.2.4.3 as follows

\[\begin{split}dP(\alpha \to \beta) &= \left( \frac{(2\pi)^3}{V} \right)^{N_{\alpha}} \left| S_{\beta \alpha} \right|^2 d\beta \\ &= (2\pi)^2 \left( \frac{(2\pi)^3}{V} \right)^{N_{\alpha} - 1} \frac{T}{2\pi} \delta^3_V(\pbf_{\beta} - \pbf_{\alpha}) \delta_T(E_{\beta} - E_{\alpha}) \left| M_{\beta \alpha} \right|^2 d\beta \\ &= (2\pi)^{3N_{\alpha} - 2} V^{1 - N_{\alpha}} T \delta^4(p_{\beta} - p_{\alpha}) \left| M_{\beta \alpha} \right|^2 d\beta\end{split}\]

where we have restored \(\delta^4(p_{\beta} - p_{\alpha})\) by taking the large \(V\) and \(T\) limits. Dividing by time \(T\), the differential rate of transition can be defined as follows

(2.4.7)#\[d\Gamma(\alpha \to \beta) \coloneqq dP(\alpha \to \beta) / T = (2\pi)^{3N_{\alpha}-2} V^{1-N_{\alpha}} \delta^4(p_{\beta} - p_{\alpha}) |M_{\beta \alpha}|^2 d\beta\]

As will be explained in more detail in the example of the decay of one particle below, it should be kept in mind that Eq.2.4.7 is valid only when large \(T\) limit can be justified. Nonetheless, such rates are closely related to what actual experiments report.

2.4.2. Examples with few initial particles#

One special case of interest is when \(N_{\alpha} = 1\), or in other words, processes where one particle decays into multi-particles. In this case Eq.2.4.7 becomes

(2.4.8)#\[d\Gamma(\alpha \to \beta) = 2\pi \delta^4(p_{\beta} - p_{\alpha}) |M_{\beta \alpha}|^2 d\beta\]

which becomes independent of the volume of the box. This is reasonable because the decay rate of one particle shouldn’t care about the size of the containing box. However, the \(T \to \infty\) limit in \(\delta^4(p_{\beta} - p_{\alpha})\) is no longer valid. In fact, it cannot be longer than the (mean) lifetime \(\tau_{\alpha}\) of the particle \(\alpha\), because the interaction wouldn’t make sense if the particle itself already disintegrates. In this case, in order for Eq.2.4.5 to still approximate a delta function, we must assume that any characteristic energy of the interaction satisfies

\[|E_{\beta} - E_{\alpha}| \ll 1/\tau_{\alpha}\]

where the right-hand-side is known as the total decay rate.

Another case of interest is when \(N_{\alpha} = 2\). In this case Eq.2.4.7 takes the following form

(2.4.9)#\[d\Gamma(\alpha \to \beta) = (2\pi)^4 V^{-1} \delta^4(p_{\beta} - p_{\alpha}) |M_{\beta \alpha}|^2 d\beta\]

It turns out that in the world of experimentalists, it’s more common to use, instead of the transition rate, something called cross-section, or equivalently, rate per flux, where the flux is defined as [3]

\begin{equation*} \Phi_{\alpha} \coloneqq u_{\alpha} / V \end{equation*}

and \(u_{\alpha}\) is the (relativistic) relative velocity between the two particles, to be discussed in more detail in the next section by considering Lorentz symmetry. We can then rewrite Eq.2.4.9 in terms of the cross-section as follows

(2.4.10)#\[d\sigma(\alpha \to \beta) \coloneqq d\Gamma(\alpha \to \beta) / \Phi_{\alpha} = (2\pi)^4 u_{\alpha}^{-1} \delta^4(p_{\beta} - p_{\alpha}) |M_{\beta \alpha}|^2 d\beta\]

Note that \(d\sigma\) has the dimension of an area.

2.4.3. Lorentz symmetry of rates and cross-sections#

We can investigate the Lorentz symmetry on the rates and cross-sections as follows. Squaring Eq.2.3.8, and using the fact that the little group representations are unitary, we see that the following quantity

\[R_{\beta \alpha} \coloneqq \sum_{\text{spins}} |M_{\beta \alpha}|^2 \prod_{\beta} E \prod_{\alpha} E\]

is Lorentz invariant, where \(E = p_0 = \sqrt{\pbf^2 + m^2}\) for each particle in \(\alpha\) and \(\beta\), respectively.

It follows that in the one-particle case, Eq.2.4.8 gives

\[\sum_{\text{spins}} d\Gamma(\alpha \to \beta) = 2\pi E_{\alpha}^{-1} R_{\beta \alpha} \delta^4(p_{\beta} - p_{\alpha}) \frac{d\beta}{\prod_{\beta} E}\]

In particular, we recognize \(d\beta / \prod_{\beta} E\) as a product of the Lorentz invariant \(3\)-momentum volume elements constructed in Eq.1.3.9. Hence the only factor in the right-hand-side which is not Lorentz invariant is \(E_{\alpha}^{-1}\). It follows that the decay rate of a particle, summed up over all spins, is inverse proportional to its energy, or in other words, a faster moving particle decays slower, which is consistent with the special theory of relativity and experimentally observed slow decay rates of high energy particles coming from cosmic rays.

Next, let’s turn to the two-particles case. In this case Eq.2.4.10 gives

\[\sum_{\text{spins}} d\sigma(\alpha \to \beta) = (2\pi)^4 u_{\alpha}^{-1} E_1^{-1} E_2^{-1} R_{\beta \alpha} \delta^4(p_{\beta} - p_{\alpha}) \frac{d\beta}{\prod_{\beta} E}\]

where \(E_1, E_2\) are the energies of the two particles in state \(\alpha\). As in the one-particle case, in order for the cross-section to be Lorentz invariant, we must define the relative velocity \(u_{\alpha}\) such that the product \(u_{\alpha} E_1 E_2\) is Lorentz invariant. Indeed, such a quantity is uniquely determined by the requirement that when one of the particles stays still, then \(u_{\alpha}\) should be the velocity of the other particle, and it takes the following form

(2.4.11)#\[u_{\alpha} = \frac{\sqrt{(p_1 \cdot p_2)^2 - m_1^2 m_2^2}}{E_1 E_2}\]

For later use, let’s rewrite \(u_{\alpha}\) in the center-of-mass frame as follows. In the center-of-mass frame, the total momentum vanishes, and therefore we can write \(p_1 = (E_1, \pbf)\) and \(p_2 = (E_2, -\pbf)\). It follows that

(2.4.12)#\[\begin{split}u_{\alpha} &= \frac{\sqrt{(E_1 E_2 + \pbf^2)^2 - m_1^2 m_2^2}}{E_1 E_2} \\ &= \frac{\sqrt{(E_1 E_2 + \pbf^2)^2 - (E_1^2 - \pbf^2)(E_2^2 - \pbf^2)}}{E_1 E_2} \\ &= \frac{|\pbf| (E_1 + E_2)}{E_1 E_2} \\ &= \left| \frac{\pbf_1}{E_1} - \frac{\pbf_2}{E_2} \right|\end{split}\]

which indeed looks more like a relative velocity. Note, however, that this is not a physical velocity because its value may approach \(2\) (i.e., faster than the speed of light) in relativistic limit.

2.4.4. The phase-space factor#

By phase-space factor we mean the factor \(\delta^4(p_{\beta} - p_{\alpha}) d\beta\) that appears in transition probabilities, rates and cross-sections discussed above. The goal of this section is to calculate it, particularly in the scenario where the final state consists of two particles. We’ll use the center-of-mass frame with respect to the initial state so that \(\pbf_{\alpha} = 0\). Then the phase-space factors can be written as follows

\[\delta^4(p_{\beta} - p_{\alpha}) d\beta = \delta^3(\pbf'_1 + \pbf'_2 + \cdots) \delta(E'_1 + E'_2 + \cdots - E_{\alpha}) d^3 \pbf'_1 d^3 \pbf'_2 \cdots\]

where we recall that the primes indicate that the quantities are taken from state \(\beta\), and \(E_{\alpha}\) denotes the total energy of state \(\alpha\). In the case where the final state consists of exactly two particles, the phase-space factor can be further simplified as follows

(2.4.13)#\[\begin{split}\delta^4(p_{\beta} - p_{\alpha}) d\beta &= \delta(E'_1 + E'_2 - E_{\alpha}) d^3 \pbf'_1 \\ &= \delta \left( \sqrt{|\pbf'_1|^2 + {m'_1}^2} + \sqrt{|\pbf'_1|^2 + {m'_2}^2} - E_{\alpha} \right) |\pbf'_1|^2 d|\pbf'_1| d\Omega\end{split}\]

where \(\Omega\) is the solid angle in \(\pbf'_1\)-space, if in the integration we replace every occurrence of \(\pbf'_2\) with \(-\pbf'_1\).

To further simply the delta function in Eq.2.4.13, we recall the following identity, which is an incarnation of integration by substitution,

\[\delta(f(x)) = \delta(x - x_0) / f'(x_0)\]

where \(x_0\) is a simple zero of \(f\). In the case of Eq.2.4.13, we let

\[f(|\pbf'_1|) = \sqrt{|\pbf'_1|^2 + {m'_1}^2} + \sqrt{|\pbf'_1|^2 + {m'_2}^2} - E_{\alpha}\]

so that the simple zero is found at

(2.4.14)#\[k' = \frac{1}{2E_{\alpha}} \sqrt{\left( E_{\alpha}^2 - {m'_1}^2 - {m'_2}^2 \right)^2 - 4 {m'_1}^2 {m'_2}^2}\]

Now differentiating \(f\) at \(k'\) we get

\[f'(k') = \frac{k'}{E'_1} + \frac{k'}{E'_2} = \frac{k' E_{\alpha}}{E_1 E_2}\]

where

(2.4.15)#\[\begin{split}E'_1 &= \sqrt{{k'}^2 + {m'_1}^2} = \frac{E_{\alpha}^2 + {m'_1}^2 - {m'_2}^2}{2E_{\alpha}} \\ E'_2 &= \sqrt{{k'}^2 + {m'_2}^2} = \frac{E_{\alpha}^2 - {m'_1}^2 + {m'_2}^2}{2E_{\alpha}}\end{split}\]

Putting all together, we can further simplify Eq.2.4.13 as follows

(2.4.16)#\[\delta^4(p_{\beta} - p_{\alpha}) d\beta = \frac{k' E'_1 E'_2}{E_{\alpha}} d\Omega\]

where \(k', E'_1\) and \(E'_2\) are defined by Eq.2.4.14 and Eq.2.4.15, respectively.

Substituting Eq.2.4.16 into Eq.2.4.8, we see that in the case of one particle decaying into two particles

\begin{equation*} \frac{d\Gamma(\alpha \to \beta)}{d\Omega} = \frac{2\pi k' E'_1 E'_2}{E_{\alpha}} |M_{\beta \alpha}|^2 \end{equation*}

and in the case of a two-body scattering \(1~2 \to 1'~2'\), we have, using also Eq.2.4.10 and Eq.2.4.12 (and remembering \(E_{\alpha} = E_1 + E_2\)), the following

(2.4.17)#\[\frac{d\sigma(\alpha \to \beta)}{d\Omega} = \frac{(2\pi)^4 k' E'_1 E'_2}{u_{\alpha} E_{\alpha}} |M_{\beta \alpha}|^2 \ = \frac{(2\pi)^4 k' E'_1 E'_2 E_1 E_2}{k E_{\alpha}^2} |M_{\beta \alpha}|^2\]

where \(k \coloneqq |\pbf_1| = |\pbf_2|\). These calculations will be used in the next section to get some insights into the scattering process.

2.4.5. Implications of the unitarity of S-matrix#

In this section we’ll no longer assume the Genericity of the S-matrix. This means that we’ll get back to use Eq.2.3.9, instead of Eq.2.4.4, which we recall as follows

\[S_{\beta \alpha} = \delta(\beta - \alpha) - 2\pi \ifrak \delta^4(p_{\beta} - p_{\alpha}) M_{\beta \alpha}\]

However, all the calculations from the previous sections can still be used here because we’ll be caring about, for example, the total rates, which are integrations over all possible final states, and the degenerate ones will not contribute to such integrals.

First, let’s spell out the consequence of the unitarity of the S-matrix, or more precisely \(S^{\dagger} S = 1\), as follows

(2.4.18)#\[\begin{split}\delta(\gamma - \alpha) &= \int d\beta ~S^{\ast}_{\beta \gamma} S_{\beta \alpha} \\ &= \int d\beta \left( \delta(\beta - \gamma) + 2\pi \ifrak \delta^4(p_{\beta} - p_{\gamma}) M^{\ast}_{\beta \gamma} \right) \left( \delta(\beta - \alpha) - 2\pi \ifrak \delta^4(p_{\beta} - p_{\alpha}) M_{\beta \alpha} \right) \\ &= \delta(\gamma - \alpha) + 2\pi \ifrak \delta^4(p_{\alpha} - p_{\gamma}) M^{\ast}_{\alpha \gamma} - 2\pi \ifrak \delta^4(p_{\gamma} - p_{\alpha}) M_{\gamma \alpha} \\ &\quad + 4\pi^2 \delta^4(p_{\gamma} - p_{\alpha}) \int d\beta ~\delta^4(p_{\beta} - p_{\alpha}) M^{\ast}_{\beta \gamma} M_{\beta \alpha}\end{split}\]

which implies

(2.4.19)#\[\ifrak M^{\ast}_{\alpha \gamma} - \ifrak M_{\gamma \alpha} + 2\pi \int d\beta ~\delta^4(p_{\beta} - p_{\alpha}) M^{\ast}_{\beta \gamma} M_{\beta \alpha} = 0\]

In the special case where \(\alpha = \gamma\), Eq.2.4.19 gives the following key identity, known as the generalized optical theorem

(2.4.20)#\[\op{Im} M_{\alpha \alpha} = -\pi \int d\beta ~\delta^4(p_{\beta} - p_{\alpha}) |M_{\beta \alpha}|^2\]

As an application we can calculate the total rate of all transitions produced by the initial state \(\alpha\) using Eq.2.4.7 as follows

\[\begin{split}\Gamma_{\alpha} &\coloneqq \int d\beta ~\frac{d\Gamma(\alpha \to \beta)}{d\beta} \\ &~= (2\pi)^{3N_{\alpha} - 2} V^{1 - N_{\alpha}} \int d\beta ~\delta^4(p_{\beta} - p_{\alpha}) |M_{\beta \alpha}|^2 \\ &~= -\frac{1}{\pi} (2\pi)^{3N_{\alpha} - 2} V^{1 - N_{\alpha}} \op{Im} M_{\alpha \alpha}\end{split}\]

Another application of the unitary of the S-matrix is along the lines of statistical mechanics. Applying the same calculation in Eq.2.4.18 to \(S S^{\dagger} = 1\), we get the counterpart to Eq.2.4.20

\[\op{Im} M_{\alpha \alpha} = -\pi \int d\beta ~\delta^4(p_{\beta} - p_{\alpha}) |M_{\alpha \beta}|^2\]

Combining with the master equation Eq.2.4.7 we have

(2.4.23)#\[\int d\beta ~c_{\alpha} \frac{d\Gamma(\alpha \to \beta)}{d\beta} = \int d\beta ~c_{\beta} \frac{d\Gamma(\beta \to \alpha)}{d\alpha}\]

where \(c_{\alpha} \coloneqq \left( V / (2\pi)^3 \right)^{N_{\alpha}}\).

We shall carry out an equilibrium analysis for state \(\alpha\). To this end, let \(P_{\alpha} d\alpha\) be the infinitesimal probability of finding the system in state \(\alpha\). Then we have

\[\frac{dP_{\alpha}}{dt} = \int d\beta ~P_{\beta} \frac{d\Gamma(\beta \to \alpha)}{d\alpha} - P_{\alpha} \int d\beta ~\frac{d\Gamma(\alpha \to \beta)}{d\beta}\]

where the first term calculates the total rate that other states transit into \(\alpha\), and the second term calculates the total rate that the state \(\alpha\) transits into other states. Recall that the entropy of the system is defined to be

\[\Scal \coloneqq -\int d\alpha ~P_{\alpha} \ln(P_{\alpha} / c_{\alpha})\]

Its rate of change can be estimated as follows

\[\begin{split}\frac{d\Scal}{dt} \ &= -\int d\alpha ~(\ln(P_{\alpha} / c_{\alpha}) + 1) \frac{dP_{\alpha}}{dt} \\ &= -\int d\alpha \int d\beta ~(\ln(P_{\alpha} / c_{\alpha}) + 1) \left( P_{\beta} \frac{d\Gamma(\beta \to \alpha)}{d\alpha} \ - P_{\alpha} \frac{d\Gamma(\alpha \to \beta)}{d\beta} \right) \\ &= \int d\alpha \int d\beta ~P_{\beta} \ln\left( \frac{P_{\beta} c_{\alpha}}{P_{\alpha} c_{\beta}} \right) \frac{d\Gamma(\beta \to \alpha)}{d\alpha} \\ &\geq \int d\alpha \int d\beta ~\left( \frac{P_{\beta}}{c_{\beta}} - \frac{P_{\alpha}}{c_{\alpha}} \right) c_{\beta} \frac{d\Gamma(\beta \to \alpha)}{d\alpha} \\ &= \int d\alpha \int d\beta ~\frac{P_{\beta}}{c_{\beta}} \left( c_{\beta} \frac{d\Gamma(\beta \to \alpha)}{d\alpha} - c_{\alpha} \frac{d\Gamma(\alpha \to \beta)}{d\beta} \right) \\ &= \int d\alpha ~\frac{P_{\alpha}}{c_{\alpha}} \int d\beta \left( c_{\alpha} \frac{d\Gamma(\alpha \to \beta)}{d\beta} - c_{\beta} \frac{d\Gamma(\beta \to \alpha)}{d\alpha} \right) = 0\end{split}\]

where the fourth inequality follows from the general inequality \(\ln(x) \geq 1 - 1 / x\) for any \(x > 0\), and the last follows from Eq.2.4.23. This is nothing but the famous slogan: entropy never decreases! And we see that as a consequence of the unitarity of the S-matrix.

2.5. Perturbation Theory of S-matrix#

Rather than being the epilogue of Scattering Theory, this section is more like a prelude to what comes next. In particular, we will work out a candidate Hamiltonian that satisfies the Lorentz invariance condition discussed in S-matrix and its Symmetry.

One possible starting point of the perturbation theory is Eq.2.3.7 together with the Lippmann-Schwinger formula Eq.2.2.7 which we recollect as follows

(2.5.1)#\[S_{\beta \alpha} = \delta(\beta - \alpha) - 2\pi \ifrak \delta(E_{\beta} - E_{\alpha}) (\Phi_{\beta}, V\Psi_{\alpha}^-)\]

where

(2.5.2)#\[\Psi_{\alpha}^- = \Phi_{\alpha} + \int d\beta ~\frac{(\Phi_{\beta}, V\Psi_{\alpha}^-) \Phi_{\beta}}{E_{\alpha} - E_{\beta} + \ifrak \epsilon}\]

Applying \(V\) to Eq.2.5.2 and taking scalar product with \(\Phi_{\beta}\), we get

(2.5.3)#\[\left( \Phi_{\beta}, V\Psi_{\alpha}^- \right) = V_{\beta \alpha} + \int d\beta ~\frac{\left( \Phi_{\beta}, V\Psi_{\alpha}^- \right) V_{\beta \alpha}}{E_{\alpha} - E_{\beta} + \ifrak \epsilon}\]

where \(V_{\beta \alpha} \coloneqq \left( \Phi_{\beta}, V\Phi_{\alpha} \right)\). One can now apply Eq.2.5.3 iteratively to get the following power series expansion

(2.5.4)#\[\begin{split}\left( \Phi_{\beta}, V\Psi_{\alpha}^- \right) &= V_{\beta \alpha} + \int d\gamma ~\frac{V_{\beta \gamma} V_{\gamma \alpha}}{E_{\alpha} - E_{\gamma} + \ifrak \epsilon} \\ &\quad + \int d\gamma \int d\gamma' ~\frac{V_{\beta \gamma} V_{\gamma \gamma'} V_{\gamma' \alpha}}{(E_{\alpha} - E_{\gamma} + \ifrak \epsilon)(E_{\alpha} - E_{\gamma'} + \ifrak \epsilon)} + \cdots\end{split}\]

and therefore a power series expansion in \(V\) of \(S_{\beta \alpha}\) in view of Eq.2.5.1.

One obvious drawback of the expansion Eq.2.5.4 is that it obscures the Lorentz symmetry of the S-matrix because the denominators consist of only the energy terms. To overcome this, we shall use instead the other interpretation of the S-matrix in terms of the Hamiltonians given by Eq.2.3.3 and Eq.2.3.4, which we recall as follows

\[S = U(\infty, -\infty)\]

where

(2.5.5)#\[U(\tau, \tau_0) = \exp(\ifrak H_0 \tau) \exp(-\ifrak H (\tau - \tau_0)) \exp(-\ifrak H_0 \tau_0)\]

Now differentiating Eq.2.5.5 with respect to \(\tau\) gives

(2.5.6)#\[\begin{split}\ifrak \frac{d}{d\tau} U(\tau, \tau_0) &= -H_0 \exp(\ifrak H_0 \tau) \exp(-\ifrak H (\tau - \tau_0)) \exp(-\ifrak H_0 \tau_0) \\ &\quad + \exp(\ifrak H_0 \tau) H \exp(-\ifrak H (\tau - \tau_0)) \exp(-\ifrak H_0 \tau_0) \\ &= \exp(\ifrak H_0 \tau) (H - H_0) \exp(-\ifrak H (\tau - \tau_0)) \exp(-\ifrak H_0 \tau_0) \\ &= \exp(\ifrak H_0 \tau) V \exp(-\ifrak H_0 \tau) U(\tau, \tau_0) \\ &\eqqcolon V(\tau) U(\tau, \tau_0)\end{split}\]

Here

(2.5.7)#\[V(\tau) \coloneqq \exp(\ifrak H_0 \tau) V \exp(-\ifrak H_0 \tau)\]

is a time-dependent operator in the so-called interaction picture, to be distinguished from the Heisenberg picture operator where the true Hamiltonian \(H\) should be used in place of \(H_0\). The differential equation Eq.2.5.6 can be easily solved as follows

\[U(\tau, \tau_0) = 1 - \ifrak \int_{\tau_0}^{\tau} dt ~V(t) U(t, \tau_0)\]

which can then be iterated to give the following

\[\begin{split}U(\tau, \tau_0) &= 1 - \ifrak \int_{\tau_0}^{\tau} dt_1 ~V(t_1) + (-\ifrak)^2 \int_{\tau_0}^{\tau} dt_1 \int_{\tau_0}^{t_1} dt_2 ~V(t_1) V(t_2) \\ &\quad + (-\ifrak)^3 \int_{\tau_0}^{\tau} dt_1 \int_{\tau_0}^{t_1} dt_2 \int_{\tau_0}^{t_2} dt_3 ~V(t_1) V(t_2) V(t_3) + \cdots\end{split}\]

Letting \(\tau \to \infty\) and \(\tau_0 \to -\infty\) we get another power series expansion of \(S\) in \(V\) as follows

(2.5.8)#\[\begin{split}S &= 1 - \ifrak \int_{-\infty}^{\infty} dt_1 ~V(t_1) + (-\ifrak)^2 \int_{-\infty}^{\infty} dt_1 \int_{-\infty}^{t_1} dt_2 ~V(t_1) V(t_2) \\ &\quad + (-\ifrak)^3 \int_{-\infty}^{\infty} dt_1 \int_{-\infty}^{t_1} dt_2 \int_{-\infty}^{t_2} dt_3 ~V(t_1) V(t_2) V(t_3) + \cdots\end{split}\]

It’s somewhat inconvenient that the integral limits in Eq.2.5.8 ruins the permutation symmetry of the products of \(V\). But this can be fixed by introducing a time-ordered product as follows

(2.5.9)#\[\begin{split}T\{ V(t) \} &\coloneqq V(t) \\ T\{ V(t_1) V(t_2) \} &\coloneqq \theta(t_1 - t_2) V(t_1) V(t_2) + \theta(t_2 - t_1) V(t_2) V(t_1) \\ T\{ V(t_1) V(t_2) V(t_3) \} &\coloneqq \theta(t_1 - t_2) \theta(t_2 - t_3) V(t_1) V(t_2) V(t_3) + \cdots \\ &\cdots\end{split}\]

where \(\theta(\tau)\) is the step function which equals \(1\) for \(\tau > 0\) and \(0\) for \(\tau < 0\), and it doesn’t matter what the value at \(\tau = 0\) is because it doesn’t contribute to the integrals in Eq.2.5.8 anyway. With this definition, we can rewrite Eq.2.5.8 as follows

(2.5.10)#\[S = 1 + \sum_{n=1}^{\infty} \frac{(-\ifrak)^n}{n!} \int_{-\infty}^{\infty} dt_1 dt_2 \cdots dt_n ~T\{ V(t_1) V(t_2) \cdots V(t_n) \}\]

where the division by \(n!\) is to account for the duplicated integrals introduced by the time-ordered product. Note that this power series looks much like the Taylor series of an exponential function. Indeed, in the unlikely event where \(V(t)\) at different times all commute, one can remove the time-ordering and write Eq.2.5.10 as an exponential function.

One great benefit of writing \(S\) as in the form of Eq.2.5.10 is that we can reformulate the condition of \(S\) being Lorentz symmetric in terms of some condition on \(V\). Recall from S-matrix and its Symmetry that a sufficient condition for a Lorentz invariant S-matrix is that the S-operator commutes with \(U_0(\Lambda, a)\), or equivalently in infinitesimal terms Eq.2.3.10 are satisfied. Now the main postulation is to express \(V\) using a density function as follows

(2.5.11)#\[V(t) = \int d^3 x ~\Hscr(t, \xbf)\]

such that \(\Hscr(x)\) is a scalar in the sense that

(2.5.12)#\[U_0(\Lambda, a) \Hscr(x) U^{-1}_0(\Lambda, a) = \Hscr(\Lambda x + a)\]

Under these assumptions, we can further rewrite Eq.2.5.10 in terms of \(\Hscr(x)\) as follows

(2.5.13)#\[S = 1 + \sum_{n=1}^{\infty} \frac{(-\ifrak)^n}{n!} \int d^4 x_1 \cdots d^4 x_n ~T\{ \Hscr(x_1) \cdots \Hscr(x_n) \}\]

This expression of \(S\) is manifestly Lorentz invariant, except for the time-ordering part. In fact, the time-ordering between two spacetime points \(x_1, x_2\) are Lorentz invariant if and only if \(x_1 - x_2\) is time-like, namely, \((x_1 - x_2)^2 \geq 0\). This is consistent with intuition because events with time-like (or light-like) separations may be observed by one observer, who definitely should know which event happened first. Therefore we obtain a sufficient condition for the Lorentz invariance of \(S\) as follows

(2.5.14)#\[[\Hscr(x_1), \Hscr(x_2)] = 0, \quad\forall ~(x_1 - x_2)^2 \leq 0\]

where we’ve also included the light-like case for technical reasons that will only become clear later. This condition is also referred to as the causality condition as it may be interpreted as saying that interactions happening at space-like separations should not be correlated.

At last we’ve finally climbed the highest peak in scattering theory, namely Eq.2.5.14. It is specific to the relativistic theory because time-ordering is always preserved in Galilean symmetry. It is also this restriction that eventually leads us to a quantum field theory. In the words of the author

“It is this condition that makes the combination of Lorentz invariance and quantum mechanics so restrictive.” [4]

—S. Weinberg

Footnotes