8. Path-Integral Methods#
In this chapter we’ll learn about a new way to build quantum field theory that complements the canonical quantization. This is done via a path-integral formalism invented by Feynman. The advantage of the path-integral formalism over the canonical formalism, as we’ll see, is the manifest Lorentz invariance in deriving the propagators, as opposed to the miraculous cancellation of local interactions, such as the Coulomb interaction Eq.7.4.4 discussed in the previous chapter. The disadvantage, on the other hand, is that the verification of the Lorentz invariance of the S-matrix becomes obscure.
Note
The gear is shifted a bit in this chapter in the following two aspects. First, we’ll adopt Dirac’s bra-ket notation, and second, it’s the Schrödinger picture rather than the interaction picture that will be considered alongside the Heisenberg picture.
8.1. Path Integrals for Bosons#
Similar to the canonical formalism Eq.6.1.1, let’s consider Hermitian operators \(Q_a\) and their conjugates \(P_a\) satisfying the following canonical commutation relations
We’ll think of \(Q_a\) as (spacetime) coordinates and \(P_a\) as momenta. Moreover, they are considered to be time-independent Schrödinger-picture operators. Note that the commutators are used here since the particles under consideration are bosons. The corresponding theory for fermions will be taken up in the next section.
Since all \(Q_a\)s commute each other, they can be simultaneously diagonalized so that there exist eigenstates \(\ket{q}\) satisfying
Moreover, the eigenstates satisfy the following orthogonality condition
and the completeness condition
Similar eigenstates exist for \(P_a\)s as well.
It’s important to note that \(\ket{q}\) and \(\ket{p}\) live in the same Hilbert space. Indeed, let’s compute \(\braket{q | p}\) as follows. First, it follows from Eq.8.1.1 that \(P_a\) acts on wave functions in \(q\)-basis as
Indeed we have
in agreement with Eq.8.1.1. Next, using the completeness condition, one can write
It follows then from Eq.8.1.5 and the fact that the \(\ket{q}\)s form a basis that
for any \(b\). It follows that
where the factor \((2\pi)^{-1/2}\) is determined by the normalizing condition \(\braket{p' | p} = \delta(p' - p)\).
8.1.1. The general path integral formula#
To derive the general path integral formula, we need to pass to the Heisenberg picture as follows
where the Hamiltonian \(H\) is given as a function of \(P\) and \(Q\). Their eigenstates
are obviously given by
Warning
The eigenstates \(\ket{t, q}\) and \(\ket{t, p}\) given by Eq.8.1.9 are not time-\(t\) evolutions of \(\ket{q}\) and \(\ket{p}\) which, according to Schrödinger’s equation, would be \(e^{-\ifrak Ht} \ket{q}\) and \(e^{-\ifrak Ht} \ket{p}\), respectively.
The time-independent eigenstates satisfy similar orthogonality and completeness conditions as follows
Moreover Eq.8.1.6 also carries over
Now the key idea in deriving the path integral formula is to evaluate how the eigenstates evolve in infinitesimal time steps \(\tau \to \tau + d\tau\) as follows
In light of Eq.8.1.8, it’ll be convenient to rewrite \(H = H(Q, P)\) in terms of \(Q(t)\) and \(P(t)\) defined by Eq.8.1.7. This is done by the following calculation
Using the canonical commutation relations Eq.8.1.1, we can make the following assumption without losing any generality.
Assumption
All the \(Q\) operators in \(H\) lie to the left of the \(P\) operators.
Under this assumption, one can expand Eq.8.1.12 for infinitesimal \(d\tau\) using Eq.8.1.8 and Eq.8.1.11 as follows
Note that the third equality holds only for infinitesimal \(d\tau\), which allows us to pretend that \(e^{-\ifrak H d\tau}\) is linear in \(H\).
Important
The function \(H(q', p)\) in the last expression, or written simply as \(H(q, p)\), is an ordinary function of scalars. In particular, it makes no difference however \(q\) and \(p\) are ordered. It should therefore be remembered that when this process is reversed, i.e., the quantization of a classical Hamiltonian, the quantized Hamiltonian must have all the \(Q\) operators lying to the left of the \(P\) operators.
Now given two time \(t < t'\) with a finite separation, one can divide the time-interval into \(N\) steps
where
As \(N \to \infty\), one can apply Eq.8.1.14 to each sub-interval to calculate the transition amplitude
with the understanding that \(q_0 = q\) and \(q_{N+1} = q'\). It’s in the last equality where the limit \(N \to \infty\), or equivalently \(d\tau \to 0\), is taken. The integral is taken over all paths from state \(\ket{q}\) at time \(t\) to state \(\ket{q'}\) at time \(t'\), and hence the name – path integral.
It turns out that the same recipe for deriving the general path integral formula above can also be applied to calculate matrix elements of an operator \(\Oscr(P(t), Q(t))\), or more generally a time-ordered product of such operators. Note that in contrast to the Hamiltonian (cf. Eq.8.1.13), we’ve swapped the order of arguments \(Q, P\) in \(\Oscr\). This is, for reasons which will become clear momentarily, due to the following arrangement.
Assumption
All the \(P\) operators in \(\Oscr\) lie to the left of the \(Q\) operators.
As before, let’s first calculate the infinitesimal matrix element as follows
Consider a time-ordered sequence of operators
such that \(t_A > t_B > \cdots\). We can calculate the matrix element of the product of the operators at a finite time difference by dividing the time-interval in the same way as in Eq.8.1.15 and pay attention to the sub-intervals that contains \(t_A, t_B, \cdots\), as follows
Since the right-hand-side doesn’t rely on the time-ordering, we may replace the product of operators on the left-hand-side with the timed-ordered product as follows
as long as \(t_A, t_B, \cdots\) are all distinct.
8.1.2. Transition to the S-matrix#
From now on, we will restrict the discussion to quantum field theories where the index \(a\) from the previous section becomes \((\xbf, m)\), where \(\xbf\) is the spatial coordinates and \(m\) denotes other quantum labels such as spin. In this case we rewrite Eq.8.1.16 as follows
Recall that the S-matrix involves matrix elements between in- and out-states, which are states are time \(t = \mp\infty\), respectively. Hence if we write \(\ket{\alpha, \op{in}}\) for the in-state and \(\ket{\beta, \op{out}}\) for the out-state, then the S-matrix element can be written as follows
where the path integral now has essentially no boundary conditions.
The goal now is to calculate the wave functions \(\braket{\beta, \op{out} | q(\infty), \infty}\) and \(\braket{q(-\infty), -\infty | \alpha, \op{in}}\), if we choose a specific basis for the in- and out-states. It turns out, following discussions in External edges off the mass shell, that it suffices to consider the vacuum state \(\ket{\VAC}\). Moreover we’ll not distinguish between \(\ket{\VAC, \op{in}}\) and \(\ket{\VAC, \op{out}}\) since the calculations will mostly be the same.
The vacuum state, being a state with no particles, can be characterized by
where \(a(\pbf, \sigma, n)\) is the operator that annihilates a particle with momentum \(\pbf\), spin \(z\)-component \(\sigma\), and other quantum numbers \(n\).
For simplicity, we’ll focus on the real scalar field given by Eq.4.2.8 and turned into canonical variables following Eq.6.1.2 as follows
where \(E = p_0 = \sqrt{\pbf^2 + m^2}\) on the on-mass-shell energy. From these one can solve for \(a(\pbf)\) as follows
where we’ve pulled out the time-dependency since in order to apply it to in- and out-state, we need to take the limits \(t \to \mp\infty\), respectively. More explicitly, one can write
It turns out that the time limits are not really relevant in calculating the wave functions since \(e^{\ifrak Et}\) is never zero. Hence we’ll continue to just use \(a(\pbf)\) in calculations. In the same vein, define Schrödinger-picture operators
In places where specifying \(t = \mp\infty\) doesn’t matter, we’ll also simply write \(\phi(\xbf)\) and \(\pi(\xbf)\).
Using Eq.8.1.18, one finds a differential equation that the wave functions \(\braket{\mp\infty, \phi(\mp\infty, \xbf) | \VAC}\) must satisfy as follows
where we have also used the interpretation of \(\pi(\xbf)\) as variational derivative \(-\ifrak \delta/\delta \phi(\xbf)\) (cf. Eq.8.1.5). Based on the experience of solving an analogous ODE by exponential function, it’s quite natural to postulate a Gaussian solution
where \(\Nscr\) is a constant. Indeed Eq.8.1.19 becomes equivalent to
For the right-hand-side to vanish for any \(\phi\), the quantity in the parenthesis must vanish. An inverse Fourier transform then gives
where we recall once again that \(E(\pbf) = \sqrt{\pbf^2 + m^2}\). This solves Eq.8.1.20 up to an unknown field-independent constant \(\Nscr\), which turns out to be insignificant. Indeed, the same constant \(\Nscr\) also appears in \(\braket{\VAC, \op{out} | \VAC, \op{in}}\) and hence can be eliminated by normalization. More details about this will be discussed in the next section.
We can continue the calculation Eq.8.1.17 in the case of vacuum expectation values for real scalar fields as follows
and therefore
Without working out the details, we claim that the only difference in the calculation for general fields is the term after \(\ifrak\epsilon/2\), whose exact form turns out to be insignificant. For later references, the final result is recorded as follows
where the \(\ifrak\epsilon\) terms depend only on \(q\)s.
8.1.3. Lagrangian version of the path integral#
So far the path integral formalism has been developed using the Hamiltonian. Now we’ll develop a version based on the Lagrangian. In fact, the integrand in the exponential power in Eq.8.1.24, leaving alone the \(\ifrak\epsilon\) terms, looks just like the corresponding Lagrangian (cf. Eq.6.1.11). However, there is an important difference, namely, the \(q\) and \(p\) variables in Eq.8.1.24 are independent variables, while in the Lagrangian formalism, they are related by Eq.6.1.14. As we’ll see, it turns out that when the Hamiltonian \(H\) is quadratic in \(p\) and the (timed-ordered) operators \(\Oscr_A, \Oscr_B, \cdots\), are independent of the \(P\)s, one can explicitly evaluate the integral in \(p\) in Eq.8.1.24, which will then produce the Lagrangian version of the path integral.
To spell out the details, let’s write down the (Heisenberg-picture) Hamiltonian in the most general form as follows
where \(A\) is a real, symmetric, positive matrix. Moreover \(H\) is written in the way that all the \(Q\) operators lie to the left of the \(P\) operators.
Now we can write the power in the exponential in Eq.8.1.24 without the \(\ifrak\epsilon\) terms as follows
where it’s organized so that the first summand on the right-hand-side is quadratic in \(p\), the second is linear, and the third is independent of \(p\). The reason to arrange the power in this form is because of the following (finite-dimensional) Gaussian integral formula.
Gaussian Integral Formula
(8.1.27)#\[\begin{split}&\int_{-\infty}^{\infty} \prod_s d\xi_s \exp\left( -\ifrak \left( \frac{1}{2} \sum_{s, r} \Ascr_{sr} \xi_s \xi_r + \sum_s \Bscr_s \xi_s + \Cscr_s \right) \right) \\ &\quad = \left( \det(\ifrak \Ascr / 2\pi) \right)^{-1/2} \exp\left( -\ifrak \left( \sum_{s, r} \Ascr_{sr} \bar{\xi}_s \bar{\xi}_r + \sum_s \Bscr_s \bar{\xi}_s + \Cscr_s \right) \right)\end{split}\]where \(\bar{\xi}\) is the (unique) stationary point of the quadratic power given explicitly by
\[\bar{\xi}_s = -\sum_r (\Ascr^{-1})_{sr} \Bscr_r\]
Note
In more general cases where \(H\) is not quadratic in \(P\), approximation techniques such as the stationary phase approximation may be applied.
To figure out the stationary point of the power in Eq.8.1.24 with respect to \(p\), let’s calculate the following variational derivative assuming the \(\ifrak\epsilon\) terms are independent of the \(p\)s
It follows that \(\bar{p}\) is stationary if it satisfies Hamilton’s equation
Assuming, in addition, that the (timed-ordered) operators \(\Oscr_A, \Oscr_B, \cdots\), are independent of the \(P\)s, we can evaluate the \(p\)-integral in Eq.8.1.24 using the (infinite-dimensional) Gaussian integral formula Eq.8.1.27 as follows
where \(L\) is the Lagrangian defined by
with \(\bar{p}\) satisfying Eq.8.1.28 and
is given by Eq.8.1.26.
Finally, we can write down the Lagrangian version of Eq.8.1.24 as follows
The rest of this section is devoted to the determination of \(\Ascr(q)\) in various examples.
- Scalar fields with non-derivative coupling
Following Eq.6.4.1, consider the following Lagrangian density of a set of (massless) scalar fields \(\Phi_n\) that have only non-derivative interaction \(V\) and are coupled to external currents \(J_n\)
\[\Lscr = -\sum_n \left( \frac{1}{2} \p_{\mu} \Phi_n \p^{\mu} \Phi_n + J_n^{\mu} \p_{\mu} \Phi_n \right) - V(\Phi)\]The canonical adjoint \(\Pi_n\) is, according to Eq.6.2.1, given by
\[\Pi_n = \frac{\delta \Lscr}{\delta \dot{\Phi}_n} = \dot{\Phi}_n - J_n^0\]and hence the Hamiltonian is, according to Eq.6.2.6, given by
\[\begin{split}H &= \int d^3x \left( \sum_n \Pi_n \dot{\Phi}_n - \Lscr \right) \\ &= \int d^3x \sum_n \left( \Pi_n (\Pi_n + J_n^0) - \frac{1}{2} (\Pi_n + J_n^0)^2 + \frac{1}{2} (\nabla \Phi_n)^2 + J_n^0 (\Pi_n + J_n^0) + \Jbf_n \cdot \nabla \Phi_n \right) \\ &\quad + \int d^3x~V(\Phi) \\ &= \int d^3x \sum_n \left( \frac{1}{2} (\Pi_n + J^0_n)^2 + \frac{1}{2} (\nabla \Phi_n)^2 + \Jbf_n \cdot \nabla \Phi_n \right) + \int d^3x~V(\Phi)\end{split}\]Comparing with Eq.8.1.25 and following Eq.8.1.30, we see that
\[\Ascr_{x n, x' n'} = \delta^4(x-x') \delta_{nn'}\]which is field independent, and therefore can be eliminated in the same way that \(\Nscr\) can be eliminated (cf. Eq.8.1.31).
- Nonlinear \(\sigma\)-model
The so-called nonlinear \(\sigma\)-model is described by the following Lagrangian density
\[\Lscr = -\frac{1}{2} \sum_{n, m} \p_{\mu} \Phi_n \p^{\mu} \Phi_m (\delta_{nm} + U_{nm}(\Phi)) - V(\Phi)\]where the nonlinearity is carried by \(U_{nm}(\Phi)\).
In this case the canonical adjoint \(\Pi_n\) is given by
\[\Pi_n = \frac{\delta \Lscr}{\delta \dot{\Phi}_n} = \sum_m \dot{\Phi}_m (\delta_{nm} + U_{nm}(\Phi))\]and can be solved in matrix notation as follows
\[\dot{\Phi}_n = \sum_m (1+U(\Phi))^{-1}_{nm} \Pi_m\]hence the Hamiltonian
\[\begin{split}H &= \int d^3x \left( \sum_n \Pi_n \dot{\Phi}_n - \Lscr \right) \\ &= \int d^3x \sum_{n, m} \left( \frac{1}{2} \Pi_n (1+U(\Phi))^{-1}_{nm} \Pi_m + \frac{1}{2} \nabla \Phi_n \cdot \nabla \Phi_m (1+U(\Phi))^{-1}_{nm} \right) + \int d^3x~V(\Phi)\end{split}\]In follows that
(8.1.32)#\[\Ascr_{xn, x'n'} = (1+U(\Phi))^{-1}_{nn'} \delta^4(x-x')\]which obviously depend on \(\Phi\), and therefore cannot be eliminated by the division by the vacuum expectation value. The idea then is to absorb it into the Lagrangian (density) which we now explain.
Looking at Eq.8.1.31, we note the following general identity
(8.1.33)#\[\det\Ascr = \exp \Tr \ln \Ascr\]for any real symmetric positive \(\Ascr\). To evaluate the logarithm, it’s convenient to discretize the Dirac delta function in Eq.8.1.32 as follows
\[\delta^4(x-x') = \Omega^{-1} \delta_{xx'}\]where \(\Omega\) denotes an infinitesimal volume in spacetime. It follows that
(8.1.34)#\[(\ln \Ascr)_{xn, x'n'} = \delta_{xx'} \left( -\ln(1+U(\Phi)) - \ln\Omega \right)_{nn'}\]where \(\ln\Omega\) is understood as a constant multiple as the identity matrix. Next note that the trace of \(\delta_{xx'}\) can be evaluated by
\[\Tr~\delta_{xx'} \cdots = \Omega^{-1} \int d^4x \cdots\]It follows that
\[\det\Ascr \propto \exp\left( -\Omega^{-1} \int d^4x~\Tr\ln(1+U(\Phi)) \right)\]where the proportionality constant, coming from the constant \(-\ln\Omega\) in Eq.8.1.34, is field-independent. Plugging into Eq.8.1.31, we see that Lagrangian density receives a correction term
\[\Delta\Lscr = -\frac{\ifrak}{2} \Omega^{-1} \Tr\ln(1+U(\Phi))\]which unfortunately contains a diverging term \(\Omega^{-1}\). This is known as an ultraviolet divergence since it comes from the infinitesimal spacetime volume. We’ll not address how it may be handled here.
- Vector fields
The two examples considered so far admit a Lagrangian without auxiliary fields (cf. Eq.6.2.8). To cover this case, consider the following Lagrangian for a set of non-interacting vector fields (cf. Eq.6.4.6)
(8.1.35)#\[\Lscr = -\sum_n \left( \frac{1}{4} F_{n \mu\nu} F_n^{\mu\nu} + \frac{1}{2} M^2 A_{n \mu} A_n^{\mu} + J_n^{\mu} A_{n\mu} \right)\]According to Eq.6.4.10, the corresponding Hamiltonian is given by
(8.1.36)#\[\begin{split}H &= \int d^3x \sum_n \left( \frac{1}{2} \bm{\Pi}_n^2 + \frac{1}{2M_n^2} (\nabla \cdot \bm{\Pi}_n)^2 + \frac{1}{M_n^2} J_n^0 \nabla \cdot \bm{\Pi}_n \right. \\ &\qquad \left. + \frac{1}{2} (\nabla \times \Abf_n)^2 + \frac{1}{2} M_n^2 \Abf_n^2 + \frac{1}{2M_n^2} (J_n^0)^2 - \Jbf_n \cdot \Abf_n \right)\end{split}\]where the terms are ordered in descending power of \(\bm{\Pi}\). Using the following calculation
\[\begin{split}\int d^3x~d^3y~\nabla_i \nabla_j \delta^3(\xbf-\ybf) \bm{\Pi}_n^i(x) \bm{\Pi}_n^j(y) &= -\int d^3x~d^3y~\nabla_j \delta^3(\xbf-\ybf) \p_i \bm{\Pi}_n^i(x) \bm{\Pi}_n^j(y) \\ &= -\int d^3x~d^3y~\delta^3(\xbf-\ybf) \p_i \bm{\Pi}_n^i(x) \p_j \bm{\Pi}_n^j(y) \\ &= -\int d^3x~\left( \nabla \cdot \bm{\Pi}_n \right)^2\end{split}\]we conclude that
\[\Ascr_{x i n, y j m} = \delta_{nm} \left( \delta_{ij}\delta^4(x-y) - \frac{1}{M_n^2} \nabla_i\nabla_j\delta^4(x-y) \right)\]which is field-independent. As before, it means that the term \(\det(2\pi\ifrak \Ascr(q))^{-1/2}\) in Eq.8.1.31 plays no role. Nonetheless, the Lagrangian defined by Eq.8.1.29 cannot be the same the original Eq.8.1.35 since the former doesn’t involve the time-component \(A_0\). As a consequence, the Lorentz invariance of Eq.8.1.31 is far from obvious.
To restore the manifest Lorentz invariance, let’s introduce, according to Eq.6.4.9, a correction term to the Hamiltonian \(H \to H + \Delta H\) where
(8.1.37)#\[\Delta H = -\frac{1}{2} \sum_n M_n^2 \int d^3x \left( A_n^0 - M_n^{-2} \nabla \cdot \bm{\Pi}_n - M_n^{-2} J_n^0 \right)^2\]Moreover, in addition to the integration of \(\Abf_n\) and \(\bm{\Pi}_n\) in Eq.8.1.24, we also integrate over \(A_n^0\). This addition doesn’t really make a difference to the physics since the integrant in \(\Delta H\) being a perfect square means that the integration over \(A_n^0\) will only introduce an insignificant field-independent factor to the matrix element.
Combining Eq.8.1.36 and Eq.8.1.37 together, we have
\[H + \Delta H = \int d^3x \sum_n \left( \frac{1}{2} \bm{\Pi}_n^2 + A_n^0 \nabla \cdot \bm{\Pi}_n + \frac{1}{2} (\nabla \times \Abf_n)^2 + \frac{1}{2} M_n^2 A_n^2 - J_n \cdot A_n \right)\]We see that the integrand is still quadratic in \(\bm{\Pi}\), whose integration, according to the Gaussian integral formula, can be done by replacing \(\bm{\Pi}_n\) with the solution to Eq.8.1.28, which reads
\[\dot{\Abf}_n = \bm{\Pi}_n - \nabla A_n^0 \iff \bm{\Pi}_n = \dot{\Abf}_n + \nabla A_n^0\]One can then verify that the Legendre transformed quantity
\[-H - \Delta H + \sum_n \int d^3x~\dot{\Abf}_n \cdot \bm{\Pi}_n\]indeed recovers the original Lagrangian density Eq.8.1.35.
We see from the above examples that it’s far from obvious to choose the correct Lagrangian in Eq.8.1.31. Moreover, the choice of canonical fields may not be the initial \(q\)s. Examples of this kind include the vector fields discussed above as well as QED which will be discussed later. Under these considerations, let’s rewrite Eq.8.1.31 as follows
where the field-independent constants \(|\Nscr|^2\) and the part of \(\det(2\pi\ifrak\Ascr)^{-1/2}\) are suppressed into the proportionality, and the field-dependent part of \(\det(2\pi\ifrak\Ascr)^{-1/2}\) is absorbed into the Lagrangian. In addition, the dependence of the \(\psi\)-fields on the right-hand-side on \(A, B, \cdots\), is suppressed into the index \(n\) in the product measure.
8.1.4. Path integral derivation of Feynman rules#
The vacuum expectation value of a time-ordered product of operators given by Eq.8.1.38 can be evaluated by Feynman diagrams, assuming the propagators have been worked out. However these diagrams may not all be connected. In particular, there is a set of \(2\)-component diagrams: one of them consists of vertices from only the timed-ordered operators, and the other consists of vertices from the interaction density. Such diagrams can be gotten rid of by considering the following normalized vacuum expectation value
Here a few notations have changed from the previous sections. Firstly, we’ve used \(\ell_A, \ell_B, \cdots\), instead of \(A, B, \cdots\), to label the time-ordered operators, which will allow us to unify the labels by \(\ell\). Secondly, the argument of the fields has changed from time such as \(t_A\) to spacetime coordinates \(x_A\), which of courses contain \(t_A\) as its time-component.
Now if the Hamiltonian is quadratic in the \(P\)-operators as discussed in Lagrangian version of the path integral, then Eq.8.1.38 implies that Eq.8.1.39 can be rewritten as
where
is the action.
Suppose, in the same vein as discussed in Perturbation Theory of S-matrix and specifically Eq.2.5.11, the Lagrangian is given by a density \(\Lscr\). Then following the philosophy of perturbation theory, let’s write it as the sum of a free part \(\Lscr_0\) and an interacting part \(\Lscr_1\). In other words
which, in turn, implies that
Such decomposition then allows us to write the exponential term in Eq.8.1.40 in the following form
where we’ve also expanded the second exponential of the interaction action. The reason to do so, or rather, to keep the first exponential of the free action, is that \(I_0[\psi]\) is typically, and will be assumed to be, quadratic. Indeed, an explicit example was worked out for scalar field in Eq.8.1.23, as long as we ignore the term \(e^{-\epsilon |\tau|}\) which spoils the quadraticity only in higher orders of \(\epsilon\).
If we write
then according to Eq.8.1.42, both the denominator and the numerator of Eq.8.1.40 are sums of integrals of the following form
where \(I_0[\psi]\) is quadratic. In the case of finite-dimensional integrals, this is a well-known extension of the Gaussian Integral Formula discussed above by integration-by-parts. More formally, this is known as Wick’s theorem which we recall as follows
Wick’s theorem
\[\begin{split}&\int \prod_r d\xi_s~\xi_{s_1} \xi_{s_2} \cdots \xi_{s_{2N}} \exp\left( -\frac{\ifrak}{2} \sum_{s, r} \Dscr_{sr} \xi_s \xi_r \right) \\ &\quad = \left( \det(\ifrak\Dscr / 2\pi) \right)^{-1/2} \sum_{\substack{\text{pairings} \\ \text{of } s_1, \cdots, s_{2N}}} \prod_{\text{pairs}} \left(-\ifrak\Dscr^{-1}\right)_{\text{paired indices}}\end{split}\]where \(\Dscr\) is a real, symmetric, positive matrix.
Applying Wick’s theorem to Eq.8.1.44 we get
where \(\Dscr\) is given by Eq.8.1.43. Observe that this evaluation, besides the unimportant field-independent factor \((\det(\ifrak\Dscr / 2\pi))^{-1/2}\), can be thought of as a sum over Feynman diagrams where the edges are paired fields that come from either the expansion of \(e^{\ifrak I_1[\psi]}\) or the timed-ordered operators in the denominator of Eq.8.1.40. Moreover, the “propagator” \(-\ifrak\Delta\) can be defined as follows
To invert \(\Dscr\) in spacetime coordinates, let’s rewrite Eq.8.1.45 as an integral equation as follows
Assuming translation-invariance of the theory, it follows that \(\Dscr\) can be written as a Fourier transform as follows
which, in turn, implies
We conclude the discussion with an example.
- Scalar field
Recall from Eq.6.1.12 that the free Lagrangian density takes the following form
\[\Lscr_0 = -\frac{1}{2} \p_{\mu} \phi \p^{\mu} \phi - \frac{1}{2} m^2 \phi^2\]It follows then from Eq.8.1.23 and Eq.8.1.41 that the free action \(I_0[\phi]\), up to the first order of \(\epsilon\), takes the following form
\[\begin{split}&I_0[\phi] \\ &= -\frac{1}{2} \int d^4x \left( \p_{\mu} \phi \p^{\mu} \phi + m^2 \phi^2 \right) + \frac{1}{2} \ifrak\epsilon \int dt \int d^3x~d^3x'~\Escr(\xbf, \xbf') \phi(t, \xbf) \phi(t, \xbf') \\ &= -\frac{1}{2} \int d^4x~d^4x' \left( \delta^4(x-x') (\p_{\mu} \phi \p^{\mu} \phi + m^2 \phi^2) - \ifrak\epsilon \delta(t-t') \Escr(\xbf, \xbf') \phi(x) \phi(x') \right) \\ &= -\frac{1}{2} \int d^4x~d^4x' \left( \frac{\p^2}{\p x^{\mu} \p x'_{\mu}} \delta^4(x-x') + m^2 \delta^4(x-x') -\ifrak\epsilon \delta(t-t') \Escr(\xbf, \xbf') \right) \phi(x) \phi(x') \\ &= -\frac{1}{2} \int d^4x~d^4x' \left( (2\pi)^{-4} \int d^4p~e^{\ifrak p \cdot (x-x')} \left( p^2 + m^2 - \ifrak\epsilon E(\pbf) \right) \right) \phi(x) \phi(x')\end{split}\]where in the last equality we’ve also used Eq.8.1.21.
Comparing with Eq.8.1.43 and Eq.8.1.46, we find
\[\Dscr(p) = p^2 + m^2 - \ifrak \epsilon E(\pbf)\]and therefore the propagator
\[\Delta(x, y) = (2\pi)^{-4} \int d^4p~e^{\ifrak p \cdot (x-y)} \left( p^2 + m^2 - \ifrak\epsilon E(\pbf) \right)^{-1}\]according to Eq.8.1.47. This recovers the Feynman propagator defined by Eq.5.2.8 and evaluated in Eq.5.2.11.
8.2. Path Integrals for Fermions#
We’ll develop the path integral formalism for fermions in parallel to the theory for bosons. The starting point is the commutation relations between Schrödinger-picture canonical variables
where the curly bracket denotes the anti-commutator. This is to be compared with the bosonic commutation relations Eq.8.1.1. As in the bosonic case, the indices \(a, b\) will be replaced by spacetime coordinates as we transit specifically to quantum field theory. A key difference, which will be discussed in more detail in Fermionic zero states, is that, unlike the bosonic canonical variables, the fermionic \(Q\) and \(P\) operators are not Hermitian.
8.2.1. Fermionic zero states#
It follows from Eq.8.2.1 that
Hence there must exist a ket-state \(\ket{0}\) and a bra-state \(\bra{0}\) such that
Indeed they can be explicitly constructed as follows
where \(\ket{f}\) and \(\bra{g}\) can be any states that makes the right-hand-sides nonzero. In particular the zero states are not in general unique. It turns out that in the absence of bosonic degrees of freedom, the zero states are unique up to a scalar, which can be chosen to satisfy the following normalization property
Note
The condition Eq.8.2.2 may seem a bit strange given that the fermionic \(Q\) and \(P\) operators are completely interchangeable in light of Eq.8.2.1. However, it cannot be the case that \(Q_a \ket{0} = \bra{0} Q_a = 0\) since it would imply \(\braket{0 | \{Q_a, P_b\} | 0} = 0\) in contradiction with Eq.8.2.3.
Indeed, the relationship between \(Q\) and \(P\) operators may vary. In Dirac’s theory of spin-\(1/2\) particles, we have \(Q_a^{\dagger} = -\ifrak P_a\) in light of Eq.6.4.18 and Eq.4.4.45 (cf. Eq.4.4.7 and Eq.4.4.10). In the theory of ghost field, on the other hand, the \(Q\) and \(P\) operators are not related at all.
8.2.2. Fermionic eigenstates#
In light of Eq.8.2.2, we can think \(P\) as the creation operators and \(Q\) as the annihilation operators. A complete basis of the states can then be obtained from \(\ket{0}\) by applying an arbitrary number of \(P\) operators as follows
Note that the basis state is anti-symmetric in the following sense
It follows from Eq.8.2.2 that
where \(\hat{a}_k\) means that it’s removed from the sequence.
Similarly, the dual basis can be obtained from \(\bra{0}\) as follows
The reason to define the dual vector this way is to realize the following normalization condition
The cases when \(\{ b_1, b_2, \cdots, b_M \}\) is a permutation of \(\{ a_1, a_2, \cdots, a_N \}\) can be covered using Eq.8.2.5.
The issue with the ket and bra-states defined by Eq.8.2.4 and Eq.8.2.6, respectively, is that they are not eigenstates of \(Q\) or \(P\). In fact, in sharp contrast to the bosonic case (cf. Eq.8.1.2), there cannot be any eigenstate of, say, all \(Q\) operators with nonzero (numeric) eigenvalues in the following sense
Indeed, the fermionic commutation relation Eq.8.2.1 would demand
which cannot be satisfied if \(q_a, q_b\) are nonzero complex numbers. It turns out that the solution to this difficulty, which may seem to be artificial, is to introduce a new set of “numbers” \(q_a\), known as Grassmann numbers which satisfy the following anti-commutation relations
Now the fermionic eigenstate equation Eq.8.2.8 as well as its dual can be solved by the following
where the exponential is defined using its Taylor expansion.
Warning
In the definition of \(\bra{q}\) there is a sign ambiguity depending on the ordering of the product of the \(Q\) operators.
The ket-state \(\ket{q}\) (e.g. \(\ket{0}\)) is not necessarily the adjoint of the corresponding bra-state \(\bra{q}\) (e.g. \(\bra{0}\)) since \(Q\) is not Hermitian.
Moreover the scalar product of the ket and bra \(Q\)-eigenstates can be evaluated as follow
where in the last step, we’ve used Eq.8.2.1 to move the \(Q\) operators to the right of the \(P\) operators. Though not obvious at the moment, the right-hand-side will work as a delta function in fermionic integrals (cf. the bosonic case Eq.8.1.3).
The eigenstate of the \(P\) operators satisfying
can be constructed in a way similar to Eq.8.2.9 as follows
where the order of the product \(\prod_a P_a\) is, by convention, the same as the one in Eq.8.2.9.
The scalar product between the \(P\)-eigenstates can be similarly evaluated to the following
In analogy to Eq.8.1.6, let’s calculate the scalar products between \(Q\) and \(P\)-eigenstates as follows
where Eq.8.2.6, Eq.8.2.7, and Eq.8.2.3 are used in the last equality. Here \(N\) is the number of \(Q_a\)s, which is the same as the number of \(P_a\)s. Similarly, but more simply, we have
We end this section with the note that the states \(\ket{q}\) are complete in the following sense. If we expand \(\ket{q}\) in Eq.8.2.9 as a power series in products of the \(q_a\)s, then the coefficients span the whole space of states defined by Eq.8.2.4.
8.2.3. Fermionic calculus#
Since the eigenvalues of fermionic eigenstates are Grassmann numbers rather than ordinary (complex) numbers, we need a framework to do calculus, in particular integration, for functions of Grassmann variables. It turns out that the fermionic integration can be formalized as the so-called Berezin integration, which can be determined by just two rules. Writing \(\xi\) for generic Grassmann variables, the first rule consists of the evaluation of the integral on a single monomial
and the second rule states that the integral is linear in both summation and multiplication by ordinary numbers. Here the ordering of the “differentials” \(d\xi_i\) in Eq.8.2.15 is made so that the integral can be evaluated in steps as follows
It turns out to be very convenient to introduce yet another anti-commutativity relation as follows
so we can move the integrand to the left of the “volume element” \(\prod_n d\xi_n\) at the cost of a sign. Under this convention, it’s straightforward to show that given an arbitrary function \(g(\xi')\) of Grassmann variables that are not integrated, the following two formulae hold
Another important formula in Berezin integration is to describe how the integral transforms under a (linear) change of variables. Consider the following transformation
where \(\Sscr = \left( \Sscr_{nm} \right)\) is a non-singular matrix of ordinary numbers. It follows that
and henceforth
As an application of this formalism, we’ll establish a fermionic analog of the completeness condition Eq.8.1.4. First, note that any state \(\ket{f}\) can be written as an integral
where \(f(q)\) is a polynomial in the Grassmann variables \(q\).
It follows from Eq.8.2.10 and Eq.8.2.16 that
where the easiest way to justify the second-to-last equality is to write \(f(q) = f(q' + (q - q'))\) and expand it in powers of \(q-q'\), so that only the zeroth order term survive due to the product \(\prod_n \left( q_n - q'_n \right)\) to the left.
Plugging Eq.8.2.19 into Eq.8.2.18 we have
which is the fermionic version of Eq.8.1.4. The same calculation can be done to the \(P\)-eigenstates to get the following
8.2.4. The general path integral formula and transition to S-matrix#
Let \(H\) be the (full) Hamiltonian. Then just as in the bosonic case (cf. Eq.8.1.7), we define Heisenberg-picture operators
with right and left-eigenstates defined as follows
Here we recall that \(H\) must contain an even number of fermionic fields and therefore commute with any Grassmann numbers. Just as in the bosonic case Eq.8.1.10, the time-dependent eigenstates satisfy the obviously analogous time-independent scalar product formulae Eq.8.2.10, Eq.8.2.12, Eq.8.2.13, Eq.8.2.14 and orthogonality conditions Eq.8.2.20, Eq.8.2.21.
Assuming \(H = H(P, Q)\) is arranged so that all the \(P\) operators lie to the left of the \(Q\) operators, we can calculate the infinitesimal transition amplitude in parallel to the bosonic case Eq.8.1.14 (except for the ordering \(Q\) and \(P\) which is merely a matter of convenience) as follows
where in the last quantity we’ve thrown away an insignificant field-independent phase factor (coming from Eq.8.2.13), and hence the proportionality is used instead of equality.
Now given a sequence of time-ordered operators, the matrix element analogous to the bosonic Eq.8.1.16 is given by
Note that, unlike the bosonic case, an extra sign is added to each permutation of the fermionic operators demanded by the time-ordering operator \(T\).
Transitioning to quantum field theory, we get the fermionic analog of Eq.8.1.24 as follows
where we’ve, once again, omitted the details of the \(\ifrak\epsilon\) terms. As in the bosonic case, they come from the vacuum wave functions (cf. Eq.8.1.23).
The next step in the bosonic case, as discussed in Lagrangian version of the path integral, is to assume that the Hamiltonian \(H(p, q)\) is quadratic in \(p\) and hence the integral in \(p\)-fields can be evaluated using the Gaussian integral formula (cf. Eq.8.1.31). This is not the case for fermions. Unlike the bosonic case (cf. Eq.8.1.28), the canonical conjugate \(p\) is unrelated to \(\dot{q}\). In fact, for each fermion that carries a nonzero quantum number, there are equal number of \(p\)s and \(q\)s. In particular, the free Hamiltonian \(H_0\) is bilinear in \(p\) and \(q\) so that
where \(\Dscr\) is a matrix in ordinary numbers.
Expanding both the time-ordered operators and the interaction Hamiltonian \(V = H - H_0\) in a power series in \(p\)s and \(q\)s, the right-hand-side of Eq.8.2.22 becomes a sum of Gaussian-like integrals and can be evaluated as follows
where \(\op{Perm}_N\) denotes the permutation group of \(N\) symbols. This reproduces the Feynman rules if we regard \(\left(\Dscr^{-1}\right)_{xm, yn}\) as the propagator between \(p_m(x)\) and \(q_n(y)\).
As an example, consider the free-particle action of spin-\(1/2\) particles as follows (cf. Eq.6.4.17 and Eq.6.4.19)
The canonical variables are given by (cf. Eq.6.4.18)
Comparing with Eq.8.2.23 we have
where we’ve omitted the details about the \(\ifrak\epsilon\) term since it’s not important here and can be worked out as the vacuum wave function in the same way as for (bosonic) scalar field (cf. Eq.8.1.22). It follows that the propagator can be written as follows
which recovers Eq.5.2.13 and Eq.5.2.5, except for that we’ve calculated here the propagator between \(\psi\) and \(-\bar{\psi} \gamma^0\), rather than \(\psi^{\dagger}\).
To illustrate how path-integral method works in this case, let’s consider the following interacting Lagrangian density
where \(\Gamma(x)\) represents the interaction of the fermion with an external field. Applying Eq.8.2.22, we can calculate the vacuum persistence amplitude (in matrix notation) as follows
where
Observe that the following change of variables
will make the integral in Eq.8.2.28 independent of \(\Gamma\). Hence it follows from Eq.8.2.17 that
To see that this calculation is consistent with the Feynman rules derived in Momentum Space Feynman Rules, rewrite Eq.8.2.29 as follows
where
It follows then (cf. Eq.8.1.33)
which is exactly what one would expect from a Feynman diagram calculation. Indeed, the Feynman rules demand that each vertex contributes a factor \(-\ifrak\Gscr(\Gamma)\) and each (internal) edge contributes a factor \(-\ifrak\Dscr\). In this example all connected Feynman diagram are simply loops. A loop with \(n\) vertices then contributes a factor \(1/n\) accounting for the cyclic symmetry. Finally the extra sign in \((-1)^{n+1}\) accounts for the fermionic loop.
It turns out that path integral calculations like Eq.8.2.30 do much more than just recovering perturbative calculations. But we’ll only come back to this much later.
8.2.5. Path-integral formulation of QED#
In Quantum Electrodynamics, we derived the photon propagator Eq.7.5.4 in a rather cumbersome way by working with non-Lorentz-invariant \(A\)-fields (cf. Eq.7.4.7 and Eq.7.4.8). In this section, we will re-derive Eq.7.5.4 in a cleaner manner using path integrals.
Recall from Eq.7.3.10 that the QED Hamiltonian can be written as follows
where the Coulomb potential \(V_{\op{Coul}}\) is given by Eq.7.3.11, under the Coulomb gauge condition
and the constraint (cf. Eq.7.3.6)
The general path integral calculation of the vacuum expectation value of a timed-ordered product of operators, whether the bosonic Eq.8.1.24 or the fermionic Eq.8.2.22 can be applied to the QED Hamiltonian to give the following
where the lowercase fields represent simply variables of integration, rather than interaction-picture fields as in Quantum Electrodynamics, the index \(i\) runs through the spatial \(1,2,3\) as usual, the matter field variables are represented by \(\psi_{\ell}\), and the last two delta functions are introduced to enforce the constraints Eq.8.2.31 and Eq.8.2.32.
First thing to observe is that since \(\Lscr_M\) doesn’t involve \(\bm{\pi}\) (cf. Eq.7.6.1 and Eq.6.4.17) and the power of the exponential is quadratic in \(\bm{\pi}\), it can be integrated out by the Gaussian integral formula Eq.8.1.27. More explicitly, it amounts to substitute \(\bm{\pi}\) with the solution to the stationary point of the quadratic power, which is \(\bm{\pi} = \dot{\abf}\), so we can rewrite Eq.8.2.33 as follows
Next we’d like to restore manifest Lorentz invariance by integrating also over \(a_0\). The trick is to consider the following quantity
whose exponential’s (path) integral over \(a_0(x)\) can be done by setting \(a_0(x)\) to the stationary point. In other words \(a_0(x)\) should solve the following differential equation
This is a rather familiar equation (cf. Eq.7.2.6), whose solution, given by Eq.7.2.7, is reproduced here
Plugging it back to Eq.8.2.35, we get nothing but the Coulomb action. It follows that the integral in the exponential in Eq.8.2.34 can be rewritten as follows (cf. Eq.7.3.3)