Skip to content
Miko-pedia Miko-pedia
← マップに戻る
量子力学 · AIのノート · 学部 · #quantum

Complete Quantum Mechanics

A concise undergraduate guide to wave mechanics, Hilbert space, operators, angular momentum, identical particles, perturbation theory, variational methods, and WKB.

· 29 分

このノートの改善点や、新しく欲しいノートを提案できます。

PDFを見る

Part I: Theory

Core ideas

Quantum mechanics is built upon a set of postulates that replace the deterministic trajectories of classical mechanics with a probabilistic framework based on Hilbert space.

Postulates of Quantum Mechanics.

  • The State Vector: The state of a physical system is completely described by a vector ψ(t)|\psi(t)\rangle in a complex Hilbert space.
  • Observables and Operators: Every physical observable (e.g., position, momentum, energy) corresponds to a Hermitian operator acting on this Hilbert space.
  • Measurement (Born’s Rule): If an observable AA is measured, the outcome must be one of the eigenvalues ana_n of the corresponding operator A^\hat A. The probability of obtaining ana_n is P(an)=anψ2P(a_n) = |\langle a_n|\psi\rangle|^2.
  • State Collapse: Immediately after a measurement of AA that yields ana_n, the state of the system “collapses” to the corresponding eigenvector an|a_n\rangle.
  • Time Evolution: Between measurements, the state evolves according to the Time-Dependent Schrodinger Equation:
iddtψ(t)=H^ψ(t),i\hbar\frac{d}{dt}|\psi(t)\rangle = \hat H|\psi(t)\rangle,
where $\hat H$ is the Hamiltonian (total energy) operator.

Superposition Principle. Because the Schrodinger equation is linear, any linear combination of valid states is also a valid state: ψ=c1ψ1+c2ψ2|\psi\rangle = c_1|\psi_1\rangle + c_2|\psi_2\rangle. This leads to interference effects, where amplitudes (not probabilities) add.

Unitary Evolution. For a time-independent Hamiltonian, the evolution operator is U^(t)=eiH^t/\hat U(t) = e^{-i\hat Ht/\hbar}. Since H^\hat H is Hermitian, U^\hat U is unitary (U^U^=I^\hat U^\dagger \hat U = \hat I), which ensures that the total probability ψψ=1\langle\psi|\psi\rangle = 1 is conserved over time.

Mathematical spine

iddtψ(t)=H^ψ(t)(Schrodinger Equation)A^an=anan,P(an)=anψ2(Eigenvalues and Probabilities)A^=ψA^ψ(Expectation Value)\begin{aligned} i\hbar\frac{d}{dt}|\psi(t)\rangle &= \hat H|\psi(t)\rangle & \text{(Schrodinger Equation)} \\ \hat A|a_n\rangle &= a_n|a_n\rangle, \quad P(a_n) = |\langle a_n|\psi\rangle|^2 & \text{(Eigenvalues and Probabilities)} \\ \langle \hat A \rangle &= \langle\psi|\hat A|\psi\rangle & \text{(Expectation Value)} \end{aligned}

Example: The Qubit. The simplest quantum system is a two-level system (qubit). A general state is ψ=α0+β1|\psi\rangle = \alpha|0\rangle + \beta|1\rangle with α2+β2=1|\alpha|^2 + |\beta|^2 = 1. Measuring in the {0,1}\{|0\rangle, |1\rangle\} basis yields 0 with probability α2|\alpha|^2.

Worked example

A qubit is prepared in the state ψ=120+12eiπ/41|\psi\rangle = \frac{1}{\sqrt{2}}|0\rangle + \frac{1}{\sqrt{2}}e^{i\pi/4}|1\rangle, where 0|0\rangle and 1|1\rangle are eigenstates of σ^z\hat\sigma_z with eigenvalues +1+1 and 1-1. We measure σz\sigma_z.

The probabilities are:

  • P(+1)=0ψ2=(1/2)2=1/2P(+1) = |\langle 0|\psi\rangle|^2 = (1/\sqrt{2})^2 = 1/2
  • P(1)=1ψ2=(1/2)2=1/2P(-1) = |\langle 1|\psi\rangle|^2 = (1/\sqrt{2})^2 = 1/2

The expectation value is

σ^z=12(+1)+12(1)=0.\langle \hat\sigma_z \rangle = \frac{1}{2}(+1) + \frac{1}{2}(-1) = 0.

If instead we measure σx\sigma_x, we rewrite ψ|\psi\rangle in the σx\sigma_x basis where +x=12(0+1)|+\rangle_x = \frac{1}{\sqrt{2}}(|0\rangle+|1\rangle) and x=12(01)|-\rangle_x = \frac{1}{\sqrt{2}}(|0\rangle-|1\rangle). The expansion gives P(+1)=cos2(π/8)0.854P(+1) = \cos^2(\pi/8) \approx 0.854 and P(1)=sin2(π/8)0.146P(-1) = \sin^2(\pi/8) \approx 0.146.

Problems with Solutions

Problem 1. Show that the time-evolution operator U^(t)=eiH^t/\hat U(t) = e^{-i\hat H t/\hbar} is unitary for a time-independent Hermitian Hamiltonian.

Solution. For H^\hat H Hermitian, H^=H^\hat H^\dagger = \hat H. Then

U^(t)=e+iH^t/=e+iH^t/=U^(t).\hat U^\dagger(t) = e^{+i\hat H^\dagger t/\hbar} = e^{+i\hat H t/\hbar} = \hat U(-t).

Therefore

U^(t)U^(t)=e+iH^t/eiH^t/=e0=I^,\hat U^\dagger(t)\hat U(t) = e^{+i\hat H t/\hbar}e^{-i\hat H t/\hbar} = e^{0} = \hat I,

so U^(t)\hat U(t) is unitary. Unitarity guarantees conservation of total probability.

Problem 2. A system is in state ψ=15(21+2+i3)|\psi\rangle = \frac{1}{\sqrt{5}}(2|1\rangle + |2\rangle + i|3\rangle), where {1,2,3}\{|1\rangle,|2\rangle,|3\rangle\} is an orthonormal basis. Normalize the state and find the probability of measuring the system in 1|1\rangle.

Solution. First check normalization:

ψψ=15(4+1+1)=651.\langle\psi|\psi\rangle = \frac{1}{5}(4 + 1 + 1) = \frac{6}{5} \neq 1.

The normalized state is ψnorm=56ψ=16(21+2+i3)|\psi_{\text{norm}}\rangle = \sqrt{\frac{5}{6}}|\psi\rangle = \frac{1}{\sqrt{6}}(2|1\rangle + |2\rangle + i|3\rangle). The probability of 1|1\rangle is

P(1)=1ψnorm2=262=46=23.P(1) = |\langle 1|\psi_{\text{norm}}\rangle|^2 = \left|\frac{2}{\sqrt{6}}\right|^2 = \frac{4}{6} = \frac{2}{3}.

Problem 3. If a measurement of observable A^\hat A on state ψ|\psi\rangle yields eigenvalue ana_n, what is the state immediately after measurement?

Solution. By the postulate of state collapse, immediately after obtaining ana_n the state collapses to the corresponding normalized eigenvector an|a_n\rangle:

ψafter=ananψanψ2=an.|\psi_{\text{after}}\rangle = \frac{|a_n\rangle\langle a_n|\psi\rangle}{\sqrt{|\langle a_n|\psi\rangle|^2}} = |a_n\rangle.

The phase is arbitrary and usually omitted.

Section summary. Quantum mechanics describes systems using state vectors in Hilbert space, with Hermitian operators representing observables and unitary operators representing time evolution.

The Wave Function

Core ideas

The state of a quantum particle is completely specified by a complex-valued wave function ψ(r,t)\psi(\bm r, t). It is a probability amplitude; the physical interpretation is given by Born’s rule.

Probability Interpretation and Normalization. The probability of finding the particle in a volume d3rd^3r is P(r,t)d3r=ψ(r,t)2d3rP(\bm r, t)d^3r = |\psi(\bm r, t)|^2 d^3r. Since the particle must be somewhere in space, the wave function must be normalized:

ψ(r,t)2d3r=1.\int_{-\infty}^{\infty} |\psi(\bm r, t)|^2 d^3r = 1.

For a wave function to be physically meaningful, it must be square-integrable (belong to the L2L^2 Hilbert space).

Expectation Values and Operators. Observables are represented by Hermitian operators A^\hat A. The average result of many measurements of AA on identically prepared states is the expectation value:

A=ψA^ψd3r.\langle A \rangle = \int \psi^* \hat A \psi \, d^3r.

Key operators in the position representation:

  • Position: r^=r\hat{\bm r} = \bm r
  • Momentum: p^=i\hat{\bm p} = -i\hbar \nabla

Probability Current. Local conservation of probability is expressed by the continuity equation:

ρt+j=0,ρ=ψ2.\frac{\partial \rho}{\partial t} + \nabla \cdot \bm j = 0, \quad \rho = |\psi|^2.

The probability current j\bm j represents the flow of probability density:

j=2mi(ψψψψ)=1mRe(ψp^ψ).\bm j = \frac{\hbar}{2mi} (\psi^* \nabla \psi - \psi \nabla \psi^*) = \frac{1}{m} \text{Re}(\psi^* \hat{\bm p} \psi).

Momentum Space and Fourier Transforms. The state can also be represented in momentum space by the function ϕ(p,t)\phi(\bm p, t). The two representations are related by a Fourier transform:

ψ(x)=1(2π)3/2ϕ(p)eipx/d3p,ϕ(p)=1(2π)3/2ψ(x)eipx/d3x.\psi(\bm x) = \frac{1}{(2\pi\hbar)^{3/2}} \int \phi(\bm p) e^{i\bm p \cdot \bm x / \hbar} d^3p, \quad \phi(\bm p) = \frac{1}{(2\pi\hbar)^{3/2}} \int \psi(\bm x) e^{-i\bm p \cdot \bm x / \hbar} d^3x.

This duality is the origin of the Heisenberg Uncertainty Principle: ΔxΔp/2\Delta x \Delta p \geq \hbar/2. A spatially localized wave packet (small Δx\Delta x) must be composed of many momentum states (large Δp\Delta p).

Mathematical spine

p^=i(Momentum Operator)j=mIm(ψψ)(Probability Current)ΔAΔB12[A^,B^](General Uncertainty Relation)\begin{aligned} \hat{\bm p} &= -i\hbar \nabla & \text{(Momentum Operator)} \\ \bm j &= \frac{\hbar}{m} \text{Im}(\psi^* \nabla \psi) & \text{(Probability Current)} \\ \Delta A \Delta B &\geq \frac{1}{2} | \langle [\hat A, \hat B] \rangle | & \text{(General Uncertainty Relation)} \end{aligned}

Example: Gaussian Wave Packet. A particle at rest localized at the origin can be modeled by:

ψ(x,0)=(1πσ2)1/4ex2/(2σ2).\psi(x, 0) = \left( \frac{1}{\pi \sigma^2} \right)^{1/4} e^{-x^2 / (2\sigma^2)}.

This state minimizes the uncertainty product: Δx=σ/2\Delta x = \sigma/\sqrt{2} and Δp=/(σ2)\Delta p = \hbar/(\sigma\sqrt{2}), so ΔxΔp=/2\Delta x \Delta p = \hbar/2. As the wave packet evolves in time, it spreads (dispersion).

Worked example

A free electron is described at t=0t=0 by the Gaussian wave packet

ψ(x,0)=(12πσ2)1/4ex2/(4σ2)eik0x,\psi(x,0) = \left(\frac{1}{2\pi\sigma^2}\right)^{1/4}e^{-x^2/(4\sigma^2)}e^{ik_0 x},

with σ=1.0nm\sigma = 1.0\,\text{nm} and k0=2.0×109m1k_0 = 2.0\times10^{9}\,\text{m}^{-1} (corresponding to momentum p0=k02.11×1025kg\cdotm/sp_0 = \hbar k_0 \approx 2.11\times10^{-25}\,\text{kg\cdotm/s}). The normalization constant is chosen so that ψ2dx=1\int_{-\infty}^{\infty}|\psi|^2dx = 1.

The probability density is

ψ(x,0)2=12πσ2ex2/(2σ2),|\psi(x,0)|^2 = \frac{1}{\sqrt{2\pi\sigma^2}}e^{-x^2/(2\sigma^2)},

which is a Gaussian centered at x=0x=0 with width Δx=σ=1.0nm\Delta x = \sigma = 1.0\,\text{nm}.

The momentum-space wave function is also Gaussian:

ϕ(p)=(2σ2π2)1/4eσ2(pp0)2/2.\phi(p) = \left(\frac{2\sigma^2}{\pi\hbar^2}\right)^{1/4}e^{-\sigma^2(p-p_0)^2/\hbar^2}.

The momentum width is Δp=/(2σ)5.28×1026kg\cdotm/s\Delta p = \hbar/(2\sigma) \approx 5.28\times10^{-26}\,\text{kg\cdotm/s}.

Check the uncertainty principle:

ΔxΔp=σ2σ=2,\Delta x\,\Delta p = \sigma \cdot \frac{\hbar}{2\sigma} = \frac{\hbar}{2},

so this state is a minimum-uncertainty wave packet.

Problems with Solutions

Problem 1. A particle has wave function ψ(x)=Aex/a\psi(x) = A e^{-|x|/a} with a=0.5nma = 0.5\,\text{nm}. Find the normalization constant AA and the probability of finding the particle in the region 0<x<a0 < x < a.

Solution. Normalization:

1=A2e2x/adx=2A20e2x/adx=2A2a2=A2a.1 = A^2\int_{-\infty}^{\infty}e^{-2|x|/a}dx = 2A^2\int_0^{\infty}e^{-2x/a}dx = 2A^2\frac{a}{2} = A^2 a.

Thus A=1/a=2nm1/21.41nm1/2A = 1/\sqrt{a} = \sqrt{2}\,\text{nm}^{-1/2} \approx 1.41\,\text{nm}^{-1/2}.

Probability:

P(0<x<a)=A20ae2x/adx=1a[a2e2x/a]0a=12(1e2)0.432.P(0<x<a) = A^2\int_0^a e^{-2x/a}dx = \frac{1}{a}\left[-\frac{a}{2}e^{-2x/a}\right]_0^a = \frac{1}{2}(1-e^{-2}) \approx 0.432.

Problem 2. For a particle with ψ(x)=2/Lsin(πx/L)\psi(x) = \sqrt{2/L}\sin(\pi x/L) in an infinite well 0<x<L0<x<L, calculate the probability current j(x)j(x).

Solution. Using j=2mi(ψψψψ)\bm j = \frac{\hbar}{2mi}(\psi^*\nabla\psi - \psi\nabla\psi^*):

ψ=ψ=2Lsin(πxL),dψdx=2LπLcos(πxL).\psi^* = \psi = \sqrt{\frac{2}{L}}\sin\left(\frac{\pi x}{L}\right), \quad \frac{d\psi}{dx} = \sqrt{\frac{2}{L}}\frac{\pi}{L}\cos\left(\frac{\pi x}{L}\right).

Then

j(x)=2mi[ψdψdxψdψdx]=0,j(x) = \frac{\hbar}{2mi}\left[\psi\frac{d\psi}{dx} - \psi\frac{d\psi}{dx}\right] = 0,

since ψ\psi is real. A stationary state in a bound potential has zero probability current.

Problem 3. Show that a Gaussian wave packet ψ(x)=(2πσ2)1/4ex2/(4σ2)\psi(x) = (2\pi\sigma^2)^{-1/4}e^{-x^2/(4\sigma^2)} satisfies ΔxΔp=/2\Delta x\,\Delta p = \hbar/2.

Solution. The position probability density is ψ2=(2πσ2)1/2ex2/(2σ2)|\psi|^2 = (2\pi\sigma^2)^{-1/2}e^{-x^2/(2\sigma^2)}, a Gaussian with variance σ2\sigma^2, so Δx=σ\Delta x = \sigma.

The momentum-space wave function is

ϕ(p)=(2σ2π2)1/4eσ2p2/2,\phi(p) = \left(\frac{2\sigma^2}{\pi\hbar^2}\right)^{1/4}e^{-\sigma^2 p^2/\hbar^2},

with probability density ϕ(p)2=2σ2/(π2)e2σ2p2/2|\phi(p)|^2 = \sqrt{2\sigma^2/(\pi\hbar^2)}\,e^{-2\sigma^2 p^2/\hbar^2}. This is a Gaussian with variance 2/(4σ2)\hbar^2/(4\sigma^2), so Δp=/(2σ)\Delta p = \hbar/(2\sigma).

Therefore ΔxΔp=σ/(2σ)=/2\Delta x\,\Delta p = \sigma \cdot \hbar/(2\sigma) = \hbar/2.

Section summary. The wave function is a normalized probability amplitude whose evolution is constrained by probability conservation and whose spread is limited by the uncertainty principle.

Time-Independent Schrodinger Equation

Core ideas

When the potential V(r)V(\bm r) is independent of time, the TDSE can be solved using separation of variables: Ψ(r,t)=ψ(r)eiEt/\Psi(\bm r, t) = \psi(\bm r) e^{-iEt/\hbar}. This leads to the Time-Independent Schrodinger Equation (TISE):

H^ψ=Eψ,[22m2+V(r)]ψ=Eψ.\hat H \psi = E \psi, \quad \left[ -\frac{\hbar^2}{2m} \nabla^2 + V(\bm r) \right] \psi = E \psi.

This is an eigenvalue problem for the energy EE. The solutions ψn\psi_n are stationary states because the probability density Ψ2=ψ2|\Psi|^2 = |\psi|^2 is constant in time.

Standard Solvable Models. Undergraduate quantum mechanics focuses on a few exactly solvable 1D potentials:

  • Infinite Square Well: V=0V=0 for 0<x<a0 < x < a, and \infty otherwise.
En=n2π222ma2,ψn(x)=2asin(nπxa),n=1,2,E_n = \frac{n^2 \pi^2 \hbar^2}{2 m a^2}, \quad \psi_n(x) = \sqrt{\frac{2}{a}} \sin\left(\frac{n\pi x}{a}\right), \quad n=1, 2, \dots
Quantization arises from the boundary condition $\psi(0) = \psi(a) = 0$.
  • Harmonic Oscillator: V(x)=12mω2x2V(x) = \frac{1}{2}m\omega^2 x^2.
En=(n+12)ω,n=0,1,2,E_n = \left( n + \frac{1}{2} \right) \hbar \omega, \quad n=0, 1, 2, \dots
The **zero-point energy** $E_0 = \frac{1}{2}\hbar\omega$ is the minimum possible energy, a direct consequence of the uncertainty principle.

Nodes and Energy. The nn-th excited state (for a 1D bound system) has nn nodes (points where ψ=0\psi=0). Higher energy states have more nodes because more “wiggles” correspond to higher curvature (2ψ\nabla^2 \psi) and thus higher kinetic energy.

Tunneling and Bound States.

  • Bound States: Discrete energy levels (E<VE < V_\infty).
  • Scattering States: Continuous energy spectrum (E>VE > V_\infty).
  • Tunneling: A particle has a non-zero probability of being found in classically forbidden regions (E<V(x)E < V(x)), where the wave function decays exponentially rather than oscillating.

Mathematical spine

H^ψ=Eψ(TISE)En=n2π222ma2(Infinite Well Energies)En=(n+12)ω(SHO Energies)\begin{aligned} \hat H \psi &= E \psi & \text{(TISE)} \\ E_n &= \frac{n^2 \pi^2 \hbar^2}{2 m a^2} & \text{(Infinite Well Energies)} \\ E_n &= \left( n + \frac{1}{2} \right) \hbar \omega & \text{(SHO Energies)} \end{aligned}

Example: Finite Square Well. Unlike the infinite well, the finite well has a finite number of bound states. The wave function “leaks” into the walls, leading to lower energy levels than an infinite well of the same width.

Worked example

An electron is confined in a 1D infinite square well of width a=2.0nma = 2.0\,\text{nm}. The allowed energies are

En=n2π222mea2=n2π2(1.055×1034J\@cdots)22(9.11×1031kg)(2.0×109m)2n2(0.094eV).E_n = \frac{n^2\pi^2\hbar^2}{2m_e a^2} = n^2\frac{\pi^2(1.055\times10^{-34}\,\text{J\cdots})^2}{2(9.11\times10^{-31}\,\text{kg})(2.0\times10^{-9}\,\text{m})^2} \approx n^2(0.094\,\text{eV}).

For the ground state (n=1n=1): E10.094eVE_1 \approx 0.094\,\text{eV}. For the first excited state (n=2n=2): E2=4E10.376eVE_2 = 4E_1 \approx 0.376\,\text{eV}. The energy spacing is ΔE21=E2E10.282eV\Delta E_{21} = E_2 - E_1 \approx 0.282\,\text{eV}.

The corresponding de Broglie wavelength for n=1n=1 is λ1=2a=4.0nm\lambda_1 = 2a = 4.0\,\text{nm}, which equals twice the well width. The wave function has nodes at the boundaries and no nodes inside.

Problems with Solutions

Problem 1. For a quantum harmonic oscillator with m=1.0×1026kgm = 1.0\times10^{-26}\,\text{kg} and ω=3.0×1014rad/s\omega = 3.0\times10^{14}\,\text{rad/s}, calculate the ground-state energy and the energy spacing between adjacent levels in electron-volts.

Solution. The ground-state energy is

E0=12ω=12(1.055×1034)(3.0×1014)J=3.165×10202J=1.583×1020J.E_0 = \frac{1}{2}\hbar\omega = \frac{1}{2}(1.055\times10^{-34})(3.0\times10^{14})\,\text{J} = \frac{3.165\times10^{-20}}{2}\,\text{J} = 1.583\times10^{-20}\,\text{J}.

Converting to eV: E0=1.583×1020/(1.602×1019)0.0988eVE_0 = 1.583\times10^{-20}/(1.602\times10^{-19}) \approx 0.0988\,\text{eV}.

The spacing between adjacent levels is ΔE=ω=2E00.198eV\Delta E = \hbar\omega = 2E_0 \approx 0.198\,\text{eV}.

Problem 2. A particle in an infinite square well of width aa is in the superposition state ψ(x)=12[ψ1(x)+ψ2(x)]\psi(x) = \frac{1}{\sqrt{2}}[\psi_1(x) + \psi_2(x)], where ψn(x)=2/asin(nπx/a)\psi_n(x) = \sqrt{2/a}\sin(n\pi x/a). Find the expectation value of the energy.

Solution. The expectation value is

E=12E1+12E2=12π222ma2+124π222ma2=5π224ma2.\langle E \rangle = \frac{1}{2}E_1 + \frac{1}{2}E_2 = \frac{1}{2}\frac{\pi^2\hbar^2}{2ma^2} + \frac{1}{2}\frac{4\pi^2\hbar^2}{2ma^2} = \frac{5\pi^2\hbar^2}{4ma^2}.

With a=1.0nma = 1.0\,\text{nm} and m=mem = m_e:

E=52E152(0.376eV)=0.940eV.\langle E \rangle = \frac{5}{2}E_1 \approx \frac{5}{2}(0.376\,\text{eV}) = 0.940\,\text{eV}.

Problem 3. Estimate the number of bound states in a finite square well of depth V0=5eVV_0 = 5\,\text{eV} and width a=1.0nma = 1.0\,\text{nm} for an electron.

Solution. A rough estimate is obtained by comparing the well depth to the energy spacing of an infinite well of the same width: E1=π22/(2mea2)0.376eVE_1^{\infty} = \pi^2\hbar^2/(2m_e a^2) \approx 0.376\,\text{eV}. The number of bound states is approximately NV0/E1=5/0.376=13.3=3.65=3N \approx \lfloor\sqrt{V_0/E_1^{\infty}}\rfloor = \lfloor\sqrt{5/0.376}\rfloor = \lfloor\sqrt{13.3}\rfloor = \lfloor 3.65 \rfloor = 3. A more accurate numerical solution for this well gives either 3 or 4 bound states depending on the exact parameters, with the crude estimate capturing the correct order of magnitude.

Section summary. The TISE converts the search for energy states into an eigenvalue problem, yielding quantized energy levels for bound systems and predicting phenomena like zero-point energy and tunneling.

Formalism

Core ideas

The mathematical language of quantum mechanics is linear algebra in Hilbert space. States are represented by vectors, and observables by Hermitian operators.

Dirac Notation and Hilbert Space.

  • Ket ψ|\psi\rangle: A state vector in Hilbert space.
  • Bra ψ\langle\psi|: The dual vector (complex conjugate transpose).
  • Inner Product ϕψ\langle\phi|\psi\rangle: A complex number representing the amplitude of ψ\psi in state ϕ\phi.
  • Completeness: For an orthonormal basis {n}\{|n\rangle\}, the sum of projection operators equals the identity: nnn=I^\sum_n |n\rangle\langle n| = \hat I.

Hermitian Operators and Measurement. An operator A^\hat A is Hermitian if A^=A^\hat A = \hat A^\dagger. Its eigenvalues are real, and its eigenvectors form a complete basis. Measurement “projects” the state:

ψ=ncnan,cn=anψ.|\psi\rangle = \sum_n c_n |a_n\rangle, \quad c_n = \langle a_n|\psi\rangle.

The probability of measuring ana_n is cn2|c_n|^2.

Commutators and CSCO. Two observables can be measured simultaneously if and only if their operators commute: [A^,B^]=0[\hat A, \hat B] = 0. A Complete Set of Commuting Observables (CSCO) is a set of operators whose joint eigenvectors uniquely specify a state. Non-commuting operators satisfy the generalized uncertainty relation.

Schrodinger vs. Heisenberg Pictures.

  • Schrodinger Picture: States evolve in time (ψ(t)|\psi(t)\rangle), operators are stationary.
  • Heisenberg Picture: Operators evolve in time (A^(t)=U^A^U^\hat A(t) = \hat U^\dagger \hat A \hat U), states are stationary. The equations of motion are dA^dt=i[H^,A^]+A^t\frac{d\hat A}{dt} = \frac{i}{\hbar}[\hat H, \hat A] + \frac{\partial \hat A}{\partial t}.

Mathematical spine

A^=ψA^ψ(Expectation Value)nnn=I^(Completeness/Resolution of Identity)dAdt=i[H^,A^]+A^t(Ehrenfest’s Theorem)\begin{aligned} \langle \hat A \rangle &= \langle\psi|\hat A|\psi\rangle & \text{(Expectation Value)} \\ \sum_n |n\rangle\langle n| &= \hat I & \text{(Completeness/Resolution of Identity)} \\ \frac{d\langle A \rangle}{dt} &= \frac{i}{\hbar} \langle [\hat H, \hat A] \rangle + \langle \frac{\partial \hat A}{\partial t} \rangle & \text{(Ehrenfest's Theorem)} \end{aligned}

Example: Spin-1/2. The state of a spin-1/2 particle (e.g., an electron) is a spinor in a 2D Hilbert space. The observables are the Pauli matrices σx,σy,σz\sigma_x, \sigma_y, \sigma_z. They do not commute, e.g., [σx,σy]=2iσz[\sigma_x, \sigma_y] = 2i\sigma_z, so spin components along different axes cannot be known simultaneously.

Worked example

An electron (spin-1/21/2) is prepared in the state ψ=13+z+23z|\psi\rangle = \frac{1}{\sqrt{3}}|+\rangle_z + \sqrt{\frac{2}{3}}|-\rangle_z, where S^z±z=±2±z\hat S_z|\pm\rangle_z = \pm\frac{\hbar}{2}|\pm\rangle_z and =1.055×1034J\@cdots\hbar = 1.055\times10^{-34}\,\text{J\cdots}. We measure SzS_z.

The probabilities are obtained from the expansion coefficients:

  • P(+/2)=+ψ2=(1/3)2=1/3P(+\hbar/2) = |\langle +|\psi\rangle|^2 = (1/\sqrt{3})^2 = 1/3
  • P(/2)=ψ2=(2/3)2=2/3P(-\hbar/2) = |\langle -|\psi\rangle|^2 = (\sqrt{2/3})^2 = 2/3

Check: 1/3+2/3=11/3 + 2/3 = 1. The expectation value is

S^z=13(+2)+23(2)=61.76×1035J\@cdots.\langle \hat S_z \rangle = \frac{1}{3}\left(+\frac{\hbar}{2}\right) + \frac{2}{3}\left(-\frac{\hbar}{2}\right) = -\frac{\hbar}{6} \approx -1.76\times10^{-35}\,\text{J\cdots}.

If the measurement yields +/2+\hbar/2, the state collapses to +z|+\rangle_z.

Problems with Solutions

Problem 1. Verify the commutator [x^,p^]=i[\hat x, \hat p] = i\hbar in the position representation.

Solution. With x^=x\hat x = x and p^=ix\hat p = -i\hbar\frac{\partial}{\partial x}, act on a test function f(x)f(x):

[x^,p^]f=x(ifx)(i)x(xf)=ixfx+if+ixfx=if.[\hat x, \hat p]f = x\left(-i\hbar\frac{\partial f}{\partial x}\right) - (-i\hbar)\frac{\partial}{\partial x}(xf) = -i\hbar x\frac{\partial f}{\partial x} + i\hbar f + i\hbar x\frac{\partial f}{\partial x} = i\hbar f.

Since this holds for all f(x)f(x), [x^,p^]=i[\hat x, \hat p] = i\hbar.

Problem 2. A system has orthonormal basis {1,2}\{|1\rangle, |2\rangle\}. Observable A^\hat A has eigenvalues a1=5a_1 = 5 with eigenvector 1|1\rangle and a2=1a_2 = -1 with eigenvector 2|2\rangle. The system is in state ψ=121+322|\psi\rangle = \frac{1}{2}|1\rangle + \frac{\sqrt{3}}{2}|2\rangle. Calculate A^\langle \hat A \rangle, A^2\langle \hat A^2 \rangle, and the uncertainty ΔA\Delta A. Solution.

A^=14(5)+34(1)=534=0.5.\langle \hat A \rangle = \frac{1}{4}(5) + \frac{3}{4}(-1) = \frac{5-3}{4} = 0.5. A^2=14(25)+34(1)=7.\langle \hat A^2 \rangle = \frac{1}{4}(25) + \frac{3}{4}(1) = 7. ΔA=A^2A^2=70.25=6.752.60.\Delta A = \sqrt{\langle \hat A^2 \rangle - \langle \hat A \rangle^2} = \sqrt{7 - 0.25} = \sqrt{6.75} \approx 2.60.

Problem 3. Show that the Pauli matrices satisfy σx2=σy2=σz2=I\sigma_x^2 = \sigma_y^2 = \sigma_z^2 = I and [σx,σy]=2iσz[\sigma_x, \sigma_y] = 2i\sigma_z.

Solution. Using σx=(0110)\sigma_x = \begin{pmatrix}0 & 1 \\ 1 & 0\end{pmatrix}, σy=(0ii0)\sigma_y = \begin{pmatrix}0 & -i \\ i & 0\end{pmatrix}, σz=(1001)\sigma_z = \begin{pmatrix}1 & 0 \\ 0 & -1\end{pmatrix}:

σx2=(0110)(0110)=(1001)=I,\sigma_x^2 = \begin{pmatrix}0 & 1 \\ 1 & 0\end{pmatrix}\begin{pmatrix}0 & 1 \\ 1 & 0\end{pmatrix} = \begin{pmatrix}1 & 0 \\ 0 & 1\end{pmatrix} = I,

and similarly for σy2\sigma_y^2 and σz2\sigma_z^2. For the commutator:

[σx,σy]=(0110)(0ii0)(0ii0)(0110)=(i00i)(i00i)=(2i002i)=2iσz.[\sigma_x, \sigma_y] = \begin{pmatrix}0 & 1 \\ 1 & 0\end{pmatrix}\begin{pmatrix}0 & -i \\ i & 0\end{pmatrix} - \begin{pmatrix}0 & -i \\ i & 0\end{pmatrix}\begin{pmatrix}0 & 1 \\ 1 & 0\end{pmatrix} = \begin{pmatrix}i & 0 \\ 0 & -i\end{pmatrix} - \begin{pmatrix}-i & 0 \\ 0 & i\end{pmatrix} = \begin{pmatrix}2i & 0 \\ 0 & -2i\end{pmatrix} = 2i\sigma_z.

Section summary. The formalism provides a representation-independent framework using Dirac notation, Hermitian operators, and completeness relations to describe states and measurements.

Quantum Mechanics in Three Dimensions

Core ideas

Extending quantum mechanics to 3D involves the Laplacian 2\nabla^2. For central potentials V(r)V(r), the problem separates into radial and angular parts.

Spherical Coordinates and Separation of Variables. Using ψ(r,θ,ϕ)=R(r)Ym(θ,ϕ)\psi(r, \theta, \phi) = R(r) Y_{\ell m}(\theta, \phi), where YmY_{\ell m} are the spherical harmonics:

  • Angular Part: Solutions to the angular momentum eigenvalue problem:
L^2Ym=2(+1)Ym,L^zYm=mYm.\hat L^2 Y_{\ell m} = \hbar^2 \ell(\ell+1) Y_{\ell m}, \quad \hat L_z Y_{\ell m} = \hbar m Y_{\ell m}.
  • Radial Part: The function u(r)=rR(r)u(r) = rR(r) satisfies a 1D-like equation with an effective potential:
22md2udr2+[V(r)+2(+1)2mr2]u=Eu.-\frac{\hbar^2}{2m} \frac{d^2u}{dr^2} + \left[ V(r) + \frac{\hbar^2 \ell(\ell+1)}{2mr^2} \right] u = E u.
The term proportional to $\ell(\ell+1)/r^2$ is the **centrifugal barrier**.

Hydrogen Atom and Quantum Numbers. For the Coulomb potential V(r)=e2/(4πϵ0r)V(r) = -e^2/(4\pi\epsilon_0 r), the energies are quantized:

En=me432π2ϵ0221n213.6 eVn2.E_n = -\frac{me^4}{32\pi^2\epsilon_0^2\hbar^2} \frac{1}{n^2} \approx -\frac{13.6 \text{ eV}}{n^2}.

The state is labeled by three quantum numbers:

  • nn (Principal): n=1,2,n = 1, 2, \dots (Determines energy).
  • \ell (Azimuthal): =0,1,,n1\ell = 0, 1, \dots, n-1 (Magnitude of L\bm L).
  • mm (Magnetic): m=,,+m = -\ell, \dots, +\ell (zz-component of L\bm L).

Spin and Addition of Angular Momentum. Spin S\bm S is intrinsic angular momentum. For an electron (s=1/2s=1/2), Sz=±/2S_z = \pm \hbar/2. When combining two angular momenta J1\bm J_1 and J2\bm J_2 (e.g., LL and SS), the total JJ can range from j1j2|j_1-j_2| to j1+j2j_1+j_2. The states in the combined basis are related to the individual bases by Clebsch—Gordan coefficients.

Mathematical spine

Veff(r)=V(r)+2(+1)2mr2(Effective Potential)S^i=2σi(Spin and Pauli Matrices)J=L+S,[J2,L2]=0(Addition of Angular Momentum)\begin{aligned} V_{\rm eff}(r) &= V(r) + \frac{\hbar^2 \ell(\ell+1)}{2mr^2} & \text{(Effective Potential)} \\ \hat S_i &= \frac{\hbar}{2} \sigma_i & \text{(Spin and Pauli Matrices)} \\ \bm J &= \bm L + \bm S, \quad [J^2, L^2] = 0 & \text{(Addition of Angular Momentum)} \end{aligned}

Example: The Stern—Gerlach Experiment. This experiment demonstrated the quantization of spin by passing a beam of silver atoms through a non-uniform magnetic field, splitting the beam into two distinct spots corresponding to sz=±/2s_z = \pm \hbar/2.

Worked example

A hydrogen atom is in the n=2n=2, =1\ell=1, m=0m=0 state. The energy is E2=13.6eV/22=3.40eVE_2 = -13.6\,\text{eV}/2^2 = -3.40\,\text{eV}. The Bohr radius is a0=0.529×1010ma_0 = 0.529\times10^{-10}\,\text{m}. The radial probability density for the 2p2p state is

P21(r)=r2R21(r)2=r2(13(2a0)3/2ra0er/(2a0))2=r424a05er/a0.P_{21}(r) = r^2|R_{21}(r)|^2 = r^2\left(\frac{1}{\sqrt{3}(2a_0)^{3/2}}\frac{r}{a_0}e^{-r/(2a_0)}\right)^2 = \frac{r^4}{24a_0^5}e^{-r/a_0}.

To find the most probable radius, set dP21/dr=0dP_{21}/dr = 0:

ddr(r4er/a0)=4r3er/a0r4a0er/a0=0r=4a02.12×1010m=0.212nm.\frac{d}{dr}\left(r^4 e^{-r/a_0}\right) = 4r^3 e^{-r/a_0} - \frac{r^4}{a_0}e^{-r/a_0} = 0 \Rightarrow r = 4a_0 \approx 2.12\times10^{-10}\,\text{m} = 0.212\,\text{nm}.

This is twice the most probable radius of the 1s1s state (r=a0r = a_0), reflecting the larger spatial extent of excited states.

Problems with Solutions

Problem 1. An electron in a hydrogen atom is in the state with quantum numbers n=3n=3, =2\ell=2, m=1m=-1. What are the possible values of L2L^2, LzL_z, and the energy? Solution.

  • L2=2(+1)=2(2)(3)=626.67×1068J2\@cdots2L^2 = \hbar^2\ell(\ell+1) = \hbar^2(2)(3) = 6\hbar^2 \approx 6.67\times10^{-68}\,\text{J}^2\text{\cdots}^2
  • Lz=m=1.055×1034J\@cdotsL_z = m\hbar = -\hbar \approx -1.055\times10^{-34}\,\text{J\cdots}
  • E3=13.6eV/32=1.51eVE_3 = -13.6\,\text{eV}/3^2 = -1.51\,\text{eV}

Problem 2. What are the possible total angular momentum quantum numbers jj when combining orbital angular momentum =1\ell=1 with spin s=1/2s=1/2?

Solution. Using the addition rule j1j2jj1+j2|j_1-j_2| \leq j \leq j_1+j_2:

j=11/2,11/2+1,,1+1/2=1/2,3/2.j = |1 - 1/2|, |1 - 1/2| + 1, \dots, 1 + 1/2 = 1/2, 3/2.

So there are two possible jj values: j=1/2j=1/2 (doublet) and j=3/2j=3/2 (quartet).

Problem 3. For a particle in a central potential, show that the effective potential for the radial motion contains a centrifugal barrier term.

Solution. Starting from the 3D TISE with ψ(r,θ,ϕ)=R(r)Ym(θ,ϕ)\psi(r,\theta,\phi) = R(r)Y_{\ell m}(\theta,\phi) and substituting u(r)=rR(r)u(r) = rR(r):

22md2udr2+[V(r)+2(+1)2mr2]u=Eu.-\frac{\hbar^2}{2m}\frac{d^2u}{dr^2} + \left[V(r) + \frac{\hbar^2\ell(\ell+1)}{2mr^2}\right]u = Eu.

The term Vcf(r)=2(+1)2mr2V_{\text{cf}}(r) = \frac{\hbar^2\ell(\ell+1)}{2mr^2} is the centrifugal barrier. For >0\ell > 0, it diverges as r0r\to 0, preventing the particle from reaching the origin.

Section summary. Three-dimensional systems with central symmetry are solved by separating radial and angular motion, leading to the quantization of orbital and intrinsic (spin) angular momentum.

Identical Particles

Core ideas

Identical quantum particles cannot be labeled in a physically meaningful way. Exchanging two identical particles cannot produce a new observable state. In three dimensions, many-particle wave functions are either symmetric or antisymmetric:

ψ(x1,x2)=+ψ(x2,x1)for bosons,\psi(x_1,x_2)=+\psi(x_2,x_1)\quad \text{for bosons}, ψ(x1,x2)=ψ(x2,x1)for fermions.\psi(x_1,x_2)=-\psi(x_2,x_1)\quad \text{for fermions}.

Bosons have integer spin and can occupy the same one-particle state. Fermions have half-integer spin and obey the Pauli exclusion principle. The Pauli principle explains shell structure in atoms and the structure of the periodic table.

For two fermions the antisymmetric state can be written as a Slater determinant. For particles with spin, the total wave function includes spatial and spin parts. The product must have the correct exchange symmetry.

Mathematical spine

ψ(x1,x2)=±ψ(x2,x1),Ψfermion(1,2)=12ϕa(1)ϕb(1)ϕa(2)ϕb(2).\psi(x_1,x_2)=\pm\psi(x_2,x_1),\qquad \Psi_{\rm fermion}(1,2)=\frac{1}{\sqrt2} \begin{vmatrix} \phi_a(1)&\phi_b(1)\\ \phi_a(2)&\phi_b(2) \end{vmatrix}.

Working details

Exchange symmetry applies to the complete state, including spin and spatial parts. Two electrons in the same spatial orbital can coexist only if their spin state is antisymmetric, the singlet. If the spin state is symmetric, the spatial state must be antisymmetric, reducing the probability of finding the particles close together.

For many fermions, the Slater determinant automatically changes sign when two particle labels are exchanged and vanishes if two one-particle states are identical. This is the mathematical form of Pauli exclusion. For bosons, symmetrized products allow macroscopic occupation of one state, which underlies Bose—Einstein condensation and coherent fields.

Identical-particle physics is not a small correction. It controls atomic shells, chemical bonding, the Fermi sea in metals, blackbody radiation, phonons, superfluidity, and the structure of matter. The first check in many-body problems is whether the particles are distinguishable, bosons, or fermions.

Worked example: lithium ground state 1s22s11s^2\,2s^1. Pauli exclusion fixes the electron configuration of lithium. Two electrons fill the 1s1s shell with opposite spins, and the third must occupy the next available spin-orbital, 2s2s. Writing the spin-orbitals χ1=ϕ1sα\chi_1=\phi_{1s}\alpha, χ2=ϕ1sβ\chi_2=\phi_{1s}\beta, χ3=ϕ2sα\chi_3=\phi_{2s}\alpha, the totally antisymmetric three-electron ground state is the Slater determinant

Ψ(1,2,3)=13!ϕ1s(1)α(1)ϕ1s(1)β(1)ϕ2s(1)α(1)ϕ1s(2)α(2)ϕ1s(2)β(2)ϕ2s(2)α(2)ϕ1s(3)α(3)ϕ1s(3)β(3)ϕ2s(3)α(3).\Psi(1,2,3)=\frac{1}{\sqrt{3!}} \begin{vmatrix} \phi_{1s}(1)\alpha(1) & \phi_{1s}(1)\beta(1) & \phi_{2s}(1)\alpha(1)\\ \phi_{1s}(2)\alpha(2) & \phi_{1s}(2)\beta(2) & \phi_{2s}(2)\alpha(2)\\ \phi_{1s}(3)\alpha(3) & \phi_{1s}(3)\beta(3) & \phi_{2s}(3)\alpha(3) \end{vmatrix}.

A hypothetical 1s31s^3 configuration would place two columns with identical spin-orbitals (e.g.\ a second ϕ1sα\phi_{1s}\alpha); the determinant would then vanish identically. This is Pauli exclusion in action: it forces the third electron out of the KK-shell and into 2s2s, producing the alkali-metal chemistry of lithium.

Worked example: helium singlet vs.\ triplet. For two-electron helium, the antisymmetric total wave function factorises into spatial and spin parts. The ground state has both electrons in 1s1s: the spatial part is symmetric, so the spin part must be the antisymmetric singlet 12(αββα)\frac{1}{\sqrt2}(\alpha\beta-\beta\alpha), giving the configuration 1s21S01s^2\,{}^1S_0. Excited states like 1s12s11s^1 2s^1 admit two combinations: the symmetric spatial 12[ϕ1s(1)ϕ2s(2)+ϕ2s(1)ϕ1s(2)]\frac{1}{\sqrt2}[\phi_{1s}(1)\phi_{2s}(2)+\phi_{2s}(1)\phi_{1s}(2)] paired with the singlet (parahelium), and the antisymmetric spatial 12[ϕ1s(1)ϕ2s(2)ϕ2s(1)ϕ1s(2)]\frac{1}{\sqrt2}[\phi_{1s}(1)\phi_{2s}(2)-\phi_{2s}(1)\phi_{1s}(2)] paired with the symmetric triplet (orthohelium). The triplet lies lower in energy because the antisymmetric spatial part suppresses the probability of the two electrons being close, reducing Coulomb repulsion. This exchange splitting is a direct, measurable consequence of fermion antisymmetry.

Worked example

Two non-interacting electrons are in a 1D infinite square well of width L=1.0nmL = 1.0\,\text{nm}. The single-particle energies are En=n2E1E_n = n^2 E_1 with E1=π22/(2meL2)0.376eVE_1 = \pi^2\hbar^2/(2m_e L^2) \approx 0.376\,\text{eV}. In the ground state, both electrons occupy n=1n=1 with opposite spins (singlet). The symmetric spatial wave function is

ψ(x1,x2)=2Lsin(πx1L)sin(πx2L).\psi(x_1,x_2) = \frac{2}{L}\sin\left(\frac{\pi x_1}{L}\right)\sin\left(\frac{\pi x_2}{L}\right).

The first excited spatial configuration has one electron in n=1n=1 and one in n=2n=2. Because the electrons are fermions, the total wave function must be antisymmetric. For the spin triplet (symmetric spin), the spatial part must be antisymmetric:

ψA(x1,x2)=122L[sin(πx1L)sin(2πx2L)sin(2πx1L)sin(πx2L)].\psi_A(x_1,x_2) = \frac{1}{\sqrt{2}}\frac{2}{L}\left[\sin\left(\frac{\pi x_1}{L}\right)\sin\left(\frac{2\pi x_2}{L}\right) - \sin\left(\frac{2\pi x_1}{L}\right)\sin\left(\frac{\pi x_2}{L}\right)\right].

This vanishes at x1=x2x_1 = x_2, keeping the electrons apart and reducing Coulomb repulsion. The total energy is E1+E2=0.376+1.504=1.880eVE_1 + E_2 = 0.376 + 1.504 = 1.880\,\text{eV}.

Problems with Solutions

Problem 1. Write the Slater determinant for two electrons in spin-orbitals χ1=ϕ1sα\chi_1 = \phi_{1s}\alpha and χ2=ϕ1sβ\chi_2 = \phi_{1s}\beta. Solution.

Ψ(1,2)=12!ϕ1s(1)α(1)ϕ1s(1)β(1)ϕ1s(2)α(2)ϕ1s(2)β(2)=12ϕ1s(1)ϕ1s(2)[α(1)β(2)β(1)α(2)].\Psi(1,2) = \frac{1}{\sqrt{2!}}\begin{vmatrix}\phi_{1s}(1)\alpha(1) & \phi_{1s}(1)\beta(1) \\ \phi_{1s}(2)\alpha(2) & \phi_{1s}(2)\beta(2)\end{vmatrix} = \frac{1}{\sqrt{2}}\phi_{1s}(1)\phi_{1s}(2)[\alpha(1)\beta(2) - \beta(1)\alpha(2)].

This is the antisymmetric singlet state for two electrons in the same spatial orbital.

Problem 2. Three identical bosons can each occupy one of two single-particle states aa or bb. How many distinct symmetric many-body states exist?

Solution. For bosons, any number can occupy each state. The allowed occupation-number configurations are (Na,Nb)=(3,0),(2,1),(1,2),(0,3)(N_a, N_b) = (3,0), (2,1), (1,2), (0,3). Each corresponds to one distinct symmetric state, so there are 4 states. In contrast, three fermions could not be placed in only two spin-orbitals because of Pauli exclusion.

Problem 3. Two identical spin-00 bosons are in a 1D box. Write the symmetrized two-particle state if one is in ϕ1(x)=2/Lsin(πx/L)\phi_1(x) = \sqrt{2/L}\sin(\pi x/L) and the other in ϕ2(x)=2/Lsin(2πx/L)\phi_2(x) = \sqrt{2/L}\sin(2\pi x/L).

Solution. Since they are bosons, the total wave function must be symmetric under exchange:

ψS(x1,x2)=12[ϕ1(x1)ϕ2(x2)+ϕ2(x1)ϕ1(x2)]=1L[sin(πx1L)sin(2πx2L)+sin(2πx1L)sin(πx2L)].\psi_S(x_1,x_2) = \frac{1}{\sqrt{2}}[\phi_1(x_1)\phi_2(x_2) + \phi_2(x_1)\phi_1(x_2)] = \frac{1}{L}\left[\sin\left(\frac{\pi x_1}{L}\right)\sin\left(\frac{2\pi x_2}{L}\right) + \sin\left(\frac{2\pi x_1}{L}\right)\sin\left(\frac{\pi x_2}{L}\right)\right].

Section summary. Identical particles force quantum states to be symmetric or antisymmetric under exchange, producing bosons, fermions, and Pauli exclusion.

Part II: Applications

Core ideas

Few quantum systems are exactly solvable. For most realistic problems (multi-electron atoms, atoms in external fields, anharmonic vibrations) we expand around a solvable reference. The standard approach is Rayleigh—Schr”odinger perturbation theory: split the Hamiltonian into a solvable part plus a small correction, then expand energies and states in powers of a bookkeeping parameter λ\lambda.

Setup. Write H^(λ)=H^0+λH^\hat H(\lambda)=\hat H_0+\lambda\hat H', with H^0n(0)=En(0)n(0)\hat H_0|n^{(0)}\rangle=E_n^{(0)}|n^{(0)}\rangle the known eigenproblem. The unperturbed kets {n(0)}\{|n^{(0)}\rangle\} are orthonormal and complete. We seek series

En=En(0)+λEn(1)+λ2En(2)+,n=n(0)+λn(1)+λ2n(2)+,E_n=E_n^{(0)}+\lambda E_n^{(1)}+\lambda^2 E_n^{(2)}+\cdots,\qquad |n\rangle=|n^{(0)}\rangle+\lambda|n^{(1)}\rangle+\lambda^2|n^{(2)}\rangle+\cdots,

and set λ=1\lambda=1 at the end. Intermediate normalisation n(0)n=1\langle n^{(0)}|n\rangle=1 is conventional, so each n(k1)|n^{(k\ge1)}\rangle is orthogonal to n(0)|n^{(0)}\rangle.

Non-degenerate corrections

Substituting the series into H^n=Enn\hat H|n\rangle=E_n|n\rangle and matching powers of λ\lambda gives the standard formulas. Define the matrix element Hmnm(0)H^n(0)H'_{mn}\equiv\langle m^{(0)}|\hat H'|n^{(0)}\rangle.

  • First-order energy: En(1)=Hnn=n(0)H^n(0)E_n^{(1)}=H'_{nn}=\langle n^{(0)}|\hat H'|n^{(0)}\rangle, the expectation value of the perturbation in the unperturbed state.
  • First-order state:
n(1)=mnHmnEn(0)Em(0)m(0).|n^{(1)}\rangle=\sum_{m\neq n}\frac{H'_{mn}}{E_n^{(0)}-E_m^{(0)}}\,|m^{(0)}\rangle.
  • Second-order energy:
En(2)=mnHmn2En(0)Em(0).E_n^{(2)}=\sum_{m\neq n}\frac{|H'_{mn}|^2}{E_n^{(0)}-E_m^{(0)}}.
For the ground state every denominator is negative, so $E_0^{(2)}\le0$: second-order corrections always lower the ground-state energy.

When does the series converge?

The expansion is sensible only when each successive term is much smaller than the previous one. A practical criterion is that, for every state m(0)|m^{(0)}\rangle coupled to n(0)|n^{(0)}\rangle by H^\hat H',

HmnEn(0)Em(0).\bigl|H'_{mn}\bigr|\ll\bigl|E_n^{(0)}-E_m^{(0)}\bigr|.

Practical implications.

  • The relevant small parameter is the dimensionless ratio of a typical matrix element to the nearest energy gap, not just the strength of H^\hat H' itself.
  • Levels close in energy (small denominators) destroy convergence even for a tiny perturbation; this is precisely where degenerate perturbation theory is needed.
  • The series is generically asymptotic rather than convergent: in many physical problems (e.g. anharmonic oscillators, QED) terms first decrease, then grow. Truncating at the smallest term gives the best estimate.
  • Perturbation theory cannot describe states absent from H^0\hat H_0. Bound states produced non-perturbatively (tunnelling resonances, bound states in a new potential well) are missed at every finite order.

Degenerate perturbation theory

If En(0)E_n^{(0)} has a gg-fold degenerate subspace D={na(0)}a=1g\mathcal D=\{|n^{(0)}_a\rangle\}_{a=1}^g, the denominators inside D\mathcal D vanish. The fix is to choose the right basis inside D\mathcal D before applying the formulas above. Build the g×gg\times g matrix Wab=na(0)H^nb(0)W_{ab}=\langle n^{(0)}_a|\hat H'|n^{(0)}_b\rangle and diagonalise it; its eigenvalues are the first-order shifts and its eigenvectors are the “good” zeroth-order states that diagonalise H^\hat H' inside D\mathcal D. Couplings to states outside D\mathcal D then enter through the standard non-degenerate sums.

Worked example: linear Stark effect for n=2n=2 hydrogen. A hydrogen atom in a uniform electric field E=Ez^\bm{\mathcal E}=\mathcal E\hat{\bm z} sees the perturbation H^=eEz\hat H'=e\mathcal E\,z, where e>0e>0 is the elementary charge and zz is the electron coordinate along the field. The n=2n=2 shell is four-fold degenerate with basis

2s=200,2p0=210,2p+1=211,2p1=211.|2s\rangle=|200\rangle,\quad |2p_0\rangle=|210\rangle,\quad |2p_{+1}\rangle=|211\rangle,\quad |2p_{-1}\rangle=|21\,{-1}\rangle.

Selection rules from parity (zz is odd) and from [L^z,z]=0[\hat L_z,z]=0 (so Δm=0\Delta m=0) leave only one independent matrix element,

200z210=3a0,\langle 200|z|210\rangle=-3a_0,

with a0a_0 the Bohr radius. The perturbation matrix in the ordered basis (2s,2p0,2p+1,2p1)(|2s\rangle,|2p_0\rangle,|2p_{+1}\rangle,|2p_{-1}\rangle) is

W=eE(03a0003a000000000000).W=e\mathcal E \begin{pmatrix} 0 & -3a_0 & 0 & 0\\ -3a_0 & 0 & 0 & 0\\ 0 & 0 & 0 & 0\\ 0 & 0 & 0 & 0 \end{pmatrix}.

Diagonalising the upper 2×22\times2 block gives eigenvalues ±3ea0E\pm 3ea_0\mathcal E with eigenvectors 12(2s2p0)\frac{1}{\sqrt2}(|2s\rangle\mp|2p_0\rangle); the m=±1m=\pm1 states are unshifted to first order. The fourfold level therefore splits into three: two shifted by ±3ea0E\pm 3ea_0\mathcal E and one doubly degenerate level at the original energy. The shift is linear in E\mathcal E, a signature of the accidental \ell-degeneracy of the Coulomb spectrum.

Mathematical spine

H^=H^0+λH^,Hmn=m(0)H^n(0)En(1)=Hnn,En(2)=mnHmn2En(0)Em(0)n(1)=mnHmnEn(0)Em(0)m(0),Wab=na(0)H^nb(0) (degenerate block)\begin{aligned} \hat H &= \hat H_0+\lambda\hat H', & H'_{mn}&=\langle m^{(0)}|\hat H'|n^{(0)}\rangle\\ E_n^{(1)} &= H'_{nn}, & E_n^{(2)}&=\sum_{m\neq n}\frac{|H'_{mn}|^2}{E_n^{(0)}-E_m^{(0)}}\\ |n^{(1)}\rangle &= \sum_{m\neq n}\frac{H'_{mn}}{E_n^{(0)}-E_m^{(0)}}\,|m^{(0)}\rangle, & W_{ab}&=\langle n^{(0)}_a|\hat H'|n^{(0)}_b\rangle\ (\text{degenerate block}) \end{aligned}

Example: the Zeeman effect. For a weak external field B\bm B the perturbation is H^=μB\hat H'=-\bm\mu\cdot\bm B, with μ\bm\mu the magnetic moment. Choosing B=Bz^\bm B=B\hat{\bm z} makes H^\hat H' diagonal in the standard nm|n\ell m\rangle basis (already a “good” basis), so first-order theory predicts level splittings linear in the magnetic quantum number mm, in agreement with experiment for fields where spin—orbit coupling can be ignored.

Worked example

A 1D anharmonic oscillator has Hamiltonian H^=H^0+H^\hat H = \hat H_0 + \hat H' with H^0=p^22m+12mω2x^2\hat H_0 = \frac{\hat p^2}{2m} + \frac{1}{2}m\omega^2\hat x^2 and H^=λx^4\hat H' = \lambda \hat x^4, where λ=0.01eV/nm4\lambda = 0.01\,\text{eV/nm}^4, m=1.0×1030kgm = 1.0\times10^{-30}\,\text{kg}, and ω=0.1eV\hbar\omega = 0.1\,\text{eV}. Estimate the first-order energy shift of the ground state.

Using 0x^40=34(mω)2\langle 0|\hat x^4|0\rangle = \frac{3}{4}\left(\frac{\hbar}{m\omega}\right)^2, we first compute the characteristic length a=/(mω)a = \sqrt{\hbar/(m\omega)}. With ω=0.1eV\hbar\omega = 0.1\,\text{eV} and m=1030kgm = 10^{-30}\,\text{kg}:

ω=0.1×1.602×10191.055×10341.52×1014rad/s,\omega = \frac{0.1\times1.602\times10^{-19}}{1.055\times10^{-34}} \approx 1.52\times10^{14}\,\text{rad/s}, a=1.055×1034(1030)(1.52×1014)8.3×1010m=0.83nm.a = \sqrt{\frac{1.055\times10^{-34}}{(10^{-30})(1.52\times10^{14})}} \approx 8.3\times10^{-10}\,\text{m} = 0.83\,\text{nm}.

Then

E0(1)=λ0x^40=0.01eV/nm4×34(0.83nm)40.01×34(0.474)3.6×103eV.E_0^{(1)} = \lambda\langle 0|\hat x^4|0\rangle = 0.01\,\text{eV/nm}^4 \times \frac{3}{4}(0.83\,\text{nm})^4 \approx 0.01 \times \frac{3}{4}(0.474) \approx 3.6\times10^{-3}\,\text{eV}.

The unperturbed ground-state energy is E0(0)=12ω=0.05eVE_0^{(0)} = \frac{1}{2}\hbar\omega = 0.05\,\text{eV}, so the relative shift is 7%\approx 7\%.

Problems with Solutions

Problem 1. A particle in an infinite square well (0<x<a0<x<a) is subject to perturbation H^=V0\hat H' = V_0 for a/3<x<2a/3a/3 < x < 2a/3 and zero elsewhere. Find the first-order energy shift of the nnth level.

Solution. The unperturbed states are ψn(0)(x)=2/asin(nπx/a)\psi_n^{(0)}(x) = \sqrt{2/a}\sin(n\pi x/a). The first-order shift is

En(1)=n(0)H^n(0)=V0a/32a/32asin2(nπxa)dx.E_n^{(1)} = \langle n^{(0)}|\hat H'|n^{(0)}\rangle = V_0\int_{a/3}^{2a/3}\frac{2}{a}\sin^2\left(\frac{n\pi x}{a}\right)dx.

Using sin2θ=12(1cos2θ)\sin^2\theta = \frac{1}{2}(1-\cos 2\theta):

En(1)=2V0a[x2a4nπsin(2nπxa)]a/32a/3=V0[1312nπ(sin4nπ3sin2nπ3)].E_n^{(1)} = \frac{2V_0}{a}\left[\frac{x}{2} - \frac{a}{4n\pi}\sin\left(\frac{2n\pi x}{a}\right)\right]_{a/3}^{2a/3} = V_0\left[\frac{1}{3} - \frac{1}{2n\pi}\left(\sin\frac{4n\pi}{3} - \sin\frac{2n\pi}{3}\right)\right].

Problem 2. For a two-level system with unperturbed energies E1(0)=1eVE_1^{(0)} = 1\,\text{eV} and E2(0)=3eVE_2^{(0)} = 3\,\text{eV}, the perturbation has matrix elements H11=0.1eVH'_{11} = 0.1\,\text{eV}, H22=0.1eVH'_{22} = -0.1\,\text{eV}, and H12=H21=0.2eVH'_{12} = H'_{21} = 0.2\,\text{eV}. Calculate the second-order correction to E1(0)E_1^{(0)}. Solution.

E1(2)=H212E1(0)E2(0)=(0.2eV)213=0.042eV=0.02eV.E_1^{(2)} = \frac{|H'_{21}|^2}{E_1^{(0)} - E_2^{(0)}} = \frac{(0.2\,\text{eV})^2}{1 - 3} = \frac{0.04}{-2}\,\text{eV} = -0.02\,\text{eV}.

The second-order correction lowers the ground-state energy, as required by the variational principle.

Problem 3. A two-fold degenerate level has degenerate subspace spanned by 1|1\rangle and 2|2\rangle. The perturbation matrix in this subspace is W=(2112)eVW = \begin{pmatrix}2 & 1 \\ 1 & 2\end{pmatrix}\,\text{eV}. Find the first-order energy shifts and the “good” zeroth-order states.

Solution. Diagonalize WW. The characteristic equation is (2λ)21=0(2-\lambda)^2 - 1 = 0, giving λ=3\lambda = 3 or λ=1\lambda = 1. The first-order shifts are E(1)=3eVE^{(1)} = 3\,\text{eV} and 1eV1\,\text{eV}. The eigenvectors are +=12(1+2)|+\rangle = \frac{1}{\sqrt{2}}(|1\rangle + |2\rangle) for λ=3\lambda=3 and =12(12)|-\rangle = \frac{1}{\sqrt{2}}(|1\rangle - |2\rangle) for λ=1\lambda=1. These are the good zeroth-order states.

Section summary. Perturbation theory expresses corrections as power series in a small parameter, with first- and second-order energies built from the matrix elements HmnH'_{mn} divided by unperturbed gaps. Convergence requires every Hmn|H'_{mn}| to be much smaller than the corresponding gap En(0)Em(0)|E_n^{(0)}-E_m^{(0)}|; when that fails because of degeneracy, diagonalising H^\hat H' inside the degenerate subspace, as in the linear Stark effect of n=2n=2 hydrogen, restores predictive power.

Time-Dependent Perturbation Theory

Core ideas

Time-dependent perturbation theory computes transition probabilities caused by a weak time-dependent interaction:

H^(t)=H^0+V^(t).\hat H(t)=\hat H_0+\hat V(t).

Write the state as a superposition of unperturbed energy eigenstates. The perturbation changes the coefficients.

To first order, the transition amplitude from i|i\rangle to f|f\rangle is proportional to

0teiωfitVfi(t)dt,ωfi=EfEi.\int_0^t e^{i\omega_{fi}t'}V_{fi}(t')\,dt', \qquad \omega_{fi}=\frac{E_f-E_i}{\hbar}.

If the perturbation oscillates at a frequency close to ωfi\omega_{fi}, transitions are enhanced. This gives selection rules and resonance behavior in light-matter interaction.

For long times and dense final states, the transition rate becomes Fermi’s golden rule:

Γif=2πVfi2ρ(Ef).\Gamma_{i\to f}=\frac{2\pi}{\hbar}|V_{fi}|^2\rho(E_f).

It is one of the most important formulas in atomic, nuclear, and condensed-matter physics.

Mathematical spine

Pif(t)120teiωfitVfi(t)dt2,Γif=2πVfi2ρ(Ef).P_{i\to f}(t)\approx \frac{1}{\hbar^2}\left|\int_0^t e^{i\omega_{fi}t'}V_{fi}(t')\,dt'\right|^2,\qquad \Gamma_{i\to f}=\frac{2\pi}{\hbar}|V_{fi}|^2\rho(E_f).

Working details

The perturbation drives transitions most efficiently when its frequency matches an energy difference. This is the origin of absorption and emission lines. If V^(t)\hat V(t) comes from an electromagnetic wave, the matrix element contains both the field strength and a transition dipole or multipole matrix element.

Selection rules arise when a matrix element vanishes by symmetry. For electric dipole transitions in atoms, common rules are Δ=±1\Delta \ell=\pm1 and Δm=0,±1\Delta m=0,\pm1, with details depending on polarization. A forbidden transition is often not absolutely impossible; it may be allowed by a weaker magnetic dipole, electric quadrupole, or symmetry-breaking interaction.

Fermi’s golden rule assumes weak coupling, long times compared with microscopic oscillations, and a continuum or dense set of final states. It is a rate formula, not a formula for coherent two-level oscillations. Strong resonant driving instead leads to Rabi oscillations.

Worked example: Rabi oscillations of a resonantly driven two-level atom. Consider a two-level atom with ground state g|g\rangle and excited state e|e\rangle of energies EgE_g and EeE_e, so ω0=(EeEg)/\omega_0=(E_e-E_g)/\hbar. Place it in a classical monochromatic field E(t)=E0ϵ^cosωt\bm E(t)=E_0\hat{\bm\epsilon}\cos\omega t, giving the dipole interaction V^(t)=d^E(t)\hat V(t)=-\hat{\bm d}\cdot\bm E(t). Writing ψ(t)=cg(t)eiEgt/g+ce(t)eiEet/e|\psi(t)\rangle=c_g(t)e^{-iE_g t/\hbar}|g\rangle+c_e(t)e^{-iE_e t/\hbar}|e\rangle, the Schr”odinger equation gives coupled equations for (cg,ce)(c_g,c_e). Defining the Rabi frequency

Ω=degE0,deg=ed^ϵ^g,\Omega=\frac{d_{eg}E_0}{\hbar},\qquad d_{eg}=\langle e|\hat{\bm d}\cdot\hat{\bm\epsilon}|g\rangle,

and dropping rapidly oscillating e±i(ω+ω0)t\sim e^{\pm i(\omega+\omega_0)t} terms (rotating-wave approximation) leaves only slow terms e±i(ωω0)t\sim e^{\pm i(\omega-\omega_0)t}. On exact resonance ω=ω0\omega=\omega_0 and with cg(0)=1c_g(0)=1, ce(0)=0c_e(0)=0, the solution is

cg(t)=cos ⁣Ωt2,ce(t)=isin ⁣Ωt2,c_g(t)=\cos\!\frac{\Omega t}{2},\qquad c_e(t)=-i\sin\!\frac{\Omega t}{2},

so the excited-state population oscillates coherently as

Pe(t)=ce(t)2=sin2 ⁣Ωt2.P_e(t)=|c_e(t)|^2=\sin^2\!\frac{\Omega t}{2}.

The population swaps fully between g|g\rangle and e|e\rangle with period 2π/Ω2\pi/\Omega; a pulse of area Ωt=π\Omega t=\pi (a “π\pi-pulse”) inverts the atom, while Ωt=π/2\Omega t=\pi/2 prepares an equal superposition. Off resonance, the generalised Rabi frequency Ω~=Ω2+(ωω0)2\tilde\Omega=\sqrt{\Omega^2+(\omega-\omega_0)^2} replaces Ω\Omega and the maximum excitation drops to Ω2/Ω~2<1\Omega^2/\tilde\Omega^2<1. This non-perturbative result complements first-order perturbation theory: short times or weak driving (Ωt1\Omega t\ll1) recover Pe(Ωt/2)2P_e\approx(\Omega t/2)^2, the linear-response limit.

Worked example

A hydrogen atom initially in the ground state (1s|1s\rangle, E1=13.6eVE_1 = -13.6\,\text{eV}) is exposed to a time-dependent electric field E(t)=E0et2/τ2\mathcal{E}(t) = \mathcal{E}_0 e^{-t^2/\tau^2} polarized along zz, with E0=105V/m\mathcal{E}_0 = 10^5\,\text{V/m} and τ=1.0×1014s\tau = 1.0\times10^{-14}\,\text{s}. Estimate the transition probability to the 2pz|2p_z\rangle state (E2=3.4eVE_2 = -3.4\,\text{eV}) to first order.

The perturbation is V^(t)=eE(t)z\hat V(t) = e\mathcal{E}(t) z. The transition frequency is

ω21=E2E1=3.4(13.6)eV6.582×1016eV\@cdots1.55×1016rad/s.\omega_{21} = \frac{E_2-E_1}{\hbar} = \frac{-3.4-(-13.6)\,\text{eV}}{6.582\times10^{-16}\,\text{eV\cdots}} \approx 1.55\times10^{16}\,\text{rad/s}.

The matrix element 2pzz1s=51222433a00.74a0\langle 2p_z|z|1s\rangle = \frac{512\sqrt{2}}{243\sqrt{3}}a_0 \approx 0.74\,a_0 with a0=0.529×1010ma_0 = 0.529\times10^{-10}\,\text{m}, so

V21(t)=eE0(0.74a0)et2/τ2(1.602×1019)(105)(0.74×0.529×1010)et2/τ26.3×1025et2/τ2J.V_{21}(t) = e\mathcal{E}_0(0.74\,a_0)e^{-t^2/\tau^2} \approx (1.602\times10^{-19})(10^5)(0.74\times0.529\times10^{-10})e^{-t^2/\tau^2} \approx 6.3\times10^{-25}\,e^{-t^2/\tau^2}\,\text{J}.

The first-order amplitude is

c2(1)()=ieiω21tV21(t)dt=i(6.3×1025)πτeω212τ2/4.c_2^{(1)}(\infty) = -\frac{i}{\hbar}\int_{-\infty}^{\infty}e^{i\omega_{21}t}V_{21}(t)dt = -\frac{i}{\hbar}(6.3\times10^{-25})\sqrt{\pi}\tau e^{-\omega_{21}^2\tau^2/4}.

With ω21τ1551\omega_{21}\tau \approx 155 \gg 1, the Gaussian factor is extremely small (e6000\sim e^{-6000}), so the transition probability is essentially zero. This illustrates the adiabatic limit: when the perturbation varies slowly compared with the transition period, no transitions occur.

Problems with Solutions

Problem 1. A two-level system with H^0=Eggg+Eeee\hat H_0 = E_g|g\rangle\langle g| + E_e|e\rangle\langle e| is driven by a constant perturbation V^=V0(ge+eg)\hat V = V_0(|g\rangle\langle e| + |e\rangle\langle g|) turned on at t=0t=0. If the system starts in g|g\rangle, find the transition probability to e|e\rangle at time tt to first order.

Solution. To first order:

ce(1)(t)=i0teiωegtVeg(t)dt=iV00teiωegtdt=iV0[eiωegt1iωeg]=V0ωeg(1eiωegt).c_e^{(1)}(t) = -\frac{i}{\hbar}\int_0^t e^{i\omega_{eg}t'}V_{eg}(t')dt' = -\frac{iV_0}{\hbar}\int_0^t e^{i\omega_{eg}t'}dt' = -\frac{iV_0}{\hbar}\left[\frac{e^{i\omega_{eg}t}-1}{i\omega_{eg}}\right] = \frac{V_0}{\hbar\omega_{eg}}(1-e^{i\omega_{eg}t}).

The transition probability is

Pge(t)=ce(1)(t)2=4V022ωeg2sin2(ωegt2).P_{g\to e}(t) = |c_e^{(1)}(t)|^2 = \frac{4V_0^2}{\hbar^2\omega_{eg}^2}\sin^2\left(\frac{\omega_{eg}t}{2}\right).

Problem 2. In a scattering experiment, the initial state i|i\rangle couples to a continuum of final states f|f\rangle with constant matrix element Vfi=0.01eVV_{fi} = 0.01\,\text{eV}. The density of final states at the relevant energy is ρ(Ef)=2.0eV1\rho(E_f) = 2.0\,\text{eV}^{-1}. Calculate the transition rate using Fermi’s golden rule.

Solution. Fermi’s golden rule gives

Γif=2πVfi2ρ(Ef)=2π6.582×1016eV\@cdots(0.01eV)2(2.0eV1)1.91×1012s1.\Gamma_{i\to f} = \frac{2\pi}{\hbar}|V_{fi}|^2\rho(E_f) = \frac{2\pi}{6.582\times10^{-16}\,\text{eV\cdots}}(0.01\,\text{eV})^2(2.0\,\text{eV}^{-1}) \approx 1.91\times10^{12}\,\text{s}^{-1}.

The lifetime of the initial state is τ=1/Γ5.2×1013s\tau = 1/\Gamma \approx 5.2\times10^{-13}\,\text{s}.

Problem 3. A spin-1/21/2 particle in a magnetic field B=B0z^\bm B = B_0\hat{\bm z} has Hamiltonian H^0=γB0S^z\hat H_0 = -\gamma B_0 \hat S_z with γ\gamma the gyromagnetic ratio. A weak transverse field B1B_1 is applied along xx and oscillates as B1(t)=B1cosωtB_1(t) = B_1\cos\omega t. Write the perturbation and identify the resonance condition.

Solution. The perturbation is

V^(t)=γB1(t)S^x=γB1cos(ωt)2σx=γB12cos(ωt)(0110).\hat V(t) = -\gamma B_1(t)\hat S_x = -\gamma B_1\cos(\omega t)\frac{\hbar}{2}\sigma_x = -\frac{\gamma\hbar B_1}{2}\cos(\omega t)\begin{pmatrix}0 & 1 \\ 1 & 0\end{pmatrix}.

In the basis {+z,z}\{|+\rangle_z, |-\rangle_z\}, this couples the two spin states. The energy splitting is ΔE=γB0\Delta E = \gamma\hbar B_0, so the resonance frequency is ω0=ΔE/=γB0\omega_0 = \Delta E/\hbar = \gamma B_0. When ω=ω0\omega = \omega_0, the driving field is on resonance and induces Rabi oscillations between the spin states with Rabi frequency Ω=γB1\Omega = \gamma B_1.

Section summary. Time-dependent perturbation theory computes transition probabilities and rates caused by weak time-dependent interactions.

The Variational Principle

Core ideas

The variational principle provides a powerful way to estimate the ground-state energy E0E_0 of a system when an exact solution is impossible.

The Variational Theorem. For any normalized trial wave function ψ|\psi\rangle, the expectation value of the Hamiltonian is always greater than or equal to the true ground-state energy:

E0ψH^ψ.E_0 \leq \langle \psi | \hat H | \psi \rangle.

For an unnormalized trial state ψα\psi_\alpha depending on a parameter α\alpha, we minimize the Rayleigh quotient:

ϵ(α)=ψαH^ψαψαψα.\epsilon(\alpha) = \frac{\langle \psi_\alpha | \hat H | \psi_\alpha \rangle}{\langle \psi_\alpha | \psi_\alpha \rangle}.

The best estimate for E0E_0 is the minimum value of ϵ(α)\epsilon(\alpha).

Success of the Method. The method is highly successful because even a rough guess for the wave function often yields a very accurate upper bound for the energy. This is because the error in energy is of second order in the error of the wave function.

Mathematical spine

E0ψH^ψψψ(Variational Bound)dϵdα=0(Minimization Condition)\begin{aligned} E_0 &\leq \frac{\langle \psi | \hat H | \psi \rangle}{\langle \psi | \psi \rangle} & \text{(Variational Bound)} \\ \frac{d\epsilon}{d\alpha} &= 0 & \text{(Minimization Condition)} \end{aligned}

Example: Ground State of Helium. The Helium atom has two electrons and a nucleus with charge Z=2Z=2. The Hamiltonian includes an electron-electron repulsion term that makes it unsolvable. A simple trial wave function is a product of two hydrogenic states with an effective charge ZeffZ_{\rm eff} as the variational parameter. Minimizing the energy yields Zeff=25/16=1.6875Z_{\rm eff} = 2 - 5/16 = 1.6875, showing how one electron “screens” the nucleus for the other.

Worked example

Estimate the ground-state energy of a 1D harmonic oscillator V(x)=12mω2x2V(x) = \frac{1}{2}m\omega^2 x^2 using the trial function ψα(x)=(απ)1/4eαx2/2\psi_\alpha(x) = \left(\frac{\alpha}{\pi}\right)^{1/4}e^{-\alpha x^2/2}, where α>0\alpha > 0 is a variational parameter. The exact answer is E0=12ωE_0 = \frac{1}{2}\hbar\omega.

First compute the normalization: ψαψα=1\langle\psi_\alpha|\psi_\alpha\rangle = 1 by construction. The kinetic energy expectation is

T=22mψαd2ψαdx2dx=2α4m.\langle T \rangle = -\frac{\hbar^2}{2m}\int_{-\infty}^{\infty}\psi_\alpha\frac{d^2\psi_\alpha}{dx^2}dx = \frac{\hbar^2\alpha}{4m}.

The potential energy expectation is

V=12mω2x2ψα2dx=mω24α.\langle V \rangle = \frac{1}{2}m\omega^2\int_{-\infty}^{\infty}x^2|\psi_\alpha|^2dx = \frac{m\omega^2}{4\alpha}.

The Rayleigh quotient is

ϵ(α)=2α4m+mω24α.\epsilon(\alpha) = \frac{\hbar^2\alpha}{4m} + \frac{m\omega^2}{4\alpha}.

Minimizing: dϵdα=24mmω24α2=0α=mω\frac{d\epsilon}{d\alpha} = \frac{\hbar^2}{4m} - \frac{m\omega^2}{4\alpha^2} = 0 \Rightarrow \alpha = \frac{m\omega}{\hbar}.

Substituting back:

ϵmin=24mmω+mω24mω=ω4+ω4=12ω.\epsilon_{\min} = \frac{\hbar^2}{4m}\frac{m\omega}{\hbar} + \frac{m\omega^2}{4}\frac{\hbar}{m\omega} = \frac{\hbar\omega}{4} + \frac{\hbar\omega}{4} = \frac{1}{2}\hbar\omega.

The variational estimate equals the exact ground-state energy because the trial function has the same Gaussian form as the true ground state.

Problems with Solutions

Problem 1. Use the variational principle with trial function ψ(x)=N(a2x2)\psi(x) = N(a^2 - x^2) for x<a|x|<a and zero otherwise to estimate the ground-state energy of an infinite square well of width LL (L/2<x<L/2-L/2<x<L/2). Use aa as the variational parameter with aL/2a \leq L/2.

Solution. Normalize:

1=N2aa(a2x2)2dx=N216a515N=1516a5.1 = N^2\int_{-a}^{a}(a^2-x^2)^2dx = N^2\frac{16a^5}{15} \Rightarrow N = \sqrt{\frac{15}{16a^5}}.

Kinetic energy:

T=22maa(dψdx)2dx=22mN2aa(2x)2dx=22m1516a58a33=524ma2.\langle T \rangle = \frac{\hbar^2}{2m}\int_{-a}^{a}\left(\frac{d\psi}{dx}\right)^2dx = \frac{\hbar^2}{2m}N^2\int_{-a}^{a}(2x)^2dx = \frac{\hbar^2}{2m}\frac{15}{16a^5}\frac{8a^3}{3} = \frac{5\hbar^2}{4ma^2}.

Since ψ=0\psi=0 outside x<a|x|<a, the potential is zero in the support. Minimize ϵ(a)=52/(4ma2)\epsilon(a) = 5\hbar^2/(4ma^2) subject to aL/2a \leq L/2. The minimum occurs at a=L/2a = L/2, giving

ϵ=52mL2=10π2π222mL21.013E1exact.\epsilon = \frac{5\hbar^2}{mL^2} = \frac{10}{\pi^2}\frac{\pi^2\hbar^2}{2mL^2} \approx 1.013\,E_1^{\text{exact}}.

This is only 1.3%1.3\% above the exact ground-state energy.

Problem 2. For a particle in a potential V(x)=λx4V(x) = \lambda x^4, estimate the ground-state energy using a Gaussian trial function ψα(x)=(α/π)1/4eαx2/2\psi_\alpha(x) = (\alpha/\pi)^{1/4}e^{-\alpha x^2/2}.

Solution. With x4=3/(4α2)\langle x^4 \rangle = 3/(4\alpha^2) for a Gaussian:

ϵ(α)=2α4m+3λ4α2.\epsilon(\alpha) = \frac{\hbar^2\alpha}{4m} + \frac{3\lambda}{4\alpha^2}.

Minimizing: dϵdα=24m3λ2α3=0α=(6mλ2)1/3\frac{d\epsilon}{d\alpha} = \frac{\hbar^2}{4m} - \frac{3\lambda}{2\alpha^3} = 0 \Rightarrow \alpha = \left(\frac{6m\lambda}{\hbar^2}\right)^{1/3}. Substituting:

ϵmin=24m(6mλ2)1/3+3λ4(26mλ)2/3=38(64λm2)1/30.68(4λm2)1/3.\epsilon_{\min} = \frac{\hbar^2}{4m}\left(\frac{6m\lambda}{\hbar^2}\right)^{1/3} + \frac{3\lambda}{4}\left(\frac{\hbar^2}{6m\lambda}\right)^{2/3} = \frac{3}{8}\left(\frac{6\hbar^4\lambda}{m^2}\right)^{1/3} \approx 0.68\left(\frac{\hbar^4\lambda}{m^2}\right)^{1/3}.

The exact numerical coefficient is 0.668\approx 0.668, so the variational estimate is about 2%2\% high.

Problem 3. Explain why the variational principle always gives an upper bound to the true ground-state energy.

Solution. Expand the trial state in the exact energy eigenbasis: ψ=ncnn|\psi\rangle = \sum_n c_n|n\rangle with ncn2=1\sum_n|c_n|^2 = 1. Then

ψH^ψ=ncn2EnE0ncn2=E0,\langle\psi|\hat H|\psi\rangle = \sum_n |c_n|^2 E_n \geq E_0\sum_n|c_n|^2 = E_0,

since EnE0E_n \geq E_0 for all nn. Equality holds only if ψ=0|\psi\rangle = |0\rangle, the exact ground state.

Section summary. The variational principle allows for estimating ground-state energies by optimizing trial wave functions, always providing a guaranteed upper bound.

The WKB Approximation

Core ideas

The WKB approximation is a semiclassical method for one-dimensional problems where the potential changes slowly compared with the de Broglie wavelength. Define the local classical momentum

p(x)=2m(EV(x)).p(x)=\sqrt{2m(E-V(x))}.

In the classically allowed region E>V(x)E>V(x), the wave function is approximately oscillatory:

ψ(x)Cp(x)exp(±ixp(x)dx).\psi(x)\approx \frac{C}{\sqrt{p(x)}}\exp\left(\pm\frac{i}{\hbar}\int^x p(x')\,dx'\right).

In the forbidden region E<V(x)E<V(x), the momentum is imaginary and the wave function grows or decays exponentially. This gives a simple estimate of tunneling probabilities:

Texp[2x1x2p(x)dx].T\sim \exp\left[-\frac{2}{\hbar}\int_{x_1}^{x_2}|p(x)|\,dx\right].

At turning points E=V(x)E=V(x), the naive WKB form fails and connection formulas are needed. For bound motion between two turning points, WKB gives the quantization rule

x1x2p(x)dx=(n+12)π.\int_{x_1}^{x_2}p(x)\,dx=\left(n+\frac12\right)\pi\hbar.

Mathematical spine

ψ(x)Cp(x)exp(ixp(x)dx),p(x)=2m(EV(x)),\psi(x)\approx \frac{C}{\sqrt{p(x)}}\exp\left(\frac{i}{\hbar}\int^x p(x')\,dx'\right),\qquad p(x)=\sqrt{2m(E-V(x))}, x1x2p(x)dx=(n+12)π.\int_{x_1}^{x_2}p(x)\,dx=\left(n+\frac12\right)\pi\hbar.

Working details

WKB works when the local wavelength changes slowly:

dλdBdx1,λdB=hp(x).\left|\frac{d\lambda_{\rm dB}}{dx}\right|\ll 1,\qquad \lambda_{\rm dB}=\frac{h}{p(x)}.

It fails near turning points because p(x)0p(x)\to0 and the amplitude factor diverges. Connection formulas repair the solution by matching to Airy-function behavior near the turning point.

The quantization rule is a semiclassical action condition. It says that the phase accumulated over a classical round trip must be compatible with a standing wave, with a correction from the turning points. It reproduces the exact large-nn behavior of many bound systems and gives good intuition for why classical action appears in quantum mechanics.

For tunneling, the exponential factor is usually more important than the prefactor. This makes WKB useful for alpha decay, field emission, scanning tunneling microscopy, and barrier penetration in semiconductor devices. The approximation should be checked by asking whether the barrier varies slowly on the de Broglie-wavelength scale.

Worked example

WKB quantization of a particle in a linear potential well. Consider a particle of mass mm in a triangular well defined by V(x)=FxV(x)=Fx for x>0x>0 and V(x)=V(x)=\infty for x<0x<0, where F>0F>0 is a constant force. We want to estimate the energy levels using the WKB quantization rule.

The turning points are x1=0x_1=0 (hard wall) and x2=E/Fx_2=E/F (where E=V(x)E=V(x)). The WKB quantization condition with one hard wall and one soft turning point is

0En/F2m(EnFx)dx=(n14)π,n=1,2,\int_0^{E_n/F} \sqrt{2m(E_n - Fx)}\,dx = \left(n - \frac{1}{4}\right)\pi\hbar, \quad n=1,2,\dots

Evaluating the integral:

0E/F2m(EFx)dx=22m3FE3/2.\int_0^{E/F} \sqrt{2m(E - Fx)}\,dx = \frac{2\sqrt{2m}}{3F} E^{3/2}.

Setting this equal to (n1/4)π(n-1/4)\pi\hbar and solving for EnE_n gives

En=[3πF22m(n14)]2/3.E_n = \left[\frac{3\pi\hbar F}{2\sqrt{2m}}\left(n-\frac{1}{4}\right)\right]^{2/3}.

For an electron (m=9.11×1031kgm=9.11\times10^{-31}\,\text{kg}) in a field corresponding to F=1.0×109NF=1.0\times10^{-9}\,\text{N} (a very weak force), the ground-state energy (n=1n=1) is

E1=[3π(1.055×1034Js)(1.0×109N)22(9.11×1031kg)(0.75)]2/31.8×1022J1.1×103eV.E_1 = \left[\frac{3\pi(1.055\times10^{-34}\,\text{J\,s})(1.0\times10^{-9}\,\text{N})}{2\sqrt{2(9.11\times10^{-31}\,\text{kg})}}(0.75)\right]^{2/3} \approx 1.8\times10^{-22}\,\text{J} \approx 1.1\times10^{-3}\,\text{eV}.

This demonstrates how WKB converts a classical action integral into quantized energy levels without solving the differential equation exactly.

Problems with Solutions

Problem 1. A particle of mass mm tunnels through a rectangular barrier of height V0>EV_0>E and width aa. Use the WKB approximation to estimate the transmission coefficient TT.

Solution. In the forbidden region 0<x<a0<x<a, the local momentum is imaginary: p(x)=i2m(V0E)p(x)=i\sqrt{2m(V_0-E)}. The WKB tunneling factor is

Texp ⁣[20a2m(V0E)dx]=exp ⁣[2a2m(V0E)].T \sim \exp\!\left[-\frac{2}{\hbar}\int_0^a \sqrt{2m(V_0-E)}\,dx\right] = \exp\!\left[-\frac{2a}{\hbar}\sqrt{2m(V_0-E)}\right].

For an electron (m=9.11×1031kgm=9.11\times10^{-31}\,\text{kg}) with E=1eVE=1\,\text{eV} tunneling through a barrier V0=2eVV_0=2\,\text{eV} of width a=1nma=1\,\text{nm}:

2m(V0E)=2(9.11×1031)(1.6×1019)5.40×1025kgm/s,\sqrt{2m(V_0-E)} = \sqrt{2(9.11\times10^{-31})(1.6\times10^{-19})} \approx 5.40\times10^{-25}\,\text{kg\,m/s}, 2a2m(V0E)=2(1.0×109)(5.40×1025)1.055×103410.2,\frac{2a}{\hbar}\sqrt{2m(V_0-E)} = \frac{2(1.0\times10^{-9})(5.40\times10^{-25})}{1.055\times10^{-34}} \approx 10.2,

so Te10.23.7×105T\sim e^{-10.2}\approx 3.7\times10^{-5}. The transmission is exponentially sensitive to both barrier width and height.

Problem 2. Use the WKB quantization rule to estimate the large-nn energy levels of the infinite square well V(x)=0V(x)=0 for 0<x<a0<x<a, \infty otherwise.

Solution. The WKB rule for two hard walls is 0ap(x)dx=nπ\int_0^a p(x)\,dx = n\pi\hbar with p=2mEp=\sqrt{2mE}. Thus

2mEna=nπEn=n2π222ma2.\sqrt{2mE_n}\,a = n\pi\hbar \quad\Longrightarrow\quad E_n = \frac{n^2\pi^2\hbar^2}{2ma^2}.

Remarkably, this matches the exact result for all nn because the WKB wave function happens to be exact for a constant zero potential between hard walls.

Problem 3. A harmonic oscillator has V(x)=12mω2x2V(x)=\frac{1}{2}m\omega^2 x^2. Apply the WKB quantization rule and compare with the exact answer.

Solution. The classical turning points are x=±xnx=\pm x_n with xn=2En/(mω2)x_n=\sqrt{2E_n/(m\omega^2)}. The action integral is

xnxn2m(En12mω2x2)dx=πEnω.\int_{-x_n}^{x_n}\sqrt{2m\left(E_n-\frac{1}{2}m\omega^2 x^2\right)}\,dx = \frac{\pi E_n}{\omega}.

Setting this equal to (n+1/2)π(n+1/2)\pi\hbar gives

En=(n+12)ω,E_n = \left(n+\frac{1}{2}\right)\hbar\omega,

which is exactly the known spectrum of the quantum harmonic oscillator. The WKB approximation is exact here because the turning-point corrections sum to the Maslov index 1/21/2.

Section summary. The WKB approximation connects quantum waves to classical action when the potential varies slowly.