Skip to content
Miko-pedia Miko-pedia
← Back to map
Mechanics · AI note · undergrad · #mechanics

Complete Classical Mechanics

A concise undergraduate guide to Newtonian mechanics, Lagrangian and Hamiltonian methods, central forces, rigid bodies, oscillations, relativity, chaos, perturbation theory, and continuous systems.

· 39 min
View PDF

Survey of the Elementary Principles

Core ideas

Classical mechanics describes the motion of macroscopic bodies when quantum and relativistic effects are small. A particle is an idealized object with mass mm and position r(t)\bm r(t) in a three-dimensional Euclidean space. Its motion is described by velocity v=r˙\bm v = \dot{\bm r} and acceleration a=r¨\bm a = \ddot{\bm r}.

Newton’s Laws and Inertial Frames. Newton’s laws are valid only in inertial frames---frames where a body remains at rest or in uniform motion unless acted upon by a force.

  • First Law (Inertia): In an inertial frame, a particle moves with constant velocity if the net force is zero.
  • Second Law (Dynamics): The rate of change of momentum p=mv\bm p = m\bm v equals the net force F\bm F:
F=p˙=ddt(mv).\bm F = \dot{\bm p} = \frac{d}{dt}(m\bm v).
For constant mass, this reduces to $\bm F = m\bm a$.
  • Third Law (Action-Reaction): For every force F12\bm F_{12} exerted by particle 2 on particle 1, there is an equal and opposite force F21=F12\bm F_{21} = -\bm F_{12} exerted by particle 1 on particle 2. This holds for “central” forces acting along the line joining the particles.

Work and Energy. The work done by a force F\bm F as a particle moves from point 1 to 2 is W12=12FdrW_{12} = \int_1^2 \bm F \cdot d\bm r. The Work-Energy Theorem states that the work done by the net force equals the change in kinetic energy T=12mv2T = \frac{1}{2}mv^2:

W12=T2T1=ΔT.W_{12} = T_2 - T_1 = \Delta T.

A force is conservative if the work done is independent of the path. Such forces can be derived from a potential energy V(r)V(\bm r):

F=V(r)    W12=V1V2=ΔV.\bm F = -\nabla V(\bm r) \implies W_{12} = V_1 - V_2 = -\Delta V.

In this case, total mechanical energy E=T+VE = T + V is conserved (ΔE=0\Delta E = 0).

Systems of Particles. For a system of NN particles, the total mass is M=miM = \sum m_i and the Center of Mass (CM) is R=1Mmiri\bm R = \frac{1}{M}\sum m_i \bm r_i.

  • Linear Momentum: The total momentum P=pi=MR˙\bm P = \sum \bm p_i = M\dot{\bm R} changes according to the net external force: P˙=Fext\dot{\bm P} = \bm F_{\rm ext}. If Fext=0\bm F_{\rm ext} = 0, P\bm P is conserved.
  • Angular Momentum: The total angular momentum L=ri×pi\bm L = \sum \bm r_i \times \bm p_i changes according to the net external torque Next=ri×Fi,ext\bm N_{\rm ext} = \sum \bm r_i \times \bm F_{i, \rm ext}: L˙=Next\dot{\bm L} = \bm N_{\rm ext}.
  • Kinetic Energy: T=TCM+TrelT = T_{\rm CM} + T_{\rm rel}, where TCM=12MR˙2T_{\rm CM} = \frac{1}{2}M\dot{\bm R}^2 is the kinetic energy of the CM motion and Trel=12mi(r˙i)2T_{\rm rel} = \frac{1}{2}\sum m_i (\dot{\bm r}'_i)^2 is the kinetic energy relative to the CM.

Mathematical spine

F=p˙=ma(Newton’s 2nd Law)W12=12Fdr=ΔT(Work-Energy Theorem)L=r×p,N=r×F=L˙(Angular Momentum and Torque)MR¨=Fext,L˙=Next(System Dynamics)\begin{aligned} \bm{F} &= \dot{\bm p} = m\bm{a} & \text{(Newton's 2nd Law)} \\ W_{12} &= \int_1^2 \bm F \cdot d\bm r = \Delta T & \text{(Work-Energy Theorem)} \\ \bm L &= \bm r \times \bm p, \quad \bm N = \bm r \times \bm F = \dot{\bm L} & \text{(Angular Momentum and Torque)} \\ M\ddot{\bm R} &= \bm F_{\rm ext}, \quad \dot{\bm L} = \bm N_{\rm ext} & \text{(System Dynamics)} \end{aligned}

Example: Conservative vs. Non-conservative forces. Gravity (V=mghV=mgh) and springs (V=12kx2V=\frac{1}{2}kx^2) are conservative. A conservative force does work that depends only on the endpoints; equivalently,

Fdr=0\oint \bm F\cdot d\bm r = 0

for every closed path. Kinetic friction violates this condition. Since

fk=μNv^,Wf=fkdr=μNds,\bm f_k=-\mu N\hat{\bm v}, \qquad W_f=\int \bm f_k\cdot d\bm r=-\int \mu N\,ds,

the work depends on the arclength actually traveled, not just on the initial and final positions. For a closed loop of length L>0L>0 with constant μN\mu N,

Wf=μNL<0,W_f=-\mu N L<0,

so friction cannot be represented by a single-valued potential energy. The heat produced is the physical destination of the lost mechanical energy; the mathematical reason friction is non-conservative is path dependence, or nonzero work around a closed path.

Worked example

Block on an inclined plane.
A block of mass m=5 kgm = 5\ \text{kg} slides down a plane inclined at θ=30\theta = 30^\circ to the horizontal with coefficient of kinetic friction μk=0.2\mu_k = 0.2. The normal force is N=mgcosθ=5×9.8×cos3042.4 NN = mg\cos\theta = 5\times 9.8\times\cos 30^\circ \approx 42.4\ \text{N}. The friction force is fk=μkN8.5 Nf_k = \mu_k N \approx 8.5\ \text{N}. The component of gravity along the plane is mgsinθ=24.5 Nmg\sin\theta = 24.5\ \text{N}. Newton’s second law gives ma=mgsinθfk=16.0 Nma = mg\sin\theta - f_k = 16.0\ \text{N}, so a3.2 m/s2a \approx 3.2\ \text{m/s}^2. If the plane has length d=2 md = 2\ \text{m}, the block’s speed at the bottom is v=2ad12.83.58 m/sv = \sqrt{2ad} \approx \sqrt{12.8} \approx 3.58\ \text{m/s}. The work done by friction is Wf=fkd=17.0 JW_f = -f_k d = -17.0\ \text{J}. The change in kinetic energy is ΔT=12mv20=32.0 J\Delta T = \frac{1}{2}mv^2 - 0 = 32.0\ \text{J}, which equals the work done by gravity (mgdsinθ=49.0 Jmgd\sin\theta = 49.0\ \text{J}) plus the work done by friction (17.0 J-17.0\ \text{J}), verifying the work-energy theorem.

Problems with Solutions

Problem 1. A 10 kg box is pulled across a horizontal floor by a rope at 3030^\circ above horizontal with tension T=40 NT = 40\ \text{N}. The coefficient of kinetic friction is μk=0.25\mu_k = 0.25. Find the acceleration.

Solution. Vertical forces: N+Tsin30=mgN + T\sin 30^\circ = mg, so N=10×9.840×0.5=9820=78 NN = 10\times 9.8 - 40\times 0.5 = 98 - 20 = 78\ \text{N}. Friction: fk=μkN=0.25×78=19.5 Nf_k = \mu_k N = 0.25\times 78 = 19.5\ \text{N}. Horizontal: Tcos30fk=maT\cos 30^\circ - f_k = ma, so 40×0.86619.5=34.619.5=15.1 N=10a40\times 0.866 - 19.5 = 34.6 - 19.5 = 15.1\ \text{N} = 10a, giving a=1.51 m/s2a = 1.51\ \text{m/s}^2.

Problem 2. A spring with k=200 N/mk = 200\ \text{N/m} is compressed by x=0.1 mx = 0.1\ \text{m} and launches a ball of mass m=0.5 kgm = 0.5\ \text{kg} on a frictionless horizontal track. What is the ball’s speed when the spring reaches its natural length? If the track then curves upward to a height hh, what is the maximum hh reached?

Solution. The spring potential energy is U=12kx2=12(200)(0.1)2=1.0 JU = \frac{1}{2}kx^2 = \frac{1}{2}(200)(0.1)^2 = 1.0\ \text{J}. By energy conservation, 12mv2=1.0 J\frac{1}{2}mv^2 = 1.0\ \text{J}, so v=2×1.0/0.5=2.0 m/sv = \sqrt{2\times 1.0/0.5} = 2.0\ \text{m/s}. For maximum height, kinetic energy converts to gravitational potential: mgh=1.0 Jmgh = 1.0\ \text{J}, so h=1.0/(0.5×9.8)0.204 mh = 1.0/(0.5\times 9.8) \approx 0.204\ \text{m}.

Problem 3. Two masses m1=4 kgm_1 = 4\ \text{kg} and m2=6 kgm_2 = 6\ \text{kg} are connected by a light string over a frictionless pulley. Find the acceleration of the system and the tension in the string.

Solution. Let aa be the acceleration of m1m_1 upward and m2m_2 downward. For m1m_1: Tm1g=m1aT - m_1g = m_1a. For m2m_2: m2gT=m2am_2g - T = m_2a. Adding: (m2m1)g=(m1+m2)a(m_2-m_1)g = (m_1+m_2)a, so a=(64)×9.84+6=19.610=1.96 m/s2a = \frac{(6-4)\times 9.8}{4+6} = \frac{19.6}{10} = 1.96\ \text{m/s}^2. From the first equation: T=m1(g+a)=4(9.8+1.96)=4×11.76=47.0 NT = m_1(g+a) = 4(9.8+1.96) = 4\times 11.76 = 47.0\ \text{N}.

Section summary. Newtonian mechanics builds from point particles to systems using three laws of motion. Conservation of energy, momentum, and angular momentum arise from symmetries and the nature of internal forces.

Variational Principles and Lagrange’s Equations

Core ideas

Newtonian mechanics requires identifying all forces, including unknown constraint forces (like the normal force on a bead on a wire). The Lagrangian method bypasses these by using energy and generalized coordinates qiq_i.

Generalized Coordinates and Constraints. For a system of NN particles, there are 3N3N degrees of freedom. Constraints reduce this number.

  • Holonomic constraints can be expressed as f(r1,r2,,t)=0f(r_1, r_2, \dots, t) = 0. If there are kk such constraints, the number of independent coordinates is n=3Nkn = 3N - k. These independent coordinates are the generalized coordinates q1,q2,,qnq_1, q_2, \dots, q_n.
  • Constraints that cannot be written this way are non-holonomic. Two common forms are inequalities such as g(q,t)0g(q,t)\le 0, and non-integrable velocity constraints
iai(q,t)q˙i+a0(q,t)=0.\sum_i a_i(q,t)\dot q_i+a_0(q,t)=0.
If this differential relation can be integrated to $F(q,t)=0$, it is holonomic; if it cannot, it is genuinely non-holonomic.

For example, a particle confined inside a box obeys 0x,y,zL0\le x,y,z\le L, or inside a sphere obeys x2+y2+z2R2x^2+y^2+z^2\le R^2. These are inequality constraints rather than equations F(q,t)=0F(q,t)=0. A more typical mechanics example is a car that rolls without sideways slipping. If (x,y)(x,y) is the contact point and θ\theta is the heading angle, the sideways velocity must vanish:

sinθx˙+cosθy˙=0.-\sin\theta\,\dot x+\cos\theta\,\dot y=0.

This restricts allowed velocities but does not reduce to a position-only condition F(x,y,θ)=0F(x,y,\theta)=0. A rolling wheel gives similar non-integrable relations, such as dxRcosθdϕ=0dx-R\cos\theta\,d\phi=0 and dyRsinθdϕ=0dy-R\sin\theta\,d\phi=0.

Hamilton’s Principle. The Lagrangian is defined as L(q,q˙,t)=TVL(q, \dot q, t) = T - V. Hamilton’s Principle states that the actual path taken by a system between times t1t_1 and t2t_2 is the one that makes the action SS stationary:

δS=δt1t2L(qi,q˙i,t)dt=0.\delta S = \delta \int_{t_1}^{t_2} L(q_i, \dot q_i, t) \, dt = 0.

This leads to the Euler—Lagrange equations:

ddt(Lq˙i)Lqi=0,i=1,,n.\frac{d}{dt} \left( \frac{\partial L}{\partial \dot q_i} \right) - \frac{\partial L}{\partial q_i} = 0, \quad i=1, \dots, n.

The derivation is a direct integration-by-parts calculation. Compare the actual path qi(t)q_i(t) with a varied path qi(t)+ϵηi(t)q_i(t)+\epsilon\eta_i(t), where the endpoint variations vanish: ηi(t1)=ηi(t2)=0\eta_i(t_1)=\eta_i(t_2)=0. Stationarity means

0=dSdϵϵ=0=t1t2(Lqiηi+Lq˙iη˙i)dt.0=\left.\frac{dS}{d\epsilon}\right|_{\epsilon=0} =\int_{t_1}^{t_2} \left( \frac{\partial L}{\partial q_i}\eta_i+ \frac{\partial L}{\partial \dot q_i}\dot\eta_i \right)dt .

The second term is integrated by parts:

t1t2Lq˙iη˙idt=[Lq˙iηi]t1t2t1t2ddt(Lq˙i)ηidt.\int_{t_1}^{t_2}\frac{\partial L}{\partial \dot q_i}\dot\eta_i\,dt = \left[\frac{\partial L}{\partial \dot q_i}\eta_i\right]_{t_1}^{t_2} - \int_{t_1}^{t_2} \frac{d}{dt}\left(\frac{\partial L}{\partial \dot q_i}\right)\eta_i\,dt .

The boundary term is zero because the endpoints are fixed. Therefore

δS=t1t2[Lqiddt(Lq˙i)]ηidt=0.\delta S = \int_{t_1}^{t_2} \left[ \frac{\partial L}{\partial q_i} - \frac{d}{dt}\left(\frac{\partial L}{\partial \dot q_i}\right) \right]\eta_i\,dt=0 .

Since the functions ηi(t)\eta_i(t) are arbitrary inside the interval, the bracket must vanish for each coordinate, giving the Euler—Lagrange equations.

Generalized Forces and Non-conservative Systems. If some forces (like friction) are not derivable from a potential, the Euler—Lagrange equations are modified:

ddt(Lq˙i)Lqi=Qi,\frac{d}{dt} \left( \frac{\partial L}{\partial \dot q_i} \right) - \frac{\partial L}{\partial q_i} = Q_i,

where Qi=jFjncrjqiQ_i = \sum_j \bm F_j^{\rm nc} \cdot \frac{\partial \bm r_j}{\partial q_i} is the generalized force associated with coordinate qiq_i.

For a bead sliding on a circular hoop of radius RR, with coordinate θ\theta, the position is r(θ)=R(sinθ,cosθ)\bm r(\theta)=R(\sin\theta,-\cos\theta) and r/θ=R(cosθ,sinθ)\partial\bm r/\partial\theta=R(\cos\theta,\sin\theta). If a tangential friction force has magnitude bRθ˙bR\dot\theta and opposes the motion, then

Fnc=bRθ˙e^θ,Qθ=Fncrθ=bR2θ˙.\bm F^{\rm nc}=-bR\dot\theta\,\hat{\bm e}_\theta, \qquad Q_\theta=\bm F^{\rm nc}\cdot\frac{\partial \bm r}{\partial\theta} =-bR^2\dot\theta .

With L=12mR2θ˙2mgR(1cosθ)L=\frac{1}{2}mR^2\dot\theta^2-mgR(1-\cos\theta), the equation of motion becomes

mR2θ¨+mgRsinθ=bR2θ˙,mR^2\ddot\theta+mgR\sin\theta=-bR^2\dot\theta,

or

θ¨+bmθ˙+gRsinθ=0.\ddot\theta+\frac{b}{m}\dot\theta+\frac{g}{R}\sin\theta=0.

This example shows how non-conservative forces enter as generalized forces without being folded into VV.

Symmetries and Noether’s Theorem. A coordinate qjq_j that does not appear in the Lagrangian (L/qj=0\partial L / \partial q_j = 0) is called cyclic or ignorable. Its conjugate momentum pj=L/q˙jp_j = \partial L / \partial \dot q_j is conserved:

p˙j=ddt(Lq˙j)=Lqj=0    pj=const.\dot p_j = \frac{d}{dt} \left( \frac{\partial L}{\partial \dot q_j} \right) = \frac{\partial L}{\partial q_j} = 0 \implies p_j = \text{const}.

Noether’s Theorem generalizes this: every continuous symmetry of the Lagrangian corresponds to a conservation law.

  • Time translation symmetry     \implies conservation of energy (H=q˙ipiLH = \sum \dot q_i p_i - L).
  • Spatial translation symmetry     \implies conservation of linear momentum.
  • Rotational symmetry     \implies conservation of angular momentum.

Mathematical spine

L=TV(Lagrangian)δLdt=0    ddtLq˙iLqi=0(Euler–Lagrange)pi=Lq˙i(Generalized Momentum)H=q˙ipiL(Hamiltonian/Energy)\begin{aligned} L &= T - V & \text{(Lagrangian)} \\ \delta \int L dt &= 0 \implies \frac{d}{dt} \frac{\partial L}{\partial \dot q_i} - \frac{\partial L}{\partial q_i} = 0 & \text{(Euler--Lagrange)} \\ p_i &= \frac{\partial L}{\partial \dot q_i} & \text{(Generalized Momentum)} \\ H &= \sum \dot q_i p_i - L & \text{(Hamiltonian/Energy)} \end{aligned}

Example: Simple Pendulum. Coordinate: angle θ\theta. L=TV=12ml2θ˙2mg(llcosθ)L = T - V = \frac{1}{2}ml^2\dot\theta^2 - mg(l - l\cos\theta). Euler—Lagrange: ddt(ml2θ˙)(mglsinθ)=0    θ¨+glsinθ=0\frac{d}{dt}(ml^2\dot\theta) - (-mgl\sin\theta) = 0 \implies \ddot\theta + \frac{g}{l}\sin\theta = 0.

Worked example

Bead on a rotating hoop.
A bead of mass m=0.1 kgm = 0.1\ \text{kg} slides without friction on a hoop of radius R=0.2 mR = 0.2\ \text{m} rotating about its vertical diameter at constant angular velocity Ω=5 rad/s\Omega = 5\ \text{rad/s}. Using polar angle θ\theta measured from the bottom, the position is (RsinθcosΩt,RsinθsinΩt,Rcosθ)(R\sin\theta\cos\Omega t, R\sin\theta\sin\Omega t, -R\cos\theta). The velocity squared is v2=R2θ˙2+R2Ω2sin2θv^2 = R^2\dot\theta^2 + R^2\Omega^2\sin^2\theta. The Lagrangian is L=12m(R2θ˙2+R2Ω2sin2θ)+mgRcosθL = \frac{1}{2}m(R^2\dot\theta^2 + R^2\Omega^2\sin^2\theta) + mgR\cos\theta. The Euler—Lagrange equation gives mR2θ¨=mR2Ω2sinθcosθmgRsinθmR^2\ddot\theta = mR^2\Omega^2\sin\theta\cos\theta - mgR\sin\theta, or θ¨=(Ω2cosθg/R)sinθ\ddot\theta = (\Omega^2\cos\theta - g/R)\sin\theta. For equilibrium, θ¨=0\ddot\theta=0, giving solutions θ=0\theta=0, θ=π\theta=\pi, and cosθ=g/(RΩ2)=9.8/(0.2×25)=1.96\cos\theta = g/(R\Omega^2) = 9.8/(0.2\times 25) = 1.96, which is impossible here. Thus only θ=0\theta=0 (bottom) and θ=π\theta=\pi (top) are equilibria.

Problems with Solutions

Problem 1. Use the Lagrangian method to find the equation of motion for a simple pendulum of mass mm and length ll.

Solution. Coordinate: angle θ\theta from vertical. x=lsinθx = l\sin\theta, y=lcosθy = -l\cos\theta. Kinetic energy T=12m(l2θ˙2)T = \frac{1}{2}m(l^2\dot\theta^2). Potential energy V=mgl(1cosθ)V = mgl(1-\cos\theta). Lagrangian L=TV=12ml2θ˙2mgl(1cosθ)L = T-V = \frac{1}{2}ml^2\dot\theta^2 - mgl(1-\cos\theta). Euler—Lagrange: ddt(L/θ˙)L/θ=0\frac{d}{dt}(\partial L/\partial\dot\theta) - \partial L/\partial\theta = 0. L/θ˙=ml2θ˙\partial L/\partial\dot\theta = ml^2\dot\theta, so ddt(ml2θ˙)=ml2θ¨\frac{d}{dt}(ml^2\dot\theta) = ml^2\ddot\theta. L/θ=mglsinθ\partial L/\partial\theta = -mgl\sin\theta. Thus ml2θ¨+mglsinθ=0ml^2\ddot\theta + mgl\sin\theta = 0, or θ¨+(g/l)sinθ=0\ddot\theta + (g/l)\sin\theta = 0.

Problem 2. An Atwood machine consists of two masses m1=3 kgm_1 = 3\ \text{kg} and m2=2 kgm_2 = 2\ \text{kg} connected by a light string over a frictionless pulley. Use the Lagrangian method to find the acceleration.

Solution. Let xx be the vertical position of m1m_1 (downward positive). Then m2m_2 moves up by the same amount. T=12(m1+m2)x˙2=12(5)x˙2T = \frac{1}{2}(m_1+m_2)\dot x^2 = \frac{1}{2}(5)\dot x^2. V=m1gx+m2gx=(m1m2)gx=gxV = -m_1gx + m_2gx = -(m_1-m_2)gx = -gx. L=52x˙2+gxL = \frac{5}{2}\dot x^2 + gx. Euler—Lagrange: 5x¨g=05\ddot x - g = 0, so x¨=g/5=1.96 m/s2\ddot x = g/5 = 1.96\ \text{m/s}^2. The acceleration of m1m_1 is 1.96 m/s21.96\ \text{m/s}^2 downward and m2m_2 is 1.96 m/s21.96\ \text{m/s}^2 upward.

Problem 3. A particle of mass mm moves in the xyxy-plane under a central potential V(r)=krV(r) = -\frac{k}{r}. Write the Lagrangian in polar coordinates and identify any cyclic coordinates and their conserved momenta.

Solution. In polar coordinates, T=12m(r˙2+r2ϕ˙2)T = \frac{1}{2}m(\dot r^2 + r^2\dot\phi^2) and V=k/rV = -k/r. Thus L=12m(r˙2+r2ϕ˙2)+k/rL = \frac{1}{2}m(\dot r^2 + r^2\dot\phi^2) + k/r. The coordinate ϕ\phi does not appear explicitly in LL, so it is cyclic. The conserved conjugate momentum is pϕ=L/ϕ˙=mr2ϕ˙=p_\phi = \partial L/\partial\dot\phi = mr^2\dot\phi = \ell, which is the angular momentum. This is Noether’s theorem in action: rotational symmetry implies conservation of angular momentum.

Section summary. The Lagrangian formulation replaces force-based dynamics with an optimization principle. It automatically incorporates holonomic constraints and links symmetries to conservation laws via Noether’s theorem.

The Central Force Problem

Core ideas

A central force points along the line joining two particles and depends only on their separation: F=F(r)r^\bm F = F(r)\hat{\bm r}. Examples include gravity and the Coulomb force.

Reduction to a One-Body Problem. For two particles with masses m1,m2m_1, m_2 and positions r1,r2\bm r_1, \bm r_2, the Lagrangian is L=12m1r˙12+12m2r˙22V(r1r2)L = \frac{1}{2}m_1\dot{\bm r}_1^2 + \frac{1}{2}m_2\dot{\bm r}_2^2 - V(|\bm r_1 - \bm r_2|). Transforming to the center of mass R\bm R and relative coordinate r=r1r2\bm r = \bm r_1 - \bm r_2:

L=12MR˙2+12μr˙2V(r),L = \frac{1}{2}M\dot{\bm R}^2 + \frac{1}{2}\mu\dot{\bm r}^2 - V(r),

where M=m1+m2M = m_1 + m_2 and μ=m1m2m1+m2\mu = \frac{m_1m_2}{m_1+m_2} is the reduced mass. The CM moves at constant velocity, so we focus on the relative motion in the CM frame.

Conservation Laws. Since the force is central, the torque is zero, and the angular momentum L=r×μr˙\bm L = \bm r \times \mu \dot{\bm r} is conserved. This implies the motion is confined to a plane. In plane polar coordinates (r,ϕ)(r, \phi):

Lz=μr2ϕ˙==const.L_z = \mu r^2 \dot\phi = \ell = \text{const}.

Energy is also conserved: E=12μ(r˙2+r2ϕ˙2)+V(r)=constE = \frac{1}{2}\mu(\dot r^2 + r^2\dot\phi^2) + V(r) = \text{const}.

The Effective Potential. Using ϕ˙=μr2\dot\phi = \frac{\ell}{\mu r^2}, we can write the radial energy equation:

E=12μr˙2+Veff(r),Veff(r)=V(r)+22μr2.E = \frac{1}{2}\mu \dot r^2 + V_{\rm eff}(r), \quad V_{\rm eff}(r) = V(r) + \frac{\ell^2}{2\mu r^2}.

The term 22μr2\frac{\ell^2}{2\mu r^2} is the centrifugal barrier. The radial motion is effectively one-dimensional.

The Orbit Equation. Defining u=1/ru = 1/r, the differential equation for the orbit shape is:

d2udϕ2+u=μ2u2F(1/u).\frac{d^2u}{d\phi^2} + u = -\frac{\mu}{\ell^2u^2}F(1/u).

For the inverse-square law F(r)=k/r2F(r) = -k/r^2 (where k=Gm1m2k = G m_1 m_2), the solution is a conic section:

r(ϕ)=α1+ecos(ϕϕ0),α=2μk,e=1+2E2μk2.r(\phi) = \frac{\alpha}{1 + e\cos(\phi - \phi_0)}, \quad \alpha = \frac{\ell^2}{\mu k}, \quad e = \sqrt{1 + \frac{2E\ell^2}{\mu k^2}}.

The eccentricity ee determines the orbit shape:

  • e=0e=0: Circle (E=Veff,min=μk222E = V_{\rm eff, min} = -\frac{\mu k^2}{2\ell^2})
  • 0<e<10 < e < 1: Ellipse (Veff,min<E<0V_{\rm eff, min} < E < 0)
  • e=1e = 1: Parabola (E=0E = 0)
  • e>1e > 1: Hyperbola (E>0E > 0)

Kepler’s Laws.

  • Planets move in elliptical orbits with the Sun at one focus.
  • A line joining a planet and the Sun sweeps out equal areas during equal intervals of time (equivalent to conservation of angular momentum).
  • The square of the orbital period TT is proportional to the cube of the semi-major axis aa: T2=4π2μka3T^2 = \frac{4\pi^2\mu}{k}a^3.

The Runge-Lenz Vector. For the 1/r1/r potential, there is an additional conserved vector, the Laplace—Runge—Lenz vector:

A=p×Lμkr^.\bm A = \bm p \times \bm L - \mu k \hat{\bm r}.

This vector points along the major axis and its conservation explains why orbits in a 1/r1/r potential are closed and do not precess.

Mathematical spine

Veff(r)=V(r)+22μr2(Effective Potential)d2udϕ2+u=μk2(Orbit Equation for 1/r2 force)r(ϕ)=α1+ecosϕ(Conic Section Solution)\begin{aligned} V_{\rm eff}(r) &= V(r) + \frac{\ell^2}{2\mu r^2} & \text{(Effective Potential)} \\ \frac{d^2u}{d\phi^2} + u &= \frac{\mu k}{\ell^2} & \text{(Orbit Equation for $1/r^2$ force)} \\ r(\phi) &= \frac{\alpha}{1 + e\cos\phi} & \text{(Conic Section Solution)} \end{aligned}

Example: Circular Orbit. For a circular orbit at radius r0r_0, the force must equal the centripetal requirement: F(r0)=μv2r0=2μr03|F(r_0)| = \frac{\mu v^2}{r_0} = \frac{\ell^2}{\mu r_0^3}. This corresponds to the minimum of Veff(r)V_{\rm eff}(r).

Worked example

Geostationary orbit.
A satellite orbits Earth at radius rr with period T=24 h=86400 sT = 24\ \text{h} = 86400\ \text{s}. For a circular orbit, centripetal force equals gravity: mv2/r=GMm/r2mv^2/r = GMm/r^2, so v2=GM/rv^2 = GM/r. With v=2πr/Tv = 2\pi r/T, we get (2πr/T)2=GM/r(2\pi r/T)^2 = GM/r, or r3=GMT2/(4π2)r^3 = GMT^2/(4\pi^2). Using GM=gRE29.8×(6.37×106)23.98×1014 m3/s2GM = gR_E^2 \approx 9.8\times(6.37\times10^6)^2 \approx 3.98\times10^{14}\ \text{m}^3/\text{s}^2, we find r3=3.98×1014×(86400)2/(4π2)7.54×1022 m3r^3 = 3.98\times10^{14}\times(86400)^2/(4\pi^2) \approx 7.54\times10^{22}\ \text{m}^3, giving r4.22×107 m6.6REr \approx 4.22\times10^7\ \text{m} \approx 6.6R_E. The altitude is h=rRE3.58×104 kmh = r - R_E \approx 3.58\times10^4\ \text{km}. The orbital speed is v3.07 km/sv \approx 3.07\ \text{km/s}.

Problems with Solutions

Problem 1. Calculate the escape velocity from the surface of Earth (RE=6.37×106 mR_E = 6.37\times10^6\ \text{m}, g=9.8 m/s2g = 9.8\ \text{m/s}^2).

Solution. Escape requires total energy E0E \ge 0. At the surface, E=12mv2GMm/RE=0E = \frac{1}{2}mv^2 - GMm/R_E = 0, so vesc=2GM/RE=2gRE=2×9.8×6.37×1061.25×1081.12×104 m/s=11.2 km/sv_{\rm esc} = \sqrt{2GM/R_E} = \sqrt{2gR_E} = \sqrt{2\times 9.8\times 6.37\times10^6} \approx \sqrt{1.25\times10^8} \approx 1.12\times10^4\ \text{m/s} = 11.2\ \text{km/s}.

Problem 2. A comet has a parabolic orbit around the Sun (M=1.99×1030 kgM_\odot = 1.99\times10^{30}\ \text{kg}) with perihelion distance q=0.1 AUq = 0.1\ \text{AU} (1 AU=1.496×1011 m1\ \text{AU} = 1.496\times10^{11}\ \text{m}). Find its speed at perihelion.

Solution. For a parabolic orbit, E=0E=0 and e=1e=1. At perihelion, r=qr = q and v=vmaxv = v_{\rm max}. Energy conservation: 12v2GM/q=0\frac{1}{2}v^2 - GM_\odot/q = 0, so v=2GM/q=2×6.67×1011×1.99×1030/(0.1×1.496×1011)1.775×10101.33×105 m/s=133 km/sv = \sqrt{2GM_\odot/q} = \sqrt{2\times 6.67\times10^{-11}\times 1.99\times10^{30}/(0.1\times 1.496\times10^{11})} \approx \sqrt{1.775\times10^{10}} \approx 1.33\times10^5\ \text{m/s} = 133\ \text{km/s}.

Problem 3. For an elliptical orbit with semi-major axis aa and period TT, derive the relation T2a3T^2 \propto a^3 using energy and angular momentum conservation.

Solution. From the orbit equation, r(ϕ)=α/(1+ecosϕ)r(\phi) = \alpha/(1+e\cos\phi) with α=2/(μk)\alpha = \ell^2/(\mu k). The semi-major axis is a=α/(1e2)a = \alpha/(1-e^2). The energy is E=μk2/(22)(1e2)=k/(2a)E = -\mu k^2/(2\ell^2)(1-e^2) = -k/(2a). Thus 2=μka(1e2)\ell^2 = \mu k a(1-e^2). The area swept per unit time is dA/dt=/(2μ)dA/dt = \ell/(2\mu). The total area is A=πab=πa21e2A = \pi a b = \pi a^2\sqrt{1-e^2}. Integrating, T=A/(dA/dt)=2μπa21e2/=2μπa21e2/μka(1e2)=2πμ/ka3/2T = A/(dA/dt) = 2\mu\pi a^2\sqrt{1-e^2}/\ell = 2\mu\pi a^2\sqrt{1-e^2}/\sqrt{\mu k a(1-e^2)} = 2\pi\sqrt{\mu/k}\,a^{3/2}. Therefore T2=4π2μka3T^2 = \frac{4\pi^2\mu}{k}a^3, which is Kepler’s third law.

Section summary. The central force problem reduces to a 1D radial problem using conservation of angular momentum. For inverse-square forces, orbits are conic sections, consistent with Kepler’s Laws.

The Kinematics of Rigid Body Motion

Core ideas

A rigid body is a system of particles where the distance between any two particles remains constant: rirj=cij|\bm r_i - \bm r_j| = c_{ij}. It has 6 degrees of freedom (3 for translation of a reference point, 3 for rotation).

Chasles’ Theorem. Any displacement of a rigid body can be decomposed into a translation of a chosen base point plus a rotation about an axis through that point. Typically, the Center of Mass (CM) is chosen as the base point.

Angular Velocity and Rotations. The velocity of any point PP in the body is v=V+ω×r\bm v = \bm V + \bm \omega \times \bm r, where V\bm V is the velocity of the base point and ω\bm \omega is the angular velocity vector. Rotations are often described using Euler angles (ϕ,θ,ψ)(\phi, \theta, \psi) which represent a sequence of three rotations (e.g., zz-xx'-zz'' convention) to transform from a space-fixed frame to a body-fixed frame.

Why Euler angles in practice. Euler angles provide the minimal set of three independent generalized coordinates for orientation, so that rotational dynamics can be cast directly into Lagrange’s equations without redundant constraints. In the heavy symmetric top, for instance, the choice of (ϕ,θ,ψ)(\phi,\theta,\psi) makes ϕ\phi and ψ\psi cyclic, immediately yielding two conserved momenta pϕp_\phi and pψp_\psi and reducing the problem to a one-dimensional motion in θ\theta. The same coordinates are the standard yaw—pitch—roll variables used to describe spacecraft attitude, gyrocompasses, gimbal-mounted sensors, robotic end-effectors, and aircraft orientation.

In numerical practice one must remember the well-known gimbal-lock singularity at θ=0,π\theta = 0,\pi, where ϕ˙\dot\phi and ψ˙\dot\psi become indistinguishable and the kinematic map (ϕ˙,θ˙,ψ˙)ω(\dot\phi,\dot\theta,\dot\psi)\to\bm\omega loses rank. For this reason attitude-control engineers often integrate the equivalent quaternion (Euler—Rodrigues) parameters and convert back to Euler angles only for human-readable output.

The Inertia Tensor. The inertia tensor I\bm I is a symmetric 3×33 \times 3 matrix that relates angular momentum L\bm L to angular velocity ω\bm \omega: L=Iω\bm L = \bm I \bm \omega. Its components in a given basis are:

Iij=nmn(rn2δijxn,ixn,j).I_{ij} = \sum_n m_n (r_n^2 \delta_{ij} - x_{n,i}x_{n,j}).
  • Principal Axes: For any point, there exists a set of axes where I\bm I is diagonal. The diagonal elements I1,I2,I3I_1, I_2, I_3 are the principal moments of inertia.
  • Parallel Axis Theorem: If IcmI_{\rm cm} is the inertia tensor about the CM, the tensor about an axis parallel but shifted by a\bm a is I=Icm+M(a21aa)I = I_{\rm cm} + M(a^2 \bm 1 - \bm a \otimes \bm a).

Mathematical spine

v=V+ω×r(Velocity in Rigid Body)L=Iω,Trot=12ωL=12ωIω(Angular Momentum and KE)Iij=ρ(r)(r2δijxixj)d3r(Continuum Inertia Tensor)\begin{aligned} \bm v &= \bm V + \bm \omega \times \bm r & \text{(Velocity in Rigid Body)} \\ \bm L &= \bm I \bm \omega, \quad T_{\rm rot} = \frac{1}{2}\bm \omega \cdot \bm L = \frac{1}{2}\bm \omega \cdot \bm I \bm \omega & \text{(Angular Momentum and KE)} \\ I_{ij} &= \int \rho(\bm r) (r^2 \delta_{ij} - x_ix_j) d^3r & \text{(Continuum Inertia Tensor)} \end{aligned}

Example: Inertia Tensor of a Cube. For a uniform cube of side aa and mass MM about a corner, Ixx=13Ma2I_{xx} = \frac{1}{3}Ma^2, Ixy=14Ma2I_{xy} = -\frac{1}{4}Ma^2. About the center, the axes are principal and Ixx=Iyy=Izz=16Ma2I_{xx}=I_{yy}=I_{zz} = \frac{1}{6}Ma^2.

Worked example

Velocity of a point on a rolling wheel.
A bicycle wheel of radius R=0.35 mR = 0.35\ \text{m} rolls without slipping at speed V=5 m/sV = 5\ \text{m/s}. The angular velocity is ω=V/R=5/0.3514.3 rad/s\omega = V/R = 5/0.35 \approx 14.3\ \text{rad/s}. For a point on the rim at angle ϕ\phi from the top, measured in the body frame, the position relative to the center is r=R(sinϕ,cosϕ)\bm r = R(\sin\phi, -\cos\phi). The velocity is v=V+ω×r=(V,0)+(0,0,ω)×(Rsinϕ,Rcosϕ,0)=(V+ωRcosϕ,ωRsinϕ)\bm v = \bm V + \bm\omega\times\bm r = (V, 0) + (0,0,\omega)\times(R\sin\phi, -R\cos\phi, 0) = (V + \omega R\cos\phi, \omega R\sin\phi). At the top (ϕ=0\phi=0): v=(V+V,0)=(10,0) m/s\bm v = (V+V, 0) = (10, 0)\ \text{m/s}. At the bottom (ϕ=π\phi=\pi): v=(VV,0)=0\bm v = (V-V, 0) = 0, consistent with the no-slip condition.

Problems with Solutions

Problem 1. A uniform rod of mass M=2 kgM = 2\ \text{kg} and length L=1 mL = 1\ \text{m} rotates about an axis through one end and perpendicular to the rod. Find the moment of inertia and the kinetic energy when ω=3 rad/s\omega = 3\ \text{rad/s}.

Solution. For a rod about one end, I=13ML2=13(2)(1)2=23 kgm2I = \frac{1}{3}ML^2 = \frac{1}{3}(2)(1)^2 = \frac{2}{3}\ \text{kg}\cdot\text{m}^2. Rotational kinetic energy is T=12Iω2=12×23×9=3 JT = \frac{1}{2}I\omega^2 = \frac{1}{2}\times\frac{2}{3}\times 9 = 3\ \text{J}.

Problem 2. A rigid body rotates with ω=(2,3,1) rad/s\bm\omega = (2, 3, 1)\ \text{rad/s}. Find the velocity of a point at r=(0.1,0.2,0) m\bm r = (0.1, 0.2, 0)\ \text{m} relative to the rotation axis.

Solution. v=ω×r=(2,3,1)×(0.1,0.2,0)=(3010.2,10.120,20.230.1)=(0.2,0.1,0.1) m/s\bm v = \bm\omega\times\bm r = (2,3,1)\times(0.1,0.2,0) = (3\cdot0 - 1\cdot0.2, 1\cdot0.1 - 2\cdot0, 2\cdot0.2 - 3\cdot0.1) = (-0.2, 0.1, 0.1)\ \text{m/s}. The speed is v=0.04+0.01+0.01=0.060.245 m/s|\bm v| = \sqrt{0.04+0.01+0.01} = \sqrt{0.06} \approx 0.245\ \text{m/s}.

Problem 3. Use the parallel axis theorem to find the moment of inertia of a solid disk of mass M=1 kgM = 1\ \text{kg} and radius R=0.2 mR = 0.2\ \text{m} about a tangent in its plane.

Solution. About the center perpendicular to the disk, Icm=12MR2=0.5×1×0.04=0.02 kgm2I_{\rm cm} = \frac{1}{2}MR^2 = 0.5\times 1\times 0.04 = 0.02\ \text{kg}\cdot\text{m}^2. For an axis in the plane, by symmetry Ix,cm=Iy,cm=14MR2=0.01 kgm2I_{x,\rm cm} = I_{y,\rm cm} = \frac{1}{4}MR^2 = 0.01\ \text{kg}\cdot\text{m}^2. Shifting to a tangent point on the rim, the displacement is a=Ra = R perpendicular to the new axis. The parallel axis theorem gives I=Ix,cm+MR2=14MR2+MR2=54MR2=54(1)(0.04)=0.05 kgm2I = I_{x,\rm cm} + MR^2 = \frac{1}{4}MR^2 + MR^2 = \frac{5}{4}MR^2 = \frac{5}{4}(1)(0.04) = 0.05\ \text{kg}\cdot\text{m}^2.

Section summary. Rigid body kinematics describes the 6-DOF motion through CM translation and rotation, characterized by the angular velocity vector and the inertia tensor.

The Rigid Body Equations of Motion

Core ideas

Dynamics in a rotating frame requires accounting for the rotation of the basis vectors. For any vector A\bm A:

(dAdt)space=(dAdt)body+ω×A.\left(\frac{d\bm A}{dt}\right)_{\rm space} = \left(\frac{d\bm A}{dt}\right)_{\rm body} + \bm\omega \times \bm A.

Euler’s Equations. Applying this to the angular momentum L\bm L in the body-fixed frame of principal axes:

N=L˙body+ω×L.\bm N = \dot{\bm L}_{\rm body} + \bm\omega \times \bm L.

This gives Euler’s equations:

I1ω˙1(I2I3)ω2ω3=N1I2ω˙2(I3I1)ω3ω1=N2I3ω˙3(I1I2)ω1ω2=N3\begin{aligned} I_1 \dot\omega_1 - (I_2 - I_3)\omega_2\omega_3 &= N_1 \\ I_2 \dot\omega_2 - (I_3 - I_1)\omega_3\omega_1 &= N_2 \\ I_3 \dot\omega_3 - (I_1 - I_2)\omega_1\omega_2 &= N_3 \end{aligned}

Torque-Free Motion. When N=0\bm N = 0, both Erot=12Iiωi2E_{\rm rot} = \frac{1}{2}\sum I_i \omega_i^2 and L2=Ii2ωi2L^2 = \sum I_i^2 \omega_i^2 are conserved.

  • Stability (Tennis Racket Theorem): Rotation about the principal axes with the largest (ImaxI_{\rm max}) or smallest (IminI_{\rm min}) moments is stable. Rotation about the intermediate axis is unstable.
  • Symmetric Top: If I1=I2I3I_1 = I_2 \neq I_3, the angular velocity ω\bm \omega precesses around the body-fixed symmetry axis zz' with frequency Ωbody=I3I1I1ω3\Omega_{\rm body} = \frac{I_3 - I_1}{I_1}\omega_3.

The Heavy Symmetric Top. A top with I1=I2I_1=I_2 spinning in a gravitational field exhibits complex motion:

  • Precession: The symmetry axis rotates around the vertical (gravity) axis.
  • Nutation: The symmetry axis bobs up and down between two polar angles θ1\theta_1 and θ2\theta_2.
  • Stability: A “sleeping top” (vertical spin) is stable only if ω3>2I3MglI1\omega_3 > \frac{2}{I_3}\sqrt{Mg l I_1}.

Nutation amplitude and frequency (fast-spin limit). For a top released with non-zero tilt θ0\theta_0 and large spin ω3\omega_3 the energy/angular-momentum conservation reduces θ(t)\theta(t) to motion in an effective potential. Linearising about the mean angle one finds small-amplitude nutation at the angular frequency

ωnut    I3ω3I1,\omega_{\rm nut} \;\approx\; \frac{I_3\,\omega_3}{I_1},

i.e.\ the nutation is fast compared with the precession ωprecMgl/(I3ω3)\omega_{\rm prec}\approx Mgl/(I_3\omega_3), and the two satisfy the simple product

ωnutωprec    MglI1.\omega_{\rm nut}\,\omega_{\rm prec} \;\approx\; \frac{Mgl}{I_1}.

The peak-to-peak amplitude of the wobble for a top released from rest at θ0\theta_0 is

Δθ  =  θ2θ1    2Mglsinθ0I3ω32I1I3,\Delta\theta \;=\; \theta_2 - \theta_1 \;\approx\; \frac{2 Mgl\sin\theta_0}{I_3\,\omega_3^2}\,\frac{I_1}{I_3},

which vanishes as ω32\omega_3^{-2} --- a fast-spinning top exhibits steady precession with imperceptible nutation. The sleeping-top threshold above is the same statement at θ0=0\theta_0=0: nutation about the vertical is bounded only when I32ω32>4MglI1I_3^2\omega_3^2 > 4 Mgl I_1.

Mathematical spine

I1ω˙1(I2I3)ω2ω3=N1(Euler’s Equations)ωprec=MglL(Slow Precession approximation)E=12I1(θ˙2+ϕ˙2sin2θ)+12I3(ϕ˙cosθ+ψ˙)2+Mglcosθ(Heavy Top Energy)\begin{aligned} I_1\dot\omega_1 - (I_2 - I_3)\omega_2\omega_3 &= N_1 & \text{(Euler's Equations)} \\ \omega_{\rm prec} &= \frac{Mgl}{L} & \text{(Slow Precession approximation)} \\ E &= \frac{1}{2}I_1(\dot\theta^2 + \dot\phi^2\sin^2\theta) + \frac{1}{2}I_3(\dot\phi\cos\theta + \dot\psi)^2 + Mgl\cos\theta & \text{(Heavy Top Energy)} \end{aligned}

Example: Free Precession of the Earth. The Earth is slightly oblate (I3>I1I_3 > I_1). Euler’s equations predict a precession of the spin axis (the “Chandler wobble”) with a period related to the difference in moments of inertia.

Worked example

Stability of a spinning book.
A hardcover book has principal moments I1=0.05 kgm2I_1 = 0.05\ \text{kg}\cdot\text{m}^2, I2=0.08 kgm2I_2 = 0.08\ \text{kg}\cdot\text{m}^2, I3=0.10 kgm2I_3 = 0.10\ \text{kg}\cdot\text{m}^2 (with I1<I2<I3I_1 < I_2 < I_3). According to the tennis-racket theorem, rotation about the I1I_1 (thickness) and I3I_3 (height) axes is stable, while rotation about I2I_2 (width) is unstable. If the book is spun about its intermediate axis with a small perturbation, Euler’s equations give ω˙1=I2I3I1ω2ω3=0.020.05ω2ω3=0.4ω2ω3\dot\omega_1 = \frac{I_2-I_3}{I_1}\omega_2\omega_3 = \frac{-0.02}{0.05}\omega_2\omega_3 = -0.4\omega_2\omega_3. Linearizing about ω2Ω\omega_2 \approx \Omega shows ω¨1Ω2ω1\ddot\omega_1 \propto -\Omega^2\omega_1 with a positive coefficient (since (I2I3)(I1I2)/(I1I3)>0(I_2-I_3)(I_1-I_2)/(I_1I_3) > 0), leading to exponential growth: ω1(t)eλt\omega_1(t) \propto e^{\lambda t} with λ=Ω(I3I2)(I2I1)/(I1I3)\lambda = \Omega\sqrt{(I_3-I_2)(I_2-I_1)/(I_1I_3)}.

Problems with Solutions

Problem 1. A symmetric top has I1=I2=5.0×104 kgm2I_1 = I_2 = 5.0\times10^{-4}\ \text{kg}\cdot\text{m}^2 and I3=1.0×103 kgm2I_3 = 1.0\times10^{-3}\ \text{kg}\cdot\text{m}^2. It spins at ω3=100 rad/s\omega_3 = 100\ \text{rad/s}. Find the body-frame precession rate Ωbody\Omega_{\rm body}.

Solution. For a torque-free symmetric top, Ωbody=I3I1I1ω3=1.0×1035.0×1045.0×104×100=0.5×1035.0×104×100=1×100=100 rad/s\Omega_{\rm body} = \frac{I_3-I_1}{I_1}\omega_3 = \frac{1.0\times10^{-3} - 5.0\times10^{-4}}{5.0\times10^{-4}}\times 100 = \frac{0.5\times10^{-3}}{5.0\times10^{-4}}\times 100 = 1\times 100 = 100\ \text{rad/s}. The angular velocity vector precesses around the symmetry axis at 100 rad/s100\ \text{rad/s}.

Problem 2. A top of mass M=0.2 kgM = 0.2\ \text{kg} with I1=I2=1.5×104 kgm2I_1 = I_2 = 1.5\times10^{-4}\ \text{kg}\cdot\text{m}^2 and I3=3.0×104 kgm2I_3 = 3.0\times10^{-4}\ \text{kg}\cdot\text{m}^2 has its CM a distance l=0.03 ml = 0.03\ \text{m} from the pivot. What minimum spin ω3\omega_3 is needed for the sleeping-top configuration to be stable?

Solution. The sleeping-top stability condition is ω3>2I3MglI1\omega_3 > \frac{2}{I_3}\sqrt{MglI_1}. Substituting: MglI1=0.2×9.8×0.03×1.5×104=8.82×1062.97×103\sqrt{MglI_1} = \sqrt{0.2\times 9.8\times 0.03\times 1.5\times10^{-4}} = \sqrt{8.82\times10^{-6}} \approx 2.97\times10^{-3}. Then ω3>23.0×104×2.97×103=5.94×1033.0×10419.8 rad/s\omega_3 > \frac{2}{3.0\times10^{-4}}\times 2.97\times10^{-3} = \frac{5.94\times10^{-3}}{3.0\times10^{-4}} \approx 19.8\ \text{rad/s} (about 3.23.2 rev/s).

Problem 3. A cube of side a=0.1 ma = 0.1\ \text{m} and mass M=0.5 kgM = 0.5\ \text{kg} rotates about an axis through its center and perpendicular to one face. Find its angular momentum if ω=20 rad/s\omega = 20\ \text{rad/s}.

Solution. For a cube about a central axis perpendicular to a face, I=16Ma2=16(0.5)(0.1)2=0.00568.33×104 kgm2I = \frac{1}{6}Ma^2 = \frac{1}{6}(0.5)(0.1)^2 = \frac{0.005}{6} \approx 8.33\times10^{-4}\ \text{kg}\cdot\text{m}^2. Angular momentum is L=Iω=8.33×104×201.67×102 kgm2/sL = I\omega = 8.33\times10^{-4}\times 20 \approx 1.67\times10^{-2}\ \text{kg}\cdot\text{m}^2/\text{s}.

Section summary. Euler’s equations describe rigid body dynamics in the body frame, revealing the stability of principal axis rotation and the precession/nutation of tops.

Oscillations

Core ideas

Most systems in classical mechanics behave like harmonic oscillators when displaced slightly from a stable equilibrium point.

Small Oscillations around Equilibrium. Consider a system with generalized coordinates qiq_i and a potential V(q1,,qn)V(q_1, \dots, q_n). Equilibrium occurs where Vqi=0\frac{\partial V}{\partial q_i} = 0. If this is a minimum (all eigenvalues of the Hessian 2Vqiqj\frac{\partial^2 V}{\partial q_i \partial q_j} are positive), the equilibrium is stable. Defining ηi=qiqi0\eta_i = q_i - q_{i0} as the displacement, the Lagrangian for small oscillations is:

L12i,j(Tijη˙iη˙jVijηiηj),L \approx \frac{1}{2} \sum_{i,j} (T_{ij} \dot\eta_i \dot\eta_j - V_{ij} \eta_i \eta_j),

where Vij=2Vqiqj0V_{ij} = \left.\frac{\partial^2 V}{\partial q_i \partial q_j}\right|_0 and TijT_{ij} is the kinetic energy matrix (usually constant for small displacements).

Normal Modes and the Secular Equation. The equations of motion are Tη¨+Vη=0T\ddot{\bm\eta} + V\bm\eta = 0. Assuming a sinusoidal solution η(t)=aeiωt\bm\eta(t) = \bm a e^{i\omega t} leads to the generalized eigenvalue problem:

(Vω2T)a=0.(V - \omega^2 T) \bm a = 0.

Nontrivial solutions exist only if the secular equation holds:

det(Vω2T)=0.\det(V - \omega^2 T) = 0.

The roots ωk2\omega_k^2 are the normal frequencies, and the corresponding vectors ak\bm a_k define the normal modes.

Normal Coordinates. There exists a linear transformation η=Aζ\bm \eta = \bm A \bm \zeta that simultaneously diagonalizes TT and VV. The new coordinates ζk\zeta_k are normal coordinates, and each evolves independently as a simple harmonic oscillator: ζ¨k+ωk2ζk=0\ddot\zeta_k + \omega_k^2 \zeta_k = 0.

Damping and Driving. Real systems have dissipation (QQ-factor) and external forces.

  • Damping: x¨+2γx˙+ω02x=0\ddot x + 2\gamma\dot x + \omega_0^2 x = 0. Solutions can be underdamped, overdamped, or critically damped.
  • Resonance: When driven by F0cosωtF_0\cos\omega t, the amplitude is maximized near ωω0\omega \approx \omega_0. The phase shift δ\delta changes from 0 to π\pi as ω\omega passes through resonance.

Mathematical spine

det(Vω2T)=0(Secular Equation)L=12(ζ˙2ω2ζ2)(Lagrangian in Normal Coordinates)Q=ω02γ(Quality Factor)\begin{aligned} \det(V - \omega^2 T) &= 0 & \text{(Secular Equation)} \\ L &= \frac{1}{2} (\dot{\bm\zeta}^2 - \bm\omega^2 \bm\zeta^2) & \text{(Lagrangian in Normal Coordinates)} \\ Q &= \frac{\omega_0}{2\gamma} & \text{(Quality Factor)} \end{aligned}

Example: Two Coupled Pendulums. For two identical pendulums coupled by a spring, there are two modes:

  1. In-phase: ω1=g/l\omega_1 = \sqrt{g/l} (spring not stretched).
  2. Out-of-phase: ω2=g/l+2k/m\omega_2 = \sqrt{g/l + 2k/m} (spring acts as additional restoring force).

Worked example

Driven damped harmonic oscillator.
A mass m=0.5 kgm = 0.5\ \text{kg} on a spring with k=50 N/mk = 50\ \text{N/m} is damped with b=1.0 kg/sb = 1.0\ \text{kg/s} and driven by F(t)=2cos(10t) NF(t) = 2\cos(10t)\ \text{N}. The natural frequency is ω0=k/m=10 rad/s\omega_0 = \sqrt{k/m} = 10\ \text{rad/s}, the damping coefficient is γ=b/(2m)=1.0 s1\gamma = b/(2m) = 1.0\ \text{s}^{-1}, and the driving frequency is ω=10 rad/s\omega = 10\ \text{rad/s}. The steady-state amplitude is A=F0/m/(ω02ω2)2+(2γω)2=4/0+400=4/20=0.20 mA = F_0/m / \sqrt{(\omega_0^2-\omega^2)^2 + (2\gamma\omega)^2} = 4 / \sqrt{0 + 400} = 4/20 = 0.20\ \text{m}. Since ω=ω0\omega = \omega_0, the system is at resonance and the phase lag is δ=π/2\delta = \pi/2. The quality factor is Q=ω0/(2γ)=5Q = \omega_0/(2\gamma) = 5.

Problems with Solutions

Problem 1. Two identical masses mm are connected by springs of constant kk to each other and to fixed walls. Find the normal-mode frequencies.

Solution. Let displacements be x1,x2x_1, x_2. The equations are mx¨1=kx1+k(x2x1)=2kx1+kx2m\ddot x_1 = -kx_1 + k(x_2-x_1) = -2kx_1 + kx_2 and mx¨2=k(x2x1)kx2=kx12kx2m\ddot x_2 = -k(x_2-x_1) - kx_2 = kx_1 - 2kx_2. Assuming xi=Aicosωtx_i = A_i\cos\omega t, the matrix equation is (2kmω2kk2kmω2)(A1A2)=0\begin{pmatrix} 2k-m\omega^2 & -k \\ -k & 2k-m\omega^2 \end{pmatrix}\begin{pmatrix} A_1 \\ A_2 \end{pmatrix} = 0. The secular equation gives (2kmω2)2k2=0(2k-m\omega^2)^2 - k^2 = 0, so 2kmω2=±k2k-m\omega^2 = \pm k. Thus ω12=k/m\omega_1^2 = k/m (symmetric mode: A1=A2A_1=A_2) and ω22=3k/m\omega_2^2 = 3k/m (antisymmetric mode: A1=A2A_1=-A_2).

Problem 2. A damped oscillator has ω0=5 rad/s\omega_0 = 5\ \text{rad/s} and γ=3 s1\gamma = 3\ \text{s}^{-1}. Is it underdamped, critically damped, or overdamped? Write the general solution.

Solution. Compare γ\gamma to ω0\omega_0: γ=3<5=ω0\gamma = 3 < 5 = \omega_0, so the system is underdamped. The damped frequency is ω=ω02γ2=259=4 rad/s\omega' = \sqrt{\omega_0^2 - \gamma^2} = \sqrt{25-9} = 4\ \text{rad/s}. The general solution is x(t)=Ae3tcos(4t+ϕ)x(t) = Ae^{-3t}\cos(4t + \phi), where AA and ϕ\phi are determined by initial conditions.

Problem 3. An undamped oscillator with m=1 kgm=1\ \text{kg}, k=100 N/mk=100\ \text{N/m} is driven by F(t)=5cos(9t) NF(t) = 5\cos(9t)\ \text{N}. Find the steady-state amplitude and the average power supplied by the driving force.

Solution. Here ω0=10 rad/s\omega_0 = 10\ \text{rad/s} and ω=9 rad/s\omega = 9\ \text{rad/s}. The amplitude is A=F0/m/ω02ω2=5/10081=5/190.263 mA = F_0/m / |\omega_0^2 - \omega^2| = 5/|100-81| = 5/19 \approx 0.263\ \text{m}. In steady state, x(t)=Acos(9t)x(t) = A\cos(9t) and v(t)=9Asin(9t)v(t) = -9A\sin(9t). The instantaneous power is P=Fv=45Acos(9t)sin(9t)P = Fv = -45A\cos(9t)\sin(9t), which averages to zero over a cycle because there is no damping and therefore no net energy dissipation.

Section summary. Oscillation theory linearizes motion near equilibrium, reducing complex coupled dynamics to independent normal modes via an eigenvalue problem.

The Classical Mechanics of the Special Theory of Relativity

Core ideas

Special Relativity modifies Newtonian mechanics for high speeds (vcv \sim c), based on the constancy of the speed of light cc in all inertial frames.

Lorentz Transformations and Four-Vectors. The transformation between two inertial frames moving at relative velocity vv along xx is:

x=γ(xvt),t=γ(tvx/c2),γ=11v2/c2.x' = \gamma(x - vt), \quad t' = \gamma(t - vx/c^2), \quad \gamma = \frac{1}{\sqrt{1 - v^2/c^2}}.

Events are points in Minkowski spacetime described by four-vectors xμ=(ct,r)x^\mu = (ct, \bm r). The invariant interval is ds2=c2dt2dr2ds^2 = c^2dt^2 - d\bm r^2.

Relativistic Dynamics. The proper time τ\tau is the time measured in the particle’s rest frame: dτ=dt/γd\tau = dt/\gamma.

  • Four-velocity: uμ=dxμdτ=γ(c,v)u^\mu = \frac{dx^\mu}{d\tau} = \gamma(c, \bm v).
  • Four-momentum: pμ=muμ=(E/c,p)p^\mu = mu^\mu = (E/c, \bm p), where p=γmv\bm p = \gamma m \bm v and E=γmc2E = \gamma mc^2.
  • Energy-Momentum Relation: The norm of the four-momentum is invariant: pμpμ=(E/c)2p2=m2c2p^\mu p_\mu = (E/c)^2 - p^2 = m^2c^2, leading to E2=p2c2+m2c4E^2 = p^2c^2 + m^2c^4.

Relativistic Lagrangian. The action for a free particle is proportional to its proper time (the “longest” path in Minkowski space):

S=mcds=(mc21v2/c2)dt.S = -mc \int ds = \int (-mc^2\sqrt{1 - v^2/c^2}) dt.

The free particle Lagrangian is L=mc21v2/c2L = -mc^2\sqrt{1 - v^2/c^2}. For a particle in an electromagnetic field:

L=mc21v2/c2qϕ+qAv.L = -mc^2\sqrt{1 - v^2/c^2} - q\phi + q\bm A \cdot \bm v.

Mathematical spine

pμ=(E/c,p)(Four-momentum)E2=p2c2+m2c4(Energy-momentum relation)f=dpdt=ddt(γmv)(Relativistic force/Minkowski force)\begin{aligned} p^\mu &= (E/c, \bm p) & \text{(Four-momentum)} \\ E^2 &= p^2c^2 + m^2c^4 & \text{(Energy-momentum relation)} \\ \bm f &= \frac{d\bm p}{dt} = \frac{d}{dt}(\gamma m \bm v) & \text{(Relativistic force/Minkowski force)} \end{aligned}

Example: Relativistic Doppler Effect. The frequency shift for a source moving away is ν=ν01v/c1+v/c\nu = \nu_0 \sqrt{\frac{1-v/c}{1+v/c}}. Unlike the classical case, there is also a transverse Doppler effect (ν=ν0/γ\nu = \nu_0/\gamma) due to time dilation.

Worked example

Muon decay and time dilation.
Cosmic-ray muons are created at an altitude of about h=10 kmh = 10\ \text{km} with speed v=0.995cv = 0.995c (γ10\gamma \approx 10). In the muon’s rest frame, its mean lifetime is τ0=2.2 μs\tau_0 = 2.2\ \mu\text{s}. In Earth’s frame, the dilated lifetime is τ=γτ022 μs\tau = \gamma\tau_0 \approx 22\ \mu\text{s}, during which the muon travels d=vτ0.995c×22 μs6.6 kmd = v\tau \approx 0.995c \times 22\ \mu\text{s} \approx 6.6\ \text{km}. However, muons reach the ground! This is resolved by length contraction: in the muon frame, the atmosphere is contracted to h=h/γ1 kmh' = h/\gamma \approx 1\ \text{km}, which takes t=h/v3.4 μst' = h'/v \approx 3.4\ \mu\text{s} to traverse---less than the mean lifetime. Both frames consistently predict that many muons survive to sea level.

Problems with Solutions

Problem 1. An electron is accelerated to a kinetic energy of 1.0 MeV1.0\ \text{MeV}. Given mec2=0.511 MeVm_e c^2 = 0.511\ \text{MeV}, find its speed and momentum.

Solution. Total energy E=K+mc2=1.511 MeVE = K + mc^2 = 1.511\ \text{MeV}. Since E=γmc2E = \gamma mc^2, we have γ=1.511/0.5112.957\gamma = 1.511/0.511 \approx 2.957. Then v=c11/γ2c10.1140.941cv = c\sqrt{1-1/\gamma^2} \approx c\sqrt{1-0.114} \approx 0.941c. Momentum is p=E2(mc2)2/c=1.51120.5112/c2.2830.261/c1.423 MeV/cp = \sqrt{E^2 - (mc^2)^2}/c = \sqrt{1.511^2 - 0.511^2}/c \approx \sqrt{2.283 - 0.261}/c \approx 1.423\ \text{MeV}/c.

Problem 2. A spaceship travels to a star 4.3 light-years4.3\ \text{light-years} away at v=0.8cv = 0.8c. How long does the trip take according to Earth clocks and according to the ship’s clock?

Solution. Earth time: t=d/v=4.3/0.8=5.375 yearst = d/v = 4.3/0.8 = 5.375\ \text{years}. Ship time (proper time): τ=t/γ=t10.82=5.375×0.6=3.225 years\tau = t/\gamma = t\sqrt{1-0.8^2} = 5.375 \times 0.6 = 3.225\ \text{years}. Alternatively, in the ship frame the distance is contracted to d=d/γ=4.3×0.6=2.58 lyd' = d/\gamma = 4.3 \times 0.6 = 2.58\ \text{ly}, so τ=d/v=2.58/0.8=3.225 years\tau = d'/v = 2.58/0.8 = 3.225\ \text{years}.

Problem 3. Show that the relativistic kinetic energy K=(γ1)mc2K = (\gamma - 1)mc^2 reduces to the classical expression 12mv2\frac{1}{2}mv^2 when vcv \ll c.

Solution. For small xx, (1x)1/21+x/2+3x2/8+(1-x)^{-1/2} \approx 1 + x/2 + 3x^2/8 + \dots. With x=v2/c2x = v^2/c^2, γ1+12v2/c2+38v4/c4+\gamma \approx 1 + \frac{1}{2}v^2/c^2 + \frac{3}{8}v^4/c^4 + \dots. Thus K=mc2(γ1)mc2(12v2/c2+38v4/c4)=12mv2+38mv4/c2+K = mc^2(\gamma - 1) \approx mc^2(\frac{1}{2}v^2/c^2 + \frac{3}{8}v^4/c^4) = \frac{1}{2}mv^2 + \frac{3}{8}mv^4/c^2 + \dots. The first term is the classical kinetic energy; the rest are relativistic corrections.

Section summary. Relativistic mechanics replaces absolute time with proper time and Newtonian momentum with four-momentum, unified by the invariant mass-shell condition.

The Hamilton Equations of Motion

Core ideas

Hamiltonian mechanics is a reformulation of classical mechanics that emphasizes the symmetry between coordinates qiq_i and momenta pip_i. It describes motion as a flow in phase space.

The Legendre Transformation. The Hamiltonian H(q,p,t)H(q, p, t) is obtained from the Lagrangian L(q,q˙,t)L(q, \dot q, t) via a Legendre transformation:

H(q,p,t)=ipiq˙iL(q,q˙,t),pi=Lq˙i.H(q, p, t) = \sum_i p_i \dot q_i - L(q, \dot q, t), \quad p_i = \frac{\partial L}{\partial \dot q_i}.

If L=TVL = T - V and TT is a homogeneous quadratic function of q˙\dot q, then H=T+V=EH = T + V = E (the total energy).

Hamilton’s Equations. Taking the differential of HH and using Euler—Lagrange equations leads to Hamilton’s canonical equations:

q˙i=Hpi,p˙i=Hqi.\dot q_i = \frac{\partial H}{\partial p_i}, \quad \dot p_i = -\frac{\partial H}{\partial q_i}.

These are 2n2n first-order differential equations, unlike the nn second-order Euler—Lagrange equations.

Poisson Brackets. For any two functions f(q,p,t)f(q, p, t) and g(q,p,t)g(q, p, t) on phase space, the Poisson bracket is:

{f,g}=i(fqigpifpigqi).\{f, g\} = \sum_i \left( \frac{\partial f}{\partial q_i} \frac{\partial g}{\partial p_i} - \frac{\partial f}{\partial p_i} \frac{\partial g}{\partial q_i} \right).

The time evolution of any function ff is given by:

dfdt={f,H}+ft.\frac{df}{dt} = \{f, H\} + \frac{\partial f}{\partial t}.

A quantity is conserved if {f,H}=0\{f, H\} = 0 (and it has no explicit time dependence).

Phase Space and Liouville’s Theorem. A state is a point in the 2n2n-dimensional phase space. Liouville’s Theorem states that the density of points in phase space (or the volume of a region of points) is constant along the trajectories of the system.

Mathematical spine

H(q,p,t)=piq˙iL(Hamiltonian)q˙i=Hpi,p˙i=Hqi(Hamilton’s Equations)f˙={f,H}+ft(Evolution equation)\begin{aligned} H(q, p, t) &= \sum p_i \dot q_i - L & \text{(Hamiltonian)} \\ \dot q_i = \frac{\partial H}{\partial p_i}, \quad &\dot p_i = -\frac{\partial H}{\partial q_i} & \text{(Hamilton's Equations)} \\ \dot f = \{f, H\} &+ \frac{\partial f}{\partial t} & \text{(Evolution equation)} \end{aligned}

Example: Harmonic Oscillator in Phase Space. H=p22m+12mω2q2H = \frac{p^2}{2m} + \frac{1}{2}m\omega^2 q^2. Hamilton’s equations: q˙=p/m\dot q = p/m, p˙=mω2q\dot p = -m\omega^2 q. The trajectories in phase space are ellipses.

Worked example

Particle in a uniform gravitational field.
A mass m=2 kgm = 2\ \text{kg} falls vertically in a uniform gravitational field g=9.8 m/s2g = 9.8\ \text{m/s}^2. Using coordinate q=yq = y (height) and momentum p=my˙p = m\dot y, the Hamiltonian is H=p2/(2m)+mgyH = p^2/(2m) + mgy. Hamilton’s equations give y˙=H/p=p/m\dot y = \partial H/\partial p = p/m and p˙=H/y=mg\dot p = -\partial H/\partial y = -mg. Thus y¨=p˙/m=g\ddot y = \dot p/m = -g, as expected from Newton’s second law. If released from rest at y0=10 my_0 = 10\ \text{m}, then p(0)=0p(0)=0 and y(t)=y012gt2y(t) = y_0 - \frac{1}{2}gt^2, hitting the ground at t=2y0/g1.43 st = \sqrt{2y_0/g} \approx 1.43\ \text{s} with momentum p=mgt28.0 kgm/sp = -mgt \approx -28.0\ \text{kg}\cdot\text{m/s}.

Problems with Solutions

Problem 1. A 1D harmonic oscillator has Hamiltonian H=p2/(2m)+12mω2q2H = p^2/(2m) + \frac{1}{2}m\omega^2 q^2. Write Hamilton’s equations and show that they reproduce the familiar second-order equation of motion.

Solution. q˙=H/p=p/m\dot q = \partial H/\partial p = p/m and p˙=H/q=mω2q\dot p = -\partial H/\partial q = -m\omega^2 q. Differentiating the first and substituting the second gives q¨=p˙/m=ω2q\ddot q = \dot p/m = -\omega^2 q, which is the standard harmonic oscillator equation.

Problem 2. For H=p2/(2m)αqH = p^2/(2m) - \alpha q (a particle in a constant force field), find the time evolution of q(t)q(t) and p(t)p(t) given q(0)=0q(0)=0 and p(0)=p0p(0)=p_0.

Solution. Hamilton’s equations: q˙=p/m\dot q = p/m, p˙=α\dot p = \alpha. Integrating, p(t)=p0+αtp(t) = p_0 + \alpha t and q(t)=0t(p0+αt)/mdt=p0t/m+αt2/(2m)q(t) = \int_0^t (p_0 + \alpha t')/m\,dt' = p_0 t/m + \alpha t^2/(2m). This is constant-acceleration motion with a=α/ma = \alpha/m.

Problem 3. Compute the Poisson bracket {Lz,x}\{L_z, x\} where Lz=xpyypxL_z = xp_y - yp_x.

Solution. Using {f,g}=i(fqigpifpigqi)\{f,g\} = \sum_i(\frac{\partial f}{\partial q_i}\frac{\partial g}{\partial p_i} - \frac{\partial f}{\partial p_i}\frac{\partial g}{\partial q_i}), only the terms involving yy and pyp_y survive because xx depends only on xx and pxp_x. We get {Lz,x}=(xpyypx)yxpy(xpyypx)pyxy+=(px)(0)(x)(0)+\{L_z, x\} = \frac{\partial(xp_y-yp_x)}{\partial y}\frac{\partial x}{\partial p_y} - \frac{\partial(xp_y-yp_x)}{\partial p_y}\frac{\partial x}{\partial y} + \dots = (-p_x)(0) - (x)(0) + \dots. Actually computing all terms: {xpyypx,x}={xpy,x}{ypx,x}\{xp_y-yp_x, x\} = \{xp_y, x\} - \{yp_x, x\}. The first bracket is x{py,x}+py{x,x}=0x\{p_y,x\} + p_y\{x,x\} = 0. The second is y{px,x}+px{y,x}=y(1)+0=yy\{p_x,x\} + p_x\{y,x\} = y(-1) + 0 = -y. Thus {Lz,x}=0(y)=y\{L_z,x\} = 0 - (-y) = y. (Equivalently, from the standard relation {Li,xj}=ϵijkxk\{L_i, x_j\} = \epsilon_{ijk}x_k, we have {L3,x1}=ϵ312x2=y\{L_3, x_1\} = \epsilon_{312}x_2 = y.)

Section summary. Hamiltonian mechanics treats coordinates and momenta as independent variables in phase space, providing a powerful framework for conservation laws and a bridge to quantum mechanics.

Canonical Transformations

Core ideas

A transformation from old coordinates (q,p)(q, p) to new coordinates (Q,P)(Q, P) is canonical if it preserves the form of Hamilton’s equations. This is equivalent to preserving the Poisson bracket relations:

{Qi,Qj}q,p=0,{Pi,Pj}q,p=0,{Qi,Pj}q,p=δij.\{Q_i, Q_j\}_{q,p} = 0, \quad \{P_i, P_j\}_{q,p} = 0, \quad \{Q_i, P_j\}_{q,p} = \delta_{ij}.

The Symplectic Condition. Defining η=(q1,,qn,p1,,pn)T\bm \eta = (q_1, \dots, q_n, p_1, \dots, p_n)^T, Hamilton’s equations can be written as η˙=JH\dot{\bm\eta} = \bm J \nabla H, where J=(0110)\bm J = \begin{pmatrix} 0 & 1 \\ -1 & 0 \end{pmatrix} is the symplectic matrix. A transformation ηξ\bm\eta \to \bm\xi is canonical if the Jacobian M=ξ/η\bm M = \partial \bm\xi / \partial \bm\eta satisfies the symplectic condition:

MJMT=J.\bm M \bm J \bm M^T = \bm J.

Generating Functions. Canonical transformations are often derived from a generating function. There are four basic types depending on the choice of independent variables:

  • F1(q,Q,t)F_1(q, Q, t): pi=F1qi,Pi=F1Qip_i = \frac{\partial F_1}{\partial q_i}, \quad P_i = -\frac{\partial F_1}{\partial Q_i}.
  • F2(q,P,t)F_2(q, P, t): pi=F2qi,Qi=F2Pip_i = \frac{\partial F_2}{\partial q_i}, \quad Q_i = \frac{\partial F_2}{\partial P_i}.
  • F3(p,Q,t)F_3(p, Q, t): qi=F3pi,Pi=F3Qiq_i = -\frac{\partial F_3}{\partial p_i}, \quad P_i = -\frac{\partial F_3}{\partial Q_i}.
  • F4(p,P,t)F_4(p, P, t): qi=F4pi,Qi=F4Piq_i = -\frac{\partial F_4}{\partial p_i}, \quad Q_i = \frac{\partial F_4}{\partial P_i}.

The new Hamiltonian KK is related to the old HH by K=H+FtK = H + \frac{\partial F}{\partial t}.

Goal of Canonical Transformations. The primary use is to transform to a system where all coordinates are cyclic (K/Qi=0\partial K / \partial Q_i = 0). In such a system, the new momenta PiP_i are constants of motion, and the problem is trivially solved.

Mathematical spine

{Qi,Pj}=δij(Preservation of Poisson Brackets)MJMT=J(Symplectic Condition)K=H+Ft(Transformation of the Hamiltonian)\begin{aligned} \{Q_i, P_j\} &= \delta_{ij} & \text{(Preservation of Poisson Brackets)} \\ \bm M \bm J \bm M^T &= \bm J & \text{(Symplectic Condition)} \\ K &= H + \frac{\partial F}{\partial t} & \text{(Transformation of the Hamiltonian)} \end{aligned}

Example: Identity Transformation. The generating function F2(q,P)=qiPiF_2(q, P) = \sum q_i P_i gives pi=Pip_i = P_i and Qi=qiQ_i = q_i, which is the identity transformation. Changing this to F2=fi(q)PiF_2 = \sum f_i(q) P_i generates a point transformation.

Worked example

Transformation to center-of-mass and relative coordinates.
For two particles in 1D with masses m1,m2m_1, m_2, coordinates q1,q2q_1, q_2 and momenta p1,p2p_1, p_2, define Q1=(m1q1+m2q2)/(m1+m2)Q_1 = (m_1q_1 + m_2q_2)/(m_1+m_2) (CM position), Q2=q1q2Q_2 = q_1 - q_2 (relative coordinate), P1=p1+p2P_1 = p_1 + p_2 (total momentum), and P2=(m2p1m1p2)/(m1+m2)P_2 = (m_2p_1 - m_1p_2)/(m_1+m_2) (relative momentum). The generating function is F2=P1(m1q1+m2q2)/(m1+m2)+P2(q1q2)F_2 = P_1(m_1q_1+m_2q_2)/(m_1+m_2) + P_2(q_1-q_2). One verifies {Qi,Pj}=δij\{Q_i,P_j\} = \delta_{ij} and {Qi,Qj}={Pi,Pj}=0\{Q_i,Q_j\} = \{P_i,P_j\} = 0, confirming the transformation is canonical.

Problems with Solutions

Problem 1. Show that the transformation Q=ln(1+qcosp)Q = \ln(1+\sqrt{q}\cos p), P=2(1+qcosp)qsinpP = 2(1+\sqrt{q}\cos p)\sqrt{q}\sin p is canonical by evaluating the Poisson bracket {Q,P}q,p\{Q,P\}_{q,p}.

Solution. We need {Q,P}=QqPpQpPq\{Q,P\} = \frac{\partial Q}{\partial q}\frac{\partial P}{\partial p} - \frac{\partial Q}{\partial p}\frac{\partial P}{\partial q}. Let u=1+qcospu = 1+\sqrt{q}\cos p. Then Q=lnuQ = \ln u, P=2uqsinpP = 2u\sqrt{q}\sin p. Computing: Q/q=cosp/(2qu)\partial Q/\partial q = \cos p/(2\sqrt{q}u), Q/p=qsinp/u\partial Q/\partial p = -\sqrt{q}\sin p/u, P/q=sinpcosp/q+usinp/q=sinp(1+2qcosp)/q\partial P/\partial q = \sin p\cos p/\sqrt{q} + u\sin p/\sqrt{q} = \sin p(1+2\sqrt{q}\cos p)/\sqrt{q}, P/p=2uqcosp2qsin2p\partial P/\partial p = 2u\sqrt{q}\cos p - 2q\sin^2 p. After simplification, {Q,P}=1\{Q,P\} = 1, confirming the transformation is canonical.

Problem 2. Find a type-2 generating function F2(q,P)F_2(q,P) that produces the point transformation Q=q2Q = q^2 for a 1D system.

Solution. For a point transformation, p=F2/qp = \partial F_2/\partial q and Q=F2/P=q2Q = \partial F_2/\partial P = q^2. Integrating the second equation, F2(q,P)=q2P+g(q)F_2(q,P) = q^2 P + g(q). Then p=F2/q=2qP+g(q)p = \partial F_2/\partial q = 2qP + g'(q). The simplest choice is g=0g=0, giving F2=q2PF_2 = q^2 P and p=2qPp = 2qP, or P=p/(2q)P = p/(2q).

Problem 3. Given the Hamiltonian H=p22+q22H = \frac{p^2}{2} + \frac{q^2}{2}, show that the transformation Q=pQ = p, P=qP = -q is canonical and find the new Hamiltonian.

Solution. The Poisson brackets are {Q,P}={p,q}={p,q}=1\{Q,P\} = \{p,-q\} = -\{p,q\} = 1, {Q,Q}={p,p}=0\{Q,Q\} = \{p,p\} = 0, {P,P}={q,q}=0\{P,P\} = \{-q,-q\} = 0. Thus the transformation is canonical. Since the transformation is time-independent, K=HK = H expressed in new variables: K=P22+Q22K = \frac{P^2}{2} + \frac{Q^2}{2}. This is just a rotation by 9090^\circ in phase space.

Section summary. Canonical transformations preserve the symplectic structure of phase space, allowing for coordinate changes that simplify the Hamiltonian and reveal constants of motion.

Hamilton—Jacobi Theory and Action-Angle Coordinates

Core ideas

Hamilton—Jacobi (HJ) theory is the ultimate canonical transformation: it seeks a transformation to a frame where the new Hamiltonian is zero, meaning all new coordinates and momenta are constants of motion.

The Hamilton—Jacobi Equation. Using an F2(q,P,t)F_2(q, P, t) generating function, we call the result Hamilton’s Principal Function S(q,P,t)S(q, P, t). If the new Hamiltonian K=H+S/t=0K = H + \partial S / \partial t = 0, then Pi=αiP_i = \alpha_i (constants). The HJ equation is a first-order, non-linear partial differential equation for SS:

H(q1,,qn,Sq1,,Sqn,t)+St=0.H\left(q_1, \dots, q_n, \frac{\partial S}{\partial q_1}, \dots, \frac{\partial S}{\partial q_n}, t\right) + \frac{\partial S}{\partial t} = 0.

For time-independent Hamiltonians, we use Hamilton’s Characteristic Function WW, where S(q,α,t)=W(q,α)EtS(q, \alpha, t) = W(q, \alpha) - Et. Then:

H(q,Wq)=E.H\left(q, \frac{\partial W}{\partial q}\right) = E.

Separation of Variables. The HJ equation is often solved by separation of variables: W(q1,,qn)=Wi(qi)W(q_1, \dots, q_n) = \sum W_i(q_i). This reduces the PDE to nn independent ODEs, which can be solved by integration (quadratures).

Action—Angle Variables. For periodic systems, action—angle variables (J,θ)(J, \theta) are the most natural coordinates.

  • Action variable Ji=12πpidqiJ_i = \frac{1}{2\pi} \oint p_i dq_i, where the integral is over one cycle of the motion.
  • Angle variable θi\theta_i is the canonical conjugate to JiJ_i.
  • The Hamiltonian depends only on actions: H=H(J1,,Jn)H = H(J_1, \dots, J_n).
  • The equations of motion are trivial: J˙i=0\dot J_i = 0, θ˙i=HJi=νi\dot \theta_i = \frac{\partial H}{\partial J_i} = \nu_i (the constant frequency).

Mathematical spine

H(q,S/q,t)+S/t=0(Hamilton–Jacobi Equation)Ji=12πpidqi(Action Variable)θi(t)=νit+βi(Angle Variable evolution)\begin{aligned} H(q, \partial S / \partial q, t) + \partial S / \partial t &= 0 & \text{(Hamilton--Jacobi Equation)} \\ J_i &= \frac{1}{2\pi} \oint p_i dq_i & \text{(Action Variable)} \\ \theta_i(t) &= \nu_i t + \beta_i & \text{(Angle Variable evolution)} \end{aligned}

Example: Harmonic Oscillator. H=p22m+12mω2q2=EH = \frac{p^2}{2m} + \frac{1}{2}m\omega^2 q^2 = E. The action is J=E/ωJ = E/\omega. The frequency is ν=H/J=ω\nu = \partial H / \partial J = \omega. The angle θ\theta is the phase of the oscillation.

Worked example

Particle in a 1D box via Hamilton—Jacobi.
A particle of mass mm moves freely on the interval [0,L][0,L] with infinite walls. The Hamiltonian is H=p2/(2m)=EH = p^2/(2m) = E. The HJ equation is (dW/dx)2/(2m)=E(dW/dx)^2/(2m) = E, so dW/dx=±2mEdW/dx = \pm\sqrt{2mE}. Integrating, W(x)=±x2mEW(x) = \pm x\sqrt{2mE}. The action variable is J=12πpdx=12π(2L2mE)=Lπ2mEJ = \frac{1}{2\pi}\oint p\,dx = \frac{1}{2\pi}(2L\sqrt{2mE}) = \frac{L}{\pi}\sqrt{2mE}. Inverting gives E=π2J22mL2E = \frac{\pi^2 J^2}{2mL^2}. The frequency is ν=E/J=π2JmL2=πL2Em=v2L\nu = \partial E/\partial J = \frac{\pi^2 J}{mL^2} = \frac{\pi}{L}\sqrt{\frac{2E}{m}} = \frac{v}{2L}, which is the round-trip frequency as expected.

Problems with Solutions

Problem 1. For a 1D harmonic oscillator with H=p2/(2m)+12mω2q2H = p^2/(2m) + \frac{1}{2}m\omega^2 q^2, solve the time-independent HJ equation and find Hamilton’s characteristic function W(q,E)W(q,E).

Solution. The HJ equation is 12m(W/q)2+12mω2q2=E\frac{1}{2m}(\partial W/\partial q)^2 + \frac{1}{2}m\omega^2 q^2 = E. Solving for W/q=2mEm2ω2q2\partial W/\partial q = \sqrt{2mE - m^2\omega^2 q^2}, we integrate to get W(q,E)=q2mEm2ω2q2dqW(q,E) = \int^q \sqrt{2mE - m^2\omega^2 q'^2}\,dq'. This evaluates to W=mω2q2Emω2q2+Earcsin(qmω22E)W = \frac{m\omega}{2}q\sqrt{\frac{2E}{m\omega^2}-q^2} + E\arcsin\left(q\sqrt{\frac{m\omega^2}{2E}}\right).

Problem 2. A particle moves in a central potential V(r)=k/rV(r) = -k/r. Using action—angle variables, show that the radial and azimuthal frequencies are equal, confirming that the orbit is closed.

Solution. The actions are Jϕ=pϕdϕ=2πJ_\phi = \oint p_\phi\,d\phi = 2\pi\ell and Jr=prdrJ_r = \oint p_r\,dr. For the Kepler problem, evaluating the radial action gives Jr=Jϕ+πkμ/(2E)J_r = -J_\phi + \pi k\sqrt{\mu/(-2E)}. The total energy depends on Jr+JϕJ_r + J_\phi as E=μk22(Jr+Jϕ)2E = -\frac{\mu k^2}{2(J_r+J_\phi)^2}. The frequencies are νr=E/Jr\nu_r = \partial E/\partial J_r and νϕ=E/Jϕ\nu_\phi = \partial E/\partial J_\phi. Since EE depends only on the sum Jr+JϕJ_r+J_\phi, νr=νϕ\nu_r = \nu_\phi. Equal frequencies mean the radial and angular motions complete a cycle in the same time, producing a closed ellipse.

Problem 3. Find the action variable JJ for a simple pendulum of length ll and mass mm in the small-angle approximation.

Solution. For small angles, H=pθ22ml2+12mglθ2=EH = \frac{p_\theta^2}{2ml^2} + \frac{1}{2}mgl\theta^2 = E. The motion is harmonic with ω=g/l\omega = \sqrt{g/l} and E=12mglθ02E = \frac{1}{2}mgl\theta_0^2. The phase-space trajectory is an ellipse with semi-axes pmax=ml2ωθ0=2mElp_{\rm max} = ml^2\omega\theta_0 = \sqrt{2mE}\,l and θmax=θ0=2E/(mgl)\theta_{\rm max} = \theta_0 = \sqrt{2E/(mgl)}. The area is πpmaxθmax=π2mEl2E/(mgl)=2πEl/g=2πE/ω\pi p_{\rm max}\theta_{\rm max} = \pi\sqrt{2mE}\,l \cdot \sqrt{2E/(mgl)} = 2\pi E\sqrt{l/g} = 2\pi E/\omega. Thus J=12π×(2πE/ω)=E/ωJ = \frac{1}{2\pi}\times(2\pi E/\omega) = E/\omega.

Section summary. Hamilton—Jacobi theory reduces dynamics to a single PDE, while action-angle variables provide the most efficient description for integrable periodic systems.

Classical Chaos

Core ideas

Chaos refers to the complex, unpredictable behavior that can arise in deterministic nonlinear dynamical systems. It is not due to noise or randomness but is an inherent property of the system’s geometry.

Sensitivity to Initial Conditions. The hallmark of chaos is that two trajectories starting very close together in phase space will diverge exponentially:

δx(t)δx(0)eλt,|\delta x(t)| \approx |\delta x(0)| e^{\lambda t},

where λ\lambda is the Lyapunov exponent. A positive λ\lambda implies that small uncertainties in measurement grow rapidly, making long-term prediction impossible.

Phase Space and Attractors.

  • Phase Portrait: A map of all possible states in phase space. Regular systems have trajectories on circles or tori.
  • Strange Attractor: For dissipative chaotic systems, trajectories settle onto a complex, fractal-like structure in phase space.
  • Poincaré Section: A way to simplify the analysis by taking a “snapshot” of the system’s state each time it crosses a chosen surface in phase space. This reduces the continuous dynamics to a discrete map.

The KAM Theorem. The Kolmogorov—Arnold—Moser (KAM) theorem addresses what happens to an integrable system when a small nonlinear perturbation is added. It states that many of the original “invariant tori” (regular orbits) survive the perturbation, but they are increasingly destroyed as the perturbation strength increases, leading to “stochastic” or chaotic regions.

Routes to Chaos. Systems often become chaotic through a sequence of bifurcations as a parameter is varied. A common route is period-doubling, where the period of the oscillation doubles repeatedly until it becomes infinite (chaos).

Mathematical spine

λ=limt1tlnδx(t)δx(0)(Lyapunov Exponent)xn+1=rxn(1xn)(Logistic Map - simplest chaos example)\begin{aligned} \lambda &= \lim_{t\to\infty} \frac{1}{t} \ln \frac{|\delta x(t)|}{|\delta x(0)|} & \text{(Lyapunov Exponent)} \\ x_{n+1} &= r x_n (1 - x_n) & \text{(Logistic Map - simplest chaos example)} \end{aligned}

Concrete example: period-doubling cascade in the logistic map. The discrete map xn+1=rxn(1xn)x_{n+1} = r x_n(1 - x_n) on x[0,1]x\in[0,1] shows the period-doubling route to chaos as the parameter rr is increased:

  • r<1r < 1: the only stable fixed point is x=0x^*=0 (extinction).
  • 1<r<31 < r < 3: a single non-trivial fixed point x=11/rx^* = 1 - 1/r is stable.
  • r1=3r_1 = 3: first period-doubling bifurcation; a stable 2-cycle appears.
  • r23.4495r_2 \approx 3.4495: bifurcation to a stable 4-cycle.
  • r33.5441r_3 \approx 3.5441: 8-cycle.
  • r43.5644r_4 \approx 3.5644: 16-cycle, and so on.
  • r3.56995r_\infty \approx 3.56995: accumulation point --- onset of chaos.

The successive intervals shrink geometrically, and their ratios approach the universal first Feigenbaum constant

δ  =  limnrnrn1rn+1rn  =  4.669201609\delta \;=\; \lim_{n\to\infty} \frac{r_n - r_{n-1}}{r_{n+1} - r_n} \;=\; 4.669\,201\,609\ldots

The width of the bifurcating branches scales by a second universal constant α2.5029\alpha \approx 2.5029. Feigenbaum showed in 1978 that δ\delta and α\alpha are independent of the specific map: any smooth one-dimensional map with a single quadratic maximum --- a driven pendulum, a forced oscillator, even a dripping faucet --- displays the same numbers, providing the first experimentally verified case of universality in classical chaos.

Example: Driven Damped Pendulum. A simple pendulum with friction and a periodic driving force. For small drive, it oscillates regularly. As the drive increases, it can undergo period-doubling and eventually move chaotically, never repeating its path.

Worked example

Lyapunov exponent of the logistic map.
For the logistic map xn+1=rxn(1xn)x_{n+1} = r x_n(1-x_n) with r=4r=4, the Lyapunov exponent is λ=limN1Nn=0N1lnf(xn)\lambda = \lim_{N\to\infty}\frac{1}{N}\sum_{n=0}^{N-1}\ln|f'(x_n)| where f(x)=4(12x)f'(x) = 4(1-2x). Starting from x0=0.2x_0 = 0.2, iteration gives x1=0.64x_1 = 0.64, x2=0.9216x_2 = 0.9216, etc. Numerically averaging ln4(12xn)\ln|4(1-2x_n)| over N=104N=10^4 iterations yields λln20.693\lambda \approx \ln 2 \approx 0.693. Since λ>0\lambda > 0, the dynamics is chaotic: a tiny initial difference δ0\delta_0 grows as δnδ0enλ|\delta_n| \approx \delta_0 e^{n\lambda}, so after n=10n=10 iterations the uncertainty has grown by a factor of about e6.93103e^{6.93} \approx 10^3.

Problems with Solutions

Problem 1. Find the fixed points of the logistic map xn+1=rxn(1xn)x_{n+1} = r x_n(1-x_n) and determine their stability for 0<r<30 < r < 3.

Solution. Fixed points satisfy x=rx(1x)x^* = rx^*(1-x^*), giving x=0x^*=0 or x=11/rx^* = 1-1/r. Stability requires f(x)<1|f'(x^*)| < 1 where f(x)=r(12x)f'(x) = r(1-2x). At x=0x^*=0, f(0)=r|f'(0)| = |r|. For 0<r<10<r<1, this is stable; for r>1r>1, unstable. At x=11/rx^*=1-1/r, f(x)=r(12(11/r))=2r|f'(x^*)| = |r(1-2(1-1/r))| = |2-r|. This is stable when 2r<1|2-r|<1, i.e., 1<r<31<r<3.

Problem 2. A driven damped pendulum obeys θ¨+0.5θ˙+sinθ=1.5cos(0.7t)\ddot\theta + 0.5\dot\theta + \sin\theta = 1.5\cos(0.7t). Explain why the Poincar’e section is useful and describe what it would show for periodic versus chaotic motion.

Solution. The Poincar’e section records (θ,θ˙)(\theta, \dot\theta) once per drive period T=2π/0.7T = 2\pi/0.7. For periodic motion with period equal to the drive, the section shows a single fixed point. For period-doubling, it shows 2, 4, 8, … points. For chaotic motion, the points never repeat and fill a fractal structure (strange attractor), revealing the underlying order in the chaotic flow.

Problem 3. Estimate the Lyapunov exponent for a system where two nearby trajectories diverge by a factor of 10610^6 in time t=10 st = 10\ \text{s}.

Solution. Using δ(t)δ(0)eλt|\delta(t)| \approx |\delta(0)|e^{\lambda t}, we have 106=eλ(10)10^6 = e^{\lambda(10)}, so λ=ln(106)10=6ln10101.38 s1\lambda = \frac{\ln(10^6)}{10} = \frac{6\ln 10}{10} \approx 1.38\ \text{s}^{-1}. This positive value indicates chaotic dynamics.

Section summary. Chaos demonstrates that deterministic laws can lead to unpredictable behavior through exponential sensitivity to initial conditions, often visualized through strange attractors and Poincaré sections.

Canonical Perturbation Theory

Core ideas

Many realistic systems are “nearly integrable,” meaning they can be described as an integrable system plus a small perturbation parameterized by a dimensionless number ϵ1\epsilon \ll 1:

H(J,θ)=H0(J)+ϵH1(J,θ),H(J, \theta) = H_0(J) + \epsilon H_1(J, \theta),

where J=(J1,,Jn)J=(J_1,\ldots,J_n) and θ=(θ1,,θn)\theta=(\theta_1,\ldots,\theta_n) are the action—angle variables of the unperturbed problem.

Time-independent perturbation theory. The goal is to find a near-identity canonical transformation to new variables (Jˉ,θˉ)(\bar J, \bar \theta) such that the new Hamiltonian K(Jˉ)K(\bar J) depends only on Jˉ\bar J. One uses a type-2 generating function

F2(J,θˉ)=iJiθˉi+ϵW(J,θˉ),F_2(J,\bar\theta) = \sum_i J_i \bar\theta_i + \epsilon\, W(J,\bar\theta),

in which WW is the unknown “correction” to the identity. Expanding order by order in ϵ\epsilon and demanding that the angle-dependent pieces cancel, the first-order shift in the Hamiltonian is simply the angle-average of the perturbation,

K1(Jˉ)=H1(Jˉ,θ)θ=1(2π)n02πH1(Jˉ,θ)dnθ,K_1(\bar J) = \langle H_1(\bar J, \theta) \rangle_\theta = \frac{1}{(2\pi)^n} \int_0^{2\pi} H_1(\bar J, \theta)\, d^n\theta,

and the generating-function correction takes the Fourier form

W(Jˉ,θˉ)=n0ihn(Jˉ)nω(Jˉ)einθˉ,ω=H0/Jˉ,W(\bar J, \bar\theta) = \sum_{\bm n \neq 0} \frac{i\, h_{\bm n}(\bar J)}{\bm n \cdot \bm\omega(\bar J)}\, e^{i \bm n \cdot \bar\theta}, \qquad \bm\omega = \partial H_0/\partial \bar J,

where hnh_{\bm n} are the Fourier coefficients of H1H_1 and nZn{0}\bm n\in\mathbb{Z}^n\setminus\{0\}.

Secular terms and the Lindstedt—Poincar’e fix

Naive expansion in ϵ\epsilon tends to produce secular terms: terms that grow without bound as a power of tt and so spoil the validity of the series after a time of order 1/ϵ1/\epsilon. The standard cure is the Lindstedt—Poincar’e method, which absorbs the would-be secular contribution into a renormalization of the oscillation frequency.

Worked example: Duffing oscillator. Consider the weakly anharmonic equation

x¨+x+ϵx3=0,x(0)=A, x˙(0)=0.\ddot x + x + \epsilon\, x^3 = 0, \qquad x(0)=A,\ \dot x(0)=0.

A naive expansion x=x0+ϵx1+x = x_0 + \epsilon x_1 + \cdots gives x0=Acostx_0 = A\cos t and an inhomogeneous equation for x1x_1 whose driving term contains cost\cos t, producing the resonant secular response

x1(t)38A3tsint,x_1(t) \supset -\tfrac{3}{8} A^3\, t \sin t,

which diverges with time. To remove it, rescale time by τ=ωt\tau = \omega t with ω=1+ϵω1+ϵ2ω2+\omega = 1 + \epsilon\omega_1 + \epsilon^2\omega_2 + \cdots. The equation becomes

ω2x+x+ϵx3=0,\omega^2 x'' + x + \epsilon x^3 = 0,

with x=dx/dτx' = dx/d\tau. Choosing ω1\omega_1 to kill the resonant term gives the uniformly valid first-order approximation

ω=1+38ϵA2+O(ϵ2),x(t)Acos(ωt).\omega = 1 + \tfrac{3}{8}\epsilon A^2 + \mathcal{O}(\epsilon^2), \qquad x(t) \approx A\cos(\omega t).

The amplitude-dependent frequency shift is the hallmark of nonlinear oscillation.

Resonances and the pendulum normal form

Small denominators. The denominators nω\bm n\cdot\bm\omega in WW become arbitrarily small whenever the unperturbed frequencies are nearly commensurate, nω0\bm n\cdot\bm\omega \approx 0. The series then diverges --- the celebrated “problem of small divisors” in celestial mechanics, partially tamed by KAM theory for sufficiently irrational frequency ratios.

Worked resonant example. Consider two degrees of freedom near a p ⁣: ⁣qp\!:\!q resonance, pω1qω20p\omega_1 - q\omega_2 \approx 0, with perturbation

H=H0(J1,J2)+ϵhcos(pθ1qθ2).H = H_0(J_1,J_2) + \epsilon\, h\, \cos(p\theta_1 - q\theta_2).

Average over the fast (non-resonant) angles but retain the slow resonant angle ψ=pθ1qθ2\psi = p\theta_1 - q\theta_2. Introducing the canonical transformation generated by F2=ψP+θ2P2F_2 = \psi P + \theta_2 P_2 (so that J1=pPJ_1 = pP, J2=qP+P2J_2 = -qP + P_2) gives an effective one-degree-of-freedom Hamiltonian

Heff(P,ψ)=H0(pP,qP+P2)+ϵhcosψ.H_{\text{eff}}(P,\psi) = H_0(pP, -qP+P_2) + \epsilon h\cos\psi.

Expanding H0H_0 about the resonant action PP_* where pω1=qω2p\omega_1 = q\omega_2, one obtains the pendulum form

Heff12M(PP)2+ϵhcosψ,M2H0P2P.H_{\text{eff}} \approx \tfrac{1}{2} M (P - P_*)^2 + \epsilon h \cos\psi, \qquad M \equiv \frac{\partial^2 H_0}{\partial P^2}\bigg|_{P_*}.

Resonant motion thus librates like a pendulum with small-oscillation frequency Ω=Mϵh\Omega = \sqrt{|M|\,\epsilon h} and a separatrix of width ΔPϵh/M\Delta P \sim \sqrt{\epsilon h/|M|} --- the characteristic “resonance island.”

Adiabatic invariants

Concept. If a parameter λ(t)\lambda(t) of the Hamiltonian changes on a timescale much longer than the orbital period TT, i.e.\ λ˙/λ1/T\dot\lambda/\lambda \ll 1/T, then the action

J=12πpdqJ = \frac{1}{2\pi}\oint p\, dq

is conserved up to exponentially small corrections in the slowness parameter. JJ is therefore called an adiabatic invariant.

Worked example: harmonic oscillator with slowly varying frequency. For H=12p2+12ω(t)2q2H = \tfrac{1}{2}p^2 + \tfrac{1}{2}\omega(t)^2 q^2 the action of an instantaneous orbit at energy EE is the area of a phase-space ellipse with semi-axes 2E\sqrt{2E} and 2E/ω\sqrt{2E}/\omega,

J=Eω(t).J = \frac{E}{\omega(t)}.

Adiabatic invariance of JJ therefore implies

E(t)=ω(t)J=ω(t)E0ω0,E(t) = \omega(t)\, J = \omega(t)\, \frac{E_0}{\omega_0},

so the energy tracks the frequency: a pendulum whose string is slowly shortened gains energy in proportion to its raised frequency. Historically this is the result Einstein invoked at the 1911 Solvay conference and that Ehrenfest promoted to the quantization rule J=nJ = n\hbar of the old quantum theory.

Mathematical spine

H=H0(J)+ϵH1(J,θ)(perturbed Hamiltonian)K1=H1θ(first-order energy shift)ω=ω0+38ϵA2+(Lindstedt frequency, Duffing)Hres12MδP2+ϵhcosψ(pendulum near resonance)J=12πpdqconst(adiabatic invariant)\begin{aligned} H &= H_0(J) + \epsilon H_1(J,\theta) & \text{(perturbed Hamiltonian)} \\ K_1 &= \langle H_1 \rangle_\theta & \text{(first-order energy shift)} \\ \omega &= \omega_0 + \tfrac{3}{8}\epsilon A^2 + \cdots & \text{(Lindstedt frequency, Duffing)} \\ H_{\text{res}} &\approx \tfrac{1}{2} M\, \delta P^2 + \epsilon h\cos\psi & \text{(pendulum near resonance)} \\ J &= \tfrac{1}{2\pi}\oint p\,dq \approx \text{const} & \text{(adiabatic invariant)} \end{aligned}

Astrophysical aside: precession of Mercury’s perihelion. General relativity adds an effective 1/r31/r^3 correction to the Newtonian 1/r1/r potential. Treating it as ϵH1\epsilon H_1 and applying the angle-average prescription yields the celebrated 4343''/century perihelion advance.

Worked example

Anharmonic oscillator frequency shift.
Consider a particle of mass m=1 kgm=1\ \text{kg} in a potential V(x)=12x2+ϵx4V(x) = \frac{1}{2}x^2 + \epsilon x^4 (in units where m=ω0=1m=\omega_0=1) with ϵ=0.01\epsilon = 0.01 and amplitude A=0.5 mA = 0.5\ \text{m}. Using the Lindstedt—Poincar’e method, the first-order frequency correction is ω1+38ϵA2=1+38(0.01)(0.25)=1+0.0009375\omega \approx 1 + \frac{3}{8}\epsilon A^2 = 1 + \frac{3}{8}(0.01)(0.25) = 1 + 0.0009375. The period shifts from T0=2π sT_0 = 2\pi\ \text{s} to T2π/1.000946.280 sT \approx 2\pi/1.00094 \approx 6.280\ \text{s}, a decrease of about 6 ms6\ \text{ms}.

Problems with Solutions

Problem 1. A harmonic oscillator has Hamiltonian H0=12(p2+q2)H_0 = \frac{1}{2}(p^2 + q^2). A perturbation H1=q4H_1 = q^4 is added with small parameter ϵ\epsilon. Find the first-order energy shift for a state with action JJ.

Solution. In action—angle variables, q=2Jsinθq = \sqrt{2J}\sin\theta and the unperturbed frequency is ω0=1\omega_0=1. The first-order shift is the angle average H1=12π02π(2J)2sin4θdθ=4J22π3π4=3J2\langle H_1\rangle = \frac{1}{2\pi}\int_0^{2\pi}(2J)^2\sin^4\theta\, d\theta = \frac{4J^2}{2\pi}\cdot\frac{3\pi}{4} = 3J^2. Thus K=J+3ϵJ2K = J + 3\epsilon J^2 and the corrected frequency is ω=K/J=1+6ϵJ\omega = \partial K/\partial J = 1 + 6\epsilon J.

Problem 2. A pendulum of length l=1 ml=1\ \text{m} and mass m=0.5 kgm=0.5\ \text{kg} has its length shortened slowly from l0=1 ml_0=1\ \text{m} to l1=0.5 ml_1=0.5\ \text{m} over 100 periods. If the initial amplitude is θ0=0.1 rad\theta_0 = 0.1\ \text{rad}, find the final amplitude and energy.

Solution. For small oscillations, ω=g/l\omega = \sqrt{g/l} and the adiabatic invariant is J=E/ωJ = E/\omega. Since Eω2A2E \propto \omega^2 A^2 for a harmonic oscillator, JωA2J \propto \omega A^2, so A2ω=constA^2\omega = \text{const} and Aω1/2l1/4A \propto \omega^{-1/2} \propto l^{1/4}. Thus A1=A0(l1/l0)1/4=0.1×(0.5)1/40.084 radA_1 = A_0(l_1/l_0)^{1/4} = 0.1\times(0.5)^{1/4} \approx 0.084\ \text{rad}. The energy is E=12mglθ2E = \frac{1}{2}mgl\theta^2 (small-angle), so E1/E0=(l1/l0)(θ1/θ0)2=0.5×0.50.354E_1/E_0 = (l_1/l_0)(\theta_1/\theta_0)^2 = 0.5\times\sqrt{0.5} \approx 0.354. With E0=12(0.5)(9.8)(1)(0.1)20.0245 JE_0 = \frac{1}{2}(0.5)(9.8)(1)(0.1)^2 \approx 0.0245\ \text{J}, we get E18.7×103 JE_1 \approx 8.7\times10^{-3}\ \text{J}.

Problem 3. Consider the Duffing equation x¨+x+ϵx3=0\ddot x + x + \epsilon x^3 = 0 with x(0)=Ax(0)=A and x˙(0)=0\dot x(0)=0. Show that the naive perturbation expansion produces a secular term and explain how the Lindstedt—Poincar’e method removes it.

Solution. Writing x=x0+ϵx1+x = x_0 + \epsilon x_1 + \dots, the zeroth order gives x0=Acostx_0 = A\cos t. The first-order equation is x¨1+x1=x03=A3cos3t=34A3cost14A3cos3t\ddot x_1 + x_1 = -x_0^3 = -A^3\cos^3 t = -\frac{3}{4}A^3\cos t - \frac{1}{4}A^3\cos 3t. The cost\cos t term drives the harmonic oscillator at resonance, giving a particular solution x138A3tsintx_1 \supset -\frac{3}{8}A^3 t\sin t, which grows without bound (secular). To fix this, rescale time as τ=ωt\tau = \omega t with ω=1+ϵω1+\omega = 1 + \epsilon\omega_1 + \dots. The equation becomes ω2x+x+ϵx3=0\omega^2 x'' + x + \epsilon x^3 = 0. At O(ϵ)O(\epsilon), choosing ω1=38A2\omega_1 = \frac{3}{8}A^2 cancels the resonant term, leaving only bounded oscillations at the corrected frequency.

Section summary. Canonical perturbation theory systematically organizes the effects of small departures from integrability. Secular terms in the naive series are removed by frequency renormalization (Lindstedt—Poincar’e); near a resonance the same machinery reduces the dynamics locally to a pendulum; and adiabatic invariants such as E/ωE/\omega for the slowly varying oscillator capture the robust quantities that survive slow parameter drift.

Introduction to the Lagrangian and Hamiltonian Formulations for Continuous Systems and Fields

Core ideas

Classical mechanics can be extended to systems with an infinite number of degrees of freedom, such as fluids, elastic solids, and electromagnetic fields. These are described by fields ϕ(x,y,z,t)\phi(x, y, z, t).

Lagrangian Density. The total Lagrangian LL is the spatial integral of a Lagrangian density L\mathcal{L}:

L=L(ϕ,μϕ,xμ)d3x.L = \int \mathcal{L}(\phi, \partial_\mu \phi, x^\mu) d^3x.

The action is S=Ldt=Ld4xS = \int L dt = \int \mathcal{L} d^4x. Hamilton’s Principle (δS=0\delta S = 0) leads to the Euler—Lagrange equations for fields:

Lϕμ(L(μϕ))=0,\frac{\partial \mathcal{L}}{\partial \phi} - \partial_\mu \left( \frac{\partial \mathcal{L}}{\partial (\partial_\mu \phi)} \right) = 0,

where μ=(1ct,)\partial_\mu = (\frac{1}{c}\partial_t, \nabla) is the four-gradient.

Hamiltonian Density. The conjugate momentum density is π=Lϕ˙\pi = \frac{\partial \mathcal{L}}{\partial \dot \phi}. The Hamiltonian density is:

H=πϕ˙L.\mathcal{H} = \pi \dot \phi - \mathcal{L}.

The total Hamiltonian H=Hd3xH = \int \mathcal{H} d^3x gives the total energy of the field.

Noether’s Theorem and the Stress-Energy Tensor. Symmetries of the Lagrangian density lead to conserved currents.

  • Internal symmetries lead to conserved charges (like electric charge).
  • Spacetime symmetries lead to the conservation of the Stress—Energy Tensor TμνT^{\mu\nu}:
Tμν=L(μϕ)νϕgμνL.T^{\mu\nu} = \frac{\partial \mathcal{L}}{\partial (\partial_\mu \phi)} \partial^\nu \phi - g^{\mu\nu} \mathcal{L}.
Conservation $\partial_\mu T^{\mu\nu} = 0$ implies conservation of energy and momentum.

Mathematical spine

δLd4x=0    LϕμL(μϕ)=0(Field Equations)H=πϕ˙L(Hamiltonian Density)T00=H(Energy Density)\begin{aligned} \delta \int \mathcal{L} d^4x &= 0 \implies \frac{\partial \mathcal{L}}{\partial \phi} - \partial_\mu \frac{\partial \mathcal{L}}{\partial (\partial_\mu \phi)} = 0 & \text{(Field Equations)} \\ \mathcal{H} &= \pi \dot \phi - \mathcal{L} & \text{(Hamiltonian Density)} \\ T^{00} &= \mathcal{H} & \text{(Energy Density)} \end{aligned}

Example: The Vibrating String. For a string with tension τ\tau and linear density ρ\rho, L=12ρy˙212τ(xy)2\mathcal{L} = \frac{1}{2}\rho \dot y^2 - \frac{1}{2}\tau (\partial_x y)^2. The Euler—Lagrange equation yields the wave equation: ρy¨τy=0\rho \ddot y - \tau y'' = 0.

Example: The electromagnetic field. The free Maxwell field is described by the gauge potential Aμ=(ϕ/c,A)A^\mu = (\phi/c, \bm A) and the antisymmetric field-strength tensor Fμν=μAννAμF_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu. Its Lorentz-invariant Lagrangian density is

LEM  =  14μ0FμνFμν    JμAμ,\mathcal{L}_{\rm EM} \;=\; -\tfrac{1}{4\mu_0}\, F_{\mu\nu}F^{\mu\nu} \;-\; J^\mu A_\mu,

with Jμ=(cρ,J)J^\mu = (c\rho, \bm J) the four-current. The Euler—Lagrange equations applied to AμA_\mu reproduce the inhomogeneous Maxwell equations μFμν=μ0Jν\partial_\mu F^{\mu\nu} = \mu_0 J^\nu, while the homogeneous pair follows automatically from the antisymmetry of FμνF_{\mu\nu}. The associated stress—energy tensor TμνT^{\mu\nu} reproduces the energy density 12(ε0E2+B2/μ0)\tfrac{1}{2}(\varepsilon_0 E^2 + B^2/\mu_0) and the Poynting momentum density E×B/(μ0c2)\bm E\times\bm B/(\mu_0 c^2).

Example: Non-linear waves --- the sine-Gordon kink. A celebrated nonlinear field theory in 1+11{+}1 dimensions is the sine-Gordon model with Lagrangian density

LsG  =  12(tϕ)212c2(xϕ)2m2c4β2[1cos(βϕ)],\mathcal{L}_{\rm sG} \;=\; \tfrac{1}{2}(\partial_t\phi)^2 - \tfrac{1}{2}c^2(\partial_x\phi)^2 - \frac{m^2 c^4}{\beta^2}\bigl[1 - \cos(\beta\phi)\bigr],

which yields the equation of motion t2ϕc2x2ϕ+(m2c4/β)sin(βϕ)=0\partial_t^2\phi - c^2\partial_x^2\phi + (m^2 c^4/\beta)\sin(\beta\phi) = 0. Despite being nonlinear, it admits an exact static kink (soliton) solution interpolating between adjacent vacua ϕ=0\phi=0 and ϕ=2π/β\phi = 2\pi/\beta:

ϕK(x)  =  4βarctan ⁣[exp(mcx)],Ekink  =  8mc3β2.\phi_K(x) \;=\; \frac{4}{\beta}\arctan\!\bigl[\exp(mc\, x)\bigr], \qquad E_{\rm kink} \;=\; \frac{8 m c^3}{\beta^2}.

Boosting it gives a localised travelling wave that retains its shape after collisions --- a hallmark of the integrability of the sine-Gordon system. The same equation describes mechanical chains of coupled pendulums and Josephson-junction transmission lines, illustrating how classical field theory unifies wave propagation in seemingly disparate physical systems.

Worked example

Vibrating string with fixed ends.
A uniform string of length L=1.0 mL = 1.0\ \text{m}, linear density ρ=5.0×103 kg/m\rho = 5.0\times10^{-3}\ \text{kg/m}, and tension τ=20 N\tau = 20\ \text{N} is fixed at both ends. The Lagrangian density is L=12ρy˙212τ(y)2\mathcal{L} = \frac{1}{2}\rho \dot y^2 - \frac{1}{2}\tau (y')^2. The wave equation is ρy¨=τy\rho \ddot y = \tau y'', giving wave speed c=τ/ρ=20/0.005=63.2 m/sc = \sqrt{\tau/\rho} = \sqrt{20/0.005} = 63.2\ \text{m/s}. For fixed ends, normal modes are yn(x,t)=Ansin(nπx/L)cos(ωnt)y_n(x,t) = A_n \sin(n\pi x/L)\cos(\omega_n t) with frequencies ωn=nπc/L=n×198.7 rad/s\omega_n = n\pi c/L = n\times 198.7\ \text{rad/s} (n=1,2,n=1,2,\dots). The fundamental frequency is f1=ω1/(2π)31.6 Hzf_1 = \omega_1/(2\pi) \approx 31.6\ \text{Hz}.

Problems with Solutions

Problem 1. A flexible membrane with surface tension σ\sigma and surface mass density μ\mu has transverse displacement z(x,y,t)z(x,y,t). Write the Lagrangian density and derive the wave equation.

Solution. The kinetic energy density is 12μz˙2\frac{1}{2}\mu \dot z^2 and the potential energy density from stretching is 12σ[(xz)2+(yz)2]\frac{1}{2}\sigma[(\partial_x z)^2 + (\partial_y z)^2]. Thus L=12μz˙212σ[(xz)2+(yz)2]\mathcal{L} = \frac{1}{2}\mu \dot z^2 - \frac{1}{2}\sigma[(\partial_x z)^2 + (\partial_y z)^2]. The Euler—Lagrange equation gives μz¨=σ(x2z+y2z)\mu \ddot z = \sigma(\partial_x^2 z + \partial_y^2 z), or z¨=c22z\ddot z = c^2 \nabla^2 z with c2=σ/μc^2 = \sigma/\mu.

Problem 2. For a real scalar field ϕ\phi with Lagrangian density L=12(μϕ)(μϕ)12m2ϕ2\mathcal{L} = \frac{1}{2}(\partial_\mu \phi)(\partial^\mu \phi) - \frac{1}{2}m^2\phi^2, find the equation of motion and the Hamiltonian density.

Solution. Using the Euler—Lagrange equation: μ(L/(μϕ))=μμϕ=ϕ\partial_\mu(\partial\mathcal{L}/\partial(\partial_\mu\phi)) = \partial_\mu\partial^\mu\phi = \Box\phi, and L/ϕ=m2ϕ\partial\mathcal{L}/\partial\phi = -m^2\phi. The equation of motion is the Klein—Gordon equation: (+m2)ϕ=0(\Box + m^2)\phi = 0. The conjugate momentum is π=L/ϕ˙=ϕ˙\pi = \partial\mathcal{L}/\partial\dot\phi = \dot\phi. The Hamiltonian density is H=πϕ˙L=12π2+12(ϕ)2+12m2ϕ2\mathcal{H} = \pi\dot\phi - \mathcal{L} = \frac{1}{2}\pi^2 + \frac{1}{2}(\nabla\phi)^2 + \frac{1}{2}m^2\phi^2.

Problem 3. A string of length LL has displacement y(x,t)=Asin(kx)cos(ωt)y(x,t) = A\sin(kx)\cos(\omega t). If y(0,t)=0y(0,t)=0 and y(L,t)=0y(L,t)=0, find the allowed values of kk and the corresponding frequencies when c=100 m/sc = 100\ \text{m/s} and L=0.5 mL = 0.5\ \text{m}.

Solution. The boundary condition at x=0x=0 is satisfied automatically. At x=Lx=L, sin(kL)=0    kL=nπ\sin(kL)=0 \implies kL = n\pi, so kn=nπ/L=2nπ m1k_n = n\pi/L = 2n\pi\ \text{m}^{-1}. The dispersion relation is ω=ck\omega = ck, so ωn=ckn=200nπ rad/s\omega_n = c k_n = 200n\pi\ \text{rad/s} and fn=ωn/(2π)=100n Hzf_n = \omega_n/(2\pi) = 100n\ \text{Hz}.

Section summary. Field theory generalizes discrete mechanics to continuous media, using Lagrangian densities and local field equations, providing the classical foundation for Electromagnetism and Quantum Field Theory.