A concise undergraduate guide to Newtonian mechanics, Lagrangian and Hamiltonian methods, central forces, rigid bodies, oscillations, relativity, chaos, perturbation theory, and continuous systems.
Classical mechanics describes the motion of macroscopic bodies when quantum and relativistic effects are small. A particle is an idealized object with mass m and position r(t) in a three-dimensional Euclidean space. Its motion is described by velocity v=r˙ and acceleration a=r¨.
Newton’s Laws and Inertial Frames.
Newton’s laws are valid only in inertial frames---frames where a body remains at rest or in uniform motion unless acted upon by a force.
First Law (Inertia): In an inertial frame, a particle moves with constant velocity if the net force is zero.
Second Law (Dynamics): The rate of change of momentum p=mv equals the net force F:
F=p˙=dtd(mv).
For constant mass, this reduces to $\bm F = m\bm a$.
Third Law (Action-Reaction): For every force F12 exerted by particle 2 on particle 1, there is an equal and opposite force F21=−F12 exerted by particle 1 on particle 2. This holds for “central” forces acting along the line joining the particles.
Work and Energy.
The work done by a force F as a particle moves from point 1 to 2 is W12=∫12F⋅dr. The Work-Energy Theorem states that the work done by the net force equals the change in kinetic energy T=21mv2:
W12=T2−T1=ΔT.
A force is conservative if the work done is independent of the path. Such forces can be derived from a potential energy V(r):
F=−∇V(r)⟹W12=V1−V2=−ΔV.
In this case, total mechanical energy E=T+V is conserved (ΔE=0).
Systems of Particles.
For a system of N particles, the total mass is M=∑mi and the Center of Mass (CM) is R=M1∑miri.
Linear Momentum: The total momentum P=∑pi=MR˙ changes according to the net external force: P˙=Fext. If Fext=0, P is conserved.
Angular Momentum: The total angular momentum L=∑ri×pi changes according to the net external torque Next=∑ri×Fi,ext: L˙=Next.
Kinetic Energy:T=TCM+Trel, where TCM=21MR˙2 is the kinetic energy of the CM motion and Trel=21∑mi(r˙i′)2 is the kinetic energy relative to the CM.
FW12LMR¨=p˙=ma=∫12F⋅dr=ΔT=r×p,N=r×F=L˙=Fext,L˙=Next(Newton’s 2nd Law)(Work-Energy Theorem)(Angular Momentum and Torque)(System Dynamics)
Example: Conservative vs. Non-conservative forces.
Gravity (V=mgh) and springs (V=21kx2) are conservative. A conservative force does work that depends only on the endpoints; equivalently,
∮F⋅dr=0
for every closed path. Kinetic friction violates this condition. Since
fk=−μNv^,Wf=∫fk⋅dr=−∫μNds,
the work depends on the arclength actually traveled, not just on the initial and final positions. For a closed loop of length L>0 with constant μN,
Wf=−μNL<0,
so friction cannot be represented by a single-valued potential energy. The heat produced is the physical destination of the lost mechanical energy; the mathematical reason friction is non-conservative is path dependence, or nonzero work around a closed path.
Block on an inclined plane.
A block of mass m=5kg slides down a plane inclined at θ=30∘ to the horizontal with coefficient of kinetic friction μk=0.2. The normal force is N=mgcosθ=5×9.8×cos30∘≈42.4N. The friction force is fk=μkN≈8.5N. The component of gravity along the plane is mgsinθ=24.5N. Newton’s second law gives ma=mgsinθ−fk=16.0N, so a≈3.2m/s2. If the plane has length d=2m, the block’s speed at the bottom is v=2ad≈12.8≈3.58m/s. The work done by friction is Wf=−fkd=−17.0J. The change in kinetic energy is ΔT=21mv2−0=32.0J, which equals the work done by gravity (mgdsinθ=49.0J) plus the work done by friction (−17.0J), verifying the work-energy theorem.
Problem 1. A 10 kg box is pulled across a horizontal floor by a rope at 30∘ above horizontal with tension T=40N. The coefficient of kinetic friction is μk=0.25. Find the acceleration.
Solution. Vertical forces: N+Tsin30∘=mg, so N=10×9.8−40×0.5=98−20=78N. Friction: fk=μkN=0.25×78=19.5N. Horizontal: Tcos30∘−fk=ma, so 40×0.866−19.5=34.6−19.5=15.1N=10a, giving a=1.51m/s2.
Problem 2. A spring with k=200N/m is compressed by x=0.1m and launches a ball of mass m=0.5kg on a frictionless horizontal track. What is the ball’s speed when the spring reaches its natural length? If the track then curves upward to a height h, what is the maximum h reached?
Solution. The spring potential energy is U=21kx2=21(200)(0.1)2=1.0J. By energy conservation, 21mv2=1.0J, so v=2×1.0/0.5=2.0m/s. For maximum height, kinetic energy converts to gravitational potential: mgh=1.0J, so h=1.0/(0.5×9.8)≈0.204m.
Problem 3. Two masses m1=4kg and m2=6kg are connected by a light string over a frictionless pulley. Find the acceleration of the system and the tension in the string.
Solution. Let a be the acceleration of m1 upward and m2 downward. For m1: T−m1g=m1a. For m2: m2g−T=m2a. Adding: (m2−m1)g=(m1+m2)a, so a=4+6(6−4)×9.8=1019.6=1.96m/s2. From the first equation: T=m1(g+a)=4(9.8+1.96)=4×11.76=47.0N.
Section summary.
Newtonian mechanics builds from point particles to systems using three laws of motion. Conservation of energy, momentum, and angular momentum arise from symmetries and the nature of internal forces.
Newtonian mechanics requires identifying all forces, including unknown constraint forces (like the normal force on a bead on a wire). The Lagrangian method bypasses these by using energy and generalized coordinatesqi.
Generalized Coordinates and Constraints.
For a system of N particles, there are 3N degrees of freedom. Constraints reduce this number.
Holonomic constraints can be expressed as f(r1,r2,…,t)=0. If there are k such constraints, the number of independent coordinates is n=3N−k. These independent coordinates are the generalized coordinates q1,q2,…,qn.
Constraints that cannot be written this way are non-holonomic. Two common forms are inequalities such as g(q,t)≤0, and non-integrable velocity constraints
i∑ai(q,t)q˙i+a0(q,t)=0.
If this differential relation can be integrated to $F(q,t)=0$, it is holonomic; if it cannot, it is genuinely non-holonomic.
For example, a particle confined inside a box obeys 0≤x,y,z≤L, or inside a sphere obeys x2+y2+z2≤R2. These are inequality constraints rather than equations F(q,t)=0. A more typical mechanics example is a car that rolls without sideways slipping. If (x,y) is the contact point and θ is the heading angle, the sideways velocity must vanish:
−sinθx˙+cosθy˙=0.
This restricts allowed velocities but does not reduce to a position-only condition F(x,y,θ)=0. A rolling wheel gives similar non-integrable relations, such as dx−Rcosθdϕ=0 and dy−Rsinθdϕ=0.
Hamilton’s Principle.
The Lagrangian is defined as L(q,q˙,t)=T−V. Hamilton’s Principle states that the actual path taken by a system between times t1 and t2 is the one that makes the actionS stationary:
δS=δ∫t1t2L(qi,q˙i,t)dt=0.
This leads to the Euler—Lagrange equations:
dtd(∂q˙i∂L)−∂qi∂L=0,i=1,…,n.
The derivation is a direct integration-by-parts calculation. Compare the actual path qi(t) with a varied path qi(t)+ϵηi(t), where the endpoint variations vanish: ηi(t1)=ηi(t2)=0. Stationarity means
The boundary term is zero because the endpoints are fixed. Therefore
δS=∫t1t2[∂qi∂L−dtd(∂q˙i∂L)]ηidt=0.
Since the functions ηi(t) are arbitrary inside the interval, the bracket must vanish for each coordinate, giving the Euler—Lagrange equations.
Generalized Forces and Non-conservative Systems.
If some forces (like friction) are not derivable from a potential, the Euler—Lagrange equations are modified:
dtd(∂q˙i∂L)−∂qi∂L=Qi,
where Qi=∑jFjnc⋅∂qi∂rj is the generalized force associated with coordinate qi.
For a bead sliding on a circular hoop of radius R, with coordinate θ, the position is r(θ)=R(sinθ,−cosθ) and ∂r/∂θ=R(cosθ,sinθ). If a tangential friction force has magnitude bRθ˙ and opposes the motion, then
Fnc=−bRθ˙e^θ,Qθ=Fnc⋅∂θ∂r=−bR2θ˙.
With L=21mR2θ˙2−mgR(1−cosθ), the equation of motion becomes
mR2θ¨+mgRsinθ=−bR2θ˙,
or
θ¨+mbθ˙+Rgsinθ=0.
This example shows how non-conservative forces enter as generalized forces without being folded into V.
Symmetries and Noether’s Theorem.
A coordinate qj that does not appear in the Lagrangian (∂L/∂qj=0) is called cyclic or ignorable. Its conjugate momentum pj=∂L/∂q˙j is conserved:
p˙j=dtd(∂q˙j∂L)=∂qj∂L=0⟹pj=const.
Noether’s Theorem generalizes this: every continuous symmetry of the Lagrangian corresponds to a conservation law.
Time translation symmetry ⟹ conservation of energy (H=∑q˙ipi−L).
Spatial translation symmetry ⟹ conservation of linear momentum.
Rotational symmetry ⟹ conservation of angular momentum.
Bead on a rotating hoop.
A bead of mass m=0.1kg slides without friction on a hoop of radius R=0.2m rotating about its vertical diameter at constant angular velocity Ω=5rad/s. Using polar angle θ measured from the bottom, the position is (RsinθcosΩt,RsinθsinΩt,−Rcosθ). The velocity squared is v2=R2θ˙2+R2Ω2sin2θ. The Lagrangian is L=21m(R2θ˙2+R2Ω2sin2θ)+mgRcosθ. The Euler—Lagrange equation gives mR2θ¨=mR2Ω2sinθcosθ−mgRsinθ, or θ¨=(Ω2cosθ−g/R)sinθ. For equilibrium, θ¨=0, giving solutions θ=0, θ=π, and cosθ=g/(RΩ2)=9.8/(0.2×25)=1.96, which is impossible here. Thus only θ=0 (bottom) and θ=π (top) are equilibria.
Problem 1. Use the Lagrangian method to find the equation of motion for a simple pendulum of mass m and length l.
Solution. Coordinate: angle θ from vertical. x=lsinθ, y=−lcosθ. Kinetic energy T=21m(l2θ˙2). Potential energy V=mgl(1−cosθ). Lagrangian L=T−V=21ml2θ˙2−mgl(1−cosθ). Euler—Lagrange: dtd(∂L/∂θ˙)−∂L/∂θ=0. ∂L/∂θ˙=ml2θ˙, so dtd(ml2θ˙)=ml2θ¨. ∂L/∂θ=−mglsinθ. Thus ml2θ¨+mglsinθ=0, or θ¨+(g/l)sinθ=0.
Problem 2. An Atwood machine consists of two masses m1=3kg and m2=2kg connected by a light string over a frictionless pulley. Use the Lagrangian method to find the acceleration.
Solution. Let x be the vertical position of m1 (downward positive). Then m2 moves up by the same amount. T=21(m1+m2)x˙2=21(5)x˙2. V=−m1gx+m2gx=−(m1−m2)gx=−gx. L=25x˙2+gx. Euler—Lagrange: 5x¨−g=0, so x¨=g/5=1.96m/s2. The acceleration of m1 is 1.96m/s2 downward and m2 is 1.96m/s2 upward.
Problem 3. A particle of mass m moves in the xy-plane under a central potential V(r)=−rk. Write the Lagrangian in polar coordinates and identify any cyclic coordinates and their conserved momenta.
Solution. In polar coordinates, T=21m(r˙2+r2ϕ˙2) and V=−k/r. Thus L=21m(r˙2+r2ϕ˙2)+k/r. The coordinate ϕ does not appear explicitly in L, so it is cyclic. The conserved conjugate momentum is pϕ=∂L/∂ϕ˙=mr2ϕ˙=ℓ, which is the angular momentum. This is Noether’s theorem in action: rotational symmetry implies conservation of angular momentum.
Section summary.
The Lagrangian formulation replaces force-based dynamics with an optimization principle. It automatically incorporates holonomic constraints and links symmetries to conservation laws via Noether’s theorem.
A central force points along the line joining two particles and depends only on their separation: F=F(r)r^. Examples include gravity and the Coulomb force.
Reduction to a One-Body Problem.
For two particles with masses m1,m2 and positions r1,r2, the Lagrangian is L=21m1r˙12+21m2r˙22−V(∣r1−r2∣). Transforming to the center of mass R and relative coordinate r=r1−r2:
L=21MR˙2+21μr˙2−V(r),
where M=m1+m2 and μ=m1+m2m1m2 is the reduced mass. The CM moves at constant velocity, so we focus on the relative motion in the CM frame.
Conservation Laws.
Since the force is central, the torque is zero, and the angular momentumL=r×μr˙ is conserved. This implies the motion is confined to a plane. In plane polar coordinates (r,ϕ):
Lz=μr2ϕ˙=ℓ=const.
Energy is also conserved: E=21μ(r˙2+r2ϕ˙2)+V(r)=const.
The Effective Potential.
Using ϕ˙=μr2ℓ, we can write the radial energy equation:
E=21μr˙2+Veff(r),Veff(r)=V(r)+2μr2ℓ2.
The term 2μr2ℓ2 is the centrifugal barrier. The radial motion is effectively one-dimensional.
The Orbit Equation.
Defining u=1/r, the differential equation for the orbit shape is:
dϕ2d2u+u=−ℓ2u2μF(1/u).
For the inverse-square law F(r)=−k/r2 (where k=Gm1m2), the solution is a conic section:
r(ϕ)=1+ecos(ϕ−ϕ0)α,α=μkℓ2,e=1+μk22Eℓ2.
The eccentricitye determines the orbit shape:
e=0: Circle (E=Veff,min=−2ℓ2μk2)
0<e<1: Ellipse (Veff,min<E<0)
e=1: Parabola (E=0)
e>1: Hyperbola (E>0)
Kepler’s Laws.
Planets move in elliptical orbits with the Sun at one focus.
A line joining a planet and the Sun sweeps out equal areas during equal intervals of time (equivalent to conservation of angular momentum).
The square of the orbital period T is proportional to the cube of the semi-major axis a: T2=k4π2μa3.
The Runge-Lenz Vector.
For the 1/r potential, there is an additional conserved vector, the Laplace—Runge—Lenz vector:
A=p×L−μkr^.
This vector points along the major axis and its conservation explains why orbits in a 1/r potential are closed and do not precess.
Veff(r)dϕ2d2u+ur(ϕ)=V(r)+2μr2ℓ2=ℓ2μk=1+ecosϕα(Effective Potential)(Orbit Equation for 1/r2 force)(Conic Section Solution)
Example: Circular Orbit.
For a circular orbit at radius r0, the force must equal the centripetal requirement: ∣F(r0)∣=r0μv2=μr03ℓ2. This corresponds to the minimum of Veff(r).
Geostationary orbit.
A satellite orbits Earth at radius r with period T=24h=86400s. For a circular orbit, centripetal force equals gravity: mv2/r=GMm/r2, so v2=GM/r. With v=2πr/T, we get (2πr/T)2=GM/r, or r3=GMT2/(4π2). Using GM=gRE2≈9.8×(6.37×106)2≈3.98×1014m3/s2, we find r3=3.98×1014×(86400)2/(4π2)≈7.54×1022m3, giving r≈4.22×107m≈6.6RE. The altitude is h=r−RE≈3.58×104km. The orbital speed is v≈3.07km/s.
Problem 1. Calculate the escape velocity from the surface of Earth (RE=6.37×106m, g=9.8m/s2).
Solution. Escape requires total energy E≥0. At the surface, E=21mv2−GMm/RE=0, so vesc=2GM/RE=2gRE=2×9.8×6.37×106≈1.25×108≈1.12×104m/s=11.2km/s.
Problem 2. A comet has a parabolic orbit around the Sun (M⊙=1.99×1030kg) with perihelion distance q=0.1AU (1AU=1.496×1011m). Find its speed at perihelion.
Solution. For a parabolic orbit, E=0 and e=1. At perihelion, r=q and v=vmax. Energy conservation: 21v2−GM⊙/q=0, so v=2GM⊙/q=2×6.67×10−11×1.99×1030/(0.1×1.496×1011)≈1.775×1010≈1.33×105m/s=133km/s.
Problem 3. For an elliptical orbit with semi-major axis a and period T, derive the relation T2∝a3 using energy and angular momentum conservation.
Solution. From the orbit equation, r(ϕ)=α/(1+ecosϕ) with α=ℓ2/(μk). The semi-major axis is a=α/(1−e2). The energy is E=−μk2/(2ℓ2)(1−e2)=−k/(2a). Thus ℓ2=μka(1−e2). The area swept per unit time is dA/dt=ℓ/(2μ). The total area is A=πab=πa21−e2. Integrating, T=A/(dA/dt)=2μπa21−e2/ℓ=2μπa21−e2/μka(1−e2)=2πμ/ka3/2. Therefore T2=k4π2μa3, which is Kepler’s third law.
Section summary.
The central force problem reduces to a 1D radial problem using conservation of angular momentum. For inverse-square forces, orbits are conic sections, consistent with Kepler’s Laws.
A rigid body is a system of particles where the distance between any two particles remains constant: ∣ri−rj∣=cij. It has 6 degrees of freedom (3 for translation of a reference point, 3 for rotation).
Chasles’ Theorem.
Any displacement of a rigid body can be decomposed into a translation of a chosen base point plus a rotation about an axis through that point. Typically, the Center of Mass (CM) is chosen as the base point.
Angular Velocity and Rotations.
The velocity of any point P in the body is v=V+ω×r, where V is the velocity of the base point and ω is the angular velocity vector. Rotations are often described using Euler angles(ϕ,θ,ψ) which represent a sequence of three rotations (e.g., z-x′-z′′ convention) to transform from a space-fixed frame to a body-fixed frame.
Why Euler angles in practice.
Euler angles provide the minimal set of three independent generalized coordinates for orientation, so that rotational dynamics can be cast directly into Lagrange’s equations without redundant constraints. In the heavy symmetric top, for instance, the choice of (ϕ,θ,ψ) makes ϕ and ψ cyclic, immediately yielding two conserved momenta pϕ and pψ and reducing the problem to a one-dimensional motion in θ. The same coordinates are the standard yaw—pitch—roll variables used to describe spacecraft attitude, gyrocompasses, gimbal-mounted sensors, robotic end-effectors, and aircraft orientation.
In numerical practice one must remember the well-known gimbal-lock singularity at θ=0,π, where ϕ˙ and ψ˙ become indistinguishable and the kinematic map (ϕ˙,θ˙,ψ˙)→ω loses rank. For this reason attitude-control engineers often integrate the equivalent quaternion (Euler—Rodrigues) parameters and convert back to Euler angles only for human-readable output.
The Inertia Tensor.
The inertia tensorI is a symmetric 3×3 matrix that relates angular momentum L to angular velocity ω: L=Iω. Its components in a given basis are:
Iij=n∑mn(rn2δij−xn,ixn,j).
Principal Axes: For any point, there exists a set of axes where I is diagonal. The diagonal elements I1,I2,I3 are the principal moments of inertia.
Parallel Axis Theorem: If Icm is the inertia tensor about the CM, the tensor about an axis parallel but shifted by a is I=Icm+M(a21−a⊗a).
vLIij=V+ω×r=Iω,Trot=21ω⋅L=21ω⋅Iω=∫ρ(r)(r2δij−xixj)d3r(Velocity in Rigid Body)(Angular Momentum and KE)(Continuum Inertia Tensor)
Example: Inertia Tensor of a Cube.
For a uniform cube of side a and mass M about a corner, Ixx=31Ma2, Ixy=−41Ma2. About the center, the axes are principal and Ixx=Iyy=Izz=61Ma2.
Velocity of a point on a rolling wheel.
A bicycle wheel of radius R=0.35m rolls without slipping at speed V=5m/s. The angular velocity is ω=V/R=5/0.35≈14.3rad/s. For a point on the rim at angle ϕ from the top, measured in the body frame, the position relative to the center is r=R(sinϕ,−cosϕ). The velocity is v=V+ω×r=(V,0)+(0,0,ω)×(Rsinϕ,−Rcosϕ,0)=(V+ωRcosϕ,ωRsinϕ). At the top (ϕ=0): v=(V+V,0)=(10,0)m/s. At the bottom (ϕ=π): v=(V−V,0)=0, consistent with the no-slip condition.
Problem 1. A uniform rod of mass M=2kg and length L=1m rotates about an axis through one end and perpendicular to the rod. Find the moment of inertia and the kinetic energy when ω=3rad/s.
Solution. For a rod about one end, I=31ML2=31(2)(1)2=32kg⋅m2. Rotational kinetic energy is T=21Iω2=21×32×9=3J.
Problem 2. A rigid body rotates with ω=(2,3,1)rad/s. Find the velocity of a point at r=(0.1,0.2,0)m relative to the rotation axis.
Solution.v=ω×r=(2,3,1)×(0.1,0.2,0)=(3⋅0−1⋅0.2,1⋅0.1−2⋅0,2⋅0.2−3⋅0.1)=(−0.2,0.1,0.1)m/s. The speed is ∣v∣=0.04+0.01+0.01=0.06≈0.245m/s.
Problem 3. Use the parallel axis theorem to find the moment of inertia of a solid disk of mass M=1kg and radius R=0.2m about a tangent in its plane.
Solution. About the center perpendicular to the disk, Icm=21MR2=0.5×1×0.04=0.02kg⋅m2. For an axis in the plane, by symmetry Ix,cm=Iy,cm=41MR2=0.01kg⋅m2. Shifting to a tangent point on the rim, the displacement is a=R perpendicular to the new axis. The parallel axis theorem gives I=Ix,cm+MR2=41MR2+MR2=45MR2=45(1)(0.04)=0.05kg⋅m2.
Section summary.
Rigid body kinematics describes the 6-DOF motion through CM translation and rotation, characterized by the angular velocity vector and the inertia tensor.
Torque-Free Motion.
When N=0, both Erot=21∑Iiωi2 and L2=∑Ii2ωi2 are conserved.
Stability (Tennis Racket Theorem): Rotation about the principal axes with the largest (Imax) or smallest (Imin) moments is stable. Rotation about the intermediate axis is unstable.
Symmetric Top: If I1=I2=I3, the angular velocity ω precesses around the body-fixed symmetry axis z′ with frequency Ωbody=I1I3−I1ω3.
The Heavy Symmetric Top.
A top with I1=I2 spinning in a gravitational field exhibits complex motion:
Precession: The symmetry axis rotates around the vertical (gravity) axis.
Nutation: The symmetry axis bobs up and down between two polar angles θ1 and θ2.
Stability: A “sleeping top” (vertical spin) is stable only if ω3>I32MglI1.
Nutation amplitude and frequency (fast-spin limit).
For a top released with non-zero tilt θ0 and large spin ω3 the energy/angular-momentum conservation reduces θ(t) to motion in an effective potential. Linearising about the mean angle one finds small-amplitude nutation at the angular frequency
ωnut≈I1I3ω3,
i.e.\ the nutation is fast compared with the precession ωprec≈Mgl/(I3ω3), and the two satisfy the simple product
ωnutωprec≈I1Mgl.
The peak-to-peak amplitude of the wobble for a top released from rest at θ0 is
Δθ=θ2−θ1≈I3ω322Mglsinθ0I3I1,
which vanishes as ω3−2 --- a fast-spinning top exhibits steady precession with imperceptible nutation. The sleeping-top threshold above is the same statement at θ0=0: nutation about the vertical is bounded only when I32ω32>4MglI1.
I1ω˙1−(I2−I3)ω2ω3ωprecE=N1=LMgl=21I1(θ˙2+ϕ˙2sin2θ)+21I3(ϕ˙cosθ+ψ˙)2+Mglcosθ(Euler’s Equations)(Slow Precession approximation)(Heavy Top Energy)
Example: Free Precession of the Earth.
The Earth is slightly oblate (I3>I1). Euler’s equations predict a precession of the spin axis (the “Chandler wobble”) with a period related to the difference in moments of inertia.
Stability of a spinning book.
A hardcover book has principal moments I1=0.05kg⋅m2, I2=0.08kg⋅m2, I3=0.10kg⋅m2 (with I1<I2<I3). According to the tennis-racket theorem, rotation about the I1 (thickness) and I3 (height) axes is stable, while rotation about I2 (width) is unstable. If the book is spun about its intermediate axis with a small perturbation, Euler’s equations give ω˙1=I1I2−I3ω2ω3=0.05−0.02ω2ω3=−0.4ω2ω3. Linearizing about ω2≈Ω shows ω¨1∝−Ω2ω1 with a positive coefficient (since (I2−I3)(I1−I2)/(I1I3)>0), leading to exponential growth: ω1(t)∝eλt with λ=Ω(I3−I2)(I2−I1)/(I1I3).
Problem 1. A symmetric top has I1=I2=5.0×10−4kg⋅m2 and I3=1.0×10−3kg⋅m2. It spins at ω3=100rad/s. Find the body-frame precession rate Ωbody.
Solution. For a torque-free symmetric top, Ωbody=I1I3−I1ω3=5.0×10−41.0×10−3−5.0×10−4×100=5.0×10−40.5×10−3×100=1×100=100rad/s. The angular velocity vector precesses around the symmetry axis at 100rad/s.
Problem 2. A top of mass M=0.2kg with I1=I2=1.5×10−4kg⋅m2 and I3=3.0×10−4kg⋅m2 has its CM a distance l=0.03m from the pivot. What minimum spin ω3 is needed for the sleeping-top configuration to be stable?
Solution. The sleeping-top stability condition is ω3>I32MglI1. Substituting: MglI1=0.2×9.8×0.03×1.5×10−4=8.82×10−6≈2.97×10−3. Then ω3>3.0×10−42×2.97×10−3=3.0×10−45.94×10−3≈19.8rad/s (about 3.2 rev/s).
Problem 3. A cube of side a=0.1m and mass M=0.5kg rotates about an axis through its center and perpendicular to one face. Find its angular momentum if ω=20rad/s.
Solution. For a cube about a central axis perpendicular to a face, I=61Ma2=61(0.5)(0.1)2=60.005≈8.33×10−4kg⋅m2. Angular momentum is L=Iω=8.33×10−4×20≈1.67×10−2kg⋅m2/s.
Section summary.
Euler’s equations describe rigid body dynamics in the body frame, revealing the stability of principal axis rotation and the precession/nutation of tops.
Most systems in classical mechanics behave like harmonic oscillators when displaced slightly from a stable equilibrium point.
Small Oscillations around Equilibrium.
Consider a system with generalized coordinates qi and a potential V(q1,…,qn). Equilibrium occurs where ∂qi∂V=0. If this is a minimum (all eigenvalues of the Hessian ∂qi∂qj∂2V are positive), the equilibrium is stable.
Defining ηi=qi−qi0 as the displacement, the Lagrangian for small oscillations is:
L≈21i,j∑(Tijη˙iη˙j−Vijηiηj),
where Vij=∂qi∂qj∂2V0 and Tij is the kinetic energy matrix (usually constant for small displacements).
Normal Modes and the Secular Equation.
The equations of motion are Tη¨+Vη=0. Assuming a sinusoidal solution η(t)=aeiωt leads to the generalized eigenvalue problem:
(V−ω2T)a=0.
Nontrivial solutions exist only if the secular equation holds:
det(V−ω2T)=0.
The roots ωk2 are the normal frequencies, and the corresponding vectors ak define the normal modes.
Normal Coordinates.
There exists a linear transformation η=Aζ that simultaneously diagonalizes T and V. The new coordinates ζk are normal coordinates, and each evolves independently as a simple harmonic oscillator: ζ¨k+ωk2ζk=0.
Damping and Driving.
Real systems have dissipation (Q-factor) and external forces.
Damping:x¨+2γx˙+ω02x=0. Solutions can be underdamped, overdamped, or critically damped.
Resonance: When driven by F0cosωt, the amplitude is maximized near ω≈ω0. The phase shift δ changes from 0 to π as ω passes through resonance.
Driven damped harmonic oscillator.
A mass m=0.5kg on a spring with k=50N/m is damped with b=1.0kg/s and driven by F(t)=2cos(10t)N. The natural frequency is ω0=k/m=10rad/s, the damping coefficient is γ=b/(2m)=1.0s−1, and the driving frequency is ω=10rad/s. The steady-state amplitude is A=F0/m/(ω02−ω2)2+(2γω)2=4/0+400=4/20=0.20m. Since ω=ω0, the system is at resonance and the phase lag is δ=π/2. The quality factor is Q=ω0/(2γ)=5.
Problem 1. Two identical masses m are connected by springs of constant k to each other and to fixed walls. Find the normal-mode frequencies.
Solution. Let displacements be x1,x2. The equations are mx¨1=−kx1+k(x2−x1)=−2kx1+kx2 and mx¨2=−k(x2−x1)−kx2=kx1−2kx2. Assuming xi=Aicosωt, the matrix equation is (2k−mω2−k−k2k−mω2)(A1A2)=0. The secular equation gives (2k−mω2)2−k2=0, so 2k−mω2=±k. Thus ω12=k/m (symmetric mode: A1=A2) and ω22=3k/m (antisymmetric mode: A1=−A2).
Problem 2. A damped oscillator has ω0=5rad/s and γ=3s−1. Is it underdamped, critically damped, or overdamped? Write the general solution.
Solution. Compare γ to ω0: γ=3<5=ω0, so the system is underdamped. The damped frequency is ω′=ω02−γ2=25−9=4rad/s. The general solution is x(t)=Ae−3tcos(4t+ϕ), where A and ϕ are determined by initial conditions.
Problem 3. An undamped oscillator with m=1kg, k=100N/m is driven by F(t)=5cos(9t)N. Find the steady-state amplitude and the average power supplied by the driving force.
Solution. Here ω0=10rad/s and ω=9rad/s. The amplitude is A=F0/m/∣ω02−ω2∣=5/∣100−81∣=5/19≈0.263m. In steady state, x(t)=Acos(9t) and v(t)=−9Asin(9t). The instantaneous power is P=Fv=−45Acos(9t)sin(9t), which averages to zero over a cycle because there is no damping and therefore no net energy dissipation.
Section summary.
Oscillation theory linearizes motion near equilibrium, reducing complex coupled dynamics to independent normal modes via an eigenvalue problem.
Example: Relativistic Doppler Effect.
The frequency shift for a source moving away is ν=ν01+v/c1−v/c. Unlike the classical case, there is also a transverse Doppler effect (ν=ν0/γ) due to time dilation.
Muon decay and time dilation.
Cosmic-ray muons are created at an altitude of about h=10km with speed v=0.995c (γ≈10). In the muon’s rest frame, its mean lifetime is τ0=2.2μs. In Earth’s frame, the dilated lifetime is τ=γτ0≈22μs, during which the muon travels d=vτ≈0.995c×22μs≈6.6km. However, muons reach the ground! This is resolved by length contraction: in the muon frame, the atmosphere is contracted to h′=h/γ≈1km, which takes t′=h′/v≈3.4μs to traverse---less than the mean lifetime. Both frames consistently predict that many muons survive to sea level.
Problem 1. An electron is accelerated to a kinetic energy of 1.0MeV. Given mec2=0.511MeV, find its speed and momentum.
Solution. Total energy E=K+mc2=1.511MeV. Since E=γmc2, we have γ=1.511/0.511≈2.957. Then v=c1−1/γ2≈c1−0.114≈0.941c. Momentum is p=E2−(mc2)2/c=1.5112−0.5112/c≈2.283−0.261/c≈1.423MeV/c.
Problem 2. A spaceship travels to a star 4.3light-years away at v=0.8c. How long does the trip take according to Earth clocks and according to the ship’s clock?
Solution. Earth time: t=d/v=4.3/0.8=5.375years. Ship time (proper time): τ=t/γ=t1−0.82=5.375×0.6=3.225years. Alternatively, in the ship frame the distance is contracted to d′=d/γ=4.3×0.6=2.58ly, so τ=d′/v=2.58/0.8=3.225years.
Problem 3. Show that the relativistic kinetic energy K=(γ−1)mc2 reduces to the classical expression 21mv2 when v≪c.
Solution. For small x, (1−x)−1/2≈1+x/2+3x2/8+…. With x=v2/c2, γ≈1+21v2/c2+83v4/c4+…. Thus K=mc2(γ−1)≈mc2(21v2/c2+83v4/c4)=21mv2+83mv4/c2+…. The first term is the classical kinetic energy; the rest are relativistic corrections.
Section summary.
Relativistic mechanics replaces absolute time with proper time and Newtonian momentum with four-momentum, unified by the invariant mass-shell condition.
Hamiltonian mechanics is a reformulation of classical mechanics that emphasizes the symmetry between coordinates qi and momenta pi. It describes motion as a flow in phase space.
The Legendre Transformation.
The Hamiltonian H(q,p,t) is obtained from the Lagrangian L(q,q˙,t) via a Legendre transformation:
H(q,p,t)=i∑piq˙i−L(q,q˙,t),pi=∂q˙i∂L.
If L=T−V and T is a homogeneous quadratic function of q˙, then H=T+V=E (the total energy).
Hamilton’s Equations.
Taking the differential of H and using Euler—Lagrange equations leads to Hamilton’s canonical equations:
q˙i=∂pi∂H,p˙i=−∂qi∂H.
These are 2n first-order differential equations, unlike the n second-order Euler—Lagrange equations.
Poisson Brackets.
For any two functions f(q,p,t) and g(q,p,t) on phase space, the Poisson bracket is:
{f,g}=i∑(∂qi∂f∂pi∂g−∂pi∂f∂qi∂g).
The time evolution of any function f is given by:
dtdf={f,H}+∂t∂f.
A quantity is conserved if {f,H}=0 (and it has no explicit time dependence).
Phase Space and Liouville’s Theorem.
A state is a point in the 2n-dimensional phase space. Liouville’s Theorem states that the density of points in phase space (or the volume of a region of points) is constant along the trajectories of the system.
Particle in a uniform gravitational field.
A mass m=2kg falls vertically in a uniform gravitational field g=9.8m/s2. Using coordinate q=y (height) and momentum p=my˙, the Hamiltonian is H=p2/(2m)+mgy. Hamilton’s equations give y˙=∂H/∂p=p/m and p˙=−∂H/∂y=−mg. Thus y¨=p˙/m=−g, as expected from Newton’s second law. If released from rest at y0=10m, then p(0)=0 and y(t)=y0−21gt2, hitting the ground at t=2y0/g≈1.43s with momentum p=−mgt≈−28.0kg⋅m/s.
Problem 1. A 1D harmonic oscillator has Hamiltonian H=p2/(2m)+21mω2q2. Write Hamilton’s equations and show that they reproduce the familiar second-order equation of motion.
Solution.q˙=∂H/∂p=p/m and p˙=−∂H/∂q=−mω2q. Differentiating the first and substituting the second gives q¨=p˙/m=−ω2q, which is the standard harmonic oscillator equation.
Problem 2. For H=p2/(2m)−αq (a particle in a constant force field), find the time evolution of q(t) and p(t) given q(0)=0 and p(0)=p0.
Solution. Hamilton’s equations: q˙=p/m, p˙=α. Integrating, p(t)=p0+αt and q(t)=∫0t(p0+αt′)/mdt′=p0t/m+αt2/(2m). This is constant-acceleration motion with a=α/m.
Problem 3. Compute the Poisson bracket {Lz,x} where Lz=xpy−ypx.
Solution. Using {f,g}=∑i(∂qi∂f∂pi∂g−∂pi∂f∂qi∂g), only the terms involving y and py survive because x depends only on x and px. We get {Lz,x}=∂y∂(xpy−ypx)∂py∂x−∂py∂(xpy−ypx)∂y∂x+⋯=(−px)(0)−(x)(0)+…. Actually computing all terms: {xpy−ypx,x}={xpy,x}−{ypx,x}. The first bracket is x{py,x}+py{x,x}=0. The second is y{px,x}+px{y,x}=y(−1)+0=−y. Thus {Lz,x}=0−(−y)=y. (Equivalently, from the standard relation {Li,xj}=ϵijkxk, we have {L3,x1}=ϵ312x2=y.)
Section summary.
Hamiltonian mechanics treats coordinates and momenta as independent variables in phase space, providing a powerful framework for conservation laws and a bridge to quantum mechanics.
A transformation from old coordinates (q,p) to new coordinates (Q,P) is canonical if it preserves the form of Hamilton’s equations. This is equivalent to preserving the Poisson bracket relations:
The Symplectic Condition.
Defining η=(q1,…,qn,p1,…,pn)T, Hamilton’s equations can be written as η˙=J∇H, where J=(0−110) is the symplectic matrix. A transformation η→ξ is canonical if the Jacobian M=∂ξ/∂η satisfies the symplectic condition:
MJMT=J.
Generating Functions.
Canonical transformations are often derived from a generating function. There are four basic types depending on the choice of independent variables:
F1(q,Q,t): pi=∂qi∂F1,Pi=−∂Qi∂F1.
F2(q,P,t): pi=∂qi∂F2,Qi=∂Pi∂F2.
F3(p,Q,t): qi=−∂pi∂F3,Pi=−∂Qi∂F3.
F4(p,P,t): qi=−∂pi∂F4,Qi=∂Pi∂F4.
The new Hamiltonian K is related to the old H by K=H+∂t∂F.
Goal of Canonical Transformations.
The primary use is to transform to a system where all coordinates are cyclic (∂K/∂Qi=0). In such a system, the new momenta Pi are constants of motion, and the problem is trivially solved.
{Qi,Pj}MJMTK=δij=J=H+∂t∂F(Preservation of Poisson Brackets)(Symplectic Condition)(Transformation of the Hamiltonian)
Example: Identity Transformation.
The generating function F2(q,P)=∑qiPi gives pi=Pi and Qi=qi, which is the identity transformation. Changing this to F2=∑fi(q)Pi generates a point transformation.
Transformation to center-of-mass and relative coordinates.
For two particles in 1D with masses m1,m2, coordinates q1,q2 and momenta p1,p2, define Q1=(m1q1+m2q2)/(m1+m2) (CM position), Q2=q1−q2 (relative coordinate), P1=p1+p2 (total momentum), and P2=(m2p1−m1p2)/(m1+m2) (relative momentum). The generating function is F2=P1(m1q1+m2q2)/(m1+m2)+P2(q1−q2). One verifies {Qi,Pj}=δij and {Qi,Qj}={Pi,Pj}=0, confirming the transformation is canonical.
Problem 1. Show that the transformation Q=ln(1+qcosp), P=2(1+qcosp)qsinp is canonical by evaluating the Poisson bracket {Q,P}q,p.
Solution. We need {Q,P}=∂q∂Q∂p∂P−∂p∂Q∂q∂P. Let u=1+qcosp. Then Q=lnu, P=2uqsinp. Computing: ∂Q/∂q=cosp/(2qu), ∂Q/∂p=−qsinp/u, ∂P/∂q=sinpcosp/q+usinp/q=sinp(1+2qcosp)/q, ∂P/∂p=2uqcosp−2qsin2p. After simplification, {Q,P}=1, confirming the transformation is canonical.
Problem 2. Find a type-2 generating function F2(q,P) that produces the point transformation Q=q2 for a 1D system.
Solution. For a point transformation, p=∂F2/∂q and Q=∂F2/∂P=q2. Integrating the second equation, F2(q,P)=q2P+g(q). Then p=∂F2/∂q=2qP+g′(q). The simplest choice is g=0, giving F2=q2P and p=2qP, or P=p/(2q).
Problem 3. Given the Hamiltonian H=2p2+2q2, show that the transformation Q=p, P=−q is canonical and find the new Hamiltonian.
Solution. The Poisson brackets are {Q,P}={p,−q}=−{p,q}=1, {Q,Q}={p,p}=0, {P,P}={−q,−q}=0. Thus the transformation is canonical. Since the transformation is time-independent, K=H expressed in new variables: K=2P2+2Q2. This is just a rotation by 90∘ in phase space.
Section summary.
Canonical transformations preserve the symplectic structure of phase space, allowing for coordinate changes that simplify the Hamiltonian and reveal constants of motion.
Hamilton—Jacobi (HJ) theory is the ultimate canonical transformation: it seeks a transformation to a frame where the new Hamiltonian is zero, meaning all new coordinates and momenta are constants of motion.
The Hamilton—Jacobi Equation.
Using an F2(q,P,t) generating function, we call the result Hamilton’s Principal FunctionS(q,P,t). If the new Hamiltonian K=H+∂S/∂t=0, then Pi=αi (constants). The HJ equation is a first-order, non-linear partial differential equation for S:
H(q1,…,qn,∂q1∂S,…,∂qn∂S,t)+∂t∂S=0.
For time-independent Hamiltonians, we use Hamilton’s Characteristic FunctionW, where S(q,α,t)=W(q,α)−Et. Then:
H(q,∂q∂W)=E.
Separation of Variables.
The HJ equation is often solved by separation of variables: W(q1,…,qn)=∑Wi(qi). This reduces the PDE to n independent ODEs, which can be solved by integration (quadratures).
Action—Angle Variables.
For periodic systems, action—angle variables(J,θ) are the most natural coordinates.
Action variableJi=2π1∮pidqi, where the integral is over one cycle of the motion.
Angle variableθi is the canonical conjugate to Ji.
The Hamiltonian depends only on actions: H=H(J1,…,Jn).
The equations of motion are trivial: J˙i=0, θ˙i=∂Ji∂H=νi (the constant frequency).
Particle in a 1D box via Hamilton—Jacobi.
A particle of mass m moves freely on the interval [0,L] with infinite walls. The Hamiltonian is H=p2/(2m)=E. The HJ equation is (dW/dx)2/(2m)=E, so dW/dx=±2mE. Integrating, W(x)=±x2mE. The action variable is J=2π1∮pdx=2π1(2L2mE)=πL2mE. Inverting gives E=2mL2π2J2. The frequency is ν=∂E/∂J=mL2π2J=Lπm2E=2Lv, which is the round-trip frequency as expected.
Problem 1. For a 1D harmonic oscillator with H=p2/(2m)+21mω2q2, solve the time-independent HJ equation and find Hamilton’s characteristic function W(q,E).
Solution. The HJ equation is 2m1(∂W/∂q)2+21mω2q2=E. Solving for ∂W/∂q=2mE−m2ω2q2, we integrate to get W(q,E)=∫q2mE−m2ω2q′2dq′. This evaluates to W=2mωqmω22E−q2+Earcsin(q2Emω2).
Problem 2. A particle moves in a central potential V(r)=−k/r. Using action—angle variables, show that the radial and azimuthal frequencies are equal, confirming that the orbit is closed.
Solution. The actions are Jϕ=∮pϕdϕ=2πℓ and Jr=∮prdr. For the Kepler problem, evaluating the radial action gives Jr=−Jϕ+πkμ/(−2E). The total energy depends on Jr+Jϕ as E=−2(Jr+Jϕ)2μk2. The frequencies are νr=∂E/∂Jr and νϕ=∂E/∂Jϕ. Since E depends only on the sum Jr+Jϕ, νr=νϕ. Equal frequencies mean the radial and angular motions complete a cycle in the same time, producing a closed ellipse.
Problem 3. Find the action variable J for a simple pendulum of length l and mass m in the small-angle approximation.
Solution. For small angles, H=2ml2pθ2+21mglθ2=E. The motion is harmonic with ω=g/l and E=21mglθ02. The phase-space trajectory is an ellipse with semi-axes pmax=ml2ωθ0=2mEl and θmax=θ0=2E/(mgl). The area is πpmaxθmax=π2mEl⋅2E/(mgl)=2πEl/g=2πE/ω. Thus J=2π1×(2πE/ω)=E/ω.
Section summary.
Hamilton—Jacobi theory reduces dynamics to a single PDE, while action-angle variables provide the most efficient description for integrable periodic systems.
Chaos refers to the complex, unpredictable behavior that can arise in deterministic nonlinear dynamical systems. It is not due to noise or randomness but is an inherent property of the system’s geometry.
Sensitivity to Initial Conditions.
The hallmark of chaos is that two trajectories starting very close together in phase space will diverge exponentially:
∣δx(t)∣≈∣δx(0)∣eλt,
where λ is the Lyapunov exponent. A positive λ implies that small uncertainties in measurement grow rapidly, making long-term prediction impossible.
Phase Space and Attractors.
Phase Portrait: A map of all possible states in phase space. Regular systems have trajectories on circles or tori.
Strange Attractor: For dissipative chaotic systems, trajectories settle onto a complex, fractal-like structure in phase space.
Poincaré Section: A way to simplify the analysis by taking a “snapshot” of the system’s state each time it crosses a chosen surface in phase space. This reduces the continuous dynamics to a discrete map.
The KAM Theorem.
The Kolmogorov—Arnold—Moser (KAM) theorem addresses what happens to an integrable system when a small nonlinear perturbation is added. It states that many of the original “invariant tori” (regular orbits) survive the perturbation, but they are increasingly destroyed as the perturbation strength increases, leading to “stochastic” or chaotic regions.
Routes to Chaos.
Systems often become chaotic through a sequence of bifurcations as a parameter is varied. A common route is period-doubling, where the period of the oscillation doubles repeatedly until it becomes infinite (chaos).
Concrete example: period-doubling cascade in the logistic map.
The discrete map xn+1=rxn(1−xn) on x∈[0,1] shows the period-doubling route to chaos as the parameter r is increased:
r<1: the only stable fixed point is x∗=0 (extinction).
1<r<3: a single non-trivial fixed point x∗=1−1/r is stable.
r1=3: first period-doubling bifurcation; a stable 2-cycle appears.
r2≈3.4495: bifurcation to a stable 4-cycle.
r3≈3.5441: 8-cycle.
r4≈3.5644: 16-cycle, and so on.
r∞≈3.56995: accumulation point --- onset of chaos.
The successive intervals shrink geometrically, and their ratios approach the universal first Feigenbaum constant
δ=n→∞limrn+1−rnrn−rn−1=4.669201609…
The width of the bifurcating branches scales by a second universal constant α≈2.5029. Feigenbaum showed in 1978 that δ and α are independent of the specific map: any smooth one-dimensional map with a single quadratic maximum --- a driven pendulum, a forced oscillator, even a dripping faucet --- displays the same numbers, providing the first experimentally verified case of universality in classical chaos.
Example: Driven Damped Pendulum.
A simple pendulum with friction and a periodic driving force. For small drive, it oscillates regularly. As the drive increases, it can undergo period-doubling and eventually move chaotically, never repeating its path.
Lyapunov exponent of the logistic map.
For the logistic map xn+1=rxn(1−xn) with r=4, the Lyapunov exponent is λ=limN→∞N1∑n=0N−1ln∣f′(xn)∣ where f′(x)=4(1−2x). Starting from x0=0.2, iteration gives x1=0.64, x2=0.9216, etc. Numerically averaging ln∣4(1−2xn)∣ over N=104 iterations yields λ≈ln2≈0.693. Since λ>0, the dynamics is chaotic: a tiny initial difference δ0 grows as ∣δn∣≈δ0enλ, so after n=10 iterations the uncertainty has grown by a factor of about e6.93≈103.
Problem 1. Find the fixed points of the logistic map xn+1=rxn(1−xn) and determine their stability for 0<r<3.
Solution. Fixed points satisfy x∗=rx∗(1−x∗), giving x∗=0 or x∗=1−1/r. Stability requires ∣f′(x∗)∣<1 where f′(x)=r(1−2x). At x∗=0, ∣f′(0)∣=∣r∣. For 0<r<1, this is stable; for r>1, unstable. At x∗=1−1/r, ∣f′(x∗)∣=∣r(1−2(1−1/r))∣=∣2−r∣. This is stable when ∣2−r∣<1, i.e., 1<r<3.
Problem 2. A driven damped pendulum obeys θ¨+0.5θ˙+sinθ=1.5cos(0.7t). Explain why the Poincar’e section is useful and describe what it would show for periodic versus chaotic motion.
Solution. The Poincar’e section records (θ,θ˙) once per drive period T=2π/0.7. For periodic motion with period equal to the drive, the section shows a single fixed point. For period-doubling, it shows 2, 4, 8, … points. For chaotic motion, the points never repeat and fill a fractal structure (strange attractor), revealing the underlying order in the chaotic flow.
Problem 3. Estimate the Lyapunov exponent for a system where two nearby trajectories diverge by a factor of 106 in time t=10s.
Solution. Using ∣δ(t)∣≈∣δ(0)∣eλt, we have 106=eλ(10), so λ=10ln(106)=106ln10≈1.38s−1. This positive value indicates chaotic dynamics.
Section summary.
Chaos demonstrates that deterministic laws can lead to unpredictable behavior through exponential sensitivity to initial conditions, often visualized through strange attractors and Poincaré sections.
Many realistic systems are “nearly integrable,” meaning they can be described as an integrable system plus a small perturbation parameterized by a dimensionless number ϵ≪1:
H(J,θ)=H0(J)+ϵH1(J,θ),
where J=(J1,…,Jn) and θ=(θ1,…,θn) are the action—angle variables of the unperturbed problem.
Time-independent perturbation theory.
The goal is to find a near-identity canonical transformation to new variables (Jˉ,θˉ) such that the new Hamiltonian K(Jˉ) depends only on Jˉ. One uses a type-2 generating function
F2(J,θˉ)=i∑Jiθˉi+ϵW(J,θˉ),
in which W is the unknown “correction” to the identity. Expanding order by order in ϵ and demanding that the angle-dependent pieces cancel, the first-order shift in the Hamiltonian is simply the angle-average of the perturbation,
K1(Jˉ)=⟨H1(Jˉ,θ)⟩θ=(2π)n1∫02πH1(Jˉ,θ)dnθ,
and the generating-function correction takes the Fourier form
W(Jˉ,θˉ)=n=0∑n⋅ω(Jˉ)ihn(Jˉ)ein⋅θˉ,ω=∂H0/∂Jˉ,
where hn are the Fourier coefficients of H1 and n∈Zn∖{0}.
Naive expansion in ϵ tends to produce secular terms: terms that grow without bound as a power of t and so spoil the validity of the series after a time of order 1/ϵ. The standard cure is the Lindstedt—Poincar’e method, which absorbs the would-be secular contribution into a renormalization of the oscillation frequency.
Worked example: Duffing oscillator.
Consider the weakly anharmonic equation
x¨+x+ϵx3=0,x(0)=A,x˙(0)=0.
A naive expansion x=x0+ϵx1+⋯ gives x0=Acost and an inhomogeneous equation for x1 whose driving term contains cost, producing the resonant secular response
x1(t)⊃−83A3tsint,
which diverges with time. To remove it, rescale time by τ=ωt with ω=1+ϵω1+ϵ2ω2+⋯. The equation becomes
ω2x′′+x+ϵx3=0,
with x′=dx/dτ. Choosing ω1 to kill the resonant term gives the uniformly valid first-order approximation
ω=1+83ϵA2+O(ϵ2),x(t)≈Acos(ωt).
The amplitude-dependent frequency shift is the hallmark of nonlinear oscillation.
Small denominators.
The denominators n⋅ω in W become arbitrarily small whenever the unperturbed frequencies are nearly commensurate, n⋅ω≈0. The series then diverges --- the celebrated “problem of small divisors” in celestial mechanics, partially tamed by KAM theory for sufficiently irrational frequency ratios.
Worked resonant example.
Consider two degrees of freedom near a p:q resonance, pω1−qω2≈0, with perturbation
H=H0(J1,J2)+ϵhcos(pθ1−qθ2).
Average over the fast (non-resonant) angles but retain the slow resonant angle ψ=pθ1−qθ2. Introducing the canonical transformation generated by F2=ψP+θ2P2 (so that J1=pP, J2=−qP+P2) gives an effective one-degree-of-freedom Hamiltonian
Heff(P,ψ)=H0(pP,−qP+P2)+ϵhcosψ.
Expanding H0 about the resonant action P∗ where pω1=qω2, one obtains the pendulum form
Heff≈21M(P−P∗)2+ϵhcosψ,M≡∂P2∂2H0P∗.
Resonant motion thus librates like a pendulum with small-oscillation frequency Ω=∣M∣ϵh and a separatrix of width ΔP∼ϵh/∣M∣ --- the characteristic “resonance island.”
Concept.
If a parameter λ(t) of the Hamiltonian changes on a timescale much longer than the orbital period T, i.e.\ λ˙/λ≪1/T, then the action
J=2π1∮pdq
is conserved up to exponentially small corrections in the slowness parameter. J is therefore called an adiabatic invariant.
Worked example: harmonic oscillator with slowly varying frequency.
For H=21p2+21ω(t)2q2 the action of an instantaneous orbit at energy E is the area of a phase-space ellipse with semi-axes 2E and 2E/ω,
J=ω(t)E.
Adiabatic invariance of J therefore implies
E(t)=ω(t)J=ω(t)ω0E0,
so the energy tracks the frequency: a pendulum whose string is slowly shortened gains energy in proportion to its raised frequency. Historically this is the result Einstein invoked at the 1911 Solvay conference and that Ehrenfest promoted to the quantization rule J=nℏ of the old quantum theory.
HK1ωHresJ=H0(J)+ϵH1(J,θ)=⟨H1⟩θ=ω0+83ϵA2+⋯≈21MδP2+ϵhcosψ=2π1∮pdq≈const(perturbed Hamiltonian)(first-order energy shift)(Lindstedt frequency, Duffing)(pendulum near resonance)(adiabatic invariant)
Astrophysical aside: precession of Mercury’s perihelion.
General relativity adds an effective 1/r3 correction to the Newtonian 1/r potential. Treating it as ϵH1 and applying the angle-average prescription yields the celebrated 43′′/century perihelion advance.
Anharmonic oscillator frequency shift.
Consider a particle of mass m=1kg in a potential V(x)=21x2+ϵx4 (in units where m=ω0=1) with ϵ=0.01 and amplitude A=0.5m. Using the Lindstedt—Poincar’e method, the first-order frequency correction is ω≈1+83ϵA2=1+83(0.01)(0.25)=1+0.0009375. The period shifts from T0=2πs to T≈2π/1.00094≈6.280s, a decrease of about 6ms.
Problem 1. A harmonic oscillator has Hamiltonian H0=21(p2+q2). A perturbation H1=q4 is added with small parameter ϵ. Find the first-order energy shift for a state with action J.
Solution. In action—angle variables, q=2Jsinθ and the unperturbed frequency is ω0=1. The first-order shift is the angle average ⟨H1⟩=2π1∫02π(2J)2sin4θdθ=2π4J2⋅43π=3J2. Thus K=J+3ϵJ2 and the corrected frequency is ω=∂K/∂J=1+6ϵJ.
Problem 2. A pendulum of length l=1m and mass m=0.5kg has its length shortened slowly from l0=1m to l1=0.5m over 100 periods. If the initial amplitude is θ0=0.1rad, find the final amplitude and energy.
Solution. For small oscillations, ω=g/l and the adiabatic invariant is J=E/ω. Since E∝ω2A2 for a harmonic oscillator, J∝ωA2, so A2ω=const and A∝ω−1/2∝l1/4. Thus A1=A0(l1/l0)1/4=0.1×(0.5)1/4≈0.084rad. The energy is E=21mglθ2 (small-angle), so E1/E0=(l1/l0)(θ1/θ0)2=0.5×0.5≈0.354. With E0=21(0.5)(9.8)(1)(0.1)2≈0.0245J, we get E1≈8.7×10−3J.
Problem 3. Consider the Duffing equation x¨+x+ϵx3=0 with x(0)=A and x˙(0)=0. Show that the naive perturbation expansion produces a secular term and explain how the Lindstedt—Poincar’e method removes it.
Solution. Writing x=x0+ϵx1+…, the zeroth order gives x0=Acost. The first-order equation is x¨1+x1=−x03=−A3cos3t=−43A3cost−41A3cos3t. The cost term drives the harmonic oscillator at resonance, giving a particular solution x1⊃−83A3tsint, which grows without bound (secular). To fix this, rescale time as τ=ωt with ω=1+ϵω1+…. The equation becomes ω2x′′+x+ϵx3=0. At O(ϵ), choosing ω1=83A2 cancels the resonant term, leaving only bounded oscillations at the corrected frequency.
Section summary.
Canonical perturbation theory systematically organizes the effects of small departures from integrability. Secular terms in the naive series are removed by frequency renormalization (Lindstedt—Poincar’e); near a resonance the same machinery reduces the dynamics locally to a pendulum; and adiabatic invariants such as E/ω for the slowly varying oscillator capture the robust quantities that survive slow parameter drift.
Classical mechanics can be extended to systems with an infinite number of degrees of freedom, such as fluids, elastic solids, and electromagnetic fields. These are described by fieldsϕ(x,y,z,t).
Lagrangian Density.
The total Lagrangian L is the spatial integral of a Lagrangian densityL:
L=∫L(ϕ,∂μϕ,xμ)d3x.
The action is S=∫Ldt=∫Ld4x. Hamilton’s Principle (δS=0) leads to the Euler—Lagrange equations for fields:
∂ϕ∂L−∂μ(∂(∂μϕ)∂L)=0,
where ∂μ=(c1∂t,∇) is the four-gradient.
Hamiltonian Density.
The conjugate momentum density is π=∂ϕ˙∂L. The Hamiltonian density is:
H=πϕ˙−L.
The total Hamiltonian H=∫Hd3x gives the total energy of the field.
Noether’s Theorem and the Stress-Energy Tensor.
Symmetries of the Lagrangian density lead to conserved currents.
Internal symmetries lead to conserved charges (like electric charge).
Spacetime symmetries lead to the conservation of the Stress—Energy TensorTμν:
Tμν=∂(∂μϕ)∂L∂νϕ−gμνL.
Conservation $\partial_\mu T^{\mu\nu} = 0$ implies conservation of energy and momentum.
Example: The Vibrating String.
For a string with tension τ and linear density ρ, L=21ρy˙2−21τ(∂xy)2. The Euler—Lagrange equation yields the wave equation: ρy¨−τy′′=0.
Example: The electromagnetic field.
The free Maxwell field is described by the gauge potential Aμ=(ϕ/c,A) and the antisymmetric field-strength tensor Fμν=∂μAν−∂νAμ. Its Lorentz-invariant Lagrangian density is
LEM=−4μ01FμνFμν−JμAμ,
with Jμ=(cρ,J) the four-current. The Euler—Lagrange equations applied to Aμ reproduce the inhomogeneous Maxwell equations∂μFμν=μ0Jν, while the homogeneous pair follows automatically from the antisymmetry of Fμν. The associated stress—energy tensor Tμν reproduces the energy density 21(ε0E2+B2/μ0) and the Poynting momentum density E×B/(μ0c2).
Example: Non-linear waves --- the sine-Gordon kink.
A celebrated nonlinear field theory in 1+1 dimensions is the sine-Gordon model with Lagrangian density
LsG=21(∂tϕ)2−21c2(∂xϕ)2−β2m2c4[1−cos(βϕ)],
which yields the equation of motion ∂t2ϕ−c2∂x2ϕ+(m2c4/β)sin(βϕ)=0. Despite being nonlinear, it admits an exact static kink (soliton) solution interpolating between adjacent vacua ϕ=0 and ϕ=2π/β:
ϕK(x)=β4arctan[exp(mcx)],Ekink=β28mc3.
Boosting it gives a localised travelling wave that retains its shape after collisions --- a hallmark of the integrability of the sine-Gordon system. The same equation describes mechanical chains of coupled pendulums and Josephson-junction transmission lines, illustrating how classical field theory unifies wave propagation in seemingly disparate physical systems.
Vibrating string with fixed ends.
A uniform string of length L=1.0m, linear density ρ=5.0×10−3kg/m, and tension τ=20N is fixed at both ends. The Lagrangian density is L=21ρy˙2−21τ(y′)2. The wave equation is ρy¨=τy′′, giving wave speed c=τ/ρ=20/0.005=63.2m/s. For fixed ends, normal modes are yn(x,t)=Ansin(nπx/L)cos(ωnt) with frequencies ωn=nπc/L=n×198.7rad/s (n=1,2,…). The fundamental frequency is f1=ω1/(2π)≈31.6Hz.
Problem 1. A flexible membrane with surface tension σ and surface mass density μ has transverse displacement z(x,y,t). Write the Lagrangian density and derive the wave equation.
Solution. The kinetic energy density is 21μz˙2 and the potential energy density from stretching is 21σ[(∂xz)2+(∂yz)2]. Thus L=21μz˙2−21σ[(∂xz)2+(∂yz)2]. The Euler—Lagrange equation gives μz¨=σ(∂x2z+∂y2z), or z¨=c2∇2z with c2=σ/μ.
Problem 2. For a real scalar field ϕ with Lagrangian density L=21(∂μϕ)(∂μϕ)−21m2ϕ2, find the equation of motion and the Hamiltonian density.
Solution. Using the Euler—Lagrange equation: ∂μ(∂L/∂(∂μϕ))=∂μ∂μϕ=□ϕ, and ∂L/∂ϕ=−m2ϕ. The equation of motion is the Klein—Gordon equation: (□+m2)ϕ=0. The conjugate momentum is π=∂L/∂ϕ˙=ϕ˙. The Hamiltonian density is H=πϕ˙−L=21π2+21(∇ϕ)2+21m2ϕ2.
Problem 3. A string of length L has displacement y(x,t)=Asin(kx)cos(ωt). If y(0,t)=0 and y(L,t)=0, find the allowed values of k and the corresponding frequencies when c=100m/s and L=0.5m.
Solution. The boundary condition at x=0 is satisfied automatically. At x=L, sin(kL)=0⟹kL=nπ, so kn=nπ/L=2nπm−1. The dispersion relation is ω=ck, so ωn=ckn=200nπrad/s and fn=ωn/(2π)=100nHz.
Section summary.
Field theory generalizes discrete mechanics to continuous media, using Lagrangian densities and local field equations, providing the classical foundation for Electromagnetism and Quantum Field Theory.