Mathematics
A concise guide to mathematics.
Suggest improvements to this note, or request a new note.
View PDF
Overview. This complete note provides a unified undergraduate review of core mathematical topics, from logic and discrete structures through calculus, linear algebra, probability, and modern analysis. Each section is designed to stand alone for exam review: definitions, key theorems, representative examples, and concise summaries are included throughout. The sections follow a natural progression from foundational reasoning to advanced topics such as topology and numerical methods. Use this overview to locate weak areas and target your study efficiently.
The Language of Mathematics: Propositions, Sets, Functions, and Proofs
Core ideas
Mathematics rests on a rigorous foundation of logic and set theory. A proposition is a declarative statement that is unambiguously either true or false. Propositions are combined using logical connectives: (negation), (conjunction), (disjunction), (implication), and (biconditional). An implication is vacuously true if is false. A predicate is a statement whose truth depends on a variable , quantified by (for all'') or $\exists$ (there exists”). De Morgan’s laws for quantifiers state and .
A set is a collection of distinct objects. Basic operations include union , intersection , set difference , and the power set (the set of all subsets of ). The Cartesian product produces ordered pairs. A relation on and is a subset . An equivalence relation on is a relation that is reflexive (), symmetric (), and transitive (). It partitions into disjoint equivalence classes .
A function is a relation where each is paired with exactly one . A function is injective (one-to-one) if ; surjective (onto) if with ; and bijective if both hold. Bijections establish that two sets have the same cardinality . A set is countable if it is in bijection with a subset of . Cantor’s Theorem states that , proving the existence of uncountably infinite sets (e.g., ).
Proof techniques are the tools of mathematical reasoning: Direct proof: assume hypotheses, deduce conclusion via logical steps. Proof by contrapositive: prove by showing . Proof by contradiction: assume the negation of the desired statement and derive a logical impossibility. Mathematical induction: prove (base case) and (inductive step) for all .
- Logic: connectives, quantifiers, De Morgan’s laws.
- Set Theory: operations, relations, equivalence classes, power sets, cardinality.
- Functions: injectivity, surjectivity, bijections, and inverse functions.
- Proof methods: direct, contrapositive, contradiction, induction.
Mathematical spine
Example
Consider the function defined by .
- Injectivity: Suppose . Then , which simplifies to . Thus is injective.
- Surjectivity: For any , set . Then . Thus is surjective.
- Inverse: Since is bijective, it has an inverse .
Next, prove by induction that for all .
- Base case (): LHS , RHS .
- Inductive step: Assume . Then Thus the formula holds for all .
Problems with Solutions
-
Problem. Prove by contrapositive: If is even, then is even.
Solution. We prove : if is odd, then is odd. If is odd, for some . Then , which is odd. Hence the original statement is true.
-
Problem. Let and . Compute , , , and .
Solution. , , . The Cartesian product has elements.
-
Problem. Use induction to prove that for all .
Solution. Base case : . Assume true for . Then
Thus the statement holds for all .
Section summary. The language of mathematics is built upon formal logic and set theory. Propositions, predicates, and quantifiers provide the syntax for rigorous statements. Sets, relations, and functions offer the foundational structures for modeling mathematical objects. Mastery of proof techniques---including induction, contrapositive, and contradiction---is required to rigorously verify properties of these structures.
Discrete Structures: Induction, Combinatorics, Graphs, and Algorithms
Core ideas
Discrete mathematics studies countable, distinct structures. Mathematical induction (weak and strong) is the standard proof technique for statements concerning natural numbers. Recursion defines sequences or sets in terms of previous elements, often solvable via generating functions (e.g., representing sequence as a formal power series ) or characteristic equations.
Combinatorics counts finite arrangements. The multiplication principle: if task has ways and task has ways, the pair has ways. A permutation is an ordered arrangement: . A combination is an unordered selection: . The Binomial Theorem states . The Pigeonhole Principle guarantees that placing items into boxes results in at least one box with items. The Principle of Inclusion-Exclusion (PIE) computes the union of overlapping sets: , expanding to alternating sums for more sets.
A graph consists of vertices and edges . The Handshaking Lemma states . A tree is a connected acyclic graph on vertices, having exactly edges. A graph is bipartite if its vertices can be partitioned into two sets where all edges connect the two sets; equivalently, it contains no odd cycles. An Eulerian circuit traverses every edge exactly once (exists iff all vertices have even degrees and is connected). A Hamiltonian cycle visits every vertex exactly once. For connected planar graphs (drawable without edge crossings), Euler’s Formula states , where is the number of faces. Graph coloring assigns colors to vertices so adjacent vertices differ; the chromatic number is the minimum colors needed. By the Four Color Theorem, for planar graphs.
Algorithms provide step-by-step procedures to solve discrete problems. Big-O notation, , describes worst-case asymptotic upper bounds: for sufficiently large . The Euclidean algorithm computes in steps. Graph algorithms include Breadth-first search (BFS) for shortest paths in unweighted graphs, Depth-first search (DFS) for topological sorting and cycle detection, and Dijkstra’s algorithm for shortest paths with non-negative weights.
- Counting: permutations, combinations, Pigeonhole Principle, PIE, generating functions.
- Graphs: trees, bipartite graphs, Eulerian/Hamiltonian paths, planar graphs (Euler’s formula).
- Algorithms: BFS, DFS, Dijkstra, Euclidean algorithm, Big-O complexity.
- Induction: strong induction, structural induction.
Mathematical spine
Example
A computer password must be exactly characters long and is formed using uppercase letters ( choices) and digits ( choices). A valid password must contain at least one digit. How many valid passwords exist?
Solution. The total number of -character strings is . The number of strings with no digits (only letters) is . By the subtraction principle, the number of valid passwords is
Now consider a simple graph with and . Does contain an Eulerian circuit?
Solution. The degrees are , , , . Since vertices and have odd degree, the graph does not have an Eulerian circuit. (An Eulerian circuit requires all vertices to have even degree.)
Problems with Solutions
-
Problem. A password is characters long, chosen from letters and digits. How many such passwords contain at least one digit?
Solution. Total strings: . Strings with no digits: . Valid passwords: .
-
Problem. Compute .
Solution.
-
Problem. A graph has vertices with degrees . How many edges does it have?
Solution. By the Handshaking Lemma, . The sum of degrees is . Hence , so .
Section summary. Discrete structures encompass combinatorial counting (permutations, combinations, inclusion-exclusion), graph theory (trees, planarity, colorings), and algorithmic analysis (Big-O notation, graph traversal). Mastery of these principles is critical for analyzing finite structures, solving recurrence relations, and establishing computational complexity.
Single Variable Calculus: Change, Approximation, and Integration
Core ideas
Calculus studies continuous change and accumulation. The limit means approaches as approaches . A function is continuous at if . For sequences, means terms become arbitrarily close to as grows. The Intermediate Value Theorem states: if is continuous on , it takes on every value between and .
The derivative measures the instantaneous rate of change:
Differentiability implies continuity. Key differentiation rules: product rule ; quotient rule ; chain rule . The Mean Value Theorem (MVT): if is continuous on and differentiable on , then for some . Applications include optimization (critical points where ), and L’H^opital’s rule for limits of indeterminate forms like . Functions can be locally approximated by polynomials. Taylor’s Theorem states , where the remainder for some between and (Lagrange form).
Integration calculates accumulation. The definite integral is rigorously defined as the limit of Riemann sums as the partition width . It represents the signed area under the curve. The Fundamental Theorem of Calculus (FTC) bridges derivatives and integrals. Part 1: if is continuous, . Part 2: where . Integration techniques include substitution (), integration by parts (), and partial fractions.
- Limits & Continuity: one-sided limits, sequence limits, IVT.
- Derivatives: rules, MVT, Taylor series with remainder, optimization.
- Integrals: Riemann sums, definite/indefinite integrals, FTC.
- Techniques: substitution, integration by parts, L’H^opital’s rule.
Mathematical spine
Example
Approximate using the second-order Taylor polynomial for centered at , and estimate the error.
Solution. We have , , , and . At : , , . The second-order Taylor polynomial is
Evaluating at :
The exact value is , so the error is about . Using the Lagrange remainder:
for some . Since , , consistent with the actual error.
Problems with Solutions
-
Problem. Evaluate using L’Hôpital’s rule.
Solution. The limit is . Applying L’Hôpital three times:
-
Problem. Find the derivative of .
Solution. Using the product and chain rules:
-
Problem. Compute using integration by parts.
Solution. Let , . Then , .
Section summary. Single-variable calculus formalizes continuous change and area. Limits define both the derivative (instantaneous rate of change) and the integral (accumulation via Riemann sums). Taylor series extend linear approximation to polynomial approximation. The Fundamental Theorem of Calculus profoundly unites these two branches, enabling analytical evaluation of integrals and solving differential equations.
Linear Algebra: Spaces, Maps, Eigenvalues, and Projections
Core ideas
Linear algebra studies vector spaces and linear transformations. A vector space over a field is a set closed under vector addition and scalar multiplication. A subspace is a nonempty subset closed under these operations. The span of vectors is the set of all linear combinations. A set is linearly independent if implies all . A basis is a linearly independent spanning set; its size is the dimension . The dual space is the vector space of all linear maps (functionals) from to .
A linear transformation preserves operations. The kernel (null space) and image (range) are subspaces. The rank-nullity theorem states . Every linear map between finite-dimensional spaces can be represented by a matrix. The determinant of a square matrix measures the volume scaling factor; is invertible iff .
An eigenvector of satisfies , where is the eigenvalue. Eigenvalues are roots of the characteristic polynomial . The Cayley-Hamilton Theorem states that every square matrix satisfies its own characteristic equation: . A matrix is diagonalizable if it has a basis of eigenvectors. The Spectral Theorem guarantees every real symmetric matrix is orthogonally diagonalizable with real eigenvalues. If a matrix is not diagonalizable, it can be put into Jordan canonical form. The Singular Value Decomposition (SVD) factorizes any as , with orthogonal and diagonal.
An inner product space is a vector space with an inner product (generalizing the dot product). It induces a norm and a notion of orthogonality (). Orthogonal projection onto a subspace minimizes the distance to . The Gram-Schmidt process converts any basis into an orthonormal basis. In systems with no exact solution, the least squares solution minimizes via the normal equations .
- Vector space: basis, dimension, dual space, linear independence.
- Linear maps: rank-nullity theorem, matrix representation, determinants.
- Eigen-theory: characteristic polynomial, Cayley-Hamilton, Spectral Theorem, SVD.
- Inner products: Gram-Schmidt, orthogonal projections, least squares.
Mathematical spine
Example
Find the eigenvalues and eigenvectors of . Determine whether is diagonalizable.
Solution. The characteristic polynomial is
Thus the eigenvalues are and .
For : solve :
An eigenvector is .
For : solve :
An eigenvector is .
Since we found two linearly independent eigenvectors, is diagonalizable: with and .
Problems with Solutions
-
Problem. Determine whether the vectors , , are linearly independent.
Solution. We check if has only the trivial solution. Note that . Hence , a nontrivial relation. The vectors are linearly dependent.
-
Problem. Compute the determinant of .
Solution. For an upper triangular matrix, the determinant is the product of the diagonal entries:
-
Problem. Find the least-squares solution to where and .
Solution. The normal equations are .
Solving : from the first equation . Substituting into the second: . Then . The least-squares solution is .
Section summary. Linear algebra provides the framework for analyzing flat spaces and linear mappings. Core topics include dimension and basis (coordinates), the rank-nullity theorem, and determinants. Eigenvalues, diagonalization, and the Cayley-Hamilton theorem analyze matrix powers and structure, while inner products formalize geometry (orthogonality and projection), forming the bedrock for fields ranging from quantum mechanics to machine learning.
Multivariable Analysis: Gradients, Multiple Integrals, and Vector Fields
Core ideas
Multivariable calculus extends the concepts of limits, derivatives, and integrals to functions of several variables. For a scalar function , the partial derivative represents the rate of change of in the direction of the -axis, holding all other variables constant. Clairaut’s Theorem ensures that if the mixed second-order partial derivatives are continuous, they are equal: .
The gradient vector, denoted , is central to optimization because it points in the direction of steepest ascent. Furthermore, the gradient is everywhere orthogonal to the level sets of . The directional derivative computes the rate of change of in the direction of an arbitrary unit vector .
For a vector-valued function , the Jacobian matrix linearly approximates . Its entries are . The multivariable chain rule is concisely expressed as matrix multiplication: . The Inverse Function Theorem states that if is a square matrix and invertible (i.e., ), then is locally a bijection near . Similarly, the Implicit Function Theorem guarantees that an equation locally defines as a differentiable function of provided the partial derivative with respect to is non-zero.
For optimization, critical points occur where . We classify these points using the symmetric Hessian matrix of second partial derivatives. If is positive definite, the point is a local minimum; if negative definite, a local maximum; if indefinite, a saddle point. Lagrange multipliers provide a method for finding extrema of subject to constraints by solving the system and .
Multiple integrals generalize the definite integral to compute areas, volumes, and hypervolumes. When changing variables from to , the area element is scaled by the absolute value of the determinant of the Jacobian: . Canonical coordinate systems simplify integration over symmetric domains:
- Polar coordinates (2D): .
- Cylindrical coordinates (3D): .
- Spherical coordinates (3D): .
A vector field assigns a vector to every point in space. Two fundamental differential operators act on 3D vector fields:
- Divergence (): A scalar measuring the local outward flux or “source/sink” nature of the field.
- Curl (): A vector measuring the local rotation or “swirl” of the field.
A line integral accumulates the tangential component of along a curve , computing the total work done by a force field. A vector field is conservative if it is the gradient of a scalar potential (). For conservative fields, (on simply connected domains), and the line integral is path-independent: .
The core of vector calculus is a trio of theorems that are special cases of the generalized Stokes’ Theorem on manifolds (which links the integral of a differential form over a boundary to the integral of its exterior derivative over the interior):
- Green’s Theorem (2D): Relates a line integral around a simple closed curve to a double integral over the enclosed plane region: .
- Stokes’ Theorem (3D): Relates the surface integral of the curl of a field to the line integral around the surface’s boundary: .
- Divergence Theorem (Gauss’s Theorem, 3D): Relates the flux of a field through a closed surface to the integral of its divergence over the enclosed volume: .
- Derivatives: Gradient, Jacobian, Hessian, Inverse/Implicit Function Theorems, Clairaut’s Theorem.
- Optimization: Critical points, second derivative test (Hessian), Lagrange multipliers.
- Integration: Multiple integrals, change of variables (Jacobian determinant), coordinate systems.
- Vector Calculus: Divergence, curl, conservative fields, Green’s, Stokes’, and Divergence theorems.
Example (Green’s Theorem for Area). We can use Green’s theorem to compute the area of a region . Let be traversed counterclockwise. If we choose a field , its 2D curl is . By Green’s theorem, .
Mathematical spine
Example
Let .
(a) Compute the gradient and the directional derivative at in the direction of .
(b) Evaluate where is the disk .
Solution.
(a) The partial derivatives are and . At : . The directional derivative is
(b) In polar coordinates, and . The region is , .
Problems with Solutions
-
Problem. Find and classify the critical points of .
Solution. Set . This gives and . Substituting yields , so or . The critical points are and . The Hessian is . At : , , so is a saddle point. At : , and , so is a local minimum.
-
Problem. Use Green’s Theorem to evaluate where is the circle oriented counterclockwise.
Solution. Green’s Theorem gives
The disk has area . Hence the integral equals .
-
Problem. Evaluate where is the solid bounded by in the first octant using spherical coordinates.
Solution. In spherical coordinates, , , and the first octant means , , .
The -integral is , and the -integral is . Thus the value is .
Section summary. Multivariable analysis translates single-variable calculus to higher dimensions. The Jacobian and Hessian matrices capture linear and quadratic approximations, enabling optimization and the Implicit/Inverse Function Theorems. Multiple integration relies on the Jacobian determinant for coordinate transformations. The crowning achievements are the integral theorems of Green, Stokes, and Gauss, which unify geometry and calculus by proving that the aggregate behavior of a field within a domain is determined entirely by its behavior on the domain’s boundary.
Probability: Conditional Probability, Random Variables, and Distributions
Core ideas
Probability mathematically quantifies uncertainty. A sample space is the set of all outcomes. An event is a subset. A probability measure satisfies Kolmogorov’s axioms: , , and for mutually exclusive events.
Conditional probability updates belief given . Events and are independent if . Bayes’ theorem, , reverses the conditioning. The law of total probability states for a partition .
Example (Bayes’ Theorem): Consider a disease affecting of a population. A test is accurate. If a person tests positive, the probability they actually have the disease is . This demonstrates the base rate fallacy, highlighting the importance of the prior probability.
A random variable assigns numbers to outcomes. Its cumulative distribution function (CDF) is . Discrete has a probability mass function (PMF) ; continuous has a probability density function (PDF) such that . Expected value or . The variance , with standard deviation .
For multiple random variables, the joint distribution captures their simultaneous behavior. Marginal distributions are obtained by summing or integrating out other variables (e.g., ). Covariance measures joint linear variation: .
Key distributions: Binomial models successes in independent trials. Poisson models rare events in time/space. Normal is the symmetric bell curve . Exponential models memoryless waiting times.
Limit theorems describe large-scale behavior. Markov’s and Chebyshev’s inequalities bound tail probabilities: . The Law of Large Numbers (Weak and Strong versions) guarantees that the sample mean as . The Central Limit Theorem (CLT) states that the sum of i.i.d.\ variables with mean and variance converges in distribution to Normal.
Proof Sketch (Chebyshev’s Inequality): By definition, . Dividing by yields the result.
- Axioms & Bayes: Conditional probability, independence, Bayes’ rule.
- Random variables: PMF/PDF, joint/marginal distributions, expectation, variance.
- Distributions: Binomial, Poisson, Normal, Exponential.
- Limit theorems: Chebyshev’s inequality, Law of Large Numbers, CLT.
Mathematical spine
Example
A fair six-sided die is rolled once. Let be the outcome.
(a) Compute the expected value .
(b) Compute the variance .
Solution.
(a) Each outcome has probability . Thus
(b) First compute . Then
Problems with Solutions
-
Problem. A bag contains red and blue marbles. Two marbles are drawn without replacement. Find the probability that both are red.
Solution.
-
Problem. Let . Compute .
Solution.
-
Problem. The lifetime of a device is exponentially distributed with mean years. Find the probability it lasts more than years.
Solution. For an exponential distribution, . The survival function is . Hence
Section summary. Probability grounds uncertainty in rigorous measure theory. Core concepts include conditional updates (Bayes’ rule) and distributions of random variables (joint and marginal). The field culminates in the Law of Large Numbers, which justifies expected values as long-run averages, and the Central Limit Theorem, which explains the universality of the Normal distribution.
Statistics: Estimation, Testing, Regression, and Inference
Core ideas
Statistics uses data to make inferences about populations. A parameter is a numerical characteristic of a population; a statistic is a numerical characteristic of a sample. Point estimation produces a single best guess; interval estimation gives a plausible range. Maximum likelihood estimation (MLE) chooses maximizing the likelihood . For example, the MLE of the mean for Normal data is .
Example (MLE for Poisson): Let . The likelihood is . The log-likelihood is . Setting yields the MLE .
The sampling distribution of a statistic is its probability distribution over repeated samples. The standard error is the standard deviation of the sampling distribution. For i.i.d.\ data with variance , . A confidence interval for the mean when is known: , where is the quantile of the standard normal. When is unknown, use the -distribution: where is the sample standard deviation.
Hypothesis testing assesses evidence against a null hypothesis . The -value is . If (significance level), reject . A Type I error rejects a true ; Type II error fails to reject a false . The -test for a mean: . The -test uses .
Linear regression models , with i.i.d. The least squares estimates minimize : , . The statistic measures the proportion of variance explained. Multiple regression extends to predictors: , with .
Proof Sketch (OLS Estimator): The objective function is . Taking the gradient with respect to and setting it to zero gives , leading to the normal equations . Assuming is invertible, we obtain .
Bayesian inference treats parameters as random variables with prior and posterior . The posterior mean serves as a Bayesian estimator, and credible intervals are the Bayesian analog of confidence intervals.
- Estimation: MLE, confidence intervals for mean ( and ).
- Testing: null/alternative, -value, -test, -test, Type I/II errors.
- Regression: least squares; ; .
- Bayesian: posterior likelihood prior.
For review, be able to: compute MLEs for common distributions; construct and interpret confidence intervals; perform - and -tests; interpret -values; fit and interpret linear regression; derive posterior distributions for simple conjugate priors. Identify the estimator, the test statistic null distribution, the regression assumptions, and the prior-posterior relationship.
Mathematical spine
Example
A random sample of lightbulbs yields a sample mean lifetime hours and a sample standard deviation hours.
(a) Construct a confidence interval for the true mean lifetime .
(b) Test versus at significance level .
Solution.
(a) Because is large, we use the -interval:
The confidence interval is hours.
(b) The test statistic is
Since , we reject . There is strong evidence that the true mean differs from hours.
Problems with Solutions
-
Problem. Given the data set , compute the sample mean and the sample variance.
Solution. The sample mean is . The sample variance is
-
Problem. In a -test of with known , a sample of size yields . Compute the test statistic and the two-tailed -value.
Solution. The test statistic is
The two-tailed -value is .
-
Problem. Fit a simple linear regression to the points .
Solution. We have , , .
Hence and . The fitted line is .
Section summary. Statistics bridges data and decisions through estimation, testing, and modeling. Point estimates (MLE), interval estimates (confidence intervals), and hypothesis tests (, ) form classical inference. Linear regression models relationships. Bayesian methods incorporate prior knowledge via Bayes’ theorem. These tools are the foundation of data analysis and scientific discovery.
Real Analysis: Limits, Continuity, Convergence, and Rigorous Integration
Core ideas
Real analysis provides rigorous foundations for calculus. The real numbers form a complete ordered field: they are ordered (totally), form a field under and , and satisfy the completeness axiom (every nonempty set bounded above has a supremum). A sequence converges to if such that . A sequence is Cauchy if such that . In , a sequence converges iff it is Cauchy. The Bolzano-Weierstrass theorem: every bounded sequence in has a convergent subsequence.
For a function , means such that . is continuous at if . Equivalent: preimages of open sets are open. The Intermediate Value Theorem: a continuous function on attains every value between and . The Extreme Value Theorem: a continuous function on a closed bounded interval attains its maximum and minimum.
Uniform continuity strengthens continuity: , that works for all simultaneously (independent of the point). On closed bounded intervals, continuity implies uniform continuity (Heine-Cantor theorem). A sequence of functions converges pointwise to if for each , and uniformly if . Uniform convergence preserves continuity and commutes with integration.
Proof Sketch (Uniform Limit of Continuous Functions is Continuous): Let uniformly, with continuous. To show is continuous at , we use an argument. . Choose such that for all (by uniform convergence). Then, choose such that (by continuity of ). Thus, .
Riemann integration: the integral is the limit of Riemann sums as the partition mesh goes to zero. A function is Riemann integrable if the upper and lower sums converge (which happens if the set of discontinuities has measure zero). The Fundamental Theorem of Calculus (rigorous version): if is integrable on and uniformly, then . The Lebesgue integral generalizes Riemann integration by partitioning the range rather than the domain, allowing integration of more functions and better convergence theorems (dominated convergence, monotone convergence).
Series converge if the partial sums converge. Tests: ratio test, root test, integral test, comparison test. Power series have a radius of convergence . Within the radius, they can be differentiated and integrated termwise.
- Sequences: - definition; Cauchy criterion; Bolzano-Weierstrass.
- Functions: - continuity; uniform continuity; IVT, EVT.
- Integration: Riemann sums; FTC; Lebesgue generalization.
- Series: convergence tests; power series; radius of convergence.
Example (- proof). Prove . Let be given. We need such that . Factor: . If we first require , then implies , so . Choose . Then gives . This is the standard pattern: bound to control .
For review, be able to: write - and - proofs; prove convergence/divergence of sequences and series; determine uniform versus pointwise convergence; apply IVT and EVT; define and work with Riemann integrals; prove properties of continuous functions on compact sets. Identify the bound, the uniform modulus of continuity, the dominating function for convergence theorems, and the partition refinement for integration.
Mathematical spine
Example
(a) Prove using the - definition that .
(b) Determine whether the series converges.
Solution.
(a) Let be given. We need such that implies . Simplifying:
for . We want , which gives . Choose . Then for all , the inequality holds, proving the limit.
(b) The function is positive, continuous, and decreasing on . By the integral test:
Since the integral converges, the series also converges.
Problems with Solutions
-
Problem. Prove using the definition that .
Solution. Let . We need such that . Choose . Then for , we have , so . Hence the limit is .
-
Problem. Show that is uniformly continuous on .
Solution. Let . For any , . Choose . Then . Since does not depend on or , is uniformly continuous on .
-
Problem. Find the radius of convergence of the power series .
Solution. Using the ratio test:
for all . Since for every , the radius of convergence is .
Section summary. Real analysis places calculus on rigorous foundations using - arguments. Key concepts: convergence of sequences and series (Cauchy criterion, Bolzano-Weierstrass), continuity (IVT, EVT, uniform continuity), Riemann integration (FTC, integrability conditions), and function space convergence (pointwise vs.\ uniform). This material develops the logical infrastructure underpinning all of calculus.
Algebra: Groups, Rings, Fields, and Symmetry
Core ideas
Abstract algebra studies algebraic structures. A group is a set with a binary operation satisfying: closure, associativity , identity , and inverses . A group is abelian if it is commutative (). Examples: , , the symmetric group (permutations of elements), (invertible matrices). A subgroup is a subset closed under the operation and inverses. The order of a group is its cardinality; the order of element is the smallest with .
Example (Symmetric Group): is the group of permutations of . It has order . It is the smallest non-abelian group.
A homomorphism preserves the operation: . The kernel is a normal subgroup; the image is a subgroup of . An isomorphism is a bijective homomorphism; isomorphic groups are structurally identical. Cosets of subgroup are ; they partition . Lagrange’s theorem: divides . A normal subgroup satisfies for all , allowing the quotient group to be formed. The First Isomorphism Theorem: .
A ring has two operations: is an abelian group, multiplication is associative and distributes over addition. Examples: , polynomial ring , matrix ring . A field is a commutative ring where nonzero elements have multiplicative inverses. Examples: for prime . Polynomials over a field form a Euclidean domain with division algorithm and unique factorization. The Fundamental Theorem of Algebra: every nonconstant polynomial in has a root in .
Proof Sketch (Lagrange’s Theorem): Let be a subgroup of a finite group . The left cosets partition into disjoint equivalence classes. For any , the map is a bijection from to , so all cosets have size . Since is partitioned into cosets of size , we have , proving divides .
Group actions formalize symmetry. A group action satisfies and . The orbit and stabilizer ; the orbit-stabilizer theorem: . Applications include counting (Burnside’s lemma) and classifying symmetries of geometric objects.
- Group: closure, associativity, identity, inverses; subgroup, cosets, Lagrange.
- Homomorphism: kernel, image, isomorphism theorems; quotient groups.
- Rings and fields: polynomial rings, Euclidean domain, .
- Group actions: orbit, stabilizer, Burnside’s lemma.
For review, be able to: verify group axioms; find subgroups and cosets; compute in ; construct quotient groups; apply isomorphism theorems; factor polynomials over fields; use group actions to count configurations. Identify the group operation and identity, the kernel of a homomorphism, the field characteristic, and the orbit decomposition.
Mathematical spine
Example
Consider the additive group under addition modulo .
(a) Find all subgroups of .
(b) Let . Verify that is a subgroup and find its left cosets. Confirm Lagrange’s theorem.
Solution.
(a) The subgroups of a cyclic group correspond to the divisors of . The divisors of are . Hence the subgroups are:
- (order )
- (order )
- (order )
- (order )
- (order )
- (order )
(b) is closed under addition mod , contains , and contains inverses (, , ), so it is a subgroup. The left cosets are:
There are cosets, each of size . Indeed , verifying Lagrange’s theorem.
Problems with Solutions
-
Problem. Show that the set is a subgroup of .
Solution. is nonempty (). If , then . The inverse of is . Thus is a subgroup.
-
Problem. In the symmetric group , compute the product and find its order.
Solution. Applying right to left: , , . Thus , a 3-cycle. Its order is .
-
Problem. Factor the polynomial completely over .
Solution. .
Section summary. Algebra abstracts arithmetic to structures: groups (symmetry via invertible operations), rings (arithmetic-like structures with addition and multiplication), and fields (where division works). Key results include Lagrange’s theorem (divisibility of group orders), isomorphism theorems (structural relationships), and unique factorization in polynomial rings. Group actions connect algebra to geometry and combinatorics.
Topology: Connectedness, Compactness, and Invariants of Shape
Core ideas
Topology studies properties preserved under continuous deformations (stretching, bending, but not tearing or gluing). A topological space consists of a set and a collection of open subsets satisfying: (i) ; (ii) arbitrary unions of open sets are open; (iii) finite intersections of open sets are open. The complement of an open set is closed. The metric topology is induced by a metric : open balls form a basis. with the Euclidean metric is the motivating example.
A function between topological spaces is continuous if preimages of open sets are open (the topological definition). Equivalently for metric spaces, . A homeomorphism is a bijective continuous function with continuous inverse; homeomorphic spaces are topologically equivalent (they have the same “shape”). For example, a coffee mug and a donut (torus) are homeomorphic.
Connectedness: is connected if it cannot be written as a disjoint union of two nonempty open subsets. Equivalently, the only clopen (both open and closed) subsets are and . A subset of is connected iff it is an interval. The continuous image of a connected set is connected (generalizing IVT). Path-connectedness (every pair of points joined by a continuous path) implies connectedness; the converse is false in general. A space is totally disconnected if its only connected subsets are singletons (e.g., ).
Compactness: is compact if every open cover has a finite subcover. In , the Heine-Borel theorem says is compact iff it is closed and bounded. The continuous image of a compact set is compact. A continuous function on a compact set attains its maximum and minimum (Extreme Value Theorem). Compactness also implies uniform continuity on metric spaces. Sequential compactness (every sequence has a convergent subsequence) is equivalent to compactness in metric spaces.
Proof Sketch (Continuous image of compact set is compact): Let be continuous and compact. Consider an open cover of in . Since is continuous, the preimages form an open cover of . Because is compact, this has a finite subcover . The corresponding covers . Thus, is compact.
Homotopy and the fundamental group classify spaces up to homotopy equivalence. A homotopy between maps is a continuous with , . Two spaces are homotopy equivalent if there exist maps , with and . The fundamental group consists of homotopy classes of loops based at . For the circle , ; for (), is trivial. Simply connected spaces have trivial fundamental group. The Euler characteristic for polyhedra is a topological invariant; for a sphere , for a torus .
- Topological space: open sets, continuity, homeomorphism.
- Connectedness: connected no nontrivial clopen sets; path-connectedness.
- Compactness: Heine-Borel in ; extreme value theorem; uniform continuity.
- Algebraic topology: fundamental group , homotopy, Euler characteristic.
For review, be able to: determine if a set is open in a given topology; prove continuity via inverse images of open sets; check connectedness and path-connectedness; apply Heine-Borel for compactness arguments; compute fundamental groups of simple spaces; understand homotopy equivalence. Identify the open cover, the connected component, the compact subset, and the loop homotopy class.
Mathematical spine
Example
Consider with the standard metric .
(a) Show that the interval is an open set.
(b) Show that is compact.
Solution.
(a) Let . Choose . Then the open ball is contained in . Since every point has an open neighborhood inside , the set is open.
(b) By the Heine-Borel theorem, a subset of is compact if and only if it is closed and bounded. The set is bounded (contained in ) and closed (its complement is open). Therefore is compact.
Problems with Solutions
-
Problem. In with the Euclidean metric, show that the open ball is an open set.
Solution. Let , so . Let . For any , the triangle inequality gives . Hence , proving the ball is open.
-
Problem. Prove that the continuous image of a connected set is connected. Use this to show there is no continuous surjection (where has the discrete topology).
Solution. Let be continuous and connected. Suppose with disjoint nonempty open sets in . Then and are disjoint nonempty open sets in whose union is , contradicting connectedness. Thus is connected. The space is connected, but is disconnected. If a continuous surjection existed, would be the continuous image of a connected set and hence connected, a contradiction.
-
Problem. Show that is not compact in .
Solution. The set is not closed in (its closure is ). By the Heine-Borel theorem, a subset of is compact iff it is closed and bounded. Since is not closed, it is not compact. (Alternatively, the open cover has no finite subcover.)
Section summary. Topology abstracts notions of proximity and shape. A topological space defines continuity via open sets. Connectedness captures “being all in one piece”; compactness generalizes finiteness and guarantees extreme values and uniform continuity. Algebraic topology uses the fundamental group and Euler characteristic to classify spaces up to deformation, providing invariants that distinguish spheres, tori, and other manifolds.
Numerical Mathematics: Stability, Matrix Decompositions, and Optimization
Core ideas
Numerical mathematics develops algorithms for continuous mathematical problems with guaranteed accuracy. Conditioning measures how much a problem’s output changes relative to input perturbations. The condition number ; problems with large are ill-conditioned. Numerical stability means the algorithm produces nearly the exact output for nearly correct input (i.e., backward stable). Forward error = ; backward error = smallest such that .
Example (Catastrophic Cancellation): Consider for large . If , and . Subtracting them loses significant digits. Rewriting as completely avoids cancellation, demonstrating how algebraic manipulation improves numerical stability.
Floating-point arithmetic approximates real numbers with finite precision machine epsilon in double precision. Rounding produces relative error . Catastrophic cancellation occurs when subtracting nearly equal numbers. Algorithms should avoid such cancellation and use numerically stable formulas.
Matrix decompositions are the workhorses of numerical linear algebra. LU decomposition (permutation, lower triangular, upper triangular) solves linear systems in time via forward/backward substitution. Cholesky decomposition applies to symmetric positive definite matrices (twice as fast). QR decomposition (orthogonal , upper triangular ) provides stable least-squares solutions. Singular value decomposition gives the optimal low-rank approximation (Eckart-Young theorem): truncating to rank minimizes over rank- matrices. Eigenvalue computation uses the QR algorithm (iterative on shifted ).
Iterative methods solve large sparse systems: Jacobi iteration ; Gauss-Seidel uses latest components; conjugate gradient for SPD matrices minimizes error in the energy norm. The condition number of affects convergence rate.
Numerical optimization finds minima of functions. Gradient descent converges linearly for well-conditioned problems. Newton’s method converges quadratically near the solution but requires Hessian inversion. Quasi-Newton methods (BFGS) approximate the Hessian. For constrained optimization, Lagrange multipliers and KKT conditions characterize optimality. Linear programming minimizes subject to , via the simplex method or interior-point methods.
Numerical integration (quadrature) approximates : trapezoidal rule has error , Simpson’s rule , Gaussian quadrature . Adaptive quadrature refines subintervals where the function varies rapidly. Differential equation solvers: Euler’s method (first-order), Runge-Kutta methods (classical RK4 is fourth-order). Stiff equations require implicit methods (backward Euler, BDF) for stability.
- Stability: condition number, forward/backward error, machine epsilon.
- Decompositions: LU, Cholesky, QR, SVD; QR algorithm for eigenvalues.
- Iterative methods: Jacobi, Gauss-Seidel, conjugate gradient.
- Optimization: gradient descent, Newton, BFGS, KKT, linear programming.
- Quadrature & ODEs: trapezoidal, Simpson, RK4, adaptive methods.
For review, be able to: compute condition number and interpret it; perform and apply LU, QR, and SVD factorizations; derive convergence criteria for iterative solvers; set up gradient descent and Newton iterations; understand error bounds for quadrature rules; identify stiffness in ODEs. Identify the source of ill-conditioning, the best matrix factorization for the problem, the convergence rate of iterative methods, and the dominant error term in approximation.
Mathematical spine
Example
Solve the linear system
using LU decomposition (without pivoting).
Solution. Write . We seek and such that . From the first row: , . From the second row: , and . Thus
Solve with :
- .
Solve :
- .
The solution is , .
Problems with Solutions
-
Problem. Estimate the condition number of at using the relative condition number formula .
Solution. We have . Then
The problem is well-conditioned.
-
Problem. Perform one step of gradient descent on starting from with step size .
Solution. The gradient is . At , .
The new iterate is .
-
Problem. Approximate using the Trapezoidal rule with subintervals.
Solution. The nodes are , , with . The Trapezoidal rule gives
Section summary. Numerical mathematics bridges continuous mathematics and computational implementation. Key themes: conditioning and stability ensure reliable computation; matrix decompositions (LU, QR, SVD) provide the algorithmic backbone for linear algebra; iterative methods and optimization algorithms solve large-scale problems; quadrature and ODE solvers approximate continuous processes. The interplay of mathematical insight and algorithmic design produces efficient, accurate, and stable numerical methods.