Every square matrix satisfies its own characteristic polynomial—one of the most elegant and useful results in linear algebra.
The Big Idea: If is the characteristic polynomial, then substituting the matrix for gives (the zero matrix).
A matrix polynomial is an expression where are scalars. Note: the constant term is , not just .
Let be an matrix over a field . If is the characteristic polynomial, then:
The matrix satisfies its own characteristic polynomial.
If , then:
Every power of beyond can be expressed in terms of lower powers!
For :
Step 1: Compute characteristic polynomial:
Step 2: Verify :
For (diagonal):
Check: ✓
For any matrix , the adjugate matrix satisfies:
Proof of Cayley-Hamilton using adjugate:
Step 1: The adjugate is a matrix whose entries are polynomials in of degree at most . Write:
Step 2: From the adjugate identity:
Step 3: Expand the left side and equate coefficients of each power of .
Step 4: Multiply each coefficient equation by the appropriate power of and sum:
If where is Jordan form, then . Since each Jordan block satisfies , the characteristic polynomial annihilates , hence .
If is invertible with , then and:
From Cayley-Hamilton:
Rearrange:
Divide by :
For with :
Cayley-Hamilton:
Rearrange: , so
For any , can be expressed as a polynomial of degree at most in :
For with :
Cayley-Hamilton:
Compute :
Compute :
Pattern: for this specific matrix.
The minimal polynomial of a matrix is the monic polynomial of smallest degree such that .
For any matrix :
By Cayley-Hamilton, . By definition, is the minimal polynomial that annihilates . By the division algorithm, where . Since and , we have . By minimality of , , so .
Diagonal matrix:
(no repeated factors needed)
For Jordan block :
(minimal equals characteristic)
A matrix is diagonalizable if and only if its minimal polynomial has no repeated roots.
χ_A(λ) is a scalar polynomial in λ. χ_A(A) substitutes the matrix A for λ, interpreting constants as multiples of I. The result is the zero MATRIX.
The constant term becomes , NOT just the scalar .
This is NOT what Cayley-Hamilton says! We evaluate the polynomial χ_A at the matrix A, NOT compute det(A - A·I) = det(0) = 0.
Convention matters: vs . They differ by .
For the 90° rotation :
(eigenvalues are )
Cayley-Hamilton: , so
This confirms: rotating by 90° twice gives rotation by 180°, which is !
Also: (rotating by -90° = -1 times rotating by 90°)
For :
Cayley-Hamilton: ✓
For nilpotent matrices, Cayley-Hamilton says some power is zero.
The companion matrix of :
The characteristic polynomial equals , and Cayley-Hamilton gives .
For any analytic function and matrix , can be expressed as a polynomial of degree at most in :
For with :
Using , , , ...:
This is the rotation matrix by angle !
Hamilton originally proved a version for quaternions: every quaternion satisfies a quadratic polynomial. Cayley extended this to matrices.
Cayley-Hamilton shows that is a finite-dimensional algebra over . Every element satisfies a polynomial of degree .
Via the correspondence between linear transformations and -modules, Cayley-Hamilton follows from structure theorems for finitely generated modules over PIDs.
Extensions exist for matrices over commutative rings, bounded operators on Banach spaces, and more general algebraic structures.
as polynomial in
as deg ≤ n-1 polynomial
Challenge 1
Prove Cayley-Hamilton for diagonalizable matrices directly (without using the adjugate proof).
Challenge 2
For , find using Cayley-Hamilton.
Challenge 3
Show that if (idempotent), then the only eigenvalues are 0 and 1.
Challenge 4
Prove: If , then and satisfy a common polynomial.
Q: Why doesn't χ_A(A) = det(A - A·I) = det(0) = 0 work?
This "proof" confuses scalar and matrix arithmetic. det(λI - A) is a polynomial in λ. We substitute the matrix A, not the scalar, getting a matrix polynomial evaluation.
Q: What's the geometric meaning?
The characteristic polynomial encodes how A scales along each eigendirection. Cayley-Hamilton says applying these scaling factors to A itself produces zero—each eigenvector gets multiplied by (λᵢ - λᵢ) = 0.
Q: When is minimal poly = char poly?
When the Jordan form has exactly one block for each eigenvalue (i.e., geometric multiplicity = 1 for all eigenvalues).
| Concept | Formula / Property |
|---|---|
| Cayley-Hamilton | |
| Matrix inverse | |
| Power reduction | |
| Minimal polynomial | , same roots |
| Diagonalizability | iff has no repeated roots |
Problem 1
Verify Cayley-Hamilton for .
Answer: . Check .
Problem 2
For , find using Cayley-Hamilton.
Answer: , so . Thus .
Problem 3
Express in terms of for a 2×2 matrix with .
Hint: First express , then compute recursively.
Problem 4
If , express in lower terms.
Answer:
Problem 5
Find the minimal polynomial of .
Answer: . (Compare: )
In control theory, for system , the controllability matrix is:
By Cayley-Hamilton, higher powers for can be expressed using the first terms.
For the system , the solution is .
Using Cayley-Hamilton, is a polynomial of degree in :
where are determined by eigenvalues and their multiplicities.
For a Markov transition matrix , powers approach the steady-state matrix.
Using Cayley-Hamilton, we can express as a polynomial in , which helps analyze convergence rates.
The Fibonacci sequence can be written using:
, so can be computed efficiently using Cayley-Hamilton.
For diagonalizable with :
Since each is a root of , we have , giving .
For any matrix, where is Jordan form. Each Jordan block satisfies . Since contains where , we get .
Diagonalizable matrices are dense in . Since Cayley-Hamilton holds for all diagonalizable matrices and is continuous in , it holds for all matrices by continuity.
| Matrix | Char Poly | Cayley-Hamilton |
|---|---|---|
When using Cayley-Hamilton:
For , verify Cayley-Hamilton.
Step 1: Compute :
Step 2: Compute :
Step 3: Verify :
For , find using Cayley-Hamilton.
From :
Therefore:
For , find .
Cayley-Hamilton:
For this matrix, we can show: where .
For , express in terms of .
Cayley-Hamilton:
Compute :
Substitute :
Express as polynomial in :
Reduce high powers:
= polynomial of degree ≤ n-1 in
Compute :
with
Solve linear recurrences:
Transform to matrix form, apply Cayley-Hamilton
If
If
If
| Property | Minimal | Characteristic |
|---|---|---|
| Definition | Smallest degree with | |
| Degree | ≤ n | = n |
| Roots | Eigenvalues (each once or more) | Eigenvalues with mult |
| Relationship | , same roots | |
| Diagonalizable iff | No repeated roots | Geometric = Algebraic mult |
Goal: Find A^-1?
Compute , rearrange to solve for , divide by .
Goal: Compute A^k for large k?
Express in terms of lower powers, substitute recursively.
Goal: Verify Cayley-Hamilton?
Compute , compute all powers of , substitute and verify = 0.
Goal: Find minimal polynomial?
Start with divisors of , test which smallest degree polynomial annihilates .
In this comprehensive module on the Cayley-Hamilton theorem, you learned:
Q: Why is Cayley-Hamilton important for computing matrix functions?
Any analytic function can be expressed as a polynomial of degree < n in . This makes computation tractable.
Q: What happens if we use a different polynomial that A also satisfies?
The minimal polynomial is the unique monic polynomial of smallest degree. Any other annihilating polynomial is a multiple of it.
Q: Does Cayley-Hamilton work for infinite matrices?
Not in general. Extensions exist for certain operators (compact, trace-class) but the characteristic polynomial concept changes.
Q: Can two different matrices have the same characteristic polynomial?
Yes! Similar matrices share the same characteristic polynomial, but non-similar matrices can also have identical . The Jordan form provides more fine-grained distinction.
Arthur Cayley (1821-1895): British mathematician who first stated the theorem for 2×2 and 3×3 matrices in 1858, verifying it by direct computation. He wrote: "I have not thought it necessary to undertake the labor of a formal proof of the theorem in the general case of a matrix of any degree."
William Rowan Hamilton (1805-1865): Irish mathematician who proved a version for quaternions in 1853. Quaternions can be represented as certain 2×2 complex matrices.
Ferdinand Frobenius (1849-1917): German mathematician who gave the first rigorous general proof in 1878.
Modern significance: The theorem is fundamental to matrix theory, control theory, signal processing, and computational linear algebra.
The Theorem
"A matrix satisfies its own characteristic equation"
Inverse Formula
"Solve χ(A)=0 for I, divide by A"
Power Reduction
"Aⁿ expressed using Aⁿ⁻¹, ..., I only"
Minimal Polynomial
"Smallest degree, divides char poly"
Previous
6.4 Jordan Form
Current
6.5 Cayley-Hamilton
Next
7.1 Inner Products
With Cayley-Hamilton mastered, you've completed the Eigenvalues chapter! Next up:
The Cayley-Hamilton theorem states that every square matrix satisfies its own characteristic polynomial. This elegant result has powerful applications for computing matrix inverses, reducing high powers, and connecting to the minimal polynomial.
13+
Theorems
15+
Examples
8
Quiz Questions
8
FAQs
❌ Thinking gives eigenvalues → this gives characteristic equation
❌ Confusing minimal with characteristic polynomial — minimal has smallest degree
❌ Substituting scalar operations directly — matrix multiplication is not commutative
Every square matrix A satisfies its characteristic polynomial: if χ_A(λ) = det(λI - A), then χ_A(A) = 0 (the zero matrix, not the number zero).
If χ_A(λ) = λⁿ + c₁λⁿ⁻¹ + ... + cₙ, then Aⁿ + c₁Aⁿ⁻¹ + ... + cₙI = 0. Rearrange and multiply by A^{-1} to get A^{-1} = -(Aⁿ⁻¹ + c₁Aⁿ⁻² + ... + cₙ₋₁I)/cₙ.
The monic polynomial m(λ) of smallest degree such that m(A) = 0. It always divides the characteristic polynomial. A is diagonalizable iff minimal poly has no repeated roots.
det(A - λI) is a polynomial in the scalar λ. We substitute the matrix A for λ, interpreting constants as multiples of I. The result is a matrix, not a scalar.
Cayley-Hamilton can be proven using Jordan form: each Jordan block satisfies (J - λI)^k = 0 for appropriate k, and the full Jordan matrix satisfies the char poly.
Not directly—Cayley-Hamilton is for finite-dimensional spaces. Extensions exist for certain classes of operators in functional analysis.
Named after Arthur Cayley (who stated it for 2×2 and 3×3 matrices in 1858) and William Rowan Hamilton (who proved a version for quaternions).
The theorem holds regardless—χ_A(A) = 0 even if eigenvalues are complex. The computation uses matrix arithmetic, not finding roots.