← mapLogic & Proof

Induction

⚗ Dr. Möbius, from the lab

How do you prove a statement about all of the infinitely many natural numbers in finite time? You can't check them one by one — you'd die first, and I'd be very annoyed. Induction is the cheat code: knock over the first domino, prove every domino topples the next, and walk away while infinity falls for you. It's the closest thing mathematics has to a perpetual motion machine, and unlike those fraudulent devices cluttering every crackpot's garage, this one actually fucking works.

THE BIG IDEA

If a statement holds for the base case and each case forces the next, then it holds for every natural number — by the principle of mathematical induction.

The domino principle, made rigorous

You want to prove P(n)P(n) is true for every natural number n1n \ge 1 (or 0\ge 0). Induction says you only need two things:

  1. Base case. Prove P(1)P(1) — the first domino falls.
  2. Inductive step. Prove that for every kk, if P(k)P(k) holds then P(k+1)P(k+1) holds — each domino knocks over the next.

If you establish both, the principle of mathematical induction guarantees P(n)P(n) holds for all n1n \ge 1. Why? Because P(1)P(1) is true (base case), and the step carries it forward: P(1)P(2)P(3)P(1) \Rightarrow P(2) \Rightarrow P(3) \Rightarrow \cdots, reaching every natural number in finitely many hops. The reasoning bottoms out in the successor structure of N\mathbb{N} from the very first lesson — every natural is 00 wearing finitely many SS's, so induction catches every one of them.

The two parts are a package deal. The base case alone proves one thing, exactly one thing. The step alone proves "if any domino falls, the rest do" — useless if none of them falls. Together they're infinity-conquering. Apart they're nothing. Worse than nothing, actually, because they look plausible.

Anatomy of the inductive step: the hypothesis is free loot

The inductive step is where everyone freezes, so slow down. You are proving an implication: "if P(k)P(k), then P(k+1)P(k+1)." By everything you learned in Direct Proof, you prove an implication by assuming the hypothesis — here, assume P(k)P(k) is true. That assumption has a name: the inductive hypothesis (IH).

Students panic: "isn't assuming P(k)P(k) assuming what I'm trying to prove?!" No, and this confusion will haunt you if you don't kill it right now. You're not assuming P(k+1)P(k+1), the thing you actually want. You're assuming P(k)P(k) — one specific, earlier domino — and using it to reach the next one. The IH is free loot: a gift you get to spend, handed to you by the logic of implication. The whole art of the inductive step is finding where to spend it — algebraically connecting P(k+1)P(k+1) back to P(k)P(k) so the IH can do its work.

Worked: 1 + 2 + ⋯ + n = n(n+1)/2

Claim. For all n1n \ge 1: 1+2++n=n(n+1)2\displaystyle 1 + 2 + \cdots + n = \frac{n(n+1)}{2}.

Proof (by induction on nn).

Base case (n=1n = 1): The left side is 11. The right side is 122=1\frac{1 \cdot 2}{2} = 1. Equal. ✓

Inductive step: Fix k1k \ge 1 and assume the IH: 1+2++k=k(k+1)21 + 2 + \cdots + k = \frac{k(k+1)}{2}. We must show P(k+1)P(k+1), i.e. 1+2++(k+1)=(k+1)(k+2)21 + 2 + \cdots + (k+1) = \frac{(k+1)(k+2)}{2}.

Start from the left side of P(k+1)P(k+1) and split off the last term to expose the IH: 1+2++kthis is the IH+(k+1)=k(k+1)2+(k+1).\underbrace{1 + 2 + \cdots + k}_{\text{this is the IH}} + (k+1) = \frac{k(k+1)}{2} + (k+1). That substitution — replacing the first kk terms with k(k+1)2\frac{k(k+1)}{2} — is exactly where we spent the free loot. Now just algebra: k(k+1)2+(k+1)=k(k+1)+2(k+1)2=(k+1)(k+2)2.\frac{k(k+1)}{2} + (k+1) = \frac{k(k+1) + 2(k+1)}{2} = \frac{(k+1)(k+2)}{2}. That's precisely the right side of P(k+1)P(k+1). So P(k)P(k+1)P(k) \Rightarrow P(k+1). By induction, the formula holds for all n1n \ge 1. \blacksquare

The Gauss story, for contrast. Legend has it a child Carl Friedrich Gauss, told to add 11 through 100100, instantly answered 50505050. His teacher probably wanted to die. He paired the terms: 1+100=1011 + 100 = 101, 2+99=1012 + 99 = 101, …, 5050 pairs each summing to 101101, giving 50×101=5050=100101250 \times 101 = 5050 = \frac{100 \cdot 101}{2}. Same formula, totally different proof — pairing gives insight into why; induction gives a guarantee that it holds for every nn. Keep both in your kit: insight finds the formula, induction certifies it.

Worked: the sum of the first n odd numbers is n²

Claim. For all n1n \ge 1: 1+3+5++(2n1)=n21 + 3 + 5 + \cdots + (2n - 1) = n^2.

Proof (by induction). Base (n=1n=1): left side =1= 1, right side =12=1= 1^2 = 1. ✓

Step: Assume the IH: 1+3++(2k1)=k21 + 3 + \cdots + (2k - 1) = k^2. The (k+1)(k+1)-th odd number is 2(k+1)1=2k+12(k+1) - 1 = 2k + 1. So 1+3++(2k1)IH=k2+(2k+1)=k2+(2k+1)=(k+1)2.\underbrace{1 + 3 + \cdots + (2k-1)}_{\text{IH} = k^2} + (2k + 1) = k^2 + (2k + 1) = (k+1)^2. That's P(k+1)P(k+1). By induction, the sum of the first nn odd numbers is n2n^2 for all n1n \ge 1. \blacksquare

Worked: a divisibility example — 6 divides n³ − n

Claim. For all n0n \ge 0: 6(n3n)6 \mid (n^3 - n).

Proof (by induction). Base (n=0n = 0): 030=00^3 - 0 = 0, and 606 \mid 0. ✓

Step: Assume the IH: 6(k3k)6 \mid (k^3 - k), i.e. k3k=6mk^3 - k = 6m for some integer mm. Consider (k+1)3(k+1)(k+1)^3 - (k+1) and expand: (k+1)3(k+1)=(k3+3k2+3k+1)(k+1)=(k3k)+3k2+3k.(k+1)^3 - (k+1) = (k^3 + 3k^2 + 3k + 1) - (k + 1) = (k^3 - k) + 3k^2 + 3k. The first chunk is 6m6m by the IH. For the rest, 3k2+3k=3k(k+1)3k^2 + 3k = 3k(k+1), and k(k+1)k(k+1) is a product of two consecutive integers, hence even — so 3k(k+1)3k(k+1) is divisible by 66. Thus (k+1)3(k+1)=6m+3k(k+1)=6m+6t=6(m+t)(k+1)^3 - (k+1) = 6m + 3k(k+1) = 6m + 6t = 6(m + t) for some integer tt, which is divisible by 66. So P(k)P(k+1)P(k) \Rightarrow P(k+1), and by induction 6(n3n)6 \mid (n^3 - n) for all n0n \ge 0. \blacksquare

The pattern in every inductive step is the same: carve P(k+1)P(k+1) open until the P(k)P(k) piece appears, slot in the IH, clean up. That is the entire goddamn technique. There is no secret beyond that.

Where induction fails: skip the base case and lose your mind

Both parts are mandatory, and skipping the base case produces disasters that are hilarious only if they're not yours. Behold the "all horses are the same color" non-theorem — a specimen I keep preserved in the lab as a warning:

Claim: in any group of nn horses, all are the same color. "Step": given k+1k+1 horses, remove one — by IH the remaining kk match; put it back, remove a different one — those kk match too; overlap them and all k+1k+1 match.

The inductive step looks fine for large kk — but it secretly fails at the very first transition, k=1k=2k=1 \to k=2: two horses, remove one (the other is "all the same color," trivially), remove the other — but the two groups of one don't overlap, so nothing forces the two horses to match. The chain never gets off the ground. A broken or missing base case means dominoes that never start falling — and you can "prove" any insane damn thing. Always nail the base case. This is not optional. This is not a style choice. Now go conquer some infinities.

🔬 SPECIMENS (worked examples)

Worked example 1 — sum of the first n squares

Prove by induction: for all n1n \ge 1,  12+22++n2=n(n+1)(2n+1)6\ 1^2 + 2^2 + \cdots + n^2 = \dfrac{n(n+1)(2n+1)}{6}.

Proof (by induction on nn).

Base (n=1n=1): left side =12=1= 1^2 = 1; right side =1236=1= \frac{1 \cdot 2 \cdot 3}{6} = 1. Equal. ✓

Step: Assume the IH: 12++k2=k(k+1)(2k+1)61^2 + \cdots + k^2 = \frac{k(k+1)(2k+1)}{6}. Show P(k+1)P(k+1). Split off the last term and substitute the IH: 12++k2IH+(k+1)2=k(k+1)(2k+1)6+(k+1)2.\underbrace{1^2 + \cdots + k^2}_{\text{IH}} + (k+1)^2 = \frac{k(k+1)(2k+1)}{6} + (k+1)^2. Factor out (k+1)/6(k+1)/6: =(k+1)[k(2k+1)+6(k+1)]6=(k+1)(2k2+7k+6)6=(k+1)(k+2)(2k+3)6.= \frac{(k+1)\big[k(2k+1) + 6(k+1)\big]}{6} = \frac{(k+1)(2k^2 + 7k + 6)}{6} = \frac{(k+1)(k+2)(2k+3)}{6}. That last factoring uses 2k2+7k+6=(k+2)(2k+3)2k^2 + 7k + 6 = (k+2)(2k+3). The result is exactly (k+1)((k+1)+1)(2(k+1)+1)6\frac{(k+1)((k+1)+1)(2(k+1)+1)}{6}, which is P(k+1)P(k+1). By induction the formula holds for all n1n \ge 1. \blacksquare

The engine is identical to the lesson's examples: peel off the (k+1)(k+1)-th term, slot in the IH, grind the algebra.

Worked example 2 — a geometric-style sum

Prove by induction: for all n1n \ge 1,  1+2+4++2n1=2n1\ 1 + 2 + 4 + \cdots + 2^{n-1} = 2^n - 1.

Proof (by induction on nn).

Base (n=1n=1): left side is the single term 20=12^{0} = 1; right side =211=1= 2^1 - 1 = 1. Equal. ✓

Step: Assume the IH: 1+2++2k1=2k11 + 2 + \cdots + 2^{k-1} = 2^k - 1. The (k+1)(k+1)-th term is 2(k+1)1=2k2^{(k+1)-1} = 2^k. So 1+2++2k1IH=2k1+2k=(2k1)+2k=22k1=2k+11.\underbrace{1 + 2 + \cdots + 2^{k-1}}_{\text{IH} = 2^k - 1} + 2^k = (2^k - 1) + 2^k = 2 \cdot 2^k - 1 = 2^{k+1} - 1. That's P(k+1)P(k+1). By induction, 1+2++2n1=2n11 + 2 + \cdots + 2^{n-1} = 2^n - 1 for all n1n \ge 1. \blacksquare

This is the "doubling beats adding" fact from the very first lesson, now nailed down: kk binary slots reach all the way to 2k12^k - 1. Induction certifies what place-value promised.

Worked example 3 — the trap: perfect step, dead base case, everything is ash

Consider the claim: "for all n1n \ge 1,  1+2++n=n(n+1)2+1\ 1 + 2 + \cdots + n = \frac{n(n+1)}{2} + 1." The inductive step "goes through." Find the flaw.

Watch the step appear to work. Assume the (false) IH 1++k=k(k+1)2+11 + \cdots + k = \frac{k(k+1)}{2} + 1. Then 1++k+(k+1)=k(k+1)2+1+(k+1)=(k+1)(k+2)2+1,1 + \cdots + k + (k+1) = \frac{k(k+1)}{2} + 1 + (k+1) = \frac{(k+1)(k+2)}{2} + 1, which is exactly the claimed formula at n=k+1n = k+1. So the inductive step is valid! The implication P(k)P(k+1)P(k) \Rightarrow P(k+1) genuinely holds.

So why is the theorem false? The base case fails. At n=1n = 1: left side =1= 1, but the claimed right side is 122+1=2\frac{1 \cdot 2}{2} + 1 = 2. 121 \ne 2. ✗ The first domino never falls, so the perfectly good step has nothing to propagate — every domino is poised to knock over the next, but none of them is pushed.

This is the lesson's whole warning, weaponized: a flawless inductive step is worthless without a true base case. The step says "P(k)P(k+1)P(k) \Rightarrow P(k+1)"; if P(1)P(1) is false, that implication just relays falsehood forever. Always, always check the base case. \blacksquare

☠ KNOWN HAZARDS

  • Skipping or faking the base case. Without a true P(1)P(1), the dominoes never start. The "all horses same color" fraud dies exactly because its base/first-step fails.

  • Thinking the IH is circular. Assuming P(k)P(k) is not assuming P(k+1)P(k+1). You assume an earlier case to reach the next one — that's the legitimate logic of proving an implication, not a cheat.

  • Not actually using the inductive hypothesis. If your step never invokes P(k)P(k), you haven't done induction (and probably have a hidden error). The whole point is to connect P(k+1)P(k+1) back to P(k)P(k).

  • Proving the step for one value of kk. The inductive step must hold for every kk — argue about an arbitrary kk, exactly like proving any \forall statement.

TL;DR

  • Induction = base case P(1)P(1) + inductive step (P(k)P(k+1)P(k) \Rightarrow P(k+1) for all kk) \Rightarrow P(n)P(n) for all n1n \ge 1. Dominoes, rigorously.

  • The inductive hypothesis P(k)P(k) is free loot you assume — proving an implication means assuming its hypothesis. You are NOT assuming P(k+1)P(k+1).

  • The step always works the same way: open up P(k+1)P(k+1), locate the P(k)P(k) piece, substitute the IH, simplify. E.g. 1++(k+1)=k(k+1)2+(k+1)=(k+1)(k+2)21+\cdots+(k+1) = \frac{k(k+1)}2 + (k+1) = \frac{(k+1)(k+2)}2.

  • Classic results: 1+2++n=n(n+1)21+2+\cdots+n = \frac{n(n+1)}2, sum of first nn odds =n2= n^2, and 6(n3n)6 \mid (n^3 - n).

  • Both parts are mandatory. Skip the base case and you get nonsense like "all horses are the same color" — the chain never starts. Always verify P(1)P(1).

unlocks