The functions that respect the structure
Back in the Functions stratum a function was any rule sending inputs to outputs. Most functions are barbarians — they shred whatever structure they touch. A linear map (or linear transformation) is a function between vector spaces that respects the two operations:
- Additivity: for all .
- Homogeneity: for all , .
Add-then-map equals map-then-add; scale-then-map equals map-then-scale. The structure survives the trip — the map cannot corrupt what it's supposed to preserve. (One immediate consequence, worth proving yourself: — set in homogeneity. A linear map always pins the origin, no exceptions.) The 2D linear transformations from the Matrices stratum were exactly this, in the special case . Now we free the idea from arrows and let it do what it was always meant to do.
A linear map is determined by a basis (the theorem nobody frames)
Here is the most useful theorem in the subject and almost no textbook says it loudly. Let me say it loudly, clearly, and with genuine fury at everyone who buried it in a footnote.
Theorem. Let be a basis of . If you know — just the images of the basis vectors — then is forced for every . The map is completely pinned down.
Proof. Take any . Because is a basis, has unique coordinates (the uniqueness theorem from Basis & Dimension): . Now apply and use linearity repeatedly: where additivity splits the sum and homogeneity pulls out each scalar. Every term on the right is known. So is determined — there's no freedom left.
Read what that means: to specify a linear map out of an -dimensional space, you don't supply infinitely many input-output pairs. You supply of them — where the basis goes — and linearity does the rest. That's a staggering compression, and it's why finite matrices can capture maps on infinite sets of inputs. If this doesn't make you feel something, check your pulse.
The matrix of a linear map: the decoder ring generalizes
This is exactly the Matrices stratum's "columns are the images of the basis vectors" rule, now revealed as a theorem rather than a recipe. To build the matrix of , feed in each standard basis vector and write the outputs as columns: Then for all — because and linearity gives , which is precisely . The 2D decoder ring you learned generalizes verbatim to any dimensions, and to any bases (you read coordinates relative to a chosen basis). Same move, bigger stage.
Kernel and image, and both are subspaces
Every linear map carries two subspaces with it, like a murderer's two weapons — the two questions "what gets crushed?" and "what gets hit?".
- Kernel: — everything sent to zero (the Subspaces node's null space, generalized).
- Image: — everything actually reached.
Theorem. is a subspace of and is a subspace of .
Proof (kernel). Three-condition test. (1) , so . (2) If then , so . (3) If , , so . ✓
Proof (image). Again three conditions, in . (1) . (2) If are in the image, then . (3) . ✓ Both subspaces, both proved by the same machine from Subspaces.
kernel = {0} ⟺ injective (beautiful and short)
Now the gem. Recall injective (from the Functions stratum) means one-to-one: distinct inputs give distinct outputs.
Theorem. A linear map is injective .
Proof. () If is injective: always, so no other vector can map to (that would be two inputs sharing an output). Hence .
() Suppose . Take any with . Then by linearity so , forcing , i.e. . So distinct inputs can't share an output — is injective.
Savor how short that is. For a general function, injectivity means checking all pairs of inputs — infinite pairs, potentially. For a linear map, you check one thing: does anything besides get crushed to ? That's it. Linearity collapses a global property to a single local one. This is what structure buys you, and this is why we went through the hell of those ten axioms.
A jolt: differentiation is a linear map
Final beat, and it should rearrange your skull a little. Take , polynomials of degree , and define the derivative as a formal rule — no calculus prerequisite, just the pattern extended linearly. So . This is a linear map: (Differentiation splitting over sums and pulling out constants — those are linearity, stated.)
Now compute its kernel: which polynomials does send to ? Exactly the ones with derivative zero — the constants. So Hold on. WHAT? The kernel of differentiation is the constants. That bland, boring fact from a calculus class ("the derivative of a constant is zero") is secretly a statement about a linear map's null space — a subspace, one-dimensional, sitting inside a polynomial vector space. The whole machinery we built for — kernel, image, subspace, dimension — governs calculus too, because calculus operators are linear maps. That's the business model of abstraction collecting its biggest dividend yet, and we didn't even need to build a reactor for it. File it away: it detonates across all of higher mathematics.