← mapVectors & Matrices

Matrix Multiplication

requiresMatrices & Their Arithmetic ⚠The Dot Product ⚠

⚗ Dr. Möbius, from the lab

Listen carefully, because I'm about to explain why matrix multiplication looks the way it does — and if you've ever stared at the row-times-column rule and thought "who the hell designed this?", the answer is: mathematics itself, not a sadist. The rule is forced on us, inevitably, the moment you ask "what matrix represents two transformations applied one after the other?" That question has one answer. We're going to derive it right now, in this lab, and when we're done you will never again look at that rule and feel confused. You may feel other things. Possibly awe. Possibly mild existential dread. Both are correct responses.

THE BIG IDEA

Matrix multiplication is defined by the requirement that it represent function composition: (AB)v = A(Bv) for every vector v, and this forces the row-times-column dot-product rule.

The rule, and why it exists

Before I state the rule, I need you to understand where it comes from — because without the "why," you're just memorizing a mechanical procedure, and that is a waste of both our time. We built functions in the Sets stratum. Matrices are functions on $\mathbb{R}^n$ (full story next lesson). When you compose two functions $g \circ f$ , you want a single object that represents "apply $f$ first, then $g$ ." For matrices, that single object is the product $AB$ — and the requirement that $(AB)\vec{v} = A(B\vec{v})$ for every vector $\vec{v}$ is what forces the multiplication rule. The rule is not arbitrary. It is the only rule consistent with composition. Remember when the Sets stratum showed that composition is almost never commutative? We're about to see that explode spectacularly, and it'll be fucking glorious.

The product rule: For $A$ ( $m \times n$ ) and $B$ ( $n \times p$ ), the product $AB$ is the $m \times p$ matrix whose entry in row $i$ , column $j$ is the dot product of row $i$ of $A$ with column $j$ of $B$ :

$(AB)_{ij} = \sum_{k=1}^{n} a_{ik} b_{kj} = (\text{row } i \text{ of } A) \cdot (\text{col } j \text{ of } B)$

Every entry of $AB$ is a dot product. This is not coincidental — it's the algebraic trace of the composition requirement.

Dimension compatibility: the handshake requirement

For $AB$ to exist, the inner dimensions must match: $A$ is $m \times n$ , $B$ must be $n \times p$ for some $p$ . The result is $m \times p$ .

$(\underbrace{m \times n}_{A})(\underbrace{n \times p}_{B}) = (\underbrace{m \times p}_{AB})$

The inner $n$ 's must agree — they "cancel". Think of it as a handshake: the number of columns in the left matrix must equal the number of rows in the right matrix, or they refuse to shake hands and the product is undefined.

A concrete product

Let $A = \begin{pmatrix} 1 & 2 \\ 3 & 4 \end{pmatrix}$ and $B = \begin{pmatrix} 5 & 6 \\ 7 & 8 \end{pmatrix}$ , both $2\times 2$ .

$(AB)_{11} = (1)(5) + (2)(7) = 5 + 14 = 19$ $(AB)_{12} = (1)(6) + (2)(8) = 6 + 16 = 22$ $(AB)_{21} = (3)(5) + (4)(7) = 15 + 28 = 43$ $(AB)_{22} = (3)(6) + (4)(8) = 18 + 32 = 50$

$AB = \begin{pmatrix} 19 & 22 \\ 43 & 50 \end{pmatrix}$

Each entry required a dot product: row of the left matrix against column of the right. Every. Single. Entry.

$AB \ne BA$ in general

Now watch this. Using the same matrices:

$(BA)_{11} = (5)(1) + (6)(3) = 5 + 18 = 23$ $(BA)_{12} = (5)(2) + (6)(4) = 10 + 24 = 34$ $(BA)_{21} = (7)(1) + (8)(3) = 7 + 24 = 31$ $(BA)_{22} = (7)(2) + (8)(4) = 14 + 32 = 46$

$BA = \begin{pmatrix} 23 & 34 \\ 31 & 46 \end{pmatrix} \ne AB = \begin{pmatrix} 19 & 22 \\ 43 & 50 \end{pmatrix}$

Matrix multiplication is not commutative. This should not be surprising — you already knew from the Sets stratum that composition of functions is not commutative. "Put on socks then shoes" and "put on shoes then socks" are not the same sequence. $AB$ is "apply $B$ first, then $A$ ". $BA$ is "apply $A$ first, then $B$ ". Different order, different result. The non-commutativity of matrix multiplication is not a bug, it is not a quirk, it is not something to be apologized for. It is the direct mathematical signature of the non-commutativity of function composition, made concrete and inescapable. Every time a student writes $AB = BA$ without checking, somewhere in this lab a beaker cracks.

The matrix-vector product: columns are key

A special and vital reading of matrix multiplication: $A\vec{x}$ for a matrix $A$ ( $m \times n$ ) and a column vector $\vec{x}$ ( $n \times 1$ ) is a matrix product yielding an $m \times 1$ vector.

$A\vec{x} = \begin{pmatrix} a_{11} & a_{12} \\ a_{21} & a_{22} \end{pmatrix} \begin{pmatrix} x_1 \\ x_2 \end{pmatrix} = \begin{pmatrix} a_{11} x_1 + a_{12} x_2 \\ a_{21} x_1 + a_{22} x_2 \end{pmatrix}$

But here's the reading you should burn into your memory: if $\vec{c}_1, \vec{c}_2$ are the columns of $A$ , then

$A\vec{x} = x_1 \vec{c}_1 + x_2 \vec{c}_2$

$A\vec{x}$ is a linear combination of $A$ 's columns, with the entries of $\vec{x}$ as the coefficients. The components of $\vec{x}$ are the mixing weights. This is THE most important reading of matrix-vector multiplication, and we will use it constantly — burn it into your skull: in understanding systems of equations ( $A\vec{x} = \vec{b}$ is solvable iff $\vec{b}$ is a combination of $A$ 's columns), in understanding transformations (next lesson), and throughout the Vector Spaces stratum. If you take nothing else from this lesson, take that sentence. Write it on your hand. Tattoo it somewhere tasteful.

🔬 SPECIMENS (worked examples)

Worked example 1 — computing a product step by step▸

Compute $AB$ for $A = \begin{pmatrix} 2 & 1 \\ -1 & 3 \end{pmatrix}$ and $B = \begin{pmatrix} 1 & 0 \\ 4 & -2 \end{pmatrix}$ .

Both matrices are $2\times 2$ , so $AB$ is $2\times 2$ . Compute all four entries as dot products:

$(AB)_{11} = (\text{row }1\text{ of }A)\cdot(\text{col }1\text{ of }B) = (2)(1) + (1)(4) = 2 + 4 = 6$

$(AB)_{12} = (\text{row }1\text{ of }A)\cdot(\text{col }2\text{ of }B) = (2)(0) + (1)(-2) = 0 - 2 = -2$

$(AB)_{21} = (\text{row }2\text{ of }A)\cdot(\text{col }1\text{ of }B) = (-1)(1) + (3)(4) = -1 + 12 = 11$

$(AB)_{22} = (\text{row }2\text{ of }A)\cdot(\text{col }2\text{ of }B) = (-1)(0) + (3)(-2) = 0 - 6 = -6$

$AB = \begin{pmatrix} 6 & -2 \\ 11 & -6 \end{pmatrix}$

To verify: check $(AB)_{21}$ again. Row 2 of $A$ is $(-1, 3)$ ; column 1 of $B$ is $(1, 4)^T$ . Dot product: $-1+12=11$ . Correct.

Worked example 2 — the columns-of-A reading▸

Express $A\vec{x}$ as a linear combination of the columns of $A$ , for $A = \begin{pmatrix} 3 & -1 \\ 2 & 5 \end{pmatrix}$ and $\vec{x} = \begin{pmatrix} 4 \\ -2 \end{pmatrix}$ .

Column reading: The columns of $A$ are $\vec{c}_1 = \begin{pmatrix}3\\2\end{pmatrix}$ and $\vec{c}_2 = \begin{pmatrix}-1\\5\end{pmatrix}$ .

$A\vec{x} = x_1 \vec{c}_1 + x_2 \vec{c}_2 = 4\begin{pmatrix}3\\2\end{pmatrix} + (-2)\begin{pmatrix}-1\\5\end{pmatrix}$

$= \begin{pmatrix}12\\8\end{pmatrix} + \begin{pmatrix}2\\-10\end{pmatrix} = \begin{pmatrix}14\\-2\end{pmatrix}$

Verify with the row rule: Row 1 of $A$ times $\vec{x}$ : $(3)(4) + (-1)(-2) = 12 + 2 = 14$ . Row 2: $(2)(4) + (5)(-2) = 8 - 10 = -2$ . Same answer, different view. Both views are always correct — use whichever serves the problem better.

Worked example 3 — AB and BA differ: the non-commutativity specimen▸

Let $A = \begin{pmatrix} 0 & 1 \\ 0 & 0 \end{pmatrix}$ and $B = \begin{pmatrix} 0 & 0 \\ 1 & 0 \end{pmatrix}$ . Compute both $AB$ and $BA$ and compare.

Compute $AB$ : $(AB)_{11} = (0)(0) + (1)(1) = 1, \quad (AB)_{12} = (0)(0) + (1)(0) = 0$ $(AB)_{21} = (0)(0) + (0)(1) = 0, \quad (AB)_{22} = (0)(0) + (0)(0) = 0$ $AB = \begin{pmatrix} 1 & 0 \\ 0 & 0 \end{pmatrix}$

Compute $BA$ : $(BA)_{11} = (0)(0) + (0)(0) = 0, \quad (BA)_{12} = (0)(1) + (0)(0) = 0$ $(BA)_{21} = (1)(0) + (0)(0) = 0, \quad (BA)_{22} = (1)(1) + (0)(0) = 1$ $BA = \begin{pmatrix} 0 & 0 \\ 0 & 1 \end{pmatrix}$

$AB \ne BA$ . In fact, $AB$ has its $1$ in the top-left, $BA$ has its $1$ in the bottom-right. These matrices are as different as possible for $2\times 2$ diagonal matrices. This is your go-to example when you need to prove noncommutativity of matrix multiplication to a skeptic.

☠ KNOWN HAZARDS

Trying to commute. $AB = BA$ is almost never true. Always re-derive the product in the order specified. Never flip the order to make computation easier.
Ignoring dimension compatibility. Trying to multiply a $2\times 3$ matrix by a $2\times 3$ matrix is undefined — the inner dimensions $3$ and $2$ don't match. Check the handshake before starting any computation.
Multiplying entrywise. Matrix multiplication is NOT $a_{ij} \cdot b_{ij}$ (that's the Hadamard product, a different and much less useful operation that you will basically never encounter in serious work). It's row-times-column dot products. Period. Full stop. No exceptions. Do not make me say this again.
Thinking $AB = 0$ implies $A = 0$ or $B = 0$ . This is false for matrices — demonstrably, provably, infuriatingly false. You can have two nonzero matrices whose product is zero. The zero-product property of real numbers does not survive the journey into matrix-land. Another thing you assumed was universal that isn't. Welcome to abstraction; it keeps pulling the rug out.

TL;DR

▸
Matrix multiplication is forced by function composition: $(AB)\vec{v} = A(B\vec{v})$ for all $\vec{v}$ .
▸
Rule: $(AB)_{ij} = (\text{row }i \text{ of }A) \cdot (\text{col }j\text{ of }B)$ . Every entry is a dot product.
▸
Dimension compatibility: $(m\times n)(n\times p) = (m\times p)$ . Inner dimensions must match.
▸
$AB \ne BA$ in general — noncommutativity is inherited from function composition.
▸
$A\vec{x} = x_1\vec{c}_1 + x_2\vec{c}_2 + \cdots$ — the matrix-vector product is a linear combination of $A$ 's columns.

unlocks

Linear Transformations of the Plane →