Matrices
Contents
3.3. Matrices#
Matrices are very useful objects with many applications. Loosely speaking, they are a two-dimensional array of or table of other mathematical objects.
Matrices are useful both as objects themselves and as representations of other relationships or properties. As we will explore, matrices can be used to encode binary relations, functions, and even ore abstract mathematical objects like graphs (see Section 7).
3.3.1. Definitions#
A matrix is some table of numbers, symbols, or mathematical objects coming from a set. A matrix has a “height” and a “width” corresponding to the number of rows and the number of columns, respectively. We describe a matrix first by its number of rows and then by its number of columns. A “two by three matrix of integers”, shown below, has two rows, three columns, and its entries are integers.
For a particular \((i,j)\) index, we can say the \((i,j)\)th entry of a matrix is the element of the matrix in the \(i\)th row and the \(j\)th column. In the previous matrix, \(5\) is the \((1,2)\) entry. \(-7\) is the \((2,1)\) entry. Generally, we can have an \(m\) by \(n\) matrix with \(m\) rows and \(n\) columns. An \(m\) by \(n\) matrix can be denoted as \(A_{m\times n}\)
Definition (matrix)
An \(m\) by \(n\) matrix is a rectangular array of entries coming from some set organized into \(m\) rows and \(n\) columns.
We give special names to matrices with certain dimensions.
- When \(m = 1\) we have a row vector. Below is a row vector with \(7\) entries. 
- When \(n = 1\) we have a column vector. Below is a column vector with \(6\) entries. 
- When \(m=n\), we have a square matrix. We say a square matrix is of order \(n\) for a matrix with dimension \(n\) by \(n\). A square matrix of order \(3\) is shown below. 
We can denote a \(m\) by \(n\) matrix \(A\) as \(A = (a_{i,j})\). This is similar to how we expressed sequences, but now with two indices. Unlike sequences, which may start with a term \(a_0\), matrices (almost) always start with \(a_{1,1}\) and end with \(a_{m,n}\). Since entries in a matrix come from some set \(S\), and it has a total of \(m \times n\) entries, we can view matrices as elements of \(S^{m \times n}\).
We can consider the \(i\)th row of an \(m\) by \(n\) matrix as a sequence with \(n\) terms, an \(n\)-tuple, or as a row vector with \(n\) entries.
Similarly, the \(j\)th column is a column vector with \(n\) entries.
When the entries of a matrix comes from a particular set of numbers, we can declare that matrix, for example, a “real matrix”, a “complex matrix”, or an “integer matrix”, to say its entries are real numbers, complex numbers, or integers, respectively.
Although we can view, for example, \(2\) by \(3\) real matrices as elements of \(\mathbb{R}^{2 \times 3}\). We must remember that matrices have a notion of “dimension” that generic elements of \(\mathbb{R}^{2\times3} = \mathbb{R}^6\) do not. Indeed, a \(2\) by \(3\) matrix is not the same as a \(3\) by \(2\) matrix, although both can be viewed as elements of \(\mathbb{R^6}\). Performing arithmetic on matrices will show this explicitly.
3.3.2. Operations on Matrices#
When matrices have entries coming from a set with supports addition and multiplication, like real numbers, integers, or even more general objects like polynomials, it is possible to combine them with a generalized form of arithmetic.
There are three basic operations: scalar multiplication, addition, and matrix multiplication. As we will see, two matrices must have so-called compatible dimensions, in order to be combined arithmetically.
Scalar Multiplication#
Scalar multiplication is a simple operation which multiplies a single number against each entry of a matrix to produce another matrix of the same dimensions.
Say \(A = (a_{i,j})\) is an \(m\) by \(n\) integer matrix. Given some other integer \(c\), \(cA\) is another \(m\) by \(n\) matrix where:
Scalar multiplication
Addition and Subtraction#
Two matrices can be added or subtracted when they have the same dimensions. For two \(m\) by \(n\) matrices, \(A = (a_{i,j})\) and \(B = (b_{i,j})\), their sum \(A + B\) is equal to \((a_{i,j} + b_{i,j})\). That is, each entry of \(A\) is added to the corresponding entry in \(B\) in the same position.
Thanks to scalar multiplication, we can define subtraction via addition.
To add or subtract matrices they must be of the same dimensions. Every entry in the first matrix must have a corresponding entry in the second matrix to be added to.
Matrix addition
Matrix Multiplication#
Matrix multiplication is much more involved than addition. First, we must consider under which conditions two matrices can be multiples.
Matrix multiplication between two matrices is only defined when the number of columns in the left-hand matrix equals the number of rows in the right-hand matrix. The result of the multiplication is another matrix whose number of rows equals the left-hand matrix’s and whose number of columns equals the right-hand matrix’s.
In this notation, the “inner” dimensions must be the same, and the “outer” dimensions give the dimensions of the product. In this case, \(n = n\) are the inner dimensions, and \(m, p\) are the outer dimensions.
But how do we define the entries of the product \(C = (c_{i,j})\)? Each entry of the matrix product \(c_{i,j}\) is an inner product of the \(i\)th row of \(A\) and the \(j\)th column of \(B\).
Therefore, the matrix multiplication \(A_{m\times n} \cdot B_{n\times p} = C_{m \times p}\) is actually \(m \times p\) individual inner products.
Fig. 3.20 A \(4\) by \(4\) matrix multiplication showing the inner product producing \(c_{2,2}\)#
Matrix multiplication
Matrix commutativity
\(A = \begin{bmatrix} 3 & 2 \\ 0 & -1 \end{bmatrix}\), \(B = \begin{bmatrix} 2 & -2 & 5 \\ 0 & -1 & 3 \\ \end{bmatrix} \)
Compute \(A \cdot B\), \(A \cdot A\), and \(B \cdot A\), if possible.
Matrix commutativity
The previous checkpoint brings to light an interesting consequence of matrix multiplication. For non-square matrices, if \(A \cdot B\) is defined, then \(B \cdot A\) is not. For square matrices, \(A \cdot B\) and \(B \cdot A\) are both defined, but it is very unlikely that \(AB = BA\).
Tip
Matrix multiplication is not commutative. In general, \(AB \neq BA\).
The zero and identity matrices#
Of course, it is sometimes possible for \(AB =BA\). Two important examples are when one of the matrices is the zero matrix or the identity matrix.
The zero matrix is a matrix with all zero entries. The \(m\) by \(n\) zero matrix is denoted by \(0_{m,n}\).
Assuming matrix multiplication is defined, the product of a zero matrix by any other matrix is the zero-matrix (of possibly different dimension).
Zero multiplication
For a matrix \(A = (a_{i,j})\) its main diagonalis the entries \(a_{i,j}\) for which \(i = j\). In a square matrix of order \(n\), there are \(n\) entries along the main diagonal. In a rectangular matrix, the smaller of the two dimensions determines the number of entries along the main diagonal.
Note that the other entries need not be 0. For example, the three main diagonals of the matrices in the Zero multiplication example are \((0,0)\), \((2,3)\), and \((0,0,0)\). If all entries not on the main diagonal are 0, the we call that matrix a diagonal matrix.
The identity matrix is a square diagonal matrix with all entries of the main diagonal equal to \(1\). The identity matrix of order \(n\) is denoted \(I_n\).
The property of the identity matrix is that any other matrix multiplied by the identity is equal to itself. For an \(m\) by \(n\) matrix \(A\), we have:
If \(A\) is a square matrix of order \(n\), then we have:
We can also leave the order of the identity matrix implicit. For an \(m\) by \(n\) matrix \(A\), we write \(AI\) to mean \(A\) times \(I_n\). The dimensions of the other matrix dictate the order of the identity matrix.
Identity multiplication
Powers of Matrices#
For square matrices, we can define the power of a matrix as a the repeated multiplication of the matrix with itself.
Let \(A\) be a square matrix of order \(n\). Then:
The square of a matrix
Matrix Transpose#
A transpose is an operation performed on a matrix which reverses its dimensions. In particular, it exchanges the rows of the matrix with its columns.
Given an \(m\) by \(n\) matrix \(A\), its transpose is denoted \(A^T\) and is an \(n\) by \(m\) matrix. \(A^T\) is defined as:
Notice that the second column becomes the second row.
Since a transpose changes the dimensions, the transpose of a “tall” matrix, is a “wide” matrix. Visually, matrix transposition “flips” or “rotates” the entries of the matrix along its main diagonal. The entries on the main diagonal do not change. See the below animation.
 
Fig. 3.21 Matrix transpose animated.#
Matrix transposition is easily reversible. Simply apply the transpose a second time.
Notice also that the identity matrix is not affected by transposition.
Matrix transpose
Compute the transpose of the below matrices.
Matrix transpose
3.3.3. Zero-One Matrices#
A special class of matrices whose entries come from the set \(\{0,1\}\) are called zero-one matrices. These matrices have numerous and important applications in computer science. Indeed, \(\{0,1\}\) may represent binary digits, ubiquitous in computer science.
Moreover, we can use \(\{0,1\}\) to encode truth values \(\mathbf{F}\) and \(\mathbf{T}\), respectively. This extends propositional logic to matrices. We have the following symmetries with propositional logic:
| \(a\) | \(b\) | \(a \land b\) | 
|---|---|---|
| 0 | 0 | 0 | 
| 0 | 1 | 0 | 
| 1 | 0 | 0 | 
| 1 | 1 | 1 | 
| \(a\) | \(b\) | \(a \lor b\) | 
|---|---|---|
| 0 | 0 | 0 | 
| 0 | 1 | 1 | 
| 1 | 0 | 1 | 
| 1 | 1 | 1 | 
Join and Meet#
For two zero-one matrices, we can define operations similar to matrix addition, but using \(\land\) and \(\lor\) instead of \(+\). Let \(A = (a_{i,j})\) and \(B = (b_{i,j})\) be \(m\) by \(n\) matrices.
The join of \(A\) and \(B\) is the \(m\) by \(n\) zero-one matrix defined as:
The meet of \(A\) and \(B\) is the \(m\) by \(n\) zero-one matrix defined as:
Joins and meets
The join of \(A\) and \(B\) is:
The meet of \(A\) and \(B\) is:
Boolean Product#
Similar to matrix multiplication, we can define a multiplication-like operation using zero-one matrices.
The typical inner product used matrix multiplication replaced by a “disjunction of conjunctions”.
Let \(A\) be an \(m\) by \(n\) zero-one matrix and \(B\) be an \(n\) by \(p\) zero-one matrix. The boolean product of \(A\) and \(B\), denoted \(A \odot B\), is an \(m\) by \(p\) zero-one matrix with elements \(c_{i,j}\) defined as:
Boolean product
For square zero-one matrices, we can extend Boolean product to Boolean power. Denote by \(A^{[n]}\) the Boolean product of \(A\) with itself \(n\) times. Since Boolean products are associative, the order of operations does not matter for a Boolean power.
3.3.4. Relations as Matrices#
As a first application of zero-one matrices, let us recall binary relations. A binary relation \(\mathcal{R}\) from set \(A\) to set \(B\) is a subset of the Cartesian product \(A \times B\).
We previously saw that one possible view of binary relation is as a table. This table is similar to the one of the Cartesian product \(A \times B\), except that now we are only interested in some of the cells of the table. In particular, the cells corresponding to the elements of the relations.
Let \(A = \{1,2,3\}\) and \(B = \{a,b,c,d,e\}\). A binary relation from \(A\) to \(B\) could be:
and its representation as a table:
| \(\ \) | \(a\) | \(b\) | \(c\) | \(d\) | \(e\) | 
| \(1\) | X | \(\ \) | X | X | \(\ \) | 
| \(2\) | X | X | \(\ \) | \(\ \) | \(\ \) | 
| \(3\) | \(\ \) | \(\ \) | X | \(\ \) | X | 
This lends itself very easily to a representation as a zero-one matrix. Since \(|A| = 3\) and \(|B| = 5\), this relation can be represented as a \(3\) by \(5\) zero-one matrix. Entries of this matrix are 1 if the corresponding tuple is in the relation, and 0 otherwise.
Relational Properties and Matrices#
As a consequence of representing binary relations as matrices, we can give a visual meaning to properties to reflexive, symmetric, and antisymmetric.
A binary relation \(\mathcal{R}\) on a set \(S\) can be represented as a square zero-one matrix of order \(|S|\). Let that corresponding zero-one matrix be \(A = (a_{i,j})\).
- The relation is reflexive if and only if \(A\) has a 1 for every entry along its main diagonal. 
- The relation is symmetric if and only if \(A = A^T\). 
- The relation is antisymmetric if and only if \(a_{i,j} = 0\) or \(a_{j,i} = 0\) for \(i \neq j\). 
An equivalent formulation for antisymmetry based on the matrix representation is:
A reflexive and antisymmetric matrix
Let \(\mathcal{R} = \{(1,1), (1,2), (2,2), (2, 3), (2,4), (3,3), (4,1), (4,4)\}\) be a binary relation on \(\{1,2,3,4\}\).
The zero-one matrix \(A\) represents this binary relation.
Notice that the main diagonal of \(A\) is \((1,1,1,1)\). Therefore, \(\mathcal{R}\) is reflexive. Moreover, \(\mathcal{R}\) is antisymmetric since there are no off-diagonal entries with the same value.
3.3.5. Exercises#
Exercise 3.26
Compute the matrix product.
Solution to Exercise 3.26
\(B = \begin{bmatrix} 14 & 4 \\ 8 & 9 \\ 7 & 13 \\ 8 & 2 \\ \end{bmatrix} \)
Exercise 3.27
\(A = \begin{bmatrix} 1 & 2 \\ -2 & 4 \end{bmatrix}\), \(B = \begin{bmatrix} 3 & -4 & 5 \\ 2 & 1 & 0 \\ \end{bmatrix} \), \(C = \begin{bmatrix} 3 & 1 \\ 4 & 0 \\ 1 &6 \\ \end{bmatrix} \)
Are the following multiplications defined? If so, compute the product.
- \(A \cdot B\) 
- \(B \cdot A\) 
- \(C \cdot A\) 
- \(A \cdot C\) 
- \(B \cdot C\) 
- \(A \cdot A\) 
- \(B \cdot B\) 
Solution to Exercise 3.27
- \(\begin{bmatrix} 7 & -2 & 5 \\ 2 & 12 & -10 \end{bmatrix}\) 
- Not defined. 
- \(\begin{bmatrix} 1 & 10 \\ 4 & 8 \\ -11 & 26 \end{bmatrix}\) 
- Not defined. 
- \(\begin{bmatrix} -2 & 33 \\ 10 & 2 \end{bmatrix}\) 
- \(\begin{bmatrix} -3 & 10 \\ -10 & 12 \end{bmatrix}\) 
- Not defined. 
Exercise 3.28
Find all unique Boolean powers of the zero-one matrix:
Solution to Exercise 3.28
For all \(n \geq 5\), \(A^{[n]} = A^{[5]}\)
Exercise 3.29
Consider the binary relation \(\mathcal{R}\) on the set \(\{1,2,3,8,9\}\) defined as
Give a zero-one matrix representing this relation. Is this relation reflexive? symmetric? antisymmetric? Explain why.
Solution to Exercise 3.29
We have
Its matrix encoding representation is:
This relation is reflexive since its main diagonal is all 1s. This relation is not symmetric since \((2,8)\) is in the relation but \((8,2)\) is not. This relation is antisymmetric since we have \(a_{i,j} = 1 \rightarrow a_{j,i} = 0\) for all \(i \neq j\).
Exercise 3.30
Consider the binary relation \(\mathcal{R}\) on the set \(\{2,4,7,8,12\}\) defined as
Give a zero-one matrix representing this relation. Is this relation reflexive? symmetric? antisymmetric? Explain why.
Solution to Exercise 3.30
We have $\( \mathcal{R} = \{(2,2), (2,4), (2,8), (2,12), (4,4), (4,8), (4,12), (7,7), (8,8), (12,12)\}. \)$
Its matrix encoding representation is: $$ A =
$$
This relation is reflexive since its main diagonal is all 1s. This relation is not symmetric since we have \((2,4)\) but not \((4,2)\). This relation is antisymmetric since the matrix is upper triangular.