Matrix (mathematics)
{{Short description|Array of numbers}} {{CS1 config|mode=cs2}} {{Redirect|Matrix theory|the physics topic|Matrix theory (physics)}} {{Good article}} [[File:MatrixLabelled.svg|alt=Two tall square brackets with {{mvar|m}}-many rows each containing {{mvar|n}}-many subscripted letter '{{mvar|a}}' variables. Each letter '{{mvar|a}}' is given a row number and column number as its subscript.|thumb|An {{times|{{mvar|m}}|{{mvar|n}}}} matrix: the {{mvar|m}} rows are horizontal and the {{mvar|n}} columns are vertical. Each element of a matrix is often denoted by a variable with two [[index notation|subscripts]]. For example, {{math|''a''{{sub|2,1}}}} represents the element at the second row and first column of the matrix.]]
In [[mathematics]], a '''matrix''' ({{plural form}}: '''matrices''') is a rectangular array of [[number]]s or other [[mathematical objects]] with elements or entries arranged in rows and columns, usually satisfying certain properties of [[Matrix addition|addition]] and [[Matrix multiplication|multiplication]].
For example, \begin{bmatrix}1 & 9 & -13 \20 & 5 & -6 \end{bmatrix} denotes a matrix with two rows and three columns. This is often referred to as a "two-by-three matrix", a {{times|2|3}} matrix, or a matrix of dimension {{times|2|3}}.
In [[linear algebra]], matrices are used as [[linear map]]s. In [[geometry]], matrices are used for [[geometric transformation]]s (for example [[rotation (mathematics)|rotation]]s) and [[coordinate change]]s. In [[numerical analysis]], many computational problems are solved by reducing them to a matrix computation, and this often involves computing with matrices of huge dimensions. Matrices are used in most areas of mathematics and scientific fields, either directly, or through their use in geometry and numerical analysis.
''[[Square matrices]]'', matrices with the same number of rows and columns, play a major role in matrix theory. The [[determinant]] of a square matrix is a number associated with the matrix, which is fundamental for the study of a square matrix; for example, a square matrix is [[invertible matrix|invertible]] if and only if it has a nonzero determinant and the [[eigenvalue]]s of a square matrix are the roots of its [[characteristic polynomial]], \det(\lambda I - A).
'''Matrix theory''' is the [[branch of mathematics]] that focuses on the study of matrices. It was initially a sub-branch of linear algebra, but soon grew to include subjects related to [[graph theory]], [[algebra]], [[combinatorics]] and [[statistics]].
== Definition == A matrix is a rectangular array of [[number]]s (or other mathematical objects), called the "entries" of the matrix. Matrices are subject to standard [[operation (mathematics)|operations]] such as [[#Basic operations |addition]] and [[#Matrix multiplication |multiplication]].{{sfnp|Lang|2002|loc=Chapter XIII}} Most commonly, a matrix over a [[field (mathematics)|field]] F is a rectangular array of [[element (mathematics)|elements]] of {{tmath|1= F }}.{{sfnp|Fraleigh|1976|p=209}}{{sfnp|Nering|1970|p=37}} A '''real matrix''' and a '''complex matrix''' are matrices whose entries are respectively [[real number]]s or [[complex number]]s. More general types of entries are discussed [[#Matrices with more general entries |below]]. For instance, this is a real matrix: \mathbf{A} = \begin{bmatrix} -1.3 & 0.6 \ 20.4 & 5.5 \ 9.7 & -6.2 \end{bmatrix}.
The numbers (or other objects) in the matrix are called its ''entries'' or its ''elements''. The horizontal and vertical lines of entries in a matrix are respectively called ''rows'' and ''columns''.{{sfnp|Brown|1991|p=1}}
=== Size === The size of a matrix is defined by the number of rows and columns it contains. There is no limit to the number of rows and columns that a matrix (in the usual sense) can have as long as they are positive integers. A matrix with {{mvar|m}} rows and {{mvar|n}} columns is called an {{times|{{mvar|m}}|{{mvar|n}}}} matrix,{{sfnp|Brown|1991|p=1}} or {{mvar|m}}-by-{{mvar|n}} matrix,{{sfnp|Golub|Van Loan|1996|p=3}} where {{mvar|m}} and {{mvar|n}} are called its ''{{dfn|dimensions}}''.{{sfnp|Horn|Johnson|1985|p=5}} For example, the matrix {\mathbf{A}} above is a {{times|3|2}} matrix.
Matrices with a single row are called ''[[row matrix|row matrices]]'' or ''row vectors'', and those with a single column are called ''[[column matrix|column matrices]]'' or ''column vectors''. A matrix with the same number of rows and columns is called a ''[[square matrix]]''.{{sfnp|Gbur|2011|p=89}} A matrix with an infinite number of rows or columns (or both) is called an [[#Infinite matrices|'' infinite matrix'']]. In some contexts, such as [[Computer algebra system|computer algebra programs]], it is useful to consider a matrix with no rows or no columns, called an [[#Empty matrix|'' empty matrix'']].
{| class="wikitable" style="max-width:45em;" |+Overview of a matrix size |- !scope="col"| Name !scope="col"| Size !scope="col"| Example !scope="col"| Description |- !scope="row"| [[Row matrix]] | 1\times n || style="text-align:center;" | \begin{bmatrix}3 & 7 & 2 \end{bmatrix} | A matrix with one row and more than one columns, sometimes used to represent a vector
|- !scope="row"| [[Column matrix]] | n\times 1 || style="text-align:center;" | \begin{bmatrix}4 \ 1 \ 8 \end{bmatrix} | A matrix with one column and more than one rows, sometimes used to represent a vector |- !scope="row"| [[Square matrix]] | n\times n || style="text-align:center;" | \begin{bmatrix} 9 & 13 & 5 \ 1 & 11 & 7 \ 2 & 6 & 3 \end{bmatrix} | A matrix with the same number of rows and columns, sometimes used to represent a [[#Linear transformations|linear transformation]] from a vector space to itself, such as [[reflection (mathematics)|reflection]], [[rotation (mathematics)|rotation]], or [[shear mapping|shearing]]. |}
== Notation == The specifics of symbolic matrix notation vary widely, with some prevailing trends. Matrices are commonly written in [[square bracket]]s or [[parentheses]],{{sfnp|Ramachandra Rao|Bhimasankaram|2000|p=[https://books.google.com/books?id=ZfJdDwAAQBAJ&pg=PA71 71]}} so that an {{times|{{mvar|m}}|{{mvar|n}}}} matrix \mathbf{A} is represented as \mathbf{A} = \begin{bmatrix} a_{11} & a_{12} & \cdots & a_{1n} \ a_{21} & a_{22} & \cdots & a_{2n} \ \vdots & \vdots & \ddots & \vdots \ a_{m1} & a_{m2} & \cdots & a_{mn} \end{bmatrix} = \begin{pmatrix} a_{11} & a_{12} & \cdots & a_{1n} \ a_{21} & a_{22} & \cdots & a_{2n} \ \vdots & \vdots & \ddots & \vdots \ a_{m1} & a_{m2} & \cdots & a_{mn} \end{pmatrix}. This may be abbreviated by writing only a single generic term, possibly along with indices, as in \mathbf{A} = \left(a_{ij}\right), \quad \left[ a_{ij}\right], \quad \text{or} \quad \left(a_{ij}\right){1\leq i\leq m, ; 1\leq j\leq n} or \mathbf{A}=(a{i,j})_{1\leq i,j\leq n} in the case that {{tmath|1= n=m }}.
Matrices are usually symbolized using [[upper-case]] letters (such as {\mathbf{A}} in the examples above),{{sfnp|Hamilton|1987|p=[https://books.google.com/books?id=W5o4AAAAIAAJ&pg=PA29 29]}} while the corresponding [[lower-case]] letters, with two subscript indices (e.g., {{tmath|1= a_{11} }}, or {{tmath|1= a_{1,1} }}), represent the entries.{{sfnp|Gentle|1998|pp=[https://books.google.com/books?id=2J0ndF_LmqoC&pg=PA52 52–53]}} In addition to using upper-case letters to symbolize matrices, many authors use a special [[emphasis (typography)|typographical style]], commonly boldface roman (non-italic), to further distinguish matrices from other mathematical objects. An alternative notation involves the use of a double-underline with the variable name, with or without boldface style, as in {{tmath|1= \underline{\underline{A} } }}.{{sfnp|Bauchau|Craig|2009|p=[https://books.google.com/books?id=GYRX8ZYVNYQC&pg=PA915 915]}}
The entry in the {{math|''i''}}th row and {{math|''j''}}th column of a matrix {{math|'''A'''}} is sometimes referred to as the {i,j} or (i,j) entry of the matrix, and commonly denoted by a_{i,j} or {{tmath|1= a_{ij} }}.{{sfnp|Johnston|2021|p=[https://books.google.com/books?id=y24vEAAAQBAJ&pg=PA21 21]}} Alternative notations for that entry are {\mathbf{A}[i,j]} and {{tmath|1= \mathbf{A}{i,j} }}. For example, the (1,3) entry of the following matrix \mathbf{A} is {{math|5}} (also denoted {{tmath|1= a{1 3} }}, {{tmath|1= a_{1,3} }}, \mathbf{A}[1,3] or {{tmath|1= {\mathbf{A} }_{1,3} }}): \mathbf{A}=\begin{bmatrix} 4 & -7 & \color{red}{5} & 0 \ -2 & 0 & 11 & 8 \ 19 & 1 & -3 & 12 \end{bmatrix}
Sometimes, the entries of a matrix can be defined by a formula such as {{tmath|1= a_{i,j}=f(i,j) }}. For example, each of the entries of the following matrix \mathbf{A} is determined by the formula {{tmath|1= a_{ij}=i-j }}. \mathbf A = \begin{bmatrix} 0 & -1 & -2 & -3\ 1 & 0 & -1 & -2\ 2 & 1 & 0 & -1 \end{bmatrix} In this case, the matrix itself is sometimes defined by that formula, within square brackets or double parentheses. For example, the matrix above is defined as {\mathbf{A}}=[i-j] or {{tmath|1= \mathbf{A}=((i-j)) }}. If matrix size is {{times|{{mvar|m}}|{{mvar|n}}}}, the above-mentioned formula f(i,j) is valid for any i=1,\dots,m and any {{tmath|1= j=1,\dots,n }}. This can be specified separately or indicated using {{times|{{mvar|m}}|{{mvar|n}}}} as a subscript. For instance, the matrix \mathbf{A} above is {{times|3|4}}, and can be defined as {\mathbf{A}}=[i-j] (i=1,2,3; j=1,\dots,4) or {{tmath|1= \mathbf{A}=[i-j]_{3\times4} }}.
Some programming languages utilize doubly subscripted arrays (or arrays of arrays) to represent an {{{mvar|m}}-by-{{mvar|n}} matrix. Some programming languages start the numbering of array indexes at zero, in which case the entries of an {{times|{{mvar|m}}|{{mvar|n}}}} matrix are indexed by 0\leq i \leq m-1 and {{tmath|1= 0\leq j \leq n-1 }}.{{sfnp|Oualline|2003|loc=Ch. 5}} This article follows the more common convention in mathematical writing where enumeration starts from {{math|1}}.
The [[set (mathematics)|set]] of all {{mvar|m}}-by-{{mvar|n}} real matrices is often denoted {{tmath|1= \mathcal{M}(m, n) }}, or {{tmath|1= \mathcal{M}{m \times n}(\R) }}. The set of all {{times|{{mvar|m}}|{{mvar|n}}}} matrices over another [[field (mathematics)|field]], or over a [[ring (mathematics)|ring]] {{math|''R''}}, is similarly denoted {{tmath|1= \mathcal{M}(m, n, R) }}, or {{tmath|1= \mathcal{M}{m \times n}(R) }}. If {{math|''m'' {{=}} ''n''}}, such as in the case of [[square matrices]], one does not repeat the dimension: {{tmath|1= \mathcal{M}(n, R) }}, or {{tmath|1= \mathcal{M}_n(R) }}.{{sfnp|Pop|Furdui|2017}} Often, {{tmath|1= M }}, or {{tmath|1= \operatorname{Mat} }}, is used in place of {{tmath|1= \mathcal M }}.For example, for {{tmath|1= M }}, see {{harvtxt|Mello|2017}}, [https://books.google.com/books?id=RC4tDwAAQBAJ&pg=PA48 p. 48]; for {{tmath|1= \operatorname{Mat} }}, see {{harvtxt|Axler|1997}}, [https://books.google.com/books?id=ovIYVIlithQC&pg=PA50 p. 50].
== Basic operations == Several basic operations can be applied to matrices. Some, such as ''transposition'' and ''submatrix'' do not depend on the nature of the entries. Others, such as ''matrix addition'', ''scalar multiplication'', ''matrix multiplication'', and ''row operations'' involve operations on matrix entries and therefore require that matrix entries are numbers or belong to a [[field (mathematics)|field]] or a [[ring (mathematics)|ring]].{{sfnp|Brown|1991|loc=Definition I.2.1 (addition), Definition I.2.4 (scalar multiplication), and Definition I.2.33 (transpose)}}
In this section, it is supposed that matrix entries belong to a fixed ring, which is typically a field of numbers.
=== Addition === {{Main|Matrix addition}} [[File:Matrix addition qtl2.svg|thumb|Illustration of the addition of two matrices.]]
Matrix addition and subtraction require matrices of a consistent size, and are calculated entrywise. The ''sum'' {{math|'''A''' + '''B'''}} and the difference {{math|'''A''' − '''B'''}} of two {{times|{{mvar|m}}|{{mvar|n}}}} matrices are:{{sfnp|Whitelaw|1991|p=29}}
\begin{align} (\bold A + \bold B){i, j} = \bold A{i,j} + \bold B_{i,j}, \quad 1 \leq i \leq m, \quad 1 \leq j \leq n. \ (\bold A - \bold B){i, j} = \bold A{i,j} - \bold B_{i,j}, \quad 1 \leq i \leq m, \quad 1 \leq j \leq n. \end{align}
For example, \begin{bmatrix} 1 & 3 & 1 \ 1 & 0 & 0 \end{bmatrix} + \begin{bmatrix} 0 & 0 & 5 \ 7 & 5 & 0 \end{bmatrix}
\begin{bmatrix} 1+0 & 3+0 & 1+5 \ 1+7 & 0+5 & 0+0 \end{bmatrix}
\begin{bmatrix} 1 & 3 & 6 \ 8 & 5 & 0 \end{bmatrix} {{pb}}
Familiar properties of numbers extend to these operations on matrices: for example, addition is [[commutative]], that is, the matrix sum does not depend on the order of the summands: {{math|1='''A''' + '''B''' = '''B''' + '''A'''}}.{{sfnp|Brown|1991|loc=Theorem I.2.6}}
=== Scalar multiplication === {{main| Scalar multiplication}}
The product {{math|''c'''''A'''}} of a number {{mvar|c}} (also called a [[scalar (mathematics)|scalar]] in this context) and a matrix {{math|'''A'''}} is computed by multiplying each entry of {{math|'''A'''}} by {{mvar|c}}:{{sfnp|Whitelaw|1991|p=30}} (c\bold A){i,j} = c \cdot \bold A{i,j} This operation is called ''scalar multiplication'', but its result is not named "scalar product" to avoid confusion, since "scalar product" is often used as a synonym for "[[inner product]]".{{sfnp|Maxwell|1969|p=[https://books.google.com/books?id=oQk9AAAAIAAJ&pg=PA46 46]}} For example: 2 \cdot \begin{bmatrix} 1 & 8 & -3 \ 4 & -2 & 5 \end{bmatrix}
\begin{bmatrix} 2 \cdot 1 & 2\cdot 8 & 2\cdot -3 \ 2\cdot 4 & 2\cdot -2 & 2\cdot 5 \end{bmatrix}
\begin{bmatrix} 2 & 16 & -6 \ 8 & -4 & 10 \end{bmatrix} {{pb}}
Matrix subtraction is consistent with composition of matrix addition with scalar multiplication by {{math|–1}}:{{sfnp|Lancaster|Tismenetsky|1985|pp=[https://books.google.com/books?id=2c011Aptsa8C&pg=PA6 6–7]}} \mathbf A - \mathbf B = \mathbf A +(-1)\cdot \mathbf B{{pb}}
=== Transpose === {{main|Transpose}}
The ''transpose'' of an {{times|{{mvar|m}}|{{mvar|n}}}} matrix {{math|'''A'''}} is the {{times|{{mvar|n}}|{{mvar|m}}}} matrix {{math|'''A'''{{sup|T}}}} (also denoted {{math|'''A'''{{sup|tr}}}} or {{math|{{sup|t}}'''A'''}}) formed by turning rows into columns and vice versa: \left( \bold A^{\rm T} \right){i,j} = \bold A{j,i}. For example: \begin{bmatrix} 1 & 2 & 3 \ 0 & -6 & 7 \end{bmatrix}^\mathrm{T} = \begin{bmatrix} 1 & 0 \ 2 & -6 \ 3 & 7 \end{bmatrix}
The transpose is compatible with addition and scalar multiplication, as expressed by {{math|1=(''c'''''A'''){{sup|T}} = ''c''('''A'''{{sup|T}})}} and {{math|1=('''A''' + '''B'''){{sup|T}} = '''A'''{{sup|T}} + '''B'''{{sup|T}}}}. Finally, {{math|1=('''A'''{{sup|T}}){{sup|T}} = '''A'''}}.{{sfnp|Andrilli|Hecker|2022|loc=The transpose of a matrix and its properties|p=[https://books.google.com/books?id=WtpVEAAAQBAJ&pg=PA38 38]}}
=== Matrix multiplication === {{Main|Matrix multiplication}} [[File:MatrixMultiplication.png|thumb|300px|Schematic depiction of the matrix product {{math|'''AB'''}} of two matrices {{math|'''A'''}} and {{math|'''B'''}}]] ''Multiplication'' of two matrices corresponds to the composition of [[#Linear transformations|linear transformations]] represented by each matrix. It is defined if and only if the number of columns of the left matrix is the same as the number of rows of the right matrix. If {{math|'''A'''}} is an {{times|{{mvar|m}}|{{mvar|n}}}} matrix and {{math|'''B'''}} is an {{times|{{mvar|n}}|{{mvar|p}}}} matrix, then their ''matrix product'' {{math|'''AB'''}} is the {{times|{{mvar|m}}|{{mvar|p}}}} matrix whose entries are given by the [[dot product]] of the corresponding row of {{math|'''A'''}} and the corresponding column of {{math|'''B'''}}:{{sfnp|Lancaster|Tismenetsky|1985|p=[https://books.google.com/books?id=4nfNCgAAQBAJ&pg=PA9 9]}} [\mathbf{AB}]{i,j} = a{i,1}b_{1,j} + a_{i,2}b_{2,j} + \cdots + a_{i,n}b_{n,j} = \sum_{r=1}^n a_{i,r}b_{r,j}, where {{math|1 ≤ ''i'' ≤ ''m''}} and {{math|1 ≤ ''j'' ≤ ''p''}}.{{sfnp|Brown|1991|loc=Definition I.2.20}} For example, the underlined entry 2340 in the product is calculated as {{math|(2 × 1000) + (3 × 100) + (4 × 10) {{=}} 2340:}} \begin{align} \begin{bmatrix} \underline{2} & \underline 3 & \underline 4 \ 1 & 0 & 0 \ \end{bmatrix}
\begin{bmatrix} 0 & \underline{1000} \ 1 & \underline{100} \ 0 & \underline{10} \ \end{bmatrix} &= \begin{bmatrix} 3 & \underline{2340} \ 0 & 1000 \ \end{bmatrix}. \end{align}
Matrix multiplication satisfies the rules {{math|1=('''AB''')'''C''' = '''A'''('''BC''')}} ([[associativity]]), and {{math|1=('''A''' + '''B''')'''C''' = '''AC''' + '''BC'''}} as well as {{math|1='''C'''('''A''' + '''B''') = '''CA''' + '''CB'''}} (left and right [[distributivity]]), whenever the size of the matrices is such that the various products are defined.{{sfnp|Brown|1991|loc=Theorem I.2.24}} The product {{math|'''AB'''}} may be defined without {{math|'''BA'''}} being defined, namely if {{math|'''A'''}} and {{math|'''B'''}} are {{times|{{mvar|m}}|{{mvar|n}}}} and {{times|{{mvar|n}}|{{mvar|k}}}} matrices, respectively, and {{math|''m'' ≠ ''k''.}} Even if both products are defined, they generally need not be equal, that is:{{sfnp|Boas|2005|p=117}} \bold{AB} \ne \bold{BA}.
In other words, matrix multiplication is not [[commutative property|commutative]], in marked contrast to (rational, real, or complex) numbers, whose product is independent of the order of the factors.{{sfnp|Lancaster|Tismenetsky|1985|p=[https://books.google.com/books?id=4nfNCgAAQBAJ&pg=PA9 9]}} An example of two matrices not commuting with each other is: \begin{bmatrix} 1 & 2\ 3 & 4\ \end{bmatrix}
\begin{bmatrix} 0 & 1\ 0 & 0\ \end{bmatrix}= \begin{bmatrix} 0 & 1\ 0 & 3\ \end{bmatrix}, whereas \begin{bmatrix} 0 & 1\ 0 & 0\ \end{bmatrix}
\begin{bmatrix} 1 & 2\ 3 & 4\ \end{bmatrix}= \begin{bmatrix} 3 & 4\ 0 & 0\ \end{bmatrix}.
Besides the ordinary matrix multiplication just described, other less frequently used operations on matrices that can be considered forms of multiplication also exist, such as the [[Hadamard product (matrices)|Hadamard product]] and the [[Kronecker product]].{{sfnp|Horn|Johnson|1985|loc=Ch. 4 and 5}} They arise in solving matrix equations such as the [[Sylvester equation]].{{sfnp|Van Loan|2000}}
=== Row operations === {{Main|Row operations}} There are three types of row operations:{{sfnp|Perrone|2024|p=[http://books.google.com/books?id=JO8GEQAAQBAJ&pg=PA119 119–120]}}{{sfnp|Lang|1986|p=[http://books.google.com/books?id=c_NEBAAAQBAJ&pg=PA71 71]}}
row addition, that is, adding a row to another.
row multiplication, that is, multiplying all entries of a row by a non-zero constant;
row switching, that is, interchanging two rows of a matrix;
These operations are used in several ways, including solving [[linear equation]]s and finding [[matrix inverse]]s with [[Gauss elimination]] and Gauss–Jordan elimination, respectively.{{sfnp|Watkins|2002|p=[http://books.google.com/books?id=xi5omWiQ-3kC&pg=PA102 102]}}
=== Submatrix === A '''submatrix''' of a matrix is a matrix obtained by deleting any collection of rows or columns or both.{{sfnp|Bronson|1970|p=16}}{{sfnp|Kreyszig|1972|p=220}}{{sfnp|Protter|Morrey|1970|p=869}} For example, from the following {{times|3|4}} matrix, we can construct a {{times|2|3}} submatrix by removing row 3 and column 2: \mathbf{A}=\begin{bmatrix} 1 & \color{red}{2} & 3 & 4 \ 5 & \color{red}{6} & 7 & 8 \ \color{red}{9} & \color{red}{10} & \color{red}{11} & \color{red}{12} \end{bmatrix} \rightarrow \begin{bmatrix} 1 & 3 & 4 \ 5 & 7 & 8 \end{bmatrix}.
The [[minor (linear algebra)|minors]] and cofactors of a matrix are found by computing the [[determinant]] of certain submatrices.{{sfnp|Protter|Morrey|1970|p=869}}{{sfnp|Kreyszig|1972|pp=241,244}}
A '''principal submatrix''' is a square submatrix obtained by removing certain rows and columns. The definition varies from author to author. According to some authors, a principal submatrix is a submatrix in which the set of row indices that remain is the same as the set of column indices that remain.{{sfnp|Schneider|Barker|2012}}{{sfnp|Perlis|1991}} Other authors define a principal submatrix as one in which the first {{mvar|k}} rows and columns, for some number {{mvar|k}}, are the ones that remain;{{sfnp|Anton|2010}} this type of submatrix has also been called a '''leading principal submatrix'''.{{Citation |title=Matrix Analysis|first1=Roger A.|last1=Horn|first2=Charles R.|last2=Johnson|edition=2nd|publisher=Cambridge University Press|year=2012|isbn=978-0-521-83940-2|page=17|url=https://books.google.com/books?id=5I5AYeeh0JUC&pg=PA17}}.
== Linear equations ==
{{Main|Linear equation|System of linear equations|Coefficient matrix}} Matrices can be used to compactly write and work with multiple linear equations, that is, systems of linear equations. For example, if {{math|'''A'''}} is an {{times|{{mvar|m}}|{{mvar|n}}}} matrix, {{math|'''x'''}} designates a column vector (that is, {{times|{{mvar|n}}|1}} matrix) of {{mvar|n}} variables {{math|''x''{{sub|1}}, ''x''{{sub|2}}, ..., ''x''{{sub|''n''}},}} and {{math|'''b'''}} is an {{times|{{mvar|m}}|1}} column vector, then the matrix equation \mathbf{Ax} = \mathbf{b} is equivalent to the system of linear equations{{sfnp|Brown|1991|loc=I.2.21 and 22}} \begin{align} a_{1,1}x_1 + a_{1,2}x_2 + &\cdots + a_{1,n}x_n = b_1 \ &\ \ \vdots \ a_{m,1}x_1 + a_{m,2}x_2 + &\cdots + a_{m,n}x_n = b_m \end{align}
Using matrices, this can be solved more compactly than would be possible by writing out all the equations separately. If {{math|1=''n'' = ''m''}} and the equations are [[independent equation|independent]], then this can be done by writing{{sfnp|Gbur|2011|p=95}} \mathbf{x} = \mathbf{A}^{-1} \mathbf{b} where {{math|'''A'''{{sup|−1}}}} is the [[inverse matrix]] of {{math|'''A'''}}. If {{math|'''A'''}} has no inverse, solutions—if any—can be found using its [[generalized inverse]].{{sfnp|Ben-Israel|Greville|2003|pp=1–2}}
== Linear transformations == {{Main|Linear transformation|Transformation matrix}} [[File:Area parallellogram as determinant.svg|thumb|right|The vectors represented by a {{times|2|2}} matrix correspond to the sides of a unit square transformed into a parallelogram.]] Matrices and matrix multiplication reveal their essential features when related to ''linear transformations'', also known as ''linear maps''. A real {{mvar|m}}-by-{{mvar|n}} matrix {{math|'''A'''}} gives rise to a linear transformation \R^n \to \R^m mapping each vector {{math|'''x'''}} in {{tmath|\R^n}} to the (matrix) product {{math|'''Ax'''}}, which is a vector in {{tmath|\R^m.}} Conversely, each linear transformation f: \R^n \to \R^m arises from a unique {{mvar|m}}-by-{{mvar|n}} matrix {{math|'''A'''}}: explicitly, the {{math|(''i'', ''j'')}}-entry of {{math|'''A'''}} is the {{mvar|i}}th coordinate of {{math|''f'' ('''e'''{{sub|''j''}})}}, where {{math|1='''e'''{{sub|''j''}} = (0, ..., 0, 1, 0, ..., 0)}} is the [[unit vector]] with {{math|1}} in the {{mvar|j}}th position and {{math|0}} elsewhere. The matrix {{math|'''A'''}} is said to represent the linear map {{mvar|f}}, and {{math|'''A'''}} is called the ''transformation matrix'' of {{mvar|f}}.{{sfnp|Grossman|1994|pp=494–495}}
For example, the {{times|2|2}} matrix \mathbf{A} = \begin{bmatrix} a & c\b & d \end{bmatrix} can be viewed as the transform of the [[unit square]] into a [[parallelogram]] with vertices at {{math|(0, 0)}}, {{math|(''a'', ''b'')}}, {{math|(''a'' + ''c'', ''b'' + ''d'')}}, and {{math|(''c'', ''d'')}}. The parallelogram pictured at the right is obtained by multiplying {{math|'''A'''}} with each of the column vectors {{tmath|1= \left[\begin{smallmatrix} 0 \ 0 \end{smallmatrix}\right]}}, {{tmath|1= \left[\begin{smallmatrix} 1 \ 0 \end{smallmatrix}\right] }}, {{tmath|1= \left[\begin{smallmatrix} 1 \ 1 \end{smallmatrix}\right] }}, and {{tmath|1= \left[\begin{smallmatrix} 0 \ 1 \end{smallmatrix}\right] }} in turn. These vectors define the vertices of the unit square.{{sfnp|Bierens|2004|p=[https://books.google.com/books?id=ZrBaRPVRLRoC&pg=PA263 263]}} The following table shows several {{times|2|2}} real matrices with the associated linear maps of {{tmath|\R^2}}. The {{font color | blue | blue }} original is mapped to the {{font color | green | green }} grid and shapes. The origin {{math|(0, 0)}} is marked with a black point. {| class="wikitable" style="text-align:center; margin:1em auto 1em auto;" |- | [[Shear mapping|Horizontal shear]]{{sfnp|Johnston|2021|p=[https://books.google.com/books?id=y24vEAAAQBAJ&pg=PA56 56]}}{{br}}with ''m'' = 1.25. | [[Reflection (mathematics)|Reflection]]{{sfnp|Pettofrezzo|1978|p=[https://books.google.com/books?id=2wzp9zQhA_IC&pg=PA60 60]}} through the vertical axis | [[Squeeze mapping]]{{sfnp|Han|Kim|Noz|1997}}{{br}}with ''r'' = 3/2 | [[Scaling (geometry)|Scaling]]{{sfnp|Jeffrey|2010|p=[https://books.google.com/books?id=uan0Dkn9HY8C&pg=PA264 264]}}{{br}}by a factor of 3/2 |[[Rotation matrix|Rotation]]{{sfnp|Han|Kim|Noz|1997}}{{br}}by {{pi}}/6 = 30° |- | \begin{bmatrix} 1 & 1.25 \ 0 & 1 \end{bmatrix} | \begin{bmatrix} -1 & 0 \ 0 & 1 \end{bmatrix} | \begin{bmatrix} \frac{3}{2} & 0 \ 0 & \frac{2}{3} \end{bmatrix} |\begin{bmatrix} \frac{3}{2} & 0 \ 0 & \frac{3}{2} \end{bmatrix} |\begin{bmatrix} \cos\left(\frac{\pi}{6}\right) & -\sin\left(\frac{\pi}{6}\right) \ \sin\left(\frac{\pi}{6}\right) & \cos\left(\frac{\pi}{6}\right) \end{bmatrix} |- | width="20%" | [[File:VerticalShear m=1.25.svg|175px]] | width="20%" | [[File:Flip map.svg|150px]] | width="20%" | [[File:Squeeze r=1.5.svg|150px]] | width="20%" | [[File:Scaling by 1.5.svg|125px]] | width="20%" | [[File:Rotation by pi over 6.svg|125px]] |}
Under the [[bijection|1-to-1 correspondence]] between matrices and linear maps, matrix multiplication corresponds to [[function composition|composition]] of maps:{{harvtxt|Greub|1975|p=90}}. Note however that Greub follows a transposed convention of representing a transformation by multiplying a row vector by a matrix, rather than multiplying a matrix by a column vector, leading to the reversed order for the two matrices in the product that represents a composition. if a {{mvar|k}}-by-{{mvar|m}} matrix {{math|'''B'''}} represents another linear map {{tmath|1= g: \R^m \to \R^k }}, then the composition {{math|''g'' ∘ ''f''}} is represented by {{math|'''BA'''}} since{{sfnp|Lang|1986|loc=§VI.1}} (g \circ f)(\bold x) = g(f(\bold x)) = g(\bold{Ax}) = \bold B(\bold {Ax}) = (\bold{BA}) \bold x.
The last equality follows from the above-mentioned associativity of matrix multiplication.
The [[rank of a matrix]] {{math|'''A'''}} is the maximum number of [[linear independence|linearly independent]] row vectors of the matrix, which is the same as the maximum number of linearly independent column vectors.{{sfnp|Brown|1991|loc=Definition II.3.3}} Equivalently it is the [[Hamel dimension|dimension]] of the [[image (mathematics)|image]] of the linear map represented by {{math|'''A'''}}.{{sfnp|Greub|1975|loc=Section III.1}} The [[rank–nullity theorem]] states that the dimension of the [[kernel (matrix)|kernel]] of a matrix plus the rank equals the number of columns of the matrix.{{sfnp|Brown|1991|loc=Theorem II.3.22}}
== Square matrix == {{Main|Square matrix}} A [[square matrix]] is a matrix with the same number of rows and columns. An {{mvar|n}}-by-{{mvar|n}} matrix is known as a square matrix of order {{mvar|n}}. Any two square matrices of the same order can be added and multiplied. The entries {{mvar|a{{sub|ii}}}} form the [[main diagonal]] of a square matrix. They lie on the imaginary line running from the top left corner to the bottom right corner of the matrix.{{sfnp|Anton|2010|p=[https://books.google.com/books?id=YmcQJoFyZ5gC&pg=PA27 27]}}
Square matrices of a given dimension form a [[noncommutative ring]], which is one of the most common examples of a noncommutative ring.{{sfnp|Reyes|2025}}
=== Main types === : {| class="wikitable" style="float:right; margin:0ex 0ex 2ex 2ex;" |- ! Name !! Example with {{math|1=''n'' = 3}} |- | [[Diagonal matrix]] || style="text-align:center;" | \begin{bmatrix} a_{11} & 0 & 0 \ 0 & a_{22} & 0 \ 0 & 0 & a_{33} \ \end{bmatrix} |- | [[Lower triangular matrix]] || style="text-align:center;" | \begin{bmatrix} a_{11} & 0 & 0 \ a_{21} & a_{22} & 0 \ a_{31} & a_{32} & a_{33} \ \end{bmatrix} |- | [[Upper triangular matrix]] || style="text-align:center;" | \begin{bmatrix} a_{11} & a_{12} & a_{13} \ 0 & a_{22} & a_{23} \ 0 & 0 & a_{33} \ \end{bmatrix} |}
==== Diagonal and triangular matrix ==== If all entries of {{math|'''A'''}} below the main diagonal are zero, {{math|'''A'''}} is called an ''upper [[triangular matrix]]''. Similarly, if all entries of {{math|'''A'''}} above the main diagonal are zero, {{math|'''A'''}} is called a ''lower triangular matrix''.{{sfnp|Anton|2010|p=[https://books.google.com/books?id=YmcQJoFyZ5gC&pg=PA68 68]}} If all entries outside the main diagonal are zero, {{math|'''A'''}} is called a [[diagonal matrix]].{{sfnp|Gbur|2011|p=91}}
==== Identity matrix ==== {{Main|Identity matrix}} The ''identity matrix'' {{math|'''I'''{{sub|''n''}}}} of size {{mvar|n}} is the {{mvar|n}}-by-{{mvar|n}} matrix in which all the elements on the [[main diagonal]] are equal to {{math|1}} and all other elements are equal to {{math|0}},{{sfnp|Boas|2005|p=118}} for example, \begin{align} \mathbf{I}_1 &= \begin{bmatrix} 1 \end{bmatrix}, \[4pt] \mathbf{I}_2 &= \begin{bmatrix} 1 & 0 \ 0 & 1 \end{bmatrix}, \[4pt] \vdots& \[4pt] \mathbf{I}_n &= \begin{bmatrix} 1 & 0 & \cdots & 0 \ 0 & 1 & \cdots & 0 \ \vdots & \vdots & \ddots & \vdots \ 0 & 0 & \cdots & 1 \end{bmatrix} \end{align} It is a square matrix of order {{mvar|n}}, and also a special kind of [[diagonal matrix]]. It is called an identity matrix because multiplication with it leaves a matrix unchanged:{{sfnp|Boas|2005|p=118}} \bold{AI}_n = \bold I_m \bold A = \bold A for any {{mvar|m}}-by-{{mvar|n}} matrix {{math|'''A'''}}.
A scalar multiple of an identity matrix is called a ''scalar'' matrix.{{sfnp|Horn|Johnson|1985|loc=§0.9.1 Diagonal matrices}}
==== Symmetric or skew-symmetric matrix ==== A square matrix {{math|'''A'''}} that is equal to its transpose, that is, {{math|1='''A''' = '''A'''{{sup|T}}}}, is a [[symmetric matrix]]. If instead, {{math|'''A'''}} is equal to the negative of its transpose, that is, {{math|1='''A''' = −'''A'''{{sup|T}}}}, then {{math|'''A'''}} is a [[skew-symmetric matrix]]. In complex matrices, symmetry is often replaced by the concept of [[Hermitian matrix|Hermitian matrices]], which satisfies {{math|1='''A'''{{sup|∗}} = '''A'''}}, where the star or [[asterisk]] denotes the [[conjugate transpose]] of the matrix, that is, the transpose of the [[complex conjugate]] of {{math|'''A'''}}.{{sfnp|Boas|2005|p=138}}
By the [[spectral theorem]], real symmetric matrices and complex Hermitian matrices have an [[eigenbasis]]; that is, every vector is expressible as a [[linear combination]] of eigenvectors. In both cases, all eigenvalues are real.{{sfnp|Horn|Johnson|1985|loc=Theorem 2.5.6}} This theorem can be generalized to infinite-dimensional situations related to matrices with infinitely many rows and columns.{{sfnp|Conway|1990|pp=262–263}}
==== Invertible matrix and its inverse ==== A square matrix {{math|'''A'''}} is called ''[[invertible matrix|invertible]]'' or ''non-singular'' if there exists a matrix {{math|'''B'''}} such that{{sfnp|Brown|1991|loc=Definition I.2.28}}{{sfnp|Brown|1991|loc=Definition I.5.13}} \bold{AB} = \bold{BA} = \bold I_n, where {{math|'''I'''{{sub|''n''}}}} is the {{times|{{mvar|n}}|{{mvar|n}}}} [[identity matrix]] with {{math|1}} for each entry on the [[main diagonal]] and {{math|0}} elsewhere. If {{math|'''B'''}} exists, it is unique and is called the ''[[Invertible matrix|inverse matrix]]'' of {{math|'''A'''}}, denoted {{math|'''A'''{{sup|−1}}}}.{{sfnp|Anton|2010|p=[https://books.google.com/books?id=YmcQJoFyZ5gC&pg=PA62 62]}}
There are many [[algorithm]]s for testing whether a square matrix is invertible, and, if it is, computing its inverse. One of the oldest, which is still in common use is [[Gaussian elimination]].{{sfnp|Gbur|2011|pp=99–100}}
==== Definite matrix ==== {| class="wikitable" style="float:right; text-align:center; margin:0ex 0ex 2ex 2ex;" |- ! [[Positive definite matrix]] !! [[Indefinite matrix]] |- | \begin{bmatrix} \frac{1}{4} & 0 \ 0 & 1 \ \end{bmatrix} | \begin{bmatrix} \frac{1}{4} & 0 \ 0 & -\frac{1}{4} \end{bmatrix} |- | Q(x,y)= \frac{1}{4}x^2+y^2 | Q(x,y)= \frac{1}{4}x^2-\frac{1}{4}y^2 |- | [[File:Ellipse in coordinate system with semi-axes labelled.svg|150px]] {{br}}Points such that Q(x,y)= 1 {{br}} ([[Ellipse]]) | [[File:Hyperbola2 SVG.svg|150px]] {{br}} Points such that Q(x,y)= 1{{br}} ([[Hyperbola]]) |} A symmetric real matrix {{math|'''A'''}} is called [[positive-definite matrix|''positive-definite'']] if the associated [[quadratic form]] f(\bold x) = \bold x^{\rm T} \bold{Ax} has a positive value for every nonzero vector {{math|'''x'''}} in {{tmath| \R^n }}. If {{math|{{itco|''f''}}('''x''')}} yields only negative values then {{math|'''A'''}} is [[definiteness of a matrix#Negative definite|''negative-definite'']]; if {{mvar|f}} does produce both negative and positive values then {{math|'''A'''}} is [[definiteness of a matrix#Indefinite|'' indefinite'']].{{sfnp|Horn |Johnson|1985|loc=Chapter 7}} If the quadratic form {{mvar|f}} yields only non-negative values (positive or zero), the symmetric matrix is called ''positive-semidefinite'' (or if only non-positive values, then negative-semidefinite); hence the matrix is indefinite precisely when it is neither positive-semidefinite nor negative-semidefinite.{{sfnp|Anton|2010|loc=Thm. 7.3.2}}
A symmetric matrix is positive-definite if and only if all its eigenvalues are positive, that is, the matrix is positive-semidefinite and it is invertible.{{sfnp|Horn|Johnson|1985|loc=Theorem 7.2.1}} The table at the right shows two possibilities for 2-by-2 matrices. The eigenvalues of a diagonal matrix are simply the entries along the diagonal,{{sfnp|Boas|2005|p=150}} and so in these examples, the eigenvalues can be read directly from the matrices themselves. The first matrix has two eigenvalues that are both positive, while the second has one that is positive and another that is negative.
Allowing as input two different vectors instead yields the [[bilinear form]] associated to {{math|'''A'''}}:{{sfnp|Horn |Johnson|1985|p=169|loc=Example 4.0.6}} B_{\bold A} (\bold x, \bold y) = \bold x^{\rm T} \bold{Ay}.
In the case of complex matrices, the same terminology and results apply, with ''symmetric matrix'', ''quadratic form'', ''bilinear form'', and ''transpose'' {{math|'''x'''{{sup|T}}}} replaced respectively by [[Hermitian matrix]], [[Hermitian form]], [[sesquilinear form]], and [[conjugate transpose]] {{math|'''x'''{{sup|H}}}}.{{sfnp|Lang|1986|loc=Appendix. Complex numbers}}
==== Orthogonal matrix ==== {{Main|Orthogonal matrix}} An ''orthogonal matrix'' is a square matrix with [[real number|real]] entries whose columns and rows are [[orthogonal]] [[unit vector]]s (that is, [[orthonormality|orthonormal]] vectors).{{sfnp|Horn|Johnson|1985|pp=66–67}} Equivalently, a matrix {{math|'''A'''}} is orthogonal if its [[transpose]] is equal to its [[invertible matrix|inverse]]: \mathbf{A}^\mathrm{T}=\mathbf{A}^{-1}, , which entails \mathbf{A}^\mathrm{T} \mathbf{A} = \mathbf{A} \mathbf{A}^\mathrm{T} = \mathbf{I}_n, where {{math|'''I'''{{sub|''n''}}}} is the [[identity matrix]] of size {{mvar|n}}.{{sfnp|Gbur|2011|pp=102–103}}
An orthogonal matrix {{math|'''A'''}} is necessarily [[invertible matrix|invertible]] (with inverse {{math|1='''A'''{{sup|−1}} = '''A'''{{sup|T}}}}), [[unitary matrix|unitary]] ({{math|1='''A'''{{sup|−1}} = '''A'''}}), and [[normal matrix|normal]] ({{math|1='''A''''''A''' = '''AA'''*}}). The [[determinant]] of any orthogonal matrix is either {{math|+1}} or {{math|−1}}. A ''special orthogonal matrix'' is an orthogonal matrix with [[determinant]] {{math|+1}}. As a [[linear transformation]], every orthogonal matrix with determinant {{math|+1}} is a pure [[rotation (mathematics)|rotation]] without reflection, i.e., the transformation preserves the orientation of the transformed structure, while every orthogonal matrix with determinant {{math|−1}} reverses the orientation, i.e., is a composition of a pure [[reflection (mathematics)|reflection]] and a (possibly null) rotation. The identity matrices have determinant {{math|1}} and are pure rotations by an angle zero.{{sfnp|Boas|2005|pp=127,153–154}}
The [[complex number|complex]] analog of an orthogonal matrix is a [[unitary matrix]].{{sfnp|Boas|2005|p=141}}
=== Main operations ===
==== Trace ==== The [[trace of a matrix|trace]], {{math|tr('''A''')}} of a square matrix {{math|'''A'''}} is the sum of its diagonal entries. While matrix multiplication is not commutative as mentioned [[# noncommutative |above]], the trace of the product of two matrices is independent of the order of the factors:{{sfnp|Horn|Johnson|1985|pp=40,42}} \operatorname{tr}(\mathbf{AB}) =\operatorname{tr}(\mathbf{BA}). This is immediate from the definition of matrix multiplication:{{sfnp|Lang|1986|loc=p. 281}} \operatorname{tr}(\mathbf{AB}) = \sum_{i=1}^m \sum_{j=1}^n a_{ij} b_{ji} = \operatorname{tr}(\mathbf{BA}). It follows that the trace of the product of more than two matrices is independent of [[cyclic permutation]]s of the matrices; however, this does not in general apply for arbitrary permutations. For example, {{math|tr('''ABC''') ≠ tr('''BAC''')}}, in general.{{sfnp|Tang|2006|p=226}} Also, the trace of a matrix is equal to that of its transpose,{{sfnp|Bernstein|2009|p=94}} that is, \operatorname{tr}(\bold A) = \operatorname{tr}(\bold A^{\rm T}).
==== Determinant ==== {{Main|Determinant}} [[File:Determinant example.svg|thumb|300px|right|A linear transformation on {{tmath|\R^2}} given by the indicated matrix. The determinant of this matrix is {{math|−1}}, as the area of the green parallelogram at the right is {{math|1}}, but the map reverses the [[orientation (mathematics)|orientation]], since it turns the counterclockwise orientation of the vectors to a clockwise one.]]
The ''determinant'' of a square matrix {{math|'''A'''}} (denoted {{math|det('''A''')}} or {{math|{{abs|'''A'''}}}}) is a number encoding certain properties of the matrix. A matrix is invertible [[if and only if]] its determinant is nonzero.{{sfnp|Horn|Johnson|1985|loc=§0.5 Nonsingularity}} Its [[absolute value]] equals the area (in {{tmath|\R^2}}) or volume (in {{tmath|\R^3}}) of the image of the unit square (or cube), while its sign corresponds to the orientation of the corresponding linear map: the determinant is positive if and only if the orientation is preserved.{{sfnp|Margalit|Rabinoff|2019}}
The determinant of {{times|2|2}} matrices is given by{{cite web|title=Matrix {{!}} mathematics|url=https://britannica.com/science/matrix-mathematics|access-date=2020-08-19|website=Encyclopedia Britannica}} \det \begin{bmatrix}a&b\c&d\end{bmatrix} = ad-bc. The determinant of {{times|3|3}} matrices involves six terms ([[rule of Sarrus]]). The more lengthy [[Leibniz formula for determinants|Leibniz formula]] generalizes these two formulae to all dimensions.{{sfnp|Brown|1991|loc=Definition III.2.1}}
The determinant of a product of square matrices equals the product of their determinants: \det(\bold{AB}) = \det(\bold A) \cdot \det(\bold B), or using alternate notation:{{sfnp|Brown|1991|loc=Theorem III.2.12}} |\bold{AB}| = |\bold A| \cdot |\bold B|. Adding a multiple of any row to another row, or a multiple of any column to another column, does not change the determinant. Interchanging two rows or two columns affects the determinant by multiplying it by {{math|−1}}.{{sfnp|Brown|1991|loc=Corollary III.2.16}} Using these operations, any matrix can be transformed to a lower (or upper) triangular matrix, and for such matrices, the determinant equals the product of the entries on the main diagonal; this provides a method to calculate the determinant of any matrix. Finally, the [[Laplace expansion]] expresses the determinant in terms of [[minor (linear algebra)|minors]], that is, determinants of smaller matrices.{{sfnp|Mirsky|1990|loc=Theorem 1.4.1}} This expansion can be used for a recursive definition of determinants (taking as starting case the determinant of a {{times|1|1}} matrix, which is its unique entry, or even the determinant of a {{times|0|0}} matrix, which is {{math|1}}), that can be seen to be equivalent to the Leibniz formula. Determinants can be used to solve [[linear system]]s using [[Cramer's rule]], where the division of the determinants of two related square matrices equates to the value of each of the system's variables.{{sfnp|Brown|1991|loc=Theorem III.3.18}}
==== Eigenvalues and eigenvectors ==== {{Main|Eigenvalue, eigenvector and eigenspace|l1=Eigenvalues and eigenvectors}} A number \lambda and a nonzero vector {{math|'''v'''}} satisfying \mathbf{A} \mathbf{v} = \lambda \mathbf{v} are called an ''eigenvalue'' and an ''eigenvector'' of {{math|'''A'''}}, respectively.''Eigen'' means "own" in [[German language|German]] and in [[Dutch language|Dutch]]. See [https://en.wiktionary.org/wiki/eigen Wiktionary].{{sfnp|Brown|1991|loc=Definition III.4.1}} The number {{mvar|λ}} is an eigenvalue of an {{times|{{mvar|n}}|{{mvar|n}}}} matrix {{math|'''A'''}} if and only if {{math|('''A''' − ''λ'''''I'''{{sub|''n''}})}} is not invertible, which is [[logical equivalence|equivalent]] to{{sfnp|Brown|1991|loc=Definition III.4.9}} \det(\mathbf{A}-\lambda \mathbf{I}) = 0. The polynomial {{math|''p''{{sub|'''A'''}}}} in an [[indeterminate (variable)|indeterminate]] {{mvar|X}} given by evaluation of the determinant {{math|det({{itco|''X''}}'''I'''{{sub|''n''}} − '''A''')}} is called the [[characteristic polynomial]] of {{math|'''A'''}}. It is a [[monic polynomial]] of [[degree of a polynomial|degree]] {{mvar|n}}. Therefore the polynomial equation {{math|1=''p''{{sub|'''A'''}}(''λ'') = 0}} has at most {{mvar|n}} different solutions, that is, eigenvalues of the matrix.{{sfnp|Brown|1991|loc=Corollary III.4.10}} They may be complex even if the entries of {{math|'''A'''}} are real.{{sfnp|Anton|2010|pp=[https://books.google.com/books?id=ypROEAAAQBAJ&pg=PA317 317–319]}} According to the [[Cayley–Hamilton theorem]], {{math|1=''p''{{sub|'''A'''}}('''A''') = '''0'''}}, that is, the result of substituting the matrix itself into its characteristic polynomial yields the [[zero matrix]].{{sfnp|Bernstein|2009|p=265}}
== Computational aspects == Matrix calculations can be often performed with different techniques. Many problems can be solved by both direct algorithms and iterative approaches. For example, the eigenvectors of a square matrix can be obtained by finding a [[sequence (mathematics)|sequence]] of vectors {{math|'''x'''{{sub|''n''}}}} [[limit of a sequence|converging]] to an eigenvector when {{mvar|n}} tends to [[infinity]].{{sfnp|Householder|1975|loc=Ch. 7}}
To choose the most appropriate algorithm for each specific problem, it is important to determine both the effectiveness and precision of all the available algorithms. The domain studying these matters is called [[numerical linear algebra]].{{sfnp|Bau III|Trefethen|1997}} As with other numerical situations, two main aspects are the [[complexity analysis|complexity]] of algorithms and their [[numerical stability]].
Determining the complexity of an algorithm means finding [[upper bound]]s or estimates of how many elementary operations such as additions and multiplications of scalars are necessary to perform some algorithm, for example, multiplication of matrices. Calculating the matrix product of two {{mvar|n}}-by-{{mvar|n}} matrices using the definition given above needs {{math|''n''{{sup|3}}}} multiplications, since for any of the {{math|''n''{{sup|2}}}} entries of the product, {{mvar|n}} multiplications are necessary. The [[Strassen algorithm]] outperforms this "naive" algorithm; it needs only {{math|''n''{{sup|2.807}}}} multiplications.{{sfnp|Golub|Van Loan|1996|loc=Algorithm 1.3.1}} Theoretically faster but impractical [[matrix multiplication algorithm]]s have been developed,{{sfnp|Vassilevska Williams|Xu|Xu|Zhou|2024}} as have speedups to this problem using [[parallel algorithm]]s or [[distributed computation]] systems such as [[MapReduce]].{{sfnp|Misra|Bhattacharya|Ghosh|2022}}
In many practical situations, additional information about the matrices involved is known. An important case concerns [[sparse matrix|sparse matrices]], that is, matrices whose entries are mostly zero. There are specifically adapted algorithms for, say, solving linear systems {{math|1='''Ax''' = '''b'''}} for sparse matrices {{math|'''A'''}}, such as the [[conjugate gradient method]].{{sfnp|Golub|Van Loan|1996|loc=Chapters 9 and 10, esp. section 10.2}}
An algorithm is, roughly speaking, numerically stable if little deviations in the input values do not lead to big deviations in the result. For example, one can calculate the inverse of a matrix by computing its [[adjugate matrix]]: \bold A^{-1} = \operatorname{adj}(\bold A) / \det(\bold A). However, this may lead to significant rounding errors if the determinant of the matrix is very small. The [[matrix norm|norm of a matrix]] can be used to capture the [[condition number|conditioning]] of linear algebraic problems, such as computing a matrix's inverse.{{sfnp|Golub|Van Loan|1996|loc=Chapter 2.3}}
== Decomposition == {{Main|Matrix decomposition|Matrix diagonalization|Gaussian elimination|Bareiss algorithm}} There are several methods to render matrices into a more easily accessible form. They are generally referred to as ''matrix decomposition'' or ''matrix factorization'' techniques. These techniques are of interest because they can make computations easier.
The [[LU decomposition]] factors matrices as a product of lower ({{math|'''L'''}}) and an upper [[triangular matrix|triangular matrices]] ({{math|'''U'''}}).{{sfnp|Press|Flannery|Teukolsky|Vetterling|1992}} Once this decomposition is calculated, linear systems can be solved more efficiently by a simple technique called [[forward substitution|forward and back substitution]]. Likewise, inverses of triangular matrices are algorithmically easier to calculate. The ''Gaussian elimination'' is a similar algorithm; it transforms any matrix to [[row echelon form]].{{sfnp|Stoer|Bulirsch|2002|loc=Section 4.1}} Both methods proceed by multiplying the matrix by suitable [[elementary matrix|elementary matrices]], which correspond to [[permutation matrix|permuting rows or columns]] and adding multiples of one row to another row. [[Singular value decomposition]] (SVD) expresses any matrix {{math|'''A'''}} as a product {{math|'''UDV'''{{sup|∗}}}}, where {{math|'''U'''}} and {{math|'''V'''}} are [[unitary matrix|unitary matrices]] and {{math|'''D'''}} is a diagonal matrix.{{sfnp|Gbur|2011|pp=146–153}}
[[File:Jordan blocks.svg|right|thumb|250px|An example of a matrix in Jordan normal form. The grey blocks are called Jordan blocks.]] The [[eigendecomposition]] or ''diagonalization'' expresses {{math|'''A'''}} as a product {{math|'''VDV'''{{sup|−1}}}}, where {{math|'''D'''}} is a diagonal matrix and {{math|'''V'''}} is a suitable invertible matrix.{{sfnp|Horn|Johnson|1985|loc=Theorem 2.5.4}} If {{math|'''A'''}} can be written in this form, it is called [[diagonalizable matrix|diagonalizable]]. More generally, and applicable to all matrices, the Jordan decomposition transforms a matrix into [[Jordan normal form]], that is to say matrices whose only nonzero entries are the eigenvalues {{math|''λ''{{sub|1}}}} to {{mvar|λ{{sub|n}}}} of {{math|'''A'''}}, placed on the main diagonal and possibly entries equal to one directly above the main diagonal, as shown at the right.{{sfnp|Horn|Johnson|1985|loc=Ch. 3.1, 3.2}} Given the eigendecomposition, the {{mvar|n}}th power of {{math|'''A'''}} (that is, {{mvar|n}}-fold iterated matrix multiplication) can be calculated via \bold A^n = (\bold{VDV}^{-1})^{n} = \bold{VDV}^{-1} \bold{VDV}^{-1} \ldots \bold{VDV}^{-1} = \bold{VD}^n \bold V^{-1} and the power of a diagonal matrix can be calculated by taking the corresponding powers of the diagonal entries, which is much easier than doing the exponentiation for {{math|'''A'''}} instead. This can be used to compute the [[matrix exponential]] {{math|''e''{{sup|'''A'''}}}}, a need frequently arising in solving [[linear differential equation]]s, [[matrix logarithm]]s and [[square root of a matrix|square roots of matrices]].{{sfnp|Arnold|1992|loc=Sections 14.5, 7, 8}} To avoid numerically [[condition number|ill-conditioned]] situations, further algorithms such as the [[Schur decomposition]] can be employed.{{sfnp|Bronson|1989|loc=Ch. 15}}
== Abstract algebraic aspects and generalizations == Matrices can be generalized in different ways. Abstract algebra uses matrices with entries in more general [[field (mathematics)|fields]] or even [[ring (mathematics)|rings]], while linear algebra codifies properties of matrices in the notion of linear maps. It is possible to consider matrices with infinitely many columns and rows. Another extension is [[tensor]]s, which can be seen as higher-dimensional arrays of numbers, as opposed to vectors, which can often be realized as sequences of numbers, while matrices are rectangular or two-dimensional arrays of numbers.{{sfnp|Coburn|1955|loc=Ch. V}} Matrices, subject to certain requirements tend to form [[group (mathematics)|groups]] known as matrix groups.{{sfnp|Tapp|2016}} Similarly under certain conditions matrices form [[ring (mathematics)|rings]] known as [[matrix ring]]s.{{sfnp|Lam|1999|loc=Chapter 7, §17 Matrix Rings, §17A Characterization and Examples|pp=461–470}} Though the product of matrices is not in general commutative, certain matrices form [[field (mathematics)|fields]] sometimes called matrix fields.{{sfnp|Hachenberger|Jungnickel|2020|loc=Definition 7.2.1|p=[https://books.google.com/books?id=iSAAEAAAQBAJ&pg=PA302 302]}} (However the term "matrix field" is ambiguous, also referring to certain forms of physical [[Field (physics)|fields]] that continuously map points of some space to matrices.{{sfnp|Ydri|2016}}) In general, matrices over any ring and their [[matrix multiplication|multiplication]] can be represented as the arrows and composition of arrows in a [[category (mathematics)|category]], the [[category of matrices]] over that ring. The objects of this category are natural numbers, representing the dimensions of the matrices.{{sfnp|Riehl|2016|pp=[https://books.google.com/books?id=6B9MDgAAQBAJ&pg=PA4 4-6]}}
=== Matrices with entries in a field or ring === This article focuses on matrices whose entries are real or complex numbers. However, matrices can be considered with much more general types of entries than real or complex numbers. As a first step of generalization, any [[field (mathematics)|field]], that is, a [[set (mathematics)|set]] where [[addition]], [[subtraction]], [[multiplication]], and [[division (mathematics)|division]] operations are defined and well-behaved, may be used instead of {{tmath|\R}} or {{tmath|\C}}, for example [[rational number]]s or [[finite field]]s. For example, [[coding theory]] makes use of matrices over finite fields.{{sfnp|Roth|2006|p=[https://books.google.com/books?id=fk7u8awR0hIC&pg=PA27 27]}} Wherever [[eigenvalue]]s are considered, as these are roots of a polynomial, they may exist only in a larger field than that of the entries of the matrix. For instance, they may be complex in the case of a matrix with real entries. The possibility to reinterpret the entries of a matrix as elements of a larger field (for example, to view a real matrix as a complex matrix whose entries happen to be all real) then allows considering each square matrix to possess a full set of eigenvalues.{{sfnp|Chahal|2018|pp=[https://books.google.com/books?id=ZIqADwAAQBAJ&pg=PA115 115–116]}} Alternatively one can consider only matrices with entries in an [[algebraically closed field]], such as {{tmath|\C,}} from the outset.{{sfnp|Meckes|Meckes|2018|pp=[https://books.google.com/books?id=wK1XDwAAQBAJ&pg=PA360 360–361]}}
Matrices whose entries are [[polynomial]]s,{{sfnp|Edwards|2004|p=[https://books.google.com/books?id=ylFR4h5BIDEC&pg=PA80 80]}} and more generally, matrices with entries in a [[ring (mathematics)|ring]] {{mvar|R}} are widely used in mathematics.{{sfnp|Lang|2002|loc=Chapter XIII}} Rings are a more general notion than fields in that a division operation need not exist. The very same addition and multiplication operations of matrices extend to this setting, too. The set {{math|M(''n'', ''R'')}} (also denoted {{math|M''n''(R)}}{{sfnp|Pop|Furdui|2017}}) of all square {{mvar|n}}-by-{{mvar|n}} matrices over {{mvar|R}} is a ring called [[matrix ring]], isomorphic to the [[endomorphism ring]] of the left {{mvar|R}}-[[module (mathematics)|module]] {{mvar|R{{sup|n}}}}.{{sfnp|Lang|2002|p=643|loc=XVII.1}} If the ring {{mvar|R}} is [[commutative ring|commutative]], that is, its multiplication is commutative, then the ring {{math|M(''n'', ''R'')}} is also an [[associative algebra]] over {{math|''R''}}. The [[determinant]] of square matrices over a commutative ring {{mvar|R}} can still be defined using the [[Leibniz formula (determinant)|Leibniz formula]]; such a matrix is invertible if and only if its determinant is [[invertible]] in {{mvar|R}}, generalizing the situation over a field {{mvar|F}}, where every nonzero element is invertible.{{sfnp|Lang|2002|loc=Proposition XIII.4.16}} Matrices over [[superring]]s are called [[supermatrix|supermatrices]].{{sfnp|Reichl|2004|loc=Section L.2}}
Matrices do not always have all their entries in the same ring{{nbsp}}– or even in any ring at all. One special but common case is [[block matrix|block matrices]], which may be considered as matrices whose entries themselves are matrices. The entries need not be square matrices, and thus need not be members of any [[ring (mathematics)|ring]]; but in order to multiply them, their sizes must fulfill certain conditions: each pair of submatrices that are multiplied in forming the overall product must have compatible sizes.{{sfnp|Jeffrey|2010|loc=3.7 Partitioning of matrices|pp=[https://books.google.com/books?id=uan0Dkn9HY8C&pg=PA54 54ff]}}
=== Relationship to linear maps === Linear maps \R^n \to \R^m are equivalent to {{mvar|m}}-by-{{mvar|n}} matrices, as described [[#linear maps|above]]. More generally, any linear map {{math|''f'' : ''V'' → ''W''}} between finite-[[Hamel dimension|dimensional]] [[vector space]]s can be described by a matrix {{math|1='''A''' = (''a{{sub|ij}}'')}}, after choosing [[basis (linear algebra)|bases]] {{math|'''v'''{{sub|1}}, ..., '''v'''{{sub|''n''}}}} of {{mvar|V}}, and {{math|'''w'''{{sub|1}}, ..., '''w'''{{sub|''m''}}}} of {{mvar|W}} (so {{mvar|n}} is the dimension of {{mvar|V}} and {{mvar|m}} is the dimension of {{mvar|W}}), which is such that f(\mathbf{v}j) = \sum{i=1}^m a_{i,j} \mathbf{w}_i\qquad\mbox{for}\ j=1,\ldots,n. In other words, column {{mvar|j}} of {{math|'''A'''}} expresses the image of {{math|'''v'''{{sub|''j''}}}} in terms of the basis vectors {{math|'''w'''{{sub|''i''}}}} of {{mvar|W}}; thus this relation uniquely determines the entries of the matrix {{math|'''A'''}}. The matrix depends on the choice of the bases: different choices of bases give rise to different, but [[matrix equivalence|equivalent matrices]].{{sfnp|Greub|1975|loc=Section III.3}} Many of the above concrete notions can be reinterpreted in this light, for example, the transpose matrix {{math|'''A'''{{sup|T}}}} describes the [[transpose of a linear map|transpose of the linear map]] given by {{math|'''A'''}}, concerning the [[dual space|dual bases]].{{sfnp|Greub|1975|loc=Section III.3.13}}
These properties can be restated more naturally: the [[category of matrices]] with entries in a field k with multiplication as composition is [[equivalence of categories|equivalent]] to the category of finite-dimensional [[vector space]]s and linear maps over this field.{{sfnp|Perrone|2024|pp=99-100}}
More generally, the set of {{times|{{mvar|m}}|{{mvar|n}}}} matrices can be used to represent the {{mvar|R}}-linear maps between the free modules {{mvar|R{{sup|m}}}} and {{mvar|R{{sup|n}}}} for an arbitrary ring {{mvar|R}} with unity. When {{math|1=''n'' = ''m''}} composition of these maps is possible, and this gives rise to the [[matrix ring]] of {{times|{{mvar|n}}|{{mvar|n}}}} matrices representing the [[endomorphism ring]] of {{mvar|R{{sup|n}}}}.{{sfnp|Hungerford|1980|loc=VII.1: Matrices and maps|pages=328–335}}
=== Matrix groups === {{Main|Matrix group}} A [[Group (mathematics)|group]] is a mathematical structure consisting of a set of objects together with a [[binary operation]], that is, an operation combining any two objects to a third, subject to certain requirements.{{sfnp|Horn|Johnson|1985|p=69}} A group in which the objects are [[Invertible matrix|invertible]] {{tmath|n \times n}} matrices and the group operation is matrix multiplication is called a ''matrix group'' of degree {{tmath|n}}.{{sfnp|Baker|2003|loc=Def. 1.30}} Every such matrix group is a [[subgroup]] of (that is, a smaller group contained within) the group of ''all'' invertible {{tmath|n \times n}} matrices, the [[general linear group]] of degree {{tmath|n}}.{{sfnp|Cameron|2014}}
Any property of square matrices that is preserved under matrix products and inverses can be used to define a matrix group. For example, the set of all {{tmath|n \times n}} matrices whose determinant is {{math|1}} form a group called the [[special linear group]] of degree {{tmath|n}}.{{sfnp|Baker|2003|loc=Theorem 1.2}} The set of [[orthogonal matrices]], determined by the condition \bold M^{\rm T} \bold M = \bold I, form the [[orthogonal group]].{{sfnp|Artin|1991|loc=Chapter 4.5}} Every orthogonal matrix has [[determinant]] {{math|1}} or {{math|−1}}. Orthogonal matrices with determinant {{math|1}} form a group called the ''special orthogonal group''.{{sfnp|Serre|2007|p=[https://books.google.com/books?id=vY_xBwAAQBAJ&pg=PA20 20]}}
Every [[finite group]] is [[isomorphic]] to a matrix group, as one can see by considering the [[regular representation]] of the [[symmetric group]].{{sfnp|Rowen|2008|p=198|loc=Example 19.2}} General groups can be studied using matrix groups, which are comparatively well understood, using [[representation theory]].See any reference in representation theory or [[group representation]].
=== Infinite matrices ===
It is also possible to consider matrices with infinitely many rows and columns.See the item "Matrix" in {{Harvard citations|last=Itô | year=1987 |nb=yes}}. The basic operations introduced above are defined the same way in this case. Matrix multiplication, however, and all operations stemming therefrom are only meaningful when restricted to certain matrices, since the [[#matrix_product|sum featuring in the above definition]] of the matrix product will contain an infinity of summands.{{sfnp|Boos|2000|loc=2.2 Dealing with infinite matrices|pp=[https://books.google.com/books?id=kZ9cy6XyidEC&pg=PA34 34–39]}} An easy way to circumvent this issue is to restrict to ''finitary matrices'' all of whose rows (or columns) contain only finitely many nonzero terms.{{sfnp|Grillet|2007|p=[https://books.google.com/books?id=LJtyhu8-xYwC&pg=PA334 334]}} As in the finite case (see [[#linear_maps|above]]), where matrices describe linear maps, infinite matrices can be used to describe [[Hilbert_space#Operators_on_Hilbert_spaces|operators on Hilbert spaces]], where convergence and [[Continuous function|continuity]] questions arise. However, the explicit point of view of matrices tends to obfuscate the matter, "Not much of matrix theory carries over to infinite-dimensional spaces, and what does is not so useful, but it sometimes helps." {{Harvard citations | last1=Halmos |year=1982 | loc=p. 23, Chapter 5|nb=yes}}. and the abstract and more powerful tools of [[functional analysis]] are used instead, by relating matrices to linear maps (as in the finite case [[#linear_maps|above]]), but imposing additional convergence and continuity constraints.
=== Empty matrix === An ''empty matrix'' is a matrix in which the number of rows or columns (or both) is zero."Empty Matrix: A matrix is empty if either its row or column dimension is zero", [https://omatrix.com/manual/glossary.htm Glossary] {{Webarchive|url=https://web.archive.org/web/20090429015728/http://www.omatrix.com/manual/glossary.htm |date=2009-04-29 }}, O-Matrix v6 User Guide"A matrix having at least one dimension equal to zero is called an empty matrix", [https://system.nada.kth.se/unix/software/matlab/Release_14.1/techdoc/matlab_prog/ch_dat29.html MATLAB Data Structures] {{Webarchive|url=https://web.archive.org/web/20091228102653/http://www.system.nada.kth.se/unix/software/matlab/Release_14.1/techdoc/matlab_prog/ch_dat29.html |date=2009-12-28 }} Empty matrices can be a useful [[Base case (recursion)|base case]] for certain [[recursion|recursive]] constructions,{{sfnp|Coleman|Van Loan|1988|p=[https://books.google.com/books?id=zUh9I2mSKxgC&pg=PA213 213]}} and can help to deal with maps involving the [[zero vector space]].{{sfnp|Hazewinkel|Gubareni|2017|p=[http://books.google.com/books?id=5w6lDgAAQBAJ&pg=PA151 151]}} For example, if {{math|'''A'''}} is a {{times|3|0}} matrix and {{math|'''B'''}} is a {{times|0|3}} matrix, then {{math|'''AB'''}} is the {{times|3|3}} [[zero matrix]] corresponding to the null map from a 3-dimensional space {{mvar|V}} to itself, while {{math|'''BA'''}} is a {{times|0|0}} matrix. There is no common notation for empty matrices, but most [[computer algebra system]]s allow creating and computing with them.The notation of empty matrix is used differently from some sources like {{harvp|Bernstein|2009|p=[http://books.google.com/books?id=-c0NxJg4vHMC&pg=PA90 90]}} use {{nowrap| 0_{0 \times n} ,}} resembling the [[zero matrix]]; {{harvp|Hazewinkel|Gubareni|2017|p=[http://books.google.com/books?id=5w6lDgAAQBAJ&pg=PA151 151]}} use {{nowrap| \mathfrak{I}_{0 \times n} .}} The determinant of the {{times|0|0}} matrix is conventionally defined to be 1, consistent with the [[empty product]] occurring in the Leibniz formula for the determinant.{{sfnp|West|2020|p=[https://books.google.com/books?id=0-3vDwAAQBAJ&pg=PA750 750]}} This value is also needed for consistency with the {{times|2|2}} case of the [[Desnanot–Jacobi identity]] relating determinants to the determinants of smaller matrices.{{sfnp|Brualdi|Carmona|van den Driessche|Kirkland|2018|p=[https://books.google.com/books?id=LMRTDwAAQBAJ&pg=PA19 19]}}
=== Matrices with entries in a semiring === A [[semiring]] is similar to a ring, but elements need not have [[additive inverse]]s, therefore one cannot do subtraction freely there. The definition of addition and multiplication of matrices with entries in a ring applies to matrices with entries in a semiring without modification. Matrices of fixed size with entries in a semiring form a [[commutative monoid]] \operatorname{Mat}(m,n;R) under addition.{{sfnp|Farid|Khan|Wang|2013|loc=2087}} Square matrices of fixed size with entries in a semiring form a semiring \operatorname{Mat}(n;R) under addition and multiplication.{{sfnp|Farid|Khan|Wang|2013|loc=2087}}
The determinant of an {{times|{{mvar|n}}|{{mvar|n}}}} square matrix M with entries in a [[commutative semiring]] R cannot be defined in general because the definition would involve additive inverses of semiring elements. What plays its role instead is the pair of positive and negative determinants :\det\nolimits_+M=\sum_{\sigma\in\operatorname{Alt}(n)}M_{1\sigma(1)}\cdots M_{n\sigma(n)} :\det\nolimits_-M=\sum_{\sigma\in\operatorname{Sym}(n)\setminus\operatorname{Alt}(n)}M_{1\sigma(1)}\cdots M_{n\sigma(n)} where the sums are taken over [[even permutation]]s and odd permutations, respectively.{{sfnp|Reutenauer|Straubing|1984|loc=351}}{{sfnp|Ghosh|1996|loc=222}}
=== Matrices with entries in a category === Matrices and their multiplication can be defined with entries objects of a [[category (mathematics)|category]] equipped with a "[[monoidal category|tensor product]]" similar to multiplication in a ring, having [[coproduct]]s similar to addition in a ring, in that the former is [[distributive property|distributive]] over the latter.{{sfnp|Carboni|Kasangian|Walters|1987|loc=137}} However, the multiplication thus defined may be only associative in a sense weaker than usual. These are part of a bigger structure called the ''bicategory of matrices''. The complete description of the above summary for interested readers follows.
Let (\mathcal C,\otimes,I) be a [[monoidal category]] satisfying the following two conditions:
- All (small) [[coproduct]]s exist; in particular, let \varnothing be an [[initial object]].
- The functor \otimes is distributive over coproducts; i.e., for all object X and a family of objects (Y_i){i\in I} in \mathcal C, the canonical \mathcal C-morphisms \coprod{i\in I}(X\otimes Y_i)\to X\otimes\coprod_{i\in I}Y_i\coprod_{i\in I}(Y_i\otimes X)\to\left(\coprod_{i\in I}Y_i\right)\otimes X are [[isomorphism]]s. In particular, the canonical morphisms \varnothing\to X\otimes\varnothing and \varnothing\to\varnothing\otimes X are isomorphisms. Then, the [[bicategory]] of \mathcal C-matrices \operatorname{Mat}(\mathcal C) is as follows:{{sfnp|Carboni|Kasangian|Walters|1987|loc=137}}
- The objects are the sets.
- A [[1-morphism]] M\colon A\to B is a map M\colon A\times B\to\operatorname{Ob}(\mathcal C); this is just a matrix over \mathcal C. ** The composition of 1-morphisms M\colon A\to B and N\colon B\to C, which can be understood as matrix multiplication, is (N\circ M)(a,c)=\coprod_{b\in B}M(a,b)\otimes N(b,c). ** The identity 1-morphism on A is \operatorname{id}_A(a,b)=\begin{cases} I & a=b \ \varnothing & a\ne b \end{cases}.
- A 2-morphism between 1-morphisms M,N\colon A\to B is a family of \mathcal C-morphisms (f_{ab}\colon M(a,b)\to N(a,b))_{(a,b)\in A\times B}. The definition of vertical and horizontal composition of 2-morphisms is natural: the vertical composition is componentwise composition of \mathcal C-morphisms; the horizontal composition is that derived from the functoriality of \otimes and the [[universal property]] of coproducts. In general, the bicategory of matrices need not be a strict [[2-category]]. For example, the composition of 1-morphisms may not be associative in the usual strict sense, but only up to [[Coherency (homotopy theory)|coherent]] isomorphism.
== Applications == There are numerous applications of matrices, both in mathematics and other sciences. Some of them merely take advantage of the compact representation of a set of numbers in a matrix. For example, [[Text mining]] and automated [[thesaurus]] compilation makes use of [[document-term matrix|document-term matrices]] such as [[tf-idf]] to track frequencies of certain words in several documents.{{sfnp|Manning|Schütze|1999 |loc=Section 15.3.4}}
Complex numbers can be represented by particular real 2-by-2 matrices via a + ib \leftrightarrow \begin{bmatrix} a & -b \ b & a \end{bmatrix}, under which addition and multiplication of complex numbers and matrices correspond to each other. For example, 2-by-2 rotation matrices represent the multiplication with some complex number of [[absolute value]] 1, as [[#rotation matrix|above]]. A similar interpretation is possible for [[quaternion]]s{{sfnp|Ward|1997|loc=Ch. 2.8}} and [[Clifford algebra]]s in general.{{sfnp|Abłamowicz|2000|p=[https://books.google.com/books?id=yvCC94xzJG8C&pg=PA436 436]}}
In [[game theory]] and [[economics]], the [[payoff matrix]] encodes the payoff for two players, depending on which out of a given (finite) set of strategies the players choose.{{sfnp|Fudenberg|Tirole|1983|loc=Section 1.1.1}} The expected outcome of the game, when both players play [[mixed strategy|mixed strategies]], is obtained by multiplying this matrix on both sides by vectors representing the strategies.{{sfnp|McHugh|2025|loc=11.2.3 The expected payoff as a vector–matrix–vector product|p=[https://books.google.com/books?id=U9slEQAAQBAJ&pg=PA390 390]}} The [[minimax theorem]] central to game theory is closely related to the [[Dual linear program|duality theory of linear program]]s, which are often formulated in terms of matrix-vector products.{{sfnp|Matoušek|Gärtner|2007|pp=[https://books.google.com/books?id=6MO_RS4z0w8C&pg=PA136 136–137]}}
Early [[encryption]] techniques such as the [[Hill cipher]] also used matrices. However, due to the linear nature of matrices, these codes are comparatively easy to break.{{sfnp|Stinson|2005|loc=Ch. 1.1.5 and 1.2.4}} [[Computer graphics]] uses matrices to represent objects; to calculate transformations of objects using affine [[rotation matrix|rotation matrices]] to accomplish tasks such as projecting a three-dimensional object onto a two-dimensional screen, corresponding to a theoretical camera observation; and to apply image convolutions such as sharpening, blurring, edge detection, and more.{{sfnp|ISRD Group|2005|loc=Ch. 7}} Matrices over a [[polynomial ring]] are important in the study of [[control theory]].{{sfnp|Bhaya|Kaszkurewicz|2006|p=[https://books.google.com/books?id=3X7S_965jywC&pg=PA230 230]}}
[[Chemistry]] makes use of matrices in various ways, particularly since the use of [[quantum mechanics|quantum theory]] to discuss [[chemical bond|molecular bonding]] and [[spectroscopy]]. Examples are the [[overlap matrix]] and the [[Fock matrix]] used in solving the [[Roothaan equations]] to obtain the [[molecular orbital]]s of the [[Hartree–Fock method]].{{sfnp|Jensen|1999|p=[https://archive.org/details/introductiontoco0000jens/page/65/mode/2up?q=matrix 65–69]}}
=== Graph theory === [[File:Labelled undirected graph.svg|150px|thumb|right|An undirected graph with adjacency matrix: \begin{bmatrix} 1 & 1 & 0 \ 1 & 0 & 1 \ 0 & 1 & 0 \end{bmatrix}.]] The [[adjacency matrix]] of a [[finite graph]] is a basic notion of [[graph theory]].{{sfnp|Godsil|Royle|2004|loc=Ch. 8.1}} It records which vertices of the graph are connected by an edge. Matrices containing just two different values ({{math|1}} and {{math|0}} meaning for example "yes" and "no", respectively) are called [[logical matrix|logical matrices]]. The [[distance matrix|distance (or cost) matrix]] contains information about the distances of the edges.{{sfnp|Punnen|Gutin|2002}} These concepts can be applied to [[website]]s connected by [[hyperlink]]s,{{sfnp|Zhang|Yu|Hou|2006|p=[http://books.google.com/books?id=0xhra9vKCnUC&pg=PA7 7]}} or cities connected by roads etc., in which case (unless the connection network is extremely dense) the matrices tend to be [[sparse matrix|sparse]], that is, contain few nonzero entries. Therefore, specifically tailored matrix algorithms can be used in [[network theory]].{{sfnp|Scott|Tůma|2023}}
=== Analysis and geometry === The [[Hessian matrix]] of a [[differentiable function]] f: \R^n \to \R consists of the [[second derivative]]s of {{mvar|ƒ}} concerning the several coordinate directions, that is,{{sfnp|Lang|1987|loc=Ch. XVI.6}} H(f) = \left [\frac {\partial^2f}{\partial x_i , \partial x_j} \right ]. [[File:Saddle Point SVG.svg|left|thumb|At the [[saddle point]] {{math|1=(''x'' = 0, ''y'' = 0)}} (red) of the function {{math|1=''f'' (''x'',−''y'') = ''x''{{sup|2}} − ''y''{{sup|2}}}}, the Hessian matrix \begin{bmatrix} 2 & 0 \ 0 & -2 \end{bmatrix} is [[indefinite matrix|indefinite]].]]It encodes information about the local growth behavior of the function: given a [[critical point (mathematics)|critical point]] {{math|1='''x''' = (''x''{{sub|1}}, ..., ''x{{sub|n}}'')}}, that is, a point where the first [[partial derivative]]s \partial f / \partial x_i of {{mvar|f}} vanish, the function has a [[local minimum]] if the Hessian matrix is [[definiteness of a matrix|positive definite]]. [[Quadratic programming]] can be used to find global minima or maxima of quadratic functions closely related to the ones attached to matrices (see [[#quadratic forms|above]]).{{sfnp|Nocedal|Wright|2006|loc=Ch. 16}}
Another matrix frequently used in geometrical situations is the [[Jacobian matrix and determinant|Jacobi matrix]] of a differentiable map {{tmath|1= f: \R^n \to \R^m }}. If {{math|''f''{{sub|1}}, ..., ''f{{sub|m}}''}} denote the components of {{mvar|f}}, then the Jacobi matrix is defined as{{sfnp|Lang|1987|loc=Ch. XVI.1}} J(f) = \left [\frac {\partial f_i}{\partial x_j} \right ]_{1 \leq i \leq m, 1 \leq j \leq n}. If {{math|''n'' > ''m''}}, and if the rank of the Jacobi matrix attains its maximal value {{mvar|m}}, {{mvar|f}} is locally invertible at that point, by the [[implicit function theorem]].{{Harvard citations |last1=Lang |year=1987 |nb=yes |loc=Ch. XVI.5}}. For a more advanced, and more general statement see {{Harvard citations|last1=Lang|year=1969|nb=yes|loc=Ch. VI.2}}.
[[Partial differential equation]]s can be classified by considering the matrix of coefficients of the highest-order differential operators of the equation. For [[elliptic partial differential equation]]s this matrix is positive definite, which has a decisive influence on the set of possible solutions of the equation in question.{{sfnp|Gilbarg|Trudinger|2001}}
The [[finite element method]] is an important numerical method to solve partial differential equations, widely applied in simulating complex physical systems. It attempts to approximate the solution to some equation by piecewise linear functions, where the pieces are chosen concerning a sufficiently fine grid, which in turn can be recast as a matrix equation.{{Harvard citations |last1=Šolin |year=2005 |nb=yes |loc=Ch. 2.5}}. See also [[stiffness method]].
{{Clear}}
=== Probability theory and statistics === [[File:Markov chain SVG.svg|right|thumb|280px|Two different Markov chains. The chart depicts the number of particles (of a total of 1000) in state "2". Both limiting values can be determined from the transition matrices, which are given by \left[\begin{smallmatrix} 0.7 & 0\ 0.3 & 1 \end{smallmatrix}\right] (red) and \left[\begin{smallmatrix} 0.7 & 0.2\ 0.3 & 0.8 \end{smallmatrix}\right] (black).]] [[Stochastic matrix|Stochastic matrices]] are square matrices whose rows are [[probability vector]]s, that is, whose entries are non-negative and sum up to one. Stochastic matrices are used to define [[Markov chain]]s with finitely many states.{{sfnp|Latouche|Ramaswami|1999}} A row of the stochastic matrix gives the probability distribution for the next position of some particle currently in the state that corresponds to the row. Properties of the Markov chain—like [[absorbing state]]s, that is, states that any particle attains eventually—can be read off the eigenvectors of the transition matrices.{{sfnp|Mehata|Srinivasan|1978|loc=Ch. 2.8}}
Statistics also makes use of matrices in many different forms.{{Citation |last=Healy |first=Michael |title=Matrices for Statistics |year=1986 |publisher=[[Oxford University Press]] |isbn=978-0-19-850702-4 |author-link=Michael Healy (statistician)}} [[Descriptive statistics]] is concerned with describing data sets, which can often be represented as [[data matrix (multivariate statistics)|data matrices]], which may then be subjected to [[dimensionality reduction]] techniques. The [[covariance matrix]] encodes the mutual [[variance]] of several [[random variable]]s.{{sfnp|Krzanowski|1988|p=60|loc=Ch. 2.2}} Another technique using matrices are [[linear least squares]], a method that approximates a finite set of pairs {{math|(''x''{{sub|1}}, ''y''{{sub|1}}), (''x''{{sub|2}}, ''y''{{sub|2}}), ..., (''x''{{sub|''N''}}, ''y''{{sub|''N''}})}}, by a linear function y_i \approx ax_i + b, \quad i = 1, \ldots, N which can be formulated in terms of matrices, related to the [[singular value decomposition]] of matrices.{{sfnp|Krzanowski|1988|loc=Ch. 4.1}}
[[Random matrix|Random matrices]] are matrices whose entries are random numbers, subject to suitable [[probability distribution]]s, such as [[matrix normal distribution]]. Beyond probability theory, they are applied in domains ranging from [[number theory]] to [[physics]].{{Harvard citations |authorlink1=Brian Conrey|last1=Conrey |year=2007 |nb=yes}}{{Harvard citations |last1=Zabrodin |last2=Brézin |last3=Kazakov |last4=Serban |last5=Wiegmann |year=2006 |nb=yes}}
=== Quantum mechanics and particle physics === The first model of [[quantum mechanics]] ([[Werner Heisenberg|Heisenberg]], 1925) used infinite-dimensional matrices to define the operators that took over the role of variables like position, momentum and energy from classical physics.{{sfnp|Schiff|1968|loc=Ch. 6}} (This is sometimes referred to as [[matrix mechanics]].{{sfnp|Peres|1993|p=20}}) Matrices, both finite and infinite-dimensional, have since been employed for many purposes in quantum mechanics. One particular example is the [[density matrix]], a tool used in calculating the [[probabilities]] of the outcomes of [[Measurement in quantum mechanics|measurements]] performed on [[physical system]]s.{{sfnp|Bohm|2001|loc=sections I.8, II.4, and II.8}}{{sfnp|Peres|1993|p=73}}
Linear transformations and the associated [[symmetry|symmetries]] play a [[symmetry in physics|key role in modern physics]]. For example, [[elementary particle]]s in [[quantum field theory]] are classified as representations of the [[Lorentz group]] of special relativity and, more specifically, by their behavior under the [[spin group]]. Concrete representations involving the [[Pauli matrices]] and more general [[gamma matrices]] are an integral part of the physical description of [[fermion]]s, which behave as [[spinor]]s.{{sfnp|Itzykson|Zuber|1980|loc=Ch. 2}} For the three lightest [[quark]]s, there is a group-theoretical representation involving the [[special unitary group]] SU(3); for their calculations, physicists use a convenient matrix representation known as the [[Gell-Mann matrices]], which are also used for the SU(3) [[gauge group]] that forms the basis of the modern description of strong nuclear interactions, [[quantum chromodynamics]]. The [[Cabibbo–Kobayashi–Maskawa matrix]], in turn, expresses the fact that the basic quark states that are important for [[weak interaction]]s are not the same as, but linearly related to the basic quark states that define particles with specific and distinct [[mass]]es.{{sfnp|Burgess|Moore|2007|loc=section 1.6.3. (SU(3)), section 2.4.3.2. (Kobayashi–Maskawa matrix)}}
Another matrix serves as a key tool for describing the scattering experiments that form the cornerstone of experimental particle physics: Collision reactions such as occur in [[particle accelerator]]s, where non-interacting particles head towards each other and collide in a small interaction zone, with a new set of non-interacting particles as the result, can be described as the scalar product of outgoing particle states and a linear combination of ingoing particle states. The linear combination is given by a matrix known as the [[S-matrix]], which encodes all information about the possible interactions between particles.{{sfnp|Weinberg|1995|loc=Ch. 3}}
=== Normal modes === A general application of matrices in physics is the description of linearly coupled harmonic systems. The [[equations of motion]] of such systems can be described in matrix form, with a mass matrix multiplying a generalized velocity to give the kinetic term, and a [[force]] matrix multiplying a displacement vector to characterize the interactions. The best way to obtain solutions is to determine the system's [[eigenvector]]s, its [[normal mode]]s, by diagonalizing the matrix equation. Techniques like this are crucial when it comes to the internal dynamics of [[molecules]]: the internal vibrations of systems consisting of mutually bound component atoms.{{sfnp|Wherrett|1987|loc=part II}} They are also needed for describing mechanical vibrations, and oscillations in electrical circuits.{{sfnp|Riley|Hobson|Bence|1997|loc=7.17}}
=== Geometrical optics === [[Geometrical optics]] provides further matrix applications. In this approximative theory, the [[light wave|wave nature]] of light is neglected. The result is a model in which [[ray (optics)|light rays]] are indeed [[ray (geometry)|geometrical rays]]. If the deflection of light rays by optical elements is small, the action of a [[lens (optics)|lens]] or reflective element on a given light ray can be expressed as multiplication of a two-component vector with a two-by-two matrix called [[ray transfer matrix analysis]]: the vector's components are the light ray's slope and its distance from the optical axis, while the matrix encodes the properties of the optical element. There are two kinds of matrices, viz. a ''refraction matrix'' describing the refraction at a lens surface, and a ''translation matrix'', describing the translation of the plane of reference to the next refracting surface, where another refraction matrix applies. The optical system, consisting of a combination of lenses and reflective elements, is simply described by the matrix resulting from the product of the components' matrices.{{sfnp|Guenther|1990|loc=Ch. 5}}
The [[Jones calculus]] models the [[Polarization (physics)|polarization]] of a light source as a 2\times 2 vector, and the effects of [[optical filter]]s on this polarization vector as a matrix.{{sfnp|Han|Kim|Noz|1997}}
=== Electronics === Electronic circuits that are composed of linear components (such as resistors, inductors and capacitors) obey [[Kirchhoff's circuit laws]], which leads to a system of linear equations, which can be described with a matrix equation that relates the source currents and voltages to the resultant currents and voltages at each point in the circuit, and where the matrix entries are determined by the circuit.{{sfnp|Suresh Kumar|2009|pp=747–749}}
== History == Matrices have a long history of application in solving [[linear equation]]s but they were known as arrays until the 1800s. The [[Chinese mathematics|Chinese text]] ''[[The Nine Chapters on the Mathematical Art]]'' written in the 10th–2nd century BCE is the first example of the use of array methods to solve [[system of linear equations|simultaneous equations]],{{Harvard citations |last1=Shen |last2=Crossley |last3=Lun |year=1999 |nb=yes}} cited by {{Harvard citations |last1=Bretscher |year=2005|nb=yes|loc=p. 1}} including the concept of [[determinant]]s. In 1545 Italian mathematician [[Gerolamo Cardano]] introduced the method to Europe when he published ''Ars Magna''.{{sfnp|Dossey|2002|pp=564–565}} The [[Japanese mathematics|Japanese mathematician]] [[Seki Kowa|Seki]] used the same array methods to solve simultaneous equations in 1683.{{cite book |last1=Needham |first1=Joseph |author-link1=Joseph Needham |last2=Wang Ling |author-link2=Wang Ling (historian) |title=Science and Civilisation in China |url=https://books.google.com/books?id=jfQ9E0u4pLAC&pg=PA117 |volume=III |year=1959 |publisher=Cambridge University Press |location=Cambridge |isbn=((978-0-521-05801-8)) |page=117}} The Dutch mathematician'' ''[[Jan de Witt]] represented transformations using arrays in his 1659 book ''Elements of Curves'' (1659).{{sfnp|Dossey|2002|p=564}} Between 1700 and 1710 [[Gottfried Wilhelm Leibniz]] publicized the use of arrays for recording information or solutions and experimented with over 50 different systems of arrays.{{sfnp|Dossey|2002|pp=564–565}} [[Gabriel Cramer|Cramer]] presented [[Cramer's rule|his rule]] in 1750.{{sfnp|Cramer|1750}}{{sfnp|Kosinski|2001}}
This use of the term ''matrix'' in mathematics (an English word for "womb" in the 19th century, from Latin, as well as a jargon word [[Matrix (printing)|in printing]], [[Matrix (biology)|in biology]] and [[Matrix (geology)|in geology]]{{Citation |url=https://archive.org/details/oed6barch/page/238/mode/1up |title=Matrix |work=[[A New English Dictionary on Historical Principles]] |volume=6, pt. 2 (M–N) |year=1908 |publisher=Clarendon Press |place=Oxford |page=238 |editor1-last=Murray |editor1-first=James |editor1-link=James Murray (lexicographer) |editor2-last=Bradley |editor2-first=Henry |editor2-link=Henry Bradley}}) was coined by [[James Joseph Sylvester]] in 1850,The earliest published example is J. J. Sylvester (1850) "Additions to the articles in the September number of this journal, 'On a new class of theorems,' and on Pascal's theorem," ''The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science'', '''37''': 363-370. [https://books.google.com/books?id=CBhDAQAAIAAJ&pg=PA369 From page 369]: "For this purpose, we must commence, not with a square, but with an oblong arrangement of terms consisting, suppose, of m lines and n columns. This does not in itself represent a determinant, but is, as it were, a Matrix out of which we may form various systems of determinants ... " who understood a matrix as an object giving rise to several determinants today called [[minor (linear algebra)|minors]], that is to say, determinants of smaller matrices that derive from the original one by removing columns and rows. In an 1851 paper, Sylvester explains:{{sfnp|Sylvester|1904|loc=[https://books.google.com/books?id=5GQPlxWrDiEC&pg=PA247 Paper 37]|p=247}}
{{Blockquote|I have in previous papers defined a "Matrix" as a rectangular array of terms, out of which different systems of determinants may be engendered from the womb of a common parent.}}
[[Arthur Cayley]] published a treatise on geometric transformations using matrices that were not rotated versions of the coefficients being investigated as had previously been done. Instead, he defined operations such as addition, subtraction, multiplication, and division as transformations of those matrices and showed the associative and distributive properties held. Cayley investigated and demonstrated the non-commutative property of matrix multiplication as well as the commutative property of matrix addition.{{sfnp|Dossey|2002|pp=564–565}} Early matrix theory had limited the use of arrays almost exclusively to determinants and Cayley's abstract matrix operations were revolutionary. He was instrumental in proposing a matrix concept independent of equation systems. In 1858, Cayley published his ''A memoir on the theory of matrices''{{sfnp|Cayley|1858}}{{sfnp|Dieudonné|1978|loc=Vol. 1, Ch. III, p. 96}} in which he proposed and demonstrated the [[Cayley–Hamilton theorem]].{{sfnp|Dossey|2002|pp=564–565}}
The English mathematician [[Cuthbert Edmund Cullis]] was the first to use modern bracket notation for matrices in 1913 and he simultaneously demonstrated the first significant use of the notation {{math|1='''A''' = [''a''{{sub|''i'',''j''}}]}} to represent a matrix where {{math|''a''{{sub|''i'',''j''}}}} refers to the {{mvar|i}}th row and the {{mvar|j}}th column.{{sfnp|Dossey|2002|pp=564–565}}
The modern study of determinants sprang from several sources.{{sfnp|Knobloch|1994}} [[number theory|Number-theoretical]] problems led [[Gauss]] to relate coefficients of [[quadratic form]]s, that is, expressions such as {{math|''x''{{sup|2}} + ''xy'' − 2''y''{{sup|2}}}}, and [[linear map]]s in three dimensions to matrices. [[Gotthold Eisenstein|Eisenstein]] further developed these notions, including the remark that, in modern parlance, [[matrix product]]s are [[non-commutative]]. [[Augustin-Louis Cauchy|Cauchy]] was the first to prove general statements about determinants, using as the definition of the determinant of a matrix {{math|1='''A''' = [''a''{{sub|''i'',''j''}}]}} the following: replace the powers {{math|''a''{{sub|''j''}}{{sup|''k''}}}} by {{math|''a''{{sub|''j'',''k''}}}} in the [[polynomial]] a_1 a_2 \cdots a_n \prod_{i < j} (a_j - a_i), where \textstyle\prod denotes the [[multiplication|product]] of the indicated terms. He also showed, in 1829, that the [[eigenvalue]]s of symmetric matrices are real.{{sfnp|Hawkins|1975}} [[Carl Gustav Jacob Jacobi|Jacobi]] studied "functional determinants"—later called [[Jacobian matrix and determinant|Jacobi determinant]]s by Sylvester—which can be used to describe geometric transformations at a local (or [[infinitesimal]]) level, see [[#Jacobi matrix|above]]. [[Leopold Kronecker|Kronecker]]'s ''Vorlesungen über die Theorie der Determinanten''{{Harvard citations |last1=Kronecker |editor1-last=Hensel |year=1897 |nb=yes}} and [[Karl Weierstrass|Weierstrass]]'s ''Zur Determinantentheorie'',{{Harvard citations |last1=Weierstrass |year=1915 |volume=3 |loc=pp. 271–286 |nb=yes}} both published in 1903, first treated determinants [[axiom]]atically, as opposed to previous more concrete approaches such as the mentioned formula of Cauchy. At that point, determinants were firmly established.{{sfnp||Miller|1930}}{{sfnp|Knobloch|1994}}
Many theorems were first established for small matrices only, for example, the [[Cayley–Hamilton theorem]] was proved for {{times|2|2}} matrices by Cayley in the aforementioned memoir, and by [[William Rowan Hamilton|Hamilton]] for {{times|4|4}} matrices. [[Georg Frobenius|Frobenius]], working on [[bilinear form]]s, generalized the theorem to all dimensions (1898). Also at the end of the 19th century, the [[Gauss–Jordan elimination]] (generalizing a special case now known as [[Gauss elimination]]) was established by [[Wilhelm Jordan (geodesist)|Wilhelm Jordan]]. In the early 20th century, matrices attained a central role in linear algebra,{{sfnp|Bôcher|2004}} partially due to their use in the classification of the [[hypercomplex number]] systems of the previous century.{{sfnp|Hawkins|1972}}
The inception of [[matrix mechanics]] by [[Werner Heisenberg|Heisenberg]], [[Max Born|Born]] and [[Pascual Jordan|Jordan]] led to studying matrices with infinitely many rows and columns.{{sfnp|van der Waerden|2007|pp=28–40}} Later, [[John von Neumann|von Neumann]] carried out the [[mathematical formulation of quantum mechanics]], by further developing [[functional analysis|functional analytic]] notions such as [[linear operator]]s on [[Hilbert space]]s, which, very roughly speaking, correspond to [[Euclidean space]], but with an infinity of [[Hamel dimension|independent directions]].{{sfnp|Peres|1993|pp=79,106–107}}
=== Other historical usages of the word "matrix" in mathematics === The word has been used in unusual ways by at least two authors of historical importance.
[[Bertrand Russell]] and [[Alfred North Whitehead]] in their ''[[Principia Mathematica]]'' (1910–1913) use the word "matrix" in the context of their [[axiom of reducibility]]. They proposed this axiom as a means to reduce any function to one of lower type, successively, so that at the "bottom" (0 order) the function is identical to its [[extension (predicate logic)|extension]]:Whitehead, Alfred North; and Russell, Bertrand (1913) ''Principia Mathematica to *56'', Cambridge at the University Press, Cambridge UK (republished 1962) cf page 162ff.
{{Blockquote|Let us give the name of ''matrix'' to any function, of however many variables, that does not involve any [[apparent variable]]s. Then, any possible function other than a matrix derives from a matrix using generalization, that is, by considering the proposition that the function in question is true with all possible values or with some value of one of the arguments, the other argument or arguments remaining undetermined.}}
For example, a function {{math|Φ(''x'', ''y'')}} of two variables {{mvar|x}} and {{mvar|y}} can be reduced to a ''collection'' of functions of a single variable, such as {{mvar|y}}, by "considering" the function for all possible values of "individuals" {{mvar|a{{sub|i}}}} substituted in place of a variable {{mvar|x}}. And then the resulting collection of functions of the single variable {{mvar|y}}, that is, {{math|∀''a''{{sub|''i''}}: Φ(''a''{{sub|''i''}}, ''y'')}}, can be reduced to a "matrix" of values by "considering" the function for all possible values of "individuals" {{math|''b''{{sub|''i''}}}} substituted in place of variable {{mvar|y}}: \forall b_j \forall a_i \colon \phi(a_i, b_j).
[[Alfred Tarski]] in his 1941 ''Introduction to Logic'' used the word "matrix" synonymously with the notion of [[truth table]] as used in mathematical logic.{{sfnp|Tarski|1941|p=[https://books.google.com/books?id=5MeNCgAAQBAJ&pg=PA40 40]}}
== See also == {{Portal|Mathematics}}
- [[List of named matrices]]
- {{annotated link|Gram–Schmidt process}}
- [[Irregular matrix]]
- {{annotated link|Matrix calculus}}
- {{annotated link|Matrix function}}
== Notes == {{reflist|colwidth=20em}} {{Reflist|group=nb|3}}
== References == === Mathematical references === {{refbegin|30em}}
- {{citation|title=Elementary Linear Algebra|first1=Stephen|last1=Andrilli|first2=David|last2=Hecker|edition=6th|publisher=Academic Press|year=2022|isbn=9780323984263}}
- {{Citation |title=Elementary Linear Algebra|first=Howard|last=Anton|edition=10th|publisher=John Wiley & Sons|year=2010|isbn=978-0-470-45821-1|page=414|url=https://books.google.com/books?id=YmcQJoFyZ5gC&pg=PA414}}
- {{Citation |last1=Arnold |first1=Vladimir I. |author-link1=Vladimir Arnold |translator-last=Cooke |translator-first=Roger |title=Ordinary differential equations |publisher=[[Springer-Verlag]] |location=Berlin, DE; New York, NY |isbn=978-3-540-54813-3 |year=1992}}
- {{Citation |last1=Artin |first1=Michael |author1-link=Michael Artin |title=Algebra |publisher=[[Prentice Hall]] |isbn=978-0-89871-510-1 |year=1991}}
- {{citation|title=Linear Algebra Done Right|series=Undergraduate Texts in Mathematics|first=Sheldon|last=Axler|edition=2nd|publisher=Springer|year=1997|isbn= 9780387982595}}
- {{Citation |last1=Baker |first1=Andrew J. |title=Matrix Groups: An Introduction to Lie Group Theory |publisher=Springer-Verlag |location=Berlin, DE; New York, NY |isbn=978-1-85233-470-3 |year=2003 |url-access=registration |url=https://archive.org/details/matrixgroupsintr0000bake }}
- {{Citation |last1=Bau III |first1=David |last2=Trefethen |first2=Lloyd N. |author2-link=Lloyd N. Trefethen |title=Numerical linear algebra |publisher=Society for Industrial and Applied Mathematics |location=Philadelphia, PA |isbn=978-0-89871-361-9 |year=1997}} *{{citation|first1=Adi|last1=Ben-Israel|first2=Thomas Nall Eden|last2=Greville |title=Generalized Inverses: Theory and Applications| edition=2nd|location= New York, NY|publisher=Springer|year=2003|isbn=978-0-387-00293-4 |doi=10.1007/b97366|author-link2=Thomas N. E. Greville|author-link=Adi Ben-Israel}}
- {{citation |last=Bernstein |first=Dennis S. |title=Matrix mathematics: theory, facts, and formulas |date=2009 |publisher=Princeton University Press |isbn=978-1-4008-3334-4 |edition=2nd |location=Princeton, N.J}}
- {{citation|title=Control Perspectives on Numerical Algorithms and Matrix Problems|volume=10|series=Advances in Design and Control|first1=Amit|last1=Bhaya|first2=Eugenius|last2=Kaszkurewicz|publisher=SIAM|year=2006|isbn=9780898716023}}
- {{citation|title=Introduction to the Mathematical and Statistical Foundations of Econometrics|first=Herman J.|last=Bierens|publisher=Cambridge University Press|year=2004|isbn=9780521542241}}
- {{citation|title=Classical and Modern Methods in Summability|series=Oxford mathematical monographs|first=Johann|last=Boos|publisher=Oxford University Press|year=2000|isbn=9780198501657}}
- {{Citation |last1=Bretscher |first1=Otto |title=Linear Algebra with Applications |publisher=Prentice Hall |edition=3rd |year=2005}}
- {{Citation |first1=Richard |last1=Bronson |year=1970 |lccn=70097490 |title=Matrix Methods: An Introduction |publisher=[[Academic Press]] |location=New York}}
- {{Citation |last1=Bronson |first1=Richard |title=Schaum's outline of theory and problems of matrix operations |publisher=[[McGraw–Hill]] |location=New York |isbn=978-0-07-007978-6 |year=1989}}
- {{Citation |last1=Brown |first1=William C. |title=Matrices and vector spaces |publisher=[[Marcel Dekker]] |location=New York, NY |isbn=978-0-8247-8419-5 |year=1991 |url-access=registration |url=https://archive.org/details/matricesvectorsp0000brow }}
- {{citation | last1 = Brualdi | first1 = Richard A. | author1-link = Richard A. Brualdi | last2 = Carmona | first2 = Ángeles | last3 = van den Driessche | first3 = P. | author3-link = Pauline van den Driessche | last4 = Kirkland | first4 = Stephen | last5 = Stevanović | first5 = Dragan | editor1-last = Encinas | editor1-first = Andrés M. | editor2-last = Mitjana | editor2-first = Margarida | doi = 10.1007/978-3-319-70953-6 | isbn = 978-3-319-70952-9 | mr = 3791450 | publisher = Birkhäuser/Springer, Cham | series = Advanced Courses in Mathematics. CRM Barcelona | title = Combinatorial Matrix Theory | year = 2018}}
- {{citation | last = Cameron | first = Peter J. | editor-last = Hogben | editor-first = Leslie | contribution = Matrix groups | contribution-url = https://webspace.maths.qmul.ac.uk/p.j.cameron/preprints/mgo.pdf | edition = 2nd | isbn = 978-1-4665-0728-9 | mr = 3013937 | publisher = CRC Press, Boca Raton, FL | series = Discrete Mathematics and its Applications (Boca Raton) | title = Handbook of Linear Algebra | year = 2014}}
- {{citation|last1=Carboni|first1=Aurelio|last2=Kasangian|first2=Stefano|last3=Walters|first3=Robert|title=An axiomatics for bicategories of modules|journal=Journal of Pure and Applied Algebra|volume=45|issue=2|pages=127–141|date=1987|doi=10.1016/0022-4049(87)90065-X|mr=0889588|zbl=0615.18006}}
- {{citation|title=Fundamentals of Linear Algebra|first=J. S.|last=Chahal|publisher=CRC Press|year=2018|isbn=9780429758119}}
- {{Citation |last1=Coburn |first1=Nathaniel |title=Vector and tensor analysis |publisher=Macmillan |location=New York, NY |oclc=1029828 |year=1955}}
- {{citation|title=Handbook for Matrix Computations|volume=4|series=Frontiers in Applied Mathematics|first1=Thomas F.|last1=Coleman|first2=Charles|last2=Van Loan|publisher=SIAM|year=1988|isbn=9780898712278}}
- {{Citation |last1=Conrey |first1=J. Brian |title=Ranks of elliptic curves and random matrix theory |publisher=[[Cambridge University Press]] |isbn=978-0-521-69964-8 |year=2007}}
- {{citation | last = Dossey | first = John A. | edition = 4th | isbn = 9780321079121 | publisher = Addison Wesley | title = Discrete Mathematics | year = 2002}}
- {{Citation | last = Conway | first = John B. | author-link = John B. Conway | title = A Course in Functional Analysis | publisher = Springer | year = 1990 | edition = 2nd | series = Graduate Texts in Mathematics | volume = 96 | isbn = 0-387-97245-5 }}
- {{citation|title=Linear Algebra|first=Harold M.|last=Edwards|author-link=Harold Edwards (mathematician)|publisher=Springer Science & Business Media|year=2004|isbn=9780817643706}}
- {{citation|last1=Farid|first1=F. O.|last2=Khan|first2=Israr Ali|last3=Wang|first3=Qing-Wen|title=On matrices over an arbitrary semiring and their generalized inverses|journal=Linear Algebra and its Applications|volume=439|issue=7|pages=2085–2105|date=2013|doi=10.1016/j.laa.2013.06.002|mr=3090456|zbl=1283.15016}}
- {{Citation |first1=John B. |last1=Fraleigh |year=1976 |isbn=0-201-01984-1 |title=A First Course In Abstract Algebra |edition=2nd |publisher=[[Addison-Wesley]] |location=Reading}}
- {{Citation |last1=Fudenberg |first1=Drew |last2=Tirole |first2=Jean |author2-link=Jean Tirole |title=Game Theory |publisher=[[MIT Press]] |year=1983}}
- {{citation | last = Gentle | first = James E. | isbn = 9780387985428 | publisher = Springer | title = Numerical Linear Algebra for Applications in Statistics | year = 1998}}
- {{citation|last1=Ghosh|first1=Shamik|title=Matrices over semirings|journal=Information Sciences|volume=90|issue=1-4|pages=221–230|date=1996|doi=10.1016/0020-0255(95)00283-9|mr=1388422|zbl=0884.15010}}
- {{Citation |last1=Gilbarg |first1=David |last2=Trudinger |first2=Neil S. |author2-link=Neil Trudinger |title=Elliptic partial differential equations of second order |publisher=Springer-Verlag |location=Berlin, DE; New York, NY |edition=2nd |isbn=978-3-540-41160-4 |year=2001}}
- {{Citation |first1=Chris |last1=Godsil |author-link1=Chris Godsil |first2=Gordon |last2=Royle |author-link2=Gordon Royle |title=Algebraic Graph Theory |publisher=Springer-Verlag |location=Berlin, DE; New York, NY |series=Graduate Texts in Mathematics |isbn=978-0-387-95220-8 |year=2004 |volume=207}}
- {{Citation |last1=Golub |first1=Gene H. |author1-link=Gene H. Golub |last2=Van Loan |first2=Charles F. |author2-link=Charles F. Van Loan |title=Matrix Computations |publisher=Johns Hopkins |edition=3rd |isbn=978-0-8018-5414-9 |year=1996}}
- {{Citation |last1=Greub |first1=Werner Hildbert |title=Linear algebra |publisher=Springer-Verlag |location=Berlin, DE; New York, NY |series=Graduate Texts in Mathematics |isbn=978-0-387-90110-7 |year=1975}}
- {{citation|title=Abstract Algebra|volume=242|series=[[Graduate Texts in Mathematics]]|first=Pierre Antoine|last=Grillet|edition=2nd|publisher=Springer|year=2007|isbn=9780387715681}} *{{citation | last1 = Hachenberger | first1 = Dirk | last2 = Jungnickel | first2 = Dieter | doi = 10.1007/978-3-030-60806-4 | isbn = 978-3-030-60804-0 | location = Cham | mr = 4233161 | publisher = Springer | series = Algorithms and Computation in Mathematics | title = Topics in Galois Fields | volume = 29 | year = 2020}}
- {{Citation |last1=Halmos |first1=Paul Richard |author1-link=Paul Halmos |title=A Hilbert space problem book |publisher=Springer-Verlag |location=Berlin, DE; New York, NY |edition=2nd |series=Graduate Texts in Mathematics |isbn=978-0-387-90685-0 |mr=675952 |year=1982 |volume=19}}
- {{citation|title=Elementary Linear Algebra|first=Stanley I.|last=Grossman|edition=5th|publisher=Saunders College Pub.|year=1994|isbn= 9780030973543}}
- {{citation | last = Hamilton | first = A. G. | isbn = 9780521310413 | publisher = Cambridge University Press | title = A First Course in Linear Algebra: With Concurrent Examples | year = 1987}}
- {{citation | title = Algebras, Rings and Modules, Volume 2: Non-commutative Algebras and Rings | first1 = Michiel | last1 = Hazewinkel | first2 = Nadiya M. | last2 = Gubareni | publisher = CRC Press | year = 2017 | edition = 2nd }}
- {{Citation |last1=Horn |first1=Roger A. |author1-link=Roger Horn |last2=Johnson |first2=Charles R. |author2-link=Charles Royal Johnson |title=Matrix Analysis |publisher=Cambridge University Press |isbn=978-0-521-38632-6 |year=1985}}
- {{Citation |last1=Householder |first1=Alston S. |title=The theory of matrices in numerical analysis |publisher=[[Dover Publications]] |location=New York, NY |mr=0378371 |year=1975}}
- {{citation | last = Hungerford | first = Thomas W. | isbn = 0-387-90518-9 | mr = 600654 | publisher = Springer-Verlag, New York-Berlin | series = Graduate Texts in Mathematics | title = Algebra | volume = 73 | year = 1980}}
- {{Citation |author=ISRD Group |title=Computer Graphics |publisher=Tata McGraw–Hill |isbn=978-0-07-059376-3 |year=2005}}
- {{Citation |editor1-last=Itô |editor1-first=Kiyosi |title=Encyclopedic dictionary of mathematics. Vol. I-IV |publisher=MIT Press |edition=2nd |isbn=978-0-262-09026-1 |mr=901762 |year=1987}}
- {{citation|title=Matrix Operations for Engineers and Scientists: An Essential Guide in Linear Algebra|first=Alan|last=Jeffrey|publisher=Springer|year=2010|isbn=9789048192748}}
- {{citation|title=Introduction to Linear and Matrix Algebra|first=Nathaniel|last=Johnston|publisher=Springer Nature|year=2021|isbn=9783030528119}}
- {{Citation |first1=Erwin |last1=Kreyszig |year=1972 |isbn=0-471-50728-8 |title=Advanced Engineering Mathematics |edition=3rd |publisher=[[John Wiley & Sons|Wiley]] |location=New York |url=https://archive.org/details/advancedengineer00krey}}.
- {{Citation |last1=Krzanowski |first1=Wojtek J. |title=Principles of multivariate analysis |publisher=The Clarendon Press Oxford University Press |series=Oxford Statistical Science Series |isbn=978-0-19-852211-9 |mr=969370 |year=1988 |volume=3}} *{{citation | last = Lam | first = T. Y. | doi = 10.1007/978-1-4612-0525-8 | isbn = 0-387-98428-3 | mr = 1653294 | publisher = Springer-Verlag, New York | series = Graduate Texts in Mathematics | title = Lectures on Modules and Rings | volume = 189 | year = 1999}}
- {{citation|title=The Theory of Matrices: With Applications|first1=Peter|last1=Lancaster|first2=Miron|last2=Tismenetsky|edition=2nd|publisher=Elsevier|year=1985|isbn=9780080519081}}
- {{Citation |last1=Lang |first1=Serge |author1-link=Serge Lang |title=Analysis II |publisher=[[Addison-Wesley]] |year=1969}}
- {{Citation | last = Lang | first = Serge | author1-link=Serge Lang | title = Introduction to Linear Algebra | publisher = Springer | year = 1986 | edition = 2nd | isbn = 9781461210702 }}
- {{Citation |last1=Lang |first1=Serge |title=Calculus of several variables |publisher=Springer-Verlag |location=Berlin, DE; New York, NY |edition=3rd |isbn=978-0-387-96405-8 |year=1987 |url=https://archive.org/details/calculusofsevera0000lang}}
- {{Lang Algebra}}
- {{Citation |last1=Latouche |first1=Guy |last2=Ramaswami |first2=Vaidyanathan |title=Introduction to matrix analytic methods in stochastic modeling |publisher=Society for Industrial and Applied Mathematics |location=Philadelphia, PA |edition=1st |isbn=978-0-89871-425-8 |year=1999}}
- {{Citation |last1=Manning |first1=Christopher D. |last2=Schütze |first2=Hinrich |title=Foundations of statistical natural language processing |publisher=MIT Press |isbn=978-0-262-13360-9 |year=1999}}
- {{Citation |last1=Margalit |first1=Dan |last2=Rabinoff |first2=Joseph |website=Interactive Linear Algebra |title=Determinants and Volumes |url=https://textbooks.math.gatech.edu/ila/determinants-volumes.html |publisher=Georgia Institute of Technology |access-date=2025-05-10 |year=2019}}
- {{citation|title=Understanding and Using Linear Programming|first1=Jiří|last1=Matoušek|author1-link=Jiří Matoušek (mathematician)|first2=Bernd|last2=Gärtner|publisher=Springer Science & Business Media|year=2007|isbn=9783540307174}}
- {{citation|title=Algebraic Structure and Matrices, Being Part II of Advanced Algebra|first=E. A.|last=Maxwell|year=1969|publisher=Cambridge University Press}}
- {{citation|title=Finite Mathematics: An Introduction with Applications in Business, Social Sciences, and Music|first=Andrew|last=McHugh|publisher=Academic Press|year= 2025|isbn= 9780443290954}}
- {{citation|title=Linear Algebra|first1=Elizabeth S.|last1=Meckes|author1-link=Elizabeth Meckes|first2=Mark W.|last2=Meckes|publisher=Cambridge University Press|year= 2018|isbn=9781316836026}}
- {{Citation |last1=Mehata |first1=K. M. |last2=Srinivasan |first2=S. K. |title=Stochastic processes |publisher=McGraw–Hill |location=New York, NY |isbn=978-0-07-096612-3 |year=1978}}
- {{citation|title=Invitation to Linear Algebra|series=Textbooks in Mathematics|first=David C.|last=Mello|publisher=CRC Press|year= 2017|isbn= 9781498779586}}
- {{Citation |last1=Mirsky |first1=Leonid |author-link=Leon Mirsky |title=An Introduction to Linear Algebra |url=https://books.google.com/books?id=ULMmheb26ZcC&q=linear+algebra+determinant&pg=PA1 |publisher=Courier Dover Publications |isbn=978-0-486-66434-7 |year=1990}}
- {{citation | last1 = Misra | first1 = Chandan | last2 = Bhattacharya | first2 = Sourangshu | last3 = Ghosh | first3 = Soumya K. | date = June 2022 | doi = 10.1109/tbdata.2020.2977326 | issue = 3 | journal = IEEE Transactions on Big Data | pages = 699–710 | title = Stark: Fast and scalable Strassen's matrix multiplication using Apache Spark | volume = 8| arxiv = 1811.07325 }}
- {{Citation |first1=Evar D. |last1=Nering |year=1970 |title=Linear Algebra and Matrix Theory |edition=2nd |publisher=[[John Wiley & Sons|Wiley]] |location=New York |lccn=76-91646}}
- {{Citation |last1=Nocedal |first1=Jorge |last2=Wright |first2=Stephen J. |title=Numerical Optimization |publisher=Springer-Verlag |location=Berlin, DE; New York, NY |edition=2nd |isbn=978-0-387-30303-1 |year=2006 |page=449}}
- {{Citation |last1=Oualline |first1=Steve |title=Practical C++ programming |publisher=[[O'Reilly Media|O'Reilly]] |isbn=978-0-596-00419-4 |year=2003}}
- {{Citation |last = Perrone |first = Paolo |title = Starting Category Theory |date = 2024 |publisher = World Scientific|doi = 10.1142/9789811286018_0005 |isbn = 978-981-12-8600-1|url = https://www.worldscientific.com/worldscibooks/10.1142/13670|url-access = subscription}}
- {{citation|title=Matrices and Transformations|series=Dover Books on Mathematics|first=Anthony J.|last=Pettofrezzo|publisher=Courier Corporation|year=1978|isbn=9780486636344}}
- {{Citation |title=Theory of Matrices|series=Dover books on advanced mathematics|first=Sam|last=Perlis|publisher=Courier Dover Corporation|year=1991|isbn=978-0-486-66810-9|page=103|url=https://books.google.com/books?id=5_sxtcnvLhoC&pg=PA103}}
- {{Citation|last1=Pop|last2=Furdui|year=2017|title=Square Matrices of Order 2|publisher=Springer International Publishing|isbn=978-3-319-54938-5}}
- {{Citation |last1=Press |first1=William H. |last2=Flannery |first2=Brian P. |last3=Teukolsky |first3=Saul A. |author3-link=Saul Teukolsky |last4=Vetterling |first4=William T. |title=Numerical Recipes in FORTRAN: The Art of Scientific Computing |chapter-url=https://mpi-hd.mpg.de/astrophysik/HEA/internal/Numerical_Recipes/f2-3.pdf |publisher=Cambridge University Press |edition=2nd |year=1992 |chapter=LU Decomposition and Its Applications |pages=34–42 |url-status=unfit |archive-url=https://web.archive.org/web/20090906113144/http://www.mpi-hd.mpg.de/astrophysik/HEA/internal/Numerical_Recipes/f2-3.pdf |archive-date=2009-09-06}}
- {{ citation |first1=Murray H. |last1=Protter | first2=Charles B. Jr. | last2=Morrey |year=1970 |lccn=76087042 |title=College Calculus with Analytic Geometry |edition=2nd |publisher=[[Addison-Wesley]] |location=Reading}}
- {{Citation |last1=Punnen |first1=Abraham P. |last2=Gutin |first2=Gregory |title=The traveling salesman problem and its variations |publisher=Kluwer Academic Publishers |location=Boston, MA |isbn=978-1-4020-0664-7 |year=2002}}
- {{citation|title=Linear Algebra|volume=19|series=Texts and Readings in Mathematics|first1=A.|last1=Ramachandra Rao|first2=P.|last2=Bhimasankaram|edition=2nd|publisher=Springer|year=2000|isbn=9789386279019}}
- {{citation|last1=Reutenauer|first1=Christophe|last2=Straubing|first2=Howard|title=Inversion of matrices over a commutative semiring|journal=Journal of Algebra|volume=88|issue=2|pages=350–360|date=1984|doi=10.1016/0021-8693(84)90070-X|mr=0747520|zbl=0563.15011}}
- {{citation | last = Reyes | first = Manuel | arxiv = 2409.08421 | doi = 10.1090/noti3100 | issue = 2 | journal = Notices of the American Mathematical Society | mr = 4854325 | pages = 145–153 | title = A tour of noncommutative spectral theories | volume = 72 | year = 2025}}
- {{citation|last = Riehl |first = Emily | author-link = Emily Riehl |title = Category Theory in Context |date = 2016 |publisher = Dover |url = https://books.google.com/books?id=6B9MDgAAQBAJ |isbn = 9780486809038 }}
- {{citation|title=Introduction to Coding Theory|first=Ron|last=Roth|publisher=Cambridge University Press|year=2006|isbn=9780521845045}}
- {{Citation |last1=Rowen |first1=Louis Halle |title=Graduate Algebra: noncommutative view |publisher=[[American Mathematical Society]] |location=Providence, RI |isbn=978-0-8218-4153-2 |year=2008}}
- {{Citation |title=Matrices and Linear Algebra |series=Dover Books on Mathematics |first1=Hans |last1=Schneider |first2=George Phillip |last2=Barker |publisher=Courier Dover Corporation |year=2012 |isbn=978-0-486-13930-2 |page=251 |url=https://books.google.com/books?id=9vjBAgAAQBAJ&pg=PA251}}
- {{Citation | last1 = Scott | first1 = J. | last2 = Tůma | first2 = M. | title = Algorithms for Sparse Linear Systems | chapter = Sparse Matrices and Their Graphs | year = 2023 | publisher = Birkhäuser | location = Cham | series = Nečas Center Series | pages = 19–30 | isbn = 978-3-031-25819-0 | doi = 10.1007/978-3-031-25820-6_2 | doi-access = free }}
- {{citation|title=Matrices: Theory and Applications|volume=216|series=[[Graduate Texts in Mathematics]]|first=Denis|last=Serre|publisher=Springer Science & Business Media|year=2007|isbn= 9780387227580|doi=10.1007/978-1-4419-7683-3}}
- {{Citation |last1=Šolin |first1=Pavel |title=Partial Differential Equations and the Finite Element Method |publisher=[[Wiley-Interscience]] |isbn=978-0-471-76409-0 |year=2005}}
- {{Citation |last1=Stinson |first1=Douglas R. |title=Cryptography |publisher=Chapman & Hall/CRC |series=Discrete Mathematics and its Applications |isbn=978-1-58488-508-5 |year=2005}}
- {{Citation |last1=Stoer |first1=Josef |last2=Bulirsch |first2=Roland |title=Introduction to Numerical Analysis |publisher=Springer-Verlag |location=Berlin, DE; New York, NY |edition=3rd |isbn=978-0-387-95452-3 |year=2002}}
- {{Citation |last=Suresh Kumar |first=K. S. |title=Electric Circuits and Networks |publisher=Dorling Kindersley |year=2009 |isbn=978-81-317-1390-7}}
- {{Citation |last=Tang |first=K. T. |title=Mathematical Methods for Engineers and Scientists 1: Complex Analysis, Determinants and Matrices |publisher=Springer |year=2006 |isbn=978-3-540-30273-5}} *{{citation | last = Tapp | first = Kristopher | doi = 10.1090/stml/079 | edition = 2nd | isbn = 978-1-4704-2722-1 | location = Providence, Rhode Island | mr = 3468869 | publisher = American Mathematical Society | series = Student Mathematical Library | title = Matrix Groups for Undergraduates | volume = 79 | year = 2016}} *{{citation | last = Van Loan | first = Charles F. | author-link = Charles F. Van Loan | doi = 10.1016/S0377-0427(00)00393-9 | issue = 1–2 | journal = [[Journal of Computational and Applied Mathematics]] | mr = 1798520 | pages = 85–100 | title = The ubiquitous Kronecker product | volume = 123 | year = 2000| bibcode = 2000JCoAM.123...85L }}
- {{citation |last1=Vassilevska Williams |first1=Virginia |last2=Xu |first2=Yinzhan |last3=Xu |first3=Zixuan |last4=Zhou |first4=Renfei |contribution=New bounds for matrix multiplication: from alpha to omega |title=Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA) |year=2024|pages=3792–3835 |arxiv=2307.07970 |doi=10.1137/1.9781611977912.134|isbn=978-1-61197-791-2 }}
- {{Citation |last1=Ward |first1=J. P. |title=Quaternions and Cayley numbers |publisher=Kluwer Academic Publishers Group |location=Dordrecht, NL |series=Mathematics and its Applications |isbn=978-0-7923-4513-8 |mr=1458894 |year=1997 |volume=403 |doi=10.1007/978-94-011-5768-1 |url-access=registration |url=https://archive.org/details/quaternionscayle0000ward }}
- {{citation | last = Watkins | first = David S. | title = Fundamentals of Matrix Computations | year = 2002 | url = https://books.google.com/books?id=xi5omWiQ-3kC | publisher = [[John Wiley & Sons]] | isbn = 978-0-471-46167-8 }}
- {{citation|title=Combinatorial Mathematics|first=Douglas B.|last=West|author-link=Douglas West (mathematician)|publisher=Cambridge University Press|year=2020|isbn=9781108889520}}
- {{citation |title=Introduction to Linear Algebra |edition=2nd |first=T. A. |last=Whitelaw |publisher=CRC Press |year=1991 |isbn=9780751401592 |page=29 |url=https://books.google.com/books?id=6M_kDzA7-qIC }}
- {{citation | last1 = Zhang | first1 = Yanchun | last2 = Yu | first2 = Jeffrey Xu | last3 = Hou | first3 = Jingyu | title = Web Communities: Analysis and Construction | year = 2006 | publisher = Springer | isbn = 978-3-540-27737-8 }} {{refend}}
=== Physics references === {{refbegin|30em}}
- {{citation|title=Clifford Algebras and their Applications in Mathematical Physics, Volume 1: Algebra and Physics|first=Rafał|last=Abłamowicz|volume=18|series=Progress in Mathematical Physics|publisher=Birkhäuser / Springer|year=2000|isbn=9780817641825}}
- {{citation | last1 = Bauchau | first1 = O. A. | last2 = Craig | first2 = J. I. | isbn = 9789048125166 | publisher = Springer | series = Solid Mechanics and Its Applications | title = Structural Analysis: With Applications to Aerospace Structures | volume = 163 | year = 2009}}
- {{citation |last=Boas |first=Mary L. |author-link=Mary L. Boas |title=Mathematical Methods in the Physical Sciences |title-link=Mathematical Methods in the Physical Sciences |year=2005 |edition=3rd |publisher=John Wiley & Sons |isbn=978-0-471-19826-0}}
- {{Citation |last=Bohm |first=Arno |title=Quantum Mechanics: Foundations and Applications |publisher=Springer |year=2001 |isbn=0-387-95330-2}}
- {{Citation |last1=Burgess |first1=Cliff |last2=Moore |first2=Guy |title=The Standard Model. A Primer |publisher=Cambridge University Press |year=2007 |bibcode=2007smp..book.....B |isbn=978-0-521-86036-9}}
- {{Citation |last=Gbur |first=Greg |author-link=Greg Gbur |title=Mathematical Methods in Optical Physics and Engineering |year=2011 |publisher=Cambridge University Press |bibcode=2011mmop.book.....G |isbn=978-0-521-51610-5}}
- {{Citation |last=Guenther |first=Robert D. |title=Modern Optics |publisher=John Wiley |year=1990 |isbn=0-471-60538-7}}
- {{citation | last1 = Han | first1 = D. | last2 = Kim | first2 = Y. S. | last3 = Noz | first3 = Marilyn E. | date = September 1997 | doi = 10.1364/josaa.14.002290 | issue = 9 | journal = Journal of the Optical Society of America A | page = 2290 | publisher = Optica Publishing Group | title = Jones-matrix formalism as a representation of the Lorentz group | url = https://scholar.archive.org/work/g6cqiliqqrcjxalo44oayibjw4 | volume = 14| arxiv = physics/9703032 | bibcode = 1997JOSAA..14.2290H }}
- {{Citation |last1=Itzykson |first1=Claude|author1-link=Claude Itzykson |last2=Zuber |first2=Jean-Bernard|author2-link=Jean-Bernard Zuber |title=Quantum Field Theory |publisher=McGraw–Hill |year=1980 |isbn=0-07-032071-3 |url-access=registration |url=https://archive.org/details/quantumfieldtheo0000itzy }}
- {{citation |last1=Jensen |first1=Frank |title=Introduction to Computational Chemistry |year=1999 |publisher=John Wiley & Sons |isbn=0-471-98085-4 |url=https://archive.org/details/introductiontoco0000jens}}
- {{citation|last=Peres |first=Asher |author-link=Asher Peres |title=Quantum Theory: Concepts and Methods |title-link=Quantum Theory: Concepts and Methods |year=1993 |publisher=Kluwer |isbn=978-0-7923-3632-7 }}
- {{Citation |last=Reichl |first=Linda E.|author-link=Linda Reichl |title=The transition to chaos: conservative classical systems and quantum manifestations |publisher=Springer-Verlag |location=Berlin, DE; New York, NY |isbn=978-0-387-98788-0 |year=2004}}
- {{Citation |last1=Riley |first1=Kenneth F.|author1-link=Ken Riley (physicist) |last2=Hobson |first2=Michael P. |last3=Bence |first3=Stephen J. |title=Mathematical methods for physics and engineering |publisher=Cambridge University Press |year=1997 |isbn=0-521-55506-X}}
- {{Citation |last=Schiff |first=Leonard I. |author-link=Leonard I. Schiff |title=Quantum Mechanics |edition=3rd |publisher=McGraw–Hill |year=1968}}
- {{Citation |last=Weinberg |first=Steven |author-link=Steven Weinberg |title=The Quantum Theory of Fields. Volume I: Foundations |publisher=Cambridge University Press |year=1995 |isbn=0-521-55001-7 |url=https://archive.org/details/quantumtheoryoff00stev}}
- {{Citation |last=Wherrett |first=Brian S. |year=1987 |title=Group Theory for Atoms, Molecules and Solids |publisher=Prentice–Hall International |isbn=0-13-365461-3}}
- {{citation|title=Lectures on Matrix Field Theory|volume=929|series=Lecture Notes in Physics|first=Badis|last=Ydri|publisher =Springer|year=2016|isbn= 9783319460031}}
- {{Citation |last1=Zabrodin |first1=Anton |last2=Brézin |first2=Édouard |author2-link=Édouard Brézin|last3=Kazakov |first3=Vladimir |last4=Serban |first4=Didina |last5=Wiegmann |first5=Paul|author5-link=Paul Wiegmann |title=Applications of Random Matrices in Physics (NATO Science Series II: Mathematics, Physics and Chemistry) |publisher=[[Springer-Verlag]] |location=Berlin, DE; New York, NY |isbn=978-1-4020-4530-1 |year=2006}} {{refend}}
=== Historical references === {{refbegin|30em}}
- {{Citation |last1=Bôcher |first1=Maxime |author1-link=Maxime Bôcher |title=Introduction to Higher Algebra |publisher=[[Dover Publications]] |location=New York, NY |isbn=978-0-486-49570-5 |year=2004}}, reprint of the 1907 original edition
- {{citation | last = Cayley | first = Arthur | author-link = Arthur Cayley | date = December 1858 | doi = 10.1098/rstl.1858.0002 | journal = [[Philosophical Transactions of the Royal Society of London]] | jstor = 108649 | pages = 17–37 | title = A memoir on the theory of matrices | volume = 148}}; reprinted in ''The collected mathematical papers of Arthur Cayley'', vol. II, Cambridge University Press, 1889, [https://archive.org/details/collectedmathema02cayluoft/page/474 pp. 475–496].
- {{Citation |last1=Cayley |first1=Arthur |author1-link=Arthur Cayley |title=The collected mathematical papers of Arthur Cayley |url=https://quod.lib.umich.edu/cgi/t/text/pageviewer-idx?c=umhistmath;cc=umhistmath;rgn=full%20text;idno=ABS3153.0001.001;didno=ABS3153.0001.001;view=image;seq=00000140 |publisher=[[Cambridge University Press]] |year=1889 |volume=I (1841–1853) |pages=123–126}}
- {{citation |title = Introduction à l'Analyse des lignes Courbes algébriques |last = Cramer | first = Gabriel | author-link = Gabriel Cramer |year = 1750 |location = Geneva |language = fr |url = https://www.europeana.eu/resolve/record/03486/E71FE3799CEC1F8E2B76962513829D2E36B63015 |access-date = 2012-05-18 |publisher = Europeana |pages = 656–659}}
- {{Citation |editor1-last=Dieudonné |editor1-first=Jean |editor1-link=Jean Dieudonné |title=Abrégé d'histoire des mathématiques 1700-1900 |publisher=Hermann |location=Paris, FR |year=1978}}
- {{citation | last = Hawkins | first = Thomas | author-link = Thomas W. Hawkins Jr. | doi = 10.1007/bf00328434 | issue = 4 | journal = [[Archive for History of Exact Sciences]] | pages = 243–287 | title = Hypercomplex numbers, Lie groups, and the creation of group representation theory | volume = 8 | year = 1972}}
- {{Citation |last1=Hawkins |first1=Thomas | author-link = Thomas W. Hawkins Jr. |title=Cauchy and the spectral theory of matrices |mr=0469635 |year=1975 |journal=[[Historia Mathematica]] |volume=2 |pages=1–29 |doi=10.1016/0315-0860(75)90032-4|doi-access= }}
- {{citation | last = Knobloch | first = Eberhard | author-link = Eberhard Knobloch | editor1-last = Sasaki | editor1-first = Chikara | editor2-last = Sugiura | editor2-first = Mitsuo | editor3-last = Dauben | editor3-first = Joseph W. | contribution = From Gauß to Weierstraß: determinant theory and its historical evaluations | doi = 10.1007/978-3-0348-7521-9_5 | isbn = 3-7643-5029-6 | mr = 1308079 | pages = 51–66 | publisher = Birkhäuser | series = Science Networks: Historical Studies | title = The Intersection of History and Mathematics | volume = 15 | year = 1994}}
- {{citation |last = Kosinski |first = A. A. |title = Cramer's Rule is due to Cramer |journal = [[Mathematics Magazine]] |volume = 74 |pages = 310–312 |year = 2001 |issue = 4 |doi = 10.2307/2691101 |jstor = 2691101 }}
- {{Citation |last1=Kronecker |first1=Leopold |author1-link=Leopold Kronecker |editor1-last=Hensel |editor1-first=Kurt |editor1-link=Kurt Hensel |title=Leopold Kronecker's Werke |url=https://quod.lib.umich.edu/cgi/t/text/text-idx?c=umhistmath;idno=AAS8260.0002.001 |publisher=Teubner |year=1897}}
- {{citation | last = Miller | first = G. A. | author-link = George Abram Miller | date = May 1930 | doi = 10.1080/00029890.1930.11987058 | issue = 5 | journal = [[The American Mathematical Monthly]] | jstor = 2299112 | pages = 216–219 | title = On the history of determinants | volume = 37}}
- {{Citation |last1=Shen |first1=Kangshen |last2=Crossley |first2=John N. |author2-link=John Newsome Crossley|last3=Lun |first3=Anthony Wah-Cheung |title=Nine Chapters of the Mathematical Art, Companion and Commentary |publisher=[[Oxford University Press]] |edition=2nd |isbn=978-0-19-853936-0 |year=1999}}
- {{citation |first=J. J.|last=Sylvester|author-link=James Joseph Sylvester|editor-first=H. F.|editor-last=Baker|editor-link=Henry F. Baker|title=The Collected Mathematical Papers of James Joseph Sylvester, Volume I (1837–1853)|location=Cambridge, England|publisher=Cambridge University Press|year=1904|url=https://archive.org/details/collectedmathem01sylvrich}}
- {{Citation |editor-last1=van der Waerden |editor-first1=B. L. |editor-link1=Bartel Leendert van der Waerden |title=Sources of Quantum Mechanics |publisher=Dover |year=2007 |orig-year=1968 |isbn=978-0-486-45892-2}}
- {{citation|last=Tarski|first=Alfred|author-link=Alfred Tarski|year=1941|title=Introduction to Logic and the Methodology of Deductive Sciences|publisher=Oxford University Press|mr=0003375}}; reprint of 1946 corrected printing, Dover Publications, 1995, {{ISBN|0-486-28462-X}}
- {{Citation |last1=Weierstrass |first1=Karl |author1-link=Karl Weierstrass |title=Collected Works |url=https://quod.lib.umich.edu/cgi/t/text/text-idx?c=umhistmath;idno=AAN8481.0003.001 |year=1915 |volume=3}} {{refend}}
== Further reading ==
- {{SpringerEOM|title=Matrix|id=p/m062780}}
- {{Citation |title=The Matrix Cookbook |url=https://math.uwaterloo.ca/~hwolkowi//matrixcookbook.pdf |date=November 15, 2012|first1=Kaare Brandt|last1=Petersen|first2=Michael Syskind|last2=Petersen|publisher=University of Waterloo|access-date=24 March 2014 }}
- {{Citation |last1=Brookes |first1=Mike |title=The Matrix Reference Manual |url=https://ee.ic.ac.uk/hp/staff/dmb/matrix/intro.html |publisher=[[Imperial College]] |location=London |year=2005 |access-date=10 Dec 2008 |archive-date=16 December 2008 |archive-url=https://web.archive.org/web/20081216124433/http://www.ee.ic.ac.uk/hp/staff/dmb/matrix/intro.html |url-status=dead }}
== External links == {{sister project links|d=y|c=Category:matrix|b=Linear Algebra|v=Linear algebra#Matrices|s=no|m=no|mw=no|wikt=matrix|voy=no|species=no|q=no|n=no}}
- {{citation|url=https://mathshistory.st-andrews.ac.uk/HistTopics/Matrices_and_determinants/ |title=Matrices and determinants|work=[[MacTutor History of Mathematics Archive]]|publisher=[[University of St Andrews]]|first1=J. J.|last1=O'Connor|first2=E. F.|last2=Robertson|date=February 1996}}
- [https://economics.soton.ac.uk/staff/aldrich/matrices.htm Matrices and Linear Algebra on the Earliest Uses Pages]
- [https://jeff560.tripod.com/matrices.html Earliest Uses of Symbols for Matrices and Vectors]
{{Linear algebra}} {{Tensors}} {{Matrix classes}}
{{authority control}}
[[Category:Matrices (mathematics)| ]]
From MOAI Insights

공장의 뇌는 어떻게 생겼는가 — 제조운영 AI 아키텍처 해부
지식관리, 업무자동화, 의사결정지원 — 따로 보면 다 있던 것들입니다. 제조 AI의 진짜 차이는 이 셋이 순환하면서 '우리 공장만의 지능'을 만든다는 데 있습니다.

그 30분을 18년 동안 매일 반복했습니다 — 품질팀장이 본 AI Agent
18년차 품질팀장이 매일 아침 30분씩 반복하던 데이터 분석을 AI Agent가 3분 만에 해냈습니다. 챗봇과는 완전히 다른 물건 — 직접 시스템에 접근해서 데이터를 꺼내고 분석하는 AI의 현장 도입기.

ERP 20년, 나는 왜 AI를 얹기로 했나
ERP 20년차 제조IT본부장의 고백: 3,200만 행의 데이터가 잠들어 있었다. ERP를 바꾸지 않고 AI를 얹자, 일주일 걸리던 불량 분석이 수 초로 줄었다.
Want to apply this in your factory?
MOAI helps manufacturing companies adopt AI tailored to their operations.
Talk to us →