Authors:Evan Chrisney, echrisney@gmail.com (Fall 2018), Trey Henrichsen, treyhenrichsen@gmail.com (Winter 2020)
Matrix inverses are matrices which when multiplied by their oringinal matrices equate to the identity matrix. Only square matrices are invertible, but rectangular matrices may have left or right inverses. A matrix A has a left inverse if there exists a matrix B such that BA=I, and a right inverse if there exists a matrix C such that AC=I (Moon and Stirling, 2000). Square matrices that are not invertible are called singular matrices. A matrix is invertible if it has both a right and left inverse.
Matrix inverses are important because they provide solutions to the ubiquitous equation Ax=b. There are many conditions that satisfy the invertibility of a matrix which will be explained in detail below. If a true matrix inverse does not exist for A, or if A is not square, then the left and right pseudo inverses may be used to solve Ax=b.
Let A be an m×n matrix.
Matrix B is the left inverse of A iff BA=I. The left inverse can only exist when m≥n, i.e. A is either tall or square. Additionally, for A to have a left inverse, the columns of A must be linearly independant. If A has a left inverse, then Ax=b has at most one solution.
Likewise, matrix C is the right inverse of matrix A iff AC=I. The right inverse can only exist when m≤n, i.e. the matrix is either wide or square. Additionally, the columns of A must span the whole space. In other words, rank(A)=m. If A has a right inverse, then Ax=b has at least one solution.
Neither left nor right inverses are necessarily unique. For example, take the matrices A=[100010]
Likewise, for the matrix A=[100100],
If matrix A has both a left and a right inverse, those inverses are the same, and are called the inverse of A, and A is invertible. As can be seen from the requirements for a left inverse (m≥n) and right inverse(m≤n), only square matrices can be invertible.
If A has an inverse B, then: AB=BA=I
Below is a list of properties equivalent to invertibility for square matrices. In other words, all invertible matrices have the following properties, and any n×n matrix A that has any one of these properties is invertible and has all other properties on this list.
There are many methods for calculation matrix inverses. These methods vary in complexity and speed. In some situations, such as diagonal matrices, matrix inversion may be simplified. In general, it is best to use existing implementations, such as those in NumPy, MatLab, and Eigen. There may be times where you have to implement matrix inversion yourself, and so two are given below.
The adjugate of A can be used to determine A−1 as A−1=1det(A)adj(A),
Invertible matrices can be diagonalized as A=SΛS−1, where Λ is a diagonal matrix of the eigenvalues of A and S is formed by stacking the column eigenvectors of A. The inverse of A can be calculated as A−1=SΛ−1S−1
A non-square (n x m) matrix may have a left or right inverse, as explained above. These inverses may or may not be unique. The simplest way to calculate a right or left inverse is with the Moore-Penrose pseudoinverse. B=(AHA)−1AH(left inverse)
In general the pseudo inverse of A is written as A†, whether it is a left or right inverse. Note that the Moore-Penrose pseudo inverse requires a matrix inversion. Matrices which do not have a pseudo inverse will cause this inversion to fail.
Here is some simple python code that shows the concepts of matrix inversion for a square matrix, using Cramer's rule, as well as computation of the left and right inverses, which all should be equal for this example.
import numpy as np
# numpy includes its own inverse, but this function drives home the topics above.
######################################################
# Compute the matrix inverse of a square matrix using the adjugate method
# Declare a matrix that we know is invertible
A = np.matrix([[-3, 2, -5], [-1, 0, 2], [3, -4, 1]])
# get the determinant
m = np.linalg.det(A)
# Grab length of A which we know is nxn
l = len(A)
# Initialize the adjugate matrix to 0's
C = np.zeros((l,l))
# Loop through A and compute the adjugate
for i in range(l):
for j in range(l):
# Single out the rows/cols to compute minor determinants
#
# I realize i and j are swapped, this saves the step of
# transposing C later.
temp = np.delete(A, (j), axis = 0)
temp = np.delete(temp, (i), axis = 1)
# Compute minor determinant
M = np.linalg.det(temp)
# Compute the adjugate
C[i][j] = (-1)**(i+j)*M
# Finish by computing the adjugate
A_inv = 1/m*C
# Print out matrix inverse
print(A_inv)
#####################################################
# Now find the left inverse, which should be the same
A_tran = A.getH()
Grammian_l = np.linalg.inv(A_tran*A)
A_dagger_l = Grammian_l*A_tran
print(A_dagger_l)
#####################################################
# Now find the right inverse which is the same
Grammian_r = np.linalg.inv(A*A_tran)
A_dagger_r = A_tran*Grammian_r
print(A_dagger_r)
[[-0.26666667 -0.6 -0.13333333] [-0.23333333 -0.4 -0.36666667] [-0.13333333 0.2 -0.06666667]] [[-0.26666667 -0.6 -0.13333333] [-0.23333333 -0.4 -0.36666667] [-0.13333333 0.2 -0.06666667]] [[-0.26666667 -0.6 -0.13333333] [-0.23333333 -0.4 -0.36666667] [-0.13333333 0.2 -0.06666667]]
As written above, the matrix inverses provide a solution to the ubiquitous equation Ax=b. An engineering example used in my research quite frequently is using least squares (LS) to fit a line to some data, where we'll solve for s in As=x using a left pseudo inverse.
As a brief introduction to this application, I will fit normalized radar cross section (RCS) σ0 data vs incidence angle from a satellite radar at C band know as the advanced scatteromer, or ASCAT. ASCAT is a fan beam scatterometer that observes the earth surface at a variable range of incidence angles, from about 30 to 60 degrees. The purpose of this application is to determine what the σ0 incidence angle dependence (s) is at C band in order to normalize ASCAT measurements to one incidence angle for cross calibration purposes.
Previous studies have shown that σ0 exhibits a log-linear dependence with incidence angle over tropical rainforests over the mid incidence angle range from about 30 to 60 degrees incidence (N. Madsen, BYU Masters Thesis, 2015). Due to the log-linear dependence, the dependence is easily estimated as the slope s of the first order polynomial
s1θ+s2=σ0,
where s1 and s2 are the coefficients of s and s1 is the slope which we're solving for, θ is the ASCAT incidence angle data, and σ0 is the ASCAT RCS data. In matrix form, this equation is
[θ,1][s1,s2]T=σ0,
which easily seen of the form As=x.
To determine the ASCAT σ0 incidence angle dependence, we create the LS equation
s=A†x,
where s is the dB/degree dependence, A† is the left psuedo inverse of the ASCAT incidence angle data, and x is σ0 RCS data. Again, only the first coefficient of s is of interest, as it gives the slope of the line.
In our application, we will only use a small portion of the actual ASCAT data, since there are millions of measurements in a given day. To simplify this, I truncated millions of measurements for a day into 21 measurements to give a feel for what the approximate incidence angle dependence s is.
import numpy as np
##############################################################################################
# Evan Chrisney
# 671 Application code for Pseudo Inverses
# This script will use a left inverse to solve a 1st order poylnomial equation as described above
# Create A matrix as described above using actual ASCAT incdience angle values
A = np.matrix([[59, 48.63, 50.14, 54.22, 39.23, 52.36, 52.6, 44.27, 55.07, 59.06\
,58.32, 58.5, 37.49, 45.61, 52.13, 42.04, 54.74, 58.53, 38.19, 45.74, 59.31], \
[1,1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]])
# Create sigma_0 row vector using actual ASCAT RCS values
sigma_0 = np.array([-9.5103, -8.2828, -8.0002, -8.2415, -8.4277, -9.0573, \
-8.6784, -8.5909, -9.0481, -9.6975, -8.8720, -9.7875, -8.3998, -10.0404, \
-10.0735, -6.8197, -9.2742, -9.5769, -7.5610, -8.9871, -10.3787])
# Get length of sigma_0 for use in reshape the row vector to a column vec
l = len(sigma_0)
# Transpose A into a 21x2 matrix instead of 2x21
A = A.transpose()
# Transpose sigma_0 into a column vector
sigma_0 = np.reshape(sigma_0, (l, 1))
# Get A hermitian, although transpose would also work since these are reals
A_tran = A.getH()
# Create the left inverse using the Grammian as above
Grammian_l = np.linalg.inv(A_tran*A)
# Compute the left inverse of A as above
A_dagger_l = Grammian_l*A_tran
# Solve for s using the left inverse
s = A_dagger_l*sigma_0
# Print out s, where the first coefficient is the approximate sigma^0 incidence angle dependence for ASCAT,
# which is -0.0758 dB/degree
print(s)
[[-0.07582702] [-5.07314663]]
When using a robotic arm, (known more formally as a serial link manipulator), it is often necesary to find a set of joint angles that put the end manipulator in the correct pose (position and orientation). The process of finding the joint angles to reach a given pose is known as inverse kinematics. Arms with many joints are often too complex for an analytical solution in every case, and so a numerical solution can be computed using the pseudo inverse. This method is a form of gradient descent optimization. For more on gradient descent, see Gradient Descent.
For this application, the vector q contains the angles of each joint in the robot. J(q) is the jacobian of the robot in a given position, such that J(q)˙q=v, where v is a 6-vector containing the linear and angular velocities of the end effector. The calculation of J(q) is beyond the scope of this work, but is solved problem.
The general method for inverse kinematics is as such:
Using the pseudo inverse works well because it is flexible enough to work with underactuated arms. A problem with this direct approach is that J(q) is not guaranteed to meet the requirements to have a pseudo inverse, often in configurations such as the arm reaching the edge of its workspace, or in situations similar to gimbal lock. To prevent this, a damped pseudo inverse is often used, (J(q)TJ(q)+λI)−1J(q)T, where λ is a damping coefficient. This change guarantees that a pseudo inverse exists.