Computing `AB⁻¹` with `np.linalg.solve()`

Question

I need to compute AB⁻¹ in Python / Numpy for two matrices A and B (B being square, of course).

I know that np.linalg.inv() would allow me to compute B⁻¹, which I can then multiply with A. I also know that B⁻¹A is actually better computed with np.linalg.solve().

Inspired by that, I decided to rewrite AB⁻¹ in terms of np.linalg.solve(). I got to a formula, based on the identity (AB)ᵀ = BᵀAᵀ, which uses np.linalg.solve() and .transpose():

np.linalg.solve(a.transpose(), b.transpose()).transpose()

that seems to be doing the job:

import numpy as np


n, m = 4, 2
np.random.seed(0)
a = np.random.random((n, n))
b = np.random.random((m, n))

print(np.matmul(b, np.linalg.inv(a)))
# [[ 2.87169378 -0.04207382 -1.10553758 -0.83200471]
#  [-1.08733434  1.00110176  0.79683577  0.67487591]]
print(np.linalg.solve(a.transpose(), b.transpose()).transpose())
# [[ 2.87169378 -0.04207382 -1.10553758 -0.83200471]
#  [-1.08733434  1.00110176  0.79683577  0.67487591]]
print(np.all(np.isclose(np.matmul(b, np.linalg.inv(a)), np.linalg.solve(a.transpose(), b.transpose()).transpose())))
# True

and also comes up much faster for sufficiently large inputs:

n, m = 400, 200
np.random.seed(0)
a = np.random.random((n, n))
b = np.random.random((m, n))

print(np.all(np.isclose(np.matmul(b, np.linalg.inv(a)), np.linalg.solve(a.transpose(), b.transpose()).transpose())))
# True

%timeit np.matmul(b, np.linalg.inv(a))
# 100 loops, best of 3: 13.3 ms per loop
%timeit np.linalg.solve(a.transpose(), b.transpose()).transpose()
# 100 loops, best of 3: 7.71 ms per loop

My question is: does this identity always stand correct or there are some corner cases I am overlooking?

Btw, there are a couple of things you can do to make your code more succinct and readable: 1) use `a.T` instead of `a.transpose()`, and 2) use the `@` operator for matrix multiplication instead of `np.matmul()`. So your check would be `np.allclose(b @ a.T, np.linalg.solve(a.T, b.T).T)`. — vanPelt2, May 20 '20 at 16:07

Mad Physicist · Accepted Answer · 2020-05-20T17:12:21.513

8

In general, np.linalg.solve(B, A) is equivalent to B^-1A. The rest is just math.

In all cases, (AB)^T = B^TA^T: https://math.stackexchange.com/q/1440305/295281.

Not necessary for this case, but for invertible matrices, (AB)^-1 = B^-1A^-1: https://math.stackexchange.com/q/688339/295281.

For an invertible matrix, it is also the case that (A^-1)^T = (A^T)^-1: https://math.stackexchange.com/q/340233/295281.

From that it follows that (AB^-1)^T = (B^-1)^TA^T = (B^T)^-1A^T. As long as B is invertible, you should have no issues with the transformation you propose in any case.

edited May 20 '20 at 17:12

answered May 20 '20 at 16:04

Mad Physicist

107,652
25
181
264

2

I am not sure why do we need `(AB)⁻¹ = B⁻¹A⁻¹`. For the rest, that was precisely my line of thinking. – norok2 May 20 '20 at 17:10
2

@norok2. For good measure :). I added it before I realized that I wasn't going to use it. I left it in because it doesn't hurt. I prefixed it with a note now. – Mad Physicist May 20 '20 at 17:11

Computing `AB⁻¹` with `np.linalg.solve()`

1 Answers1