jvp optimisations for pseudoinverse solvers by jpbrodrick89 · Pull Request #217 · patrick-kidger/lineax

jpbrodrick89 · 2026-04-10T00:23:36Z

Preamble
First of all, sincere apologies for adding another PR to the backlog. This is definitely not one that would fit in an extension package due to its invasive change of the linear_solve jvp rule, but also is not one I'd mind terribly if you deprioritise reviewing as I don't think its a critical bottleneck in my work. My hope is that submitting it now rather than waiting for the backlog to clear simply means it has more time to swim around in your head for a smoother review when we eventually get there (no pressure! 🙂). Furthermore, this could affect new solvers such as pivoted QR.

Intention of PR
I've been working on variable projection as well as thinking about improving Jax's least square JVP rule and comparing to Lineax for inspiration. I noticed we were missing some low-hanging fruit when it comes to leveraging standard optimisations for projection and Gram operators that are used in the JVP rule.

Dependent columns

The JVP rule has a term $A^\dagger A y$ which is the projection onto the row space. This has the following optimisations:

Wide QR: $A^†A = Q^{-\top} R^{-\top} R^\top Q^\top = \mathrm{conj}(Q) Q^\top$ by unitarity—avoiding a matmul and a triangular solve.
SVD: $A^†A = (U \Sigma V^\top)^†(U \Sigma V^\top) = V \Sigma U^T U \Sigma V^\top = V V^\top$ by unitarity of $U$, saving two O(mn) matmuls.

Dependent rows

The JVP rule has a term $A^\dagger A^{H\dagger} \mathrm{d}A^H(b-Ax)$, where the first two matrices can be written as the (pseudo-)inverse of the Gram matrix $A^\dagger A^{H\dagger}=(A^H A)^\dagger$. This has the following optimisations:

Tall QR: $(A^H A)^\dagger = (R^H Q^H Q R)^{-1} = R^{-1}R^{-H}$ saving two O(mn) matmuls.
SVD: $(V \Sigma U^H U \Sigma V^H)^\dagger=V \Sigma^{-2} V^H$ saving two O(mn) matmuls.
Tall Normal: $A^H A$ IS inner_operator so we just use a single call to the inner_solver and avoid an application of $A$.

Caveat

Savings are not always quite as good as they sound as the "vecs" are summed with others that still exist before applying the outer solve, but the saving from the inner matrix is still very real (i.e. savings are about half what they're advertised to be).

Design

I have introduced two singledispatch functions in _solve.py: _gram_inverse_mv and _row_space_projection which return NotImplemented (NOT raise NotImplementedError) by default to allow fallback to the current path and otherwise allows leveraging these optimisations in the jvp rule.

jpbrodrick89 · 2026-04-28T15:16:04Z

@patrick-kidger happy to own maintenance of the maths and performance here, but i'd appreciate your input on whether singledispatch with default return NotImplemented and falling back with if NotImplemented is a pattern you're happy with before merging?

patrick-kidger

So the usual thing I've done (e.g. in Diffrax) is not to make this extensible (whether via method or singledispatch), but instead to hardcode the particular solvers directly: if isinstance(solver, Normal): ...other code path....

This leaves the 'public API' unchanged, so there's room to make a choice later if/when other considerations arise. And it's edge case enough that realistically this affects nobody by being hardcoded.

patrick-kidger · 2026-04-20T10:33:16Z

+    _gram_inverse_mv,
+    _row_space_projection,


nit, leading underscore means private to the file it's in.

jpbrodrick89 · 2026-04-28T17:32:10Z

Hmm, the reason I didn't do isinstance is because if a user has their own custom solver, e.g. Modified Cholesky or cudSS LDLT (for example, if @johnviljoen's spineax manages to support lineax in the future) then I think the only way for power users to unlock similar optimisations is to monkey patch the linear_solve jvp rule? The only other option I can think of is hasattribute which I don't think I've seen elsewhere in the equinox ecosystem. Methods with default implementations break the golden abstract or final pattern so I won't suggest that. 😅

jpbrodrick89 · 2026-04-29T08:20:03Z

Sorry, I was being stupid here, Modified Cholesky and LDLT do NOT handle the ill-posed/low rank (dependent columns/rows) cases, they just handle indefinite matrices. The main use case would be something like SuiteSparse SPQR, MKL sparse QR which is a bit more niche (and I don't think spineax aims to support those yet, right?). Happy to just go with isinstance if you prefer non-extensibility until there is an explicit request.

patrick-kidger · 2026-04-29T09:47:06Z

Happy to just go with isinstance if you prefer non-extensibility until there is an explicit request.

Okay! If there isn't already a pressing use-case then yup let's go with isinstance (I like the sound of just doing the simple thing!)

jpbrodrick89 · 2026-04-30T16:18:36Z

Converted to draft as I think further iteration of the design is necessary and should be left for a future release. The reason I had as singledispatch originally was because it was useful on a variable projection project I was working on but there's probably a better way and it might be better bringing that lineax main too.

jvp optimisations for pseudoinverse solvers

4198a4a

patrick-kidger reviewed Apr 28, 2026

View reviewed changes

jpbrodrick89 marked this pull request as draft April 30, 2026 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jvp optimisations for pseudoinverse solvers#217

jvp optimisations for pseudoinverse solvers#217
jpbrodrick89 wants to merge 1 commit into
patrick-kidger:mainfrom
jpbrodrick89:jpb/pseudo-jvp

jpbrodrick89 commented Apr 10, 2026

Uh oh!

jpbrodrick89 commented Apr 28, 2026 •

edited

Loading

Uh oh!

patrick-kidger left a comment

Uh oh!

patrick-kidger Apr 20, 2026

Uh oh!

jpbrodrick89 commented Apr 28, 2026

Uh oh!

jpbrodrick89 commented Apr 29, 2026

Uh oh!

patrick-kidger commented Apr 29, 2026

Uh oh!

jpbrodrick89 commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jpbrodrick89 commented Apr 10, 2026

Uh oh!

jpbrodrick89 commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrick-kidger left a comment

Choose a reason for hiding this comment

Uh oh!

patrick-kidger Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

jpbrodrick89 commented Apr 28, 2026

Uh oh!

jpbrodrick89 commented Apr 29, 2026

Uh oh!

patrick-kidger commented Apr 29, 2026

Uh oh!

jpbrodrick89 commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jpbrodrick89 commented Apr 28, 2026 •

edited

Loading