Specialize array overloads #192

gdalle · 2024-09-06T13:01:48Z

At the moment, src/overloads/arrays.jl contains linear algebra functions defined on AbstractMatrix{<:Tracer}. Unfortunately, this dispatch will most likely never be hit, because each array type (Matrix, Diagonal, SparseMatrixCSC, etc.) has its own implementation of things like * or det. And f(::ConcreteMatrixType{<:Real}) will always take precedence over f(::AbstractMatrix{<:Tracer}).

My suggestion: only define these overloads for the basic Array types (Vector / Matrix). That way, we make sure that our methods are actually hit when we want them to be. And we also provide an actionable solution for people who are stuck by failing linear algebra methods: put everything inside a normal Array and you should be fine.

The text was updated successfully, but these errors were encountered:

gdalle · 2024-09-06T13:03:00Z

Related issues which are hopeless on abstract arrays:

adrhill · 2024-09-06T13:11:57Z

To add injury to their limited use, array overloads require a huge amount of effort to write and test.

An alternative mentioned in #144 is to wrap all array overloads in a function that leverages meta-programming to generate methods on XYZArray{<:AbstractTracer}.
However, this approach is limited to single-argument function on arrays. As mentioned in #133, multi-argument functions on arrays are even more of a pain. Quoting myself:

Matrix multiplication is already complex enough for simple Matrix and Vector.

It requires methods for:

Matrix of tracers * Matrix of tracers

Matrix of tracers * Vector of tracers

Vector transposed of tracers * Matrix of tracers

Vector transposed of tracers * Vector of tracers

Matrix of reals * Matrix of tracers

Matrix of reals * Vector of tracers

Vector transposed of reals * Matrix of tracers

Vector transposed of reals * Vector of tracers

Matrix of tracers * Matrix of reals

Matrix of tracers * Vector of reals

Vector transposed of tracers * Matrix of reals

Vector transposed of tracers * Vector of reals

adrhill · 2024-09-06T13:18:30Z

My suggestion: only define these overloads for the basic Array types (Vector / Matrix).

Arguably, a codebase that has use for SCT is written in a sparse manner and will rarely use the non-sparse Vector and Matrix types. Instead, it's more likely to perform scalar operations or use types from SparseArrays.

gdalle · 2024-09-06T13:20:21Z

Arguably, a codebase that has use for SCT is written in a sparse manner and will rarely use the non-sparse Vector and Matrix types. Instead, it's more likely to perform scalar operations or use types from SparseArrays.

Not true. SCT materializes the sparsity pattern of the Jacobian, but inside the code said Jacobian never needs to exist at all. Typically, the Brusselator gives rise to sparse Jacobians without ever creating a SparseMatrixCSC. Same for the Conv layer.

gdalle · 2024-09-06T13:24:48Z

As mentioned in #133, multi-argument functions on arrays are even more of a pain.

Yeah we definitely don't wanna go down the ReverseDiff road of generating truckloads of methods for combinations of lists of types. It's extremely brittle and has even broken the tests of the package for a good long while JuliaDiff/ReverseDiff.jl#242

adrhill · 2024-09-06T13:49:05Z

Typically, the Brusselator gives rise to sparse Jacobians without ever creating a SparseMatrixCSC.

Sure, but the Brusselator falls into my first category of functions:

Instead, it's more likely to perform scalar operations

And I'm not convinced Conv layers are a common use-case for sparsity detection. I put them in the README because they nicely demonstrate how generic our code is.
In fact, I'm not sure the NNlib implementation of generic convolution uses Matrix multiplication either. I'm pretty sure that example predates #131. I think it also falls in the category of functions using scalar operations.

gdalle · 2024-09-06T16:08:27Z

Sorry I had missed the "perform scalar operations" part.

adrhill · 2024-10-21T12:47:12Z

Closing for now, as it would require new code generation utilities, complicate testing, the need for it hasn't arisen yet.

adrhill added the discussion Discuss design decisions / future direction of the package label Sep 6, 2024

adrhill mentioned this issue Sep 6, 2024

Handle StaticArrays? #144

Closed

adrhill closed this as completed Oct 21, 2024

adrhill mentioned this issue Oct 21, 2024

Overload lu on matrices of tracers #138

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specialize array overloads #192

Specialize array overloads #192

gdalle commented Sep 6, 2024 •

edited

Loading

gdalle commented Sep 6, 2024

adrhill commented Sep 6, 2024 •

edited

Loading

adrhill commented Sep 6, 2024

gdalle commented Sep 6, 2024

gdalle commented Sep 6, 2024

adrhill commented Sep 6, 2024 •

edited

Loading

gdalle commented Sep 6, 2024

adrhill commented Oct 21, 2024

Specialize array overloads #192

Specialize array overloads #192

Comments

gdalle commented Sep 6, 2024 • edited Loading

gdalle commented Sep 6, 2024

adrhill commented Sep 6, 2024 • edited Loading

adrhill commented Sep 6, 2024

gdalle commented Sep 6, 2024

gdalle commented Sep 6, 2024

adrhill commented Sep 6, 2024 • edited Loading

gdalle commented Sep 6, 2024

adrhill commented Oct 21, 2024

gdalle commented Sep 6, 2024 •

edited

Loading

adrhill commented Sep 6, 2024 •

edited

Loading

adrhill commented Sep 6, 2024 •

edited

Loading