Return a BandedMatrix from a view of a BandedBlockBandedMatrix #223

dlfivefifty · 2025-03-26T12:18:10Z

@MikaelSlevinsky This gets rid of the allocations (for lazy axes at least).

Still need to figure out why it's slow.

julia> for n in 20:20:200
                  ax = BlockArrays.BlockedOneTo(ArrayLayouts.RangeCumsum(Base.OneTo(n)))
                  D = BandedBlockBandedMatrix{Float64}(I, (ax,ax), (0, 0), (0, 0))
                  x = BlockVector(randn(sum(1:n)), (ax,))
                  y = zero(x)
                  @time my_special_mul_through_data!(y, D, x);
                  @time mul!(y, D, x);
              end
  0.000002 seconds
  0.000066 seconds (1 allocation: 32 bytes)
  0.000004 seconds
  0.000035 seconds (1 allocation: 32 bytes)
  0.000007 seconds
  0.000045 seconds (1 allocation: 32 bytes)
  0.000011 seconds
  0.000056 seconds (1 allocation: 32 bytes)
  0.000016 seconds
  0.000140 seconds (1 allocation: 32 bytes)
  0.000023 seconds
  0.000172 seconds (1 allocation: 32 bytes)
  0.000057 seconds
  0.000179 seconds (1 allocation: 32 bytes)
  0.000041 seconds
  0.000316 seconds (1 allocation: 32 bytes)
  0.000067 seconds
  0.000180 seconds (1 allocation: 32 bytes)
  0.000062 seconds
  0.000212 seconds (1 allocation: 32 bytes)

codecov · 2025-03-26T12:24:39Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 86.06%. Comparing base (629276d) to head (159d05f).
Report is 1 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #223      +/-   ##
==========================================
- Coverage   88.16%   86.06%   -2.10%     
==========================================
  Files          11       11              
  Lines        1115     1134      +19     
==========================================
- Hits          983      976       -7     
- Misses        132      158      +26

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

dlfivefifty · 2025-04-22T09:46:57Z

@MikaelSlevinsky With this (and the just tagged BlockArrays v1.6.2) we have the following:

julia> using AppleAccelerate # needed so banded matrices aren't insanely slow

julia> for n in 20:20:200
                  ax = BlockArrays.BlockedOneTo(ArrayLayouts.RangeCumsum(Base.OneTo(n)))
                  D = BandedBlockBandedMatrix{Float64}(I, (ax,ax), (0, 0), (0, 0))
                  x = BlockVector(randn(sum(1:n)), (ax,))
                  y = zero(x)
                  @time my_special_mul_through_data!(y, D, x);
                  @time my_special_mul!(y, D, x);
                  @time my_special_mul_with_a_view!(y, D, x);
                  @time mul!(y, D, x);
              end
  0.000003 seconds
  0.000011 seconds (80 allocations: 5.938 KiB)
  0.000003 seconds
  0.000012 seconds (2 allocations: 64 bytes)
  0.000002 seconds
  0.000008 seconds (160 allocations: 18.219 KiB)
  0.000002 seconds
  0.000026 seconds (2 allocations: 64 bytes)
  0.000002 seconds
  0.000012 seconds (240 allocations: 37.188 KiB)
  0.000003 seconds
  0.000007 seconds (2 allocations: 64 bytes)
  0.000003 seconds
  0.000017 seconds (320 allocations: 62.438 KiB)
  0.000006 seconds
  0.000011 seconds (2 allocations: 64 bytes)
  0.000004 seconds
  0.000029 seconds (400 allocations: 94.500 KiB)
  0.000009 seconds
  0.000011 seconds (2 allocations: 64 bytes)
  0.000005 seconds
  0.000034 seconds (480 allocations: 133.469 KiB)
  0.000013 seconds
  0.000015 seconds (2 allocations: 64 bytes)
  0.000007 seconds
  0.000039 seconds (560 allocations: 178.156 KiB)
  0.000013 seconds
  0.000020 seconds (2 allocations: 64 bytes)
  0.000009 seconds
  0.000049 seconds (640 allocations: 229.531 KiB)
  0.000022 seconds
  0.000027 seconds (2 allocations: 64 bytes)
  0.000012 seconds
  0.000052 seconds (720 allocations: 287.469 KiB)
  0.000059 seconds
  0.000034 seconds (2 allocations: 64 bytes)
  0.000017 seconds
  0.000072 seconds (800 allocations: 351.938 KiB)
  0.000029 seconds
  0.000045 seconds (2 allocations: 64 bytes)

So we have got rid of almost all allocations. The remaining speed difference is due to gbmv! being too slow for diagonal matrices:

julia> n = 10_000; D = BandedMatrix(0 => 1:n); x = randn(n); @time D*x;
  0.000042 seconds (3 allocations: 78.188 KiB)

julia> D = Diagonal(1:n); x = randn(n); @time D*x;
  0.000010 seconds (3 allocations: 78.188 KiB)

We could special case diagonal blocks not to call BLAS... actually I suspect the fastest would be to get rid of all calls to banded BLAS and just do naive for loops....

MikaelSlevinsky · 2025-04-22T13:21:10Z

Awesome, thanks!

dlfivefifty · 2025-04-22T15:34:41Z

The reason ApproxFun is broken is that

https://github.com/JuliaApproximation/ApproxFunBase.jl/blob/569ff3c493c3f288214dfc874a1e32be7a9ce566/src/PDE/KroneckerOperator.jl#L321

is assuming that view returns a SubArray that can be reindexed. This is arguably a bad design.

@jishnub thoughts?

Return a BandedMatrix from a view of a BandedBlockBandedMatrix

cfc75eb

MikaelSlevinsky mentioned this pull request Mar 31, 2025

This adds methods to compute bivariate Gram matrices. JuliaApproximation/FastTransforms.jl#261

Open

dlfivefifty added 2 commits April 16, 2025 21:07

Update BandedBlockBandedMatrix.jl

0fc3234

tests pass

159d05f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return a BandedMatrix from a view of a BandedBlockBandedMatrix #223

Return a BandedMatrix from a view of a BandedBlockBandedMatrix #223

dlfivefifty commented Mar 26, 2025

codecov bot commented Mar 26, 2025 •

edited

Loading

dlfivefifty commented Apr 22, 2025

MikaelSlevinsky commented Apr 22, 2025

dlfivefifty commented Apr 22, 2025

Return a BandedMatrix from a view of a BandedBlockBandedMatrix #223

Are you sure you want to change the base?

Return a BandedMatrix from a view of a BandedBlockBandedMatrix #223

Conversation

dlfivefifty commented Mar 26, 2025

codecov bot commented Mar 26, 2025 • edited Loading

Codecov Report

dlfivefifty commented Apr 22, 2025

MikaelSlevinsky commented Apr 22, 2025

dlfivefifty commented Apr 22, 2025

codecov bot commented Mar 26, 2025 •

edited

Loading