Improvements to pycbc_brute_bank by ahnitz · Pull Request #5281 · gwastro/pycbc

ahnitz · 2026-02-04T19:26:28Z

Standard information about the request

This is a feature update for pycbc brute bank.

This affects bank generation, so could affect use in all searches, though only indirectly.

Motivation

The main motivation is to improve the computational (both wall clock time and memory use) of pycbc_brute_bank.

The options to do match calculations / proposal point checking in parallel. This does affect the behavior and is not an entirely drop-in option. It enables parallelization by assuming that proposals do not need to be checked against each other. That means if two proposals were identical points, they would both end up getting added to the bank, since no check between them is done. However, I believe that in most cases, this not really a problem. Without this option, requesting multiple cores will only parallelize bank generation as is currently the case. It think that is the safest default for the general case.
Option to limit the amount of store information on the inter-bank matches. (through option --max-connections XX). By default this is set to infinite which matches the old behavior. If set to some number, only the most important matches are actually stored for each template (in the way we do things these are actually the 'low' matches). This means that if you get a particularly large bank, you can avoid being dominated in memory usage by the connections themselves instead of the cached waveforms. There are cases, where this can result in 5-10x memory reduction (again only for a large bank region) without noticeable change to result bank size. It could potentially increase the number of matches required though if set too small.
The option to save the bank even with an early termination. If you send a sigterm or ctrl-c (interrupt) signal to the job, it will now try to intercept this and do an emergency save of the bank.

Links to any issues or associated PRs

This should be merged first #5227

Testing performed

I've verified that by default banks are not modified. The defaults resort to the old behavior.

The author of this pull request confirms they will adhere to the code of conduct

GarethCabournDavies

I think that this looks good, and as you say if defaults are used, it reverts to earlier behaviour, that is good as well

There are a few places where there are minor code-styling changes / questions.

I also don't understand why a couple of line have changes from a variable (len(...)) to a constant (2)

GarethCabournDavies · 2026-02-05T09:38:05Z

bin/bank/pycbc_brute_bank

+                    #keep[args.max_connections//2: -args.max_connections//2] = False
+                    keep[args.max_connections:] = False
+
+                hp.matches = msorted[keep].copy()


Does this array copy not now increase the memory footprint, particularly in the max_connections infinite / default case?

Actually it's the opposite, otherwise loose reference could keep something in memory when we don't really need it anymore.

A further clarification may help here. If you just return a view, even if that's all you have, the full array is always kept. I explicitly want the reduced form to be what is stored and not the full array, hence explicitly asking for a new array that contains a copy of what the view pointed to. This means that the original values can be cleaned up by python's garbage collector.

GarethCabournDavies · 2026-02-05T09:38:42Z

bin/bank/pycbc_brute_bank

            # Defensive initialization if matches/indices are missing
            if not hasattr(hc, 'matches'):
-                hc.matches = numpy.empty(len(self))
+                hc.matches = numpy.empty(2)


Im not sure I understand this change

In truth, I'd like to remove these lines. I can't really figure out why they are needed. The only guess I have is that perhaps there is some boundary issue that this is masking. However, to the point, there is no reason to add these with the full size of the existing bank. Since the information added is meaningless, it ends up being a waste of memory. Putting it at 2 was simply a lazy way to force the correct dimensions.

GarethCabournDavies · 2026-02-05T09:39:40Z

bin/bank/pycbc_brute_bank

 parser.add_argument('--tau0-cutoff-frequency', type=float, default=15.0)
 parser.add_argument('--nprocesses', type=int, default=1,
    help='Number of processes to use for waveform generation parallelization. If not given then only a single core will be used.')
+parser.add_argument('--parallel-check', action='store_true', help="Do bank checking parallel, note that this means that proposals WILL NOT be checked against each other.")


Could this line be wrapped to match maximum line length requirements?

GarethCabournDavies · 2026-02-05T09:41:02Z

bin/bank/pycbc_brute_bank

+                if hp.checked or force_add:
                    num_added += 1
-                    self.insert(hp)
+                    self.insert(hp)    


GarethCabournDavies · 2026-02-05T09:41:09Z

bin/bank/pycbc_brute_bank

                logging.info("Waveform generation failed!")
                continue
-
+                


ahnitz · 2026-02-18T04:03:21Z

@GarethCabournDavies I'll address the style issues before merging.

ahnitz added 9 commits February 4, 2026 14:15

add in option for td waveform

709f1c7

debugging info for convergence added to bank output and final file

7fa5c1c

save bank on termination

70d763e

experimental parallel match checking option to bank

727729c

fixes

c92c179

update bound approximation

1557bd0

set default connection max to infinite

f4af8df

update fixes

42a2c26

update

1e05aad

ahnitz requested review from GarethCabournDavies and yi-fan-wang February 4, 2026 19:26

ahnitz assigned GarethCabournDavies Feb 4, 2026

ahnitz added 2 commits February 4, 2026 14:28

remove unused import

10b1d03

I guess i can add this back in

550781b

GarethCabournDavies approved these changes Feb 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Improvements to pycbc_brute_bank#5281

Improvements to pycbc_brute_bank#5281
ahnitz wants to merge 11 commits intogwastro:masterfrom
ahnitz:bank_updates

ahnitz commented Feb 4, 2026 •

edited

Loading

Uh oh!

GarethCabournDavies left a comment

Uh oh!

GarethCabournDavies Feb 5, 2026

Uh oh!

ahnitz Feb 18, 2026

Uh oh!

ahnitz Feb 18, 2026

Uh oh!

GarethCabournDavies Feb 5, 2026

Uh oh!

ahnitz Feb 18, 2026

Uh oh!

GarethCabournDavies Feb 5, 2026

Uh oh!

GarethCabournDavies Feb 5, 2026

Uh oh!

GarethCabournDavies Feb 5, 2026

Uh oh!

ahnitz commented Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

ahnitz commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Standard information about the request

Motivation

Contents

Links to any issues or associated PRs

Testing performed

Uh oh!

GarethCabournDavies left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ahnitz commented Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ahnitz commented Feb 4, 2026 •

edited

Loading