A new gauge fixing algorithm which returns the rotation field. #1481

SaltyChiang · 2024-07-23T12:25:24Z

I implemented a new over-relaxation gauge-fixing algorithm. The major difference between the new and the old implementations is we can obtain the rotation field now. Our workflow often requires this.

The rotation field $g(x)$ is defined as follows:
$$U^\prime_\mu(x)=g(x)U_\mu(x)g^\dagger(x+\hat{\mu})$$

SaltyChiang · 2024-07-23T12:31:35Z

I'm wondering if I should just override the old interface computeGaugeFixingOVRQuda or add a new interface function. Please let me know if you have any considerations.

I tested the performance and didn't see any significant performance regression compared to the old implementation. But more testing is definitely needed.

maddyscientist · 2024-08-13T21:16:47Z

@Jenkins test this please

maddyscientist · 2024-08-13T21:22:03Z

Thanks for this PR @SaltyChiang. I don't think there should be any performance change, so I think we could just replace the old interface with the new one.

Can you go ahead and make this change to your PR? I think you'll need to update the MILC interface code (milc_interface.cpp) since that code calls the older interface, but that should be an easy change.

…e-fixing-ovr

…uda`.

…e-fixing-ovr

Remove `spinorRotate`.

…e-fixing-ovr

SaltyChiang · 2025-05-09T09:17:01Z

Some additional explanation about the distance preconditioning:
The source vector will get a very small norm after reweighting, so I force the reweighted vector to be normalized. Sometimes we have very large Nt, and the value of cosh(alpha*(t-t0)) will be too large/small for fp32 (>2e38 or <1e-38), which causes nan values. So I force the weighting function to return fp64 values.

SaltyChiang · 2025-05-09T09:50:26Z

The performance could be further improved by hiding communication time during computation, and the old version of gauge fixing divided the points into two parts called "Border" and "Int" to implement it. The current performance of the gauge fixing cannot beat the old one in some situations (for example, reunit_interval not very small), so I decided not to remove the existing algorithm.

I did not write a kernel similar to the old one, since it's not good for readability. I think a similar optimization could be applied to the new implementation by using a special dslash kernel working on a special spinor with Ns=3 and Nc=3. I want to work on other topics and will leave the code here for now.

maddyscientist

Thanks for all this work @SaltyChiang. This looks like a great contribution.

The main request I have for you is that we would need some unit testing added for these new features.

For the new gauge fixing functionality, could you add a test for this in the gauge_alg_test?
Do you have any thoughts on how to test the shift-only covariant derivative?

maddyscientist · 2025-05-09T23:32:52Z

include/quda.h

@@ -140,6 +142,8 @@ extern "C" {

    int laplace3D; /**< omit this direction from laplace operator: x,y,z,t -> 0,1,2,3 (-1 is full 4D) */
    int covdev_mu; /**< Apply forward/backward covariant derivative in direction mu(mu<=3)/mu-4(mu>3) */
+    bool covdev_shift; /**< Apply the shift instead of the covariant derivative */
+    bool staggered;    /**< If the input field is staggered or not for Laplace and CovDev */


Perhaps a more descriptive variable name is needed here, instead of just staggered?

Both COVDEV and LAPLACE will use it, and I thought something like covdev_laplace_staggered/covdev_laplace_nspin looks terrible. Another choice is to use two variables like covdev_nspin and laplace_nspin. Which one do you prefer?

I use laplace_nspin and covdev_nspin instead. They are initialized as 1 and 4 respectively to keep the former behavior.

maddyscientist · 2025-05-09T23:44:08Z

The performance could be further improved by hiding communication time during computation, and the old version of gauge fixing divided the points into two parts called "Border" and "Int" to implement it. The current performance of the gauge fixing cannot beat the old one in some situations (for example, reunit_interval not very small), so I decided not to remove the existing algorithm.

I did not write a kernel similar to the old one, since it's not good for readability. I think a similar optimization could be applied to the new implementation by using a special dslash kernel working on a special spinor with Ns=3 and Nc=3. I want to work on other topics and will leave the code here for now.

Yes, the old version of the code that overlaps comms and compute, while efficient, is horrible to read. Fine to have both versions of the code for now, and for correctness testing, having the two versions is not a bad thing anyway. 😄

SaltyChiang · 2025-05-10T03:50:39Z

I noticed the name covdev_mu is also used in covdev_test.cpp but has a different meaning from that in QudaInvertParam (I added it in the previous PR). I think it looks a bit ambiguous. Maybe another name, such as --test-mu, is better?

Add init/check/print for `QudaGaugeFixParam`.

SaltyChiang · 2025-05-11T10:45:31Z

@maddyscientist Tests for new gauge fixing and shift-only covdev are added.

fp32 testing for gauge fixing causes nan in the new gauge fixing algorithm. Using double versors is a workaround.
The shift-only covariant derivative is tested by comparing the shift and normal covariant derivative results with a unit gauge field. They should be the same.

…ered`.

…e-fixing-ovr

SaltyChiang added 3 commits July 23, 2024 19:34

Add new gauge fixing algorithm to return the rotation field.

f7dcc56

Add over relaxation.

2db02fb

Make compiler happy.

39844da

SaltyChiang requested review from a team as code owners July 23, 2024 12:25

SaltyChiang added 3 commits July 24, 2024 00:32

Add docstring.

07ec4f7

Don't return double2 struct.

3f98f5d

Add new members to QudaGaugeParam to control the gauge fixing.

6420ec3

Merge remote-tracking branch 'upstream/develop' into feature/new-gaug…

7e168a2

…e-fixing-ovr

SaltyChiang marked this pull request as draft April 30, 2025 14:42

SaltyChiang added 9 commits May 1, 2025 23:44

Add GaugeFixParam for gauge fixing algorithms.

5f6ff83

Add gaugeRotateQuda and spinorRotateQuda interface.

a4cb908

Enable shift only mode for the covariant derivative kernel.

ec06e81

Use gaugePrecise for performGaugeFixQuda and `performGaugeRotateQ…

e8e69a8

…uda`.

Merge remote-tracking branch 'upstream/develop' into feature/new-gaug…

91f6b4c

…e-fixing-ovr

Update aux string.

4603047

Remove `spinorRotate`.

Fusing gaugeRotate and gaugeFixQuality into a single kernel.

377dfdc

Fix possible divergence if distance preconditioning is used.

8a996fe

Merge remote-tracking branch 'upstream/develop' into feature/new-gaug…

5c6b0fa

…e-fixing-ovr

SaltyChiang marked this pull request as ready for review May 9, 2025 09:17

maddyscientist requested changes May 9, 2025

View reviewed changes

SaltyChiang added 2 commits May 10, 2025 12:33

Fix aux strings.

24a11d1

Add init/check/print for `QudaGaugeFixParam`.

Add gauge fixing test v2 to gauge_alg_test.cpp.

598eb4d

SaltyChiang added 2 commits May 11, 2025 16:30

Fix potential nan for gauge fixing with fp32.

30547b7

Add the test for shift-only mode in QUDA_COVDEV_DSLASH.

4edc1ba

SaltyChiang added 6 commits May 11, 2025 18:52

Apply clang-format.

5ab6ad6

Use laplace_nspin and covdev_nspin instead of an ambiguous `stagg…

5771a34

…ered`.

Fix a bug in covdev_test.

6ceef5b

Merge remote-tracking branch 'upstream/develop' into feature/new-gaug…

1d0acea

…e-fixing-ovr

Merge branch 'develop' into feature/new-gauge-fixing-ovr

9f6249c

Fix type of alpha0.

3f8e2f6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

A new gauge fixing algorithm which returns the rotation field. #1481

A new gauge fixing algorithm which returns the rotation field. #1481

Uh oh!

SaltyChiang commented Jul 23, 2024 •

edited

Loading

Uh oh!

SaltyChiang commented Jul 23, 2024 •

edited

Loading

Uh oh!

maddyscientist commented Aug 13, 2024

Uh oh!

maddyscientist commented Aug 13, 2024

Uh oh!

SaltyChiang commented May 9, 2025

Uh oh!

SaltyChiang commented May 9, 2025

Uh oh!

maddyscientist left a comment

Uh oh!

maddyscientist May 9, 2025

Uh oh!

SaltyChiang May 10, 2025

Uh oh!

SaltyChiang May 16, 2025 •

edited

Loading

Uh oh!

maddyscientist commented May 9, 2025

Uh oh!

SaltyChiang commented May 10, 2025

Uh oh!

SaltyChiang commented May 11, 2025

Uh oh!

Uh oh!

A new gauge fixing algorithm which returns the rotation field. #1481

Are you sure you want to change the base?

A new gauge fixing algorithm which returns the rotation field. #1481

Uh oh!

Conversation

SaltyChiang commented Jul 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SaltyChiang commented Jul 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maddyscientist commented Aug 13, 2024

Uh oh!

maddyscientist commented Aug 13, 2024

Uh oh!

SaltyChiang commented May 9, 2025

Uh oh!

SaltyChiang commented May 9, 2025

Uh oh!

maddyscientist left a comment

Choose a reason for hiding this comment

Uh oh!

maddyscientist May 9, 2025

Choose a reason for hiding this comment

Uh oh!

SaltyChiang May 10, 2025

Choose a reason for hiding this comment

Uh oh!

SaltyChiang May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

maddyscientist commented May 9, 2025

Uh oh!

SaltyChiang commented May 10, 2025

Uh oh!

SaltyChiang commented May 11, 2025

Uh oh!

Uh oh!

SaltyChiang commented Jul 23, 2024 •

edited

Loading

SaltyChiang commented Jul 23, 2024 •

edited

Loading

SaltyChiang May 16, 2025 •

edited

Loading