New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Adaptive time-stepping for transient solver using SUNDIALS #292

Open

simlapointe wants to merge 28 commits into main from simlapointe/transient-adapt-dt

simlapointe commented Nov 5, 2024

Add adaptive time-stepping capability for transient simulations. The default transient solver type remains a fixed time-stepping integrator, but the user can now use SUNDIALS implicit multistep (CVODE) and Runge-Kutta (ARKODE) integrators if desired.

simlapointe added 11 commits

October 14, 2024 20:39


          Add SUNDIALS dependency and build MFEM with SUNDIALS

f0ef4f2


          Add weak curl operator for first-order transient formulation

e2edda8


          Add new transient solver config options for adaptive time-stepping

0c4d8e1


          Add first-order transient formulation with SUNDIALS ARKode schemes

d437156


          Change first order ODE system formulation, add CVODE integrator, remo…

0c10b67

…ve explicit RK integrator


          Use explicit RHS (y' = M^-1 f(y,t)) for both ARKODE and CVODE

983cc9c


          Remove weakCurl operator, no longer needed

cff74b9


          Remove unnecessary ODE integrator prints

e5b1e1d


          Merge branch 'main' into simlapointe/transient-adapt-dt

2f10218


          Remove 2nd order ODE formulation

c4b4a01


          Update reference for transient solver tests

937fb47

simlapointe requested a review from hughcars

November 5, 2024 20:01

simlapointe added 4 commits

November 5, 2024 21:41


          Clean up

b9ce371


          Add SUNDIALS to spack install instructions

11092e9


          Change SUNDIALS version for spack installer

11ad777


          Fix formatting issues

aea8d7f

simlapointe added enhancement transient labels


          Merge branch 'main' into simlapointe/transient-adapt-dt

1bb2a95

hughcars requested changes

View reviewed changes

Collaborator

hughcars left a comment •

edited

Loading

Looking good. There's a few smaller structural issues, but the two bigger things I am thinking about
a) Can the RUNGE_KUTTA option use instead the mfem internal time integrator? That way the sundials connection is really only used for the adaptivity. This would slightly tidy the interface for adaptivity, along with making the non-adaptive slightly more fully featured. There are some SDIRK options in there that would probably be good choices.
b) Is there a way to compute B implicit to this process?

I have some of my suggestions on hughcars/transient-adapt-dt if you want to take a look.

cmake/ExternalSUNDIALS.cmake Outdated Show resolved Hide resolved

cmake/ExternalSUNDIALS.cmake Outdated Show resolved Hide resolved

docs/src/config/solver.md Outdated Show resolved Hide resolved

palace/models/timeoperator.cpp Show resolved Hide resolved

palace/models/timeoperator.cpp Outdated Show resolved Hide resolved

palace/models/timeoperator.cpp Outdated Show resolved Hide resolved

palace/models/timeoperator.cpp

Comment on lines 391 to 392

		En += E;
		Curl->AddMult(En, B, -0.5 * dt);

Collaborator

hughcars Nov 7, 2024

Is there a better way to be computing B as part of this?

Author

simlapointe Nov 8, 2024

I don't think so. We solve the ODE system for E and Edot, not sure how else to get B.

Contributor

sebastiangrimberg Nov 14, 2024

FYI you can, but you have to use the (more standard?) [E, B] linearization instead of the [E, dE/dt] one. This is described in Zhu and Cangellaris and should permit the same linear system for E after 2x2 block elimination.

Author

simlapointe Nov 14, 2024 •

edited

Loading

I meant no easy way in this [E, dE/dt] linearization. I actually tried the [E, B] formulation first but I was seeing some weird things like small but persistent solution differences compared the second-order ODE formulation, error vs dt trends not matching the expected order of accuracy, and some stability issues on complex cases. Not sure if it was an issue with the MixedVectorWeakCurl operator, some BC consideration (?), or most likely me doing something wrong. We're thinking of moving forward with the [E, dE/dt] formulation for now but I will try to revisit this later.

palace/models/timeoperator.cpp Outdated Show resolved Hide resolved

palace/models/timeoperator.cpp Outdated Show resolved Hide resolved

cmake/ExternalMFEM.cmake Outdated Show resolved Hide resolved

simlapointe added 6 commits

November 7, 2024 23:52


          Use MFEM's SDIRK23 solver as the fixed step Runge-Kutta option, along…

cd7cfc4

… with small fixes


          Remove SUNDIALS MAGMA dependency

c86a4c6


          Fix formatting issues

100d2f4


          Change Vector management to avoid GPU memory issue

8ed5f1e


          Fix formatting issues

f1ebc29


          Fix typo

cc65463

hughcars reviewed

View reviewed changes

palace/models/timeoperator.cpp Outdated

Comment on lines 156 to 164

+                  Vector du1, du2, rhs1, rhs2;
+                  du1.UseDevice(true);
+                  du2.UseDevice(true);
+                  rhs1.UseDevice(true);
+                  rhs2.UseDevice(true);
+                  du.GetSubVector(idx1, du1);
+                  du.GetSubVector(idx2, du2);
+                  RHS.GetSubVector(idx1, rhs1);
+                  RHS.GetSubVector(idx2, rhs2);

Collaborator

hughcars Nov 11, 2024

Why do you need idx1 or idx2 here rather than the approach taken in ComplexVector of using ReadWrite()?

du.ReadWrite();
Vector du1(du.GetData() + 0, size_E), du2(du.GetData() + size_E, size_E);
RHS.ReadWrite();
Vector rhs1(du.GetData() + 0, size_E), rhs2(du.GetData() + size_E, size_E);

also probably worth making RHS into rhs or rhsX into RHSX for consistency.

Author

simlapointe Nov 11, 2024

I tried that first but I was getting errors like this when running on GPU:

Verification failed: (it != maps->memories.end()) is false:
--> host pointer is not registered: h_ptr = 0x2af94870
... in function: static void mfem::MemoryManager::CheckHostMemoryType_(mfem::MemoryType, void*, bool)
... in file: /data/home/simlap/palace/build/extern/mfem/general/mem_manager.cpp:1714

Then I saw this page https://mfem.org/gpu-support/ warning against GetData() on GPU.

simlapointe added 3 commits

November 13, 2024 02:40


          Make rhs/RHS lower/upper case consistent

2c59a02


          Use MakeRef instead of Get/SetSubVector to split ODE system vectors

47a04e9


          Fix formatting issues

137ec8b

hughcars reviewed

View reviewed changes

palace/models/timeoperator.cpp

Comment on lines +111 to +116

+                  u.Read();
+                  u1.MakeRef(const_cast<Vector &>(u), 0, size_E);
+                  u2.MakeRef(const_cast<Vector &>(u), size_E, size_E);
+                  rhs.ReadWrite();
+                  rhs1.MakeRef(rhs, 0, size_E);
+                  rhs2.MakeRef(rhs, size_E, size_E);

Collaborator

hughcars Nov 13, 2024 •

edited

Loading

Nice! The const_cast isn't my favorite here, as it means the u1 and u2 aren't reflecting the safety constraints on u. I have a trick I think should work to address this using structured binding:

diff --git a/palace/models/timeoperator.cpp b/palace/models/timeoperator.cpp
index ba5eb184..e12266bc 100644
--- a/palace/models/timeoperator.cpp
+++ b/palace/models/timeoperator.cpp
@@ -18,6 +18,28 @@ namespace palace
 namespace
 {
 
+// Helper method for assembling a pair of vectors as references to each half of a vector.
+std::pair<Vector, Vector> AssemblePairedRef(Vector &x, int size)
+{
+  Vector x1, x2;
+  x1.UseDevice(true);
+  x2.UseDevice(true);
+  x1.MakeRef(x, 0, size);
+  x2.MakeRef(x, size, size);
+  return {x1, x2};
+};
+
+// Helper method for assembling a pair of vectors as references to each half of a vector.
+std::pair<const Vector, const Vector> AssemblePairedRef(const Vector &x, int size)
+{
+  Vector x1, x2;
+  x1.UseDevice(true);
+  x2.UseDevice(true);
+  x1.MakeRef(const_cast<Vector &>(x), 0, size);
+  x2.MakeRef(const_cast<Vector &>(x), size, size);
+  return {x1, x2};
+};
+
 class TimeDependentFirstOrderOperator : public mfem::TimeDependentOperator
 {
 public:
@@ -103,17 +125,8 @@ public:
   // Form the RHS for the first-order ODE system
   void FormRHS(const Vector &u, Vector &rhs) const
   {
-    Vector u1, u2, rhs1, rhs2;
-    u1.UseDevice(true);
-    u2.UseDevice(true);
-    rhs1.UseDevice(true);
-    rhs2.UseDevice(true);
-    u.Read();
-    u1.MakeRef(const_cast<Vector &>(u), 0, size_E);
-    u2.MakeRef(const_cast<Vector &>(u), size_E, size_E);
-    rhs.ReadWrite();
-    rhs1.MakeRef(rhs, 0, size_E);
-    rhs2.MakeRef(rhs, size_E, size_E);
+    const auto [u1, u2] = AssemblePairedRef(u, size_E);
+    auto [rhs1, rhs2] = AssemblePairedRef(rhs, size_E);
 
     // u1 = Edot, u2 = E
     // rhs1 = -(K * u2 + C * u1) - J(t)
@@ -140,17 +153,8 @@ public:
     }
     FormRHS(u, RHS);
 
-    Vector du1, du2, RHS1, RHS2;
-    du1.UseDevice(true);
-    du2.UseDevice(true);
-    RHS1.UseDevice(true);
-    RHS2.UseDevice(true);
-    du.ReadWrite();
-    du1.MakeRef(du, 0, size_E);
-    du2.MakeRef(du, size_E, size_E);
-    RHS.ReadWrite();
-    RHS1.MakeRef(RHS, 0, size_E);
-    RHS2.MakeRef(RHS, size_E, size_E);
+    auto [du1, du2] = AssemblePairedRef(du, size_E);
+    auto [RHS1, RHS2] = AssemblePairedRef(RHS, size_E);
 
     kspM->Mult(RHS1, du1);
     du2 = RHS2;
@@ -171,17 +175,8 @@ public:
     Mpi::Print("\n");
     FormRHS(u, RHS);
 
-    Vector k1, k2, RHS1, RHS2;
-    k1.UseDevice(true);
-    k2.UseDevice(true);
-    RHS1.UseDevice(true);
-    RHS2.UseDevice(true);
-    k.ReadWrite();
-    k1.MakeRef(k, 0, size_E);
-    k2.MakeRef(k, size_E, size_E);
-    RHS.ReadWrite();
-    RHS1.MakeRef(RHS, 0, size_E);
-    RHS2.MakeRef(RHS, size_E, size_E);
+    auto [k1, k2] = AssemblePairedRef(k, size_E);
+    auto [RHS1, RHS2] = AssemblePairedRef(RHS, size_E);
 
     // A k1 = RHS1 - dt K RHS2
     K->AddMult(RHS2, RHS1, -dt);
@@ -215,18 +210,11 @@ public:
   // Solve (Mass - dt Jacobian) x = Mass b
   int SUNImplicitSolve(const Vector &b, Vector &x, double tol) override
   {
-    Vector b1, b2, x1, x2, RHS1;
-    b1.UseDevice(true);
-    b2.UseDevice(true);
-    x1.UseDevice(true);
-    x2.UseDevice(true);
+    const auto [b1, b2] = AssemblePairedRef(b, size_E);
+    auto [x1, x2] = AssemblePairedRef(x, size_E);
+
+    Vector RHS1;
     RHS1.UseDevice(true);
-    b.Read();
-    b1.MakeRef(const_cast<Vector &>(b), 0, size_E);
-    b2.MakeRef(const_cast<Vector &>(b), size_E, size_E);
-    x.ReadWrite();
-    x1.MakeRef(x, 0, size_E);
-    x2.MakeRef(x, size_E, size_E);
     RHS.ReadWrite();
     RHS1.MakeRef(RHS, 0, size_E);

If that works, we might actually want to use the same trick in other places, then the const violation will be trapped to one relatively safe location. You should test this on your gpu builds though, as it's very plausible that the Vector constructors don't play too nicely with this.

simlapointe added 3 commits

November 13, 2024 23:45


          Simplify transient solver config file verification

df38dbb


          Merge branch 'main' into simlapointe/transient-adapt-dt

1fd974c


          fix formatting issues

9ecd6d1

Author

simlapointe commented Nov 13, 2024

I updated the config file defaults and parameter checking so we can use appropriate constraints in the json script. Using some existence checks and a few if's in config.cpp to remove the need for the exhaustive if/else's I had in iodata.cpp.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement transient