Resolving Gradient Pathologies in Physics-Informed Neural Networks

This repository contains the official implementation of the paper: Resolving Gradient Pathologies in Physics-Informed Neural Networks via Dynamic Gradient Surgery and Adaptive Refinement for Robust Inverse Discovery Under Noise.

Overview

Physics-Informed Neural Networks (PINNs) offer a mesh-free alternative to classical numerical solvers by embedding governing partial differential equations (PDEs) directly into the training objective. However, they routinely suffer from two forms of gradient pathology:

Magnitude Imbalance (Type-I): One loss term dominates the optimization.
Directional Conflict (Type-II): Satisfying one physical constraint actively degrades another (the "Tug-of-War" problem).

These pathologies become catastrophic in inverse parameter discovery, where sparse, noise-corrupted sensor data must coexist with strict PDE enforcement. This repository provides a unified framework to overcome these challenges.

Key Components

The canonical PINN objective takes the form: $$\mathcal{L}{\text{total}}=\lambda{\text{pde}}\mathcal{L}{\text{pde}}+\sum{k=1}^{K}\lambda_{k}\mathcal{L}{\text{bc},k}+\lambda{\text{data}}\mathcal{L}_{\text{data}}$$

Our framework couples four synergistic modules to stabilize training and enable robust inverse discovery:

1. Dynamic Gradient Balancing (DB-PINN)

Addresses Type-I magnitude imbalance by tracking the running gradient scale of each loss term using a Forgetful Exponential Moving Average (EMA). The EMA estimate is: $$\hat{g}_i^{(t)}=\alpha\cdot\hat{g}i^{(t-1)}+(1-\alpha)\cdot g_i^{(t)}$$ where $g_i^{(t)}=|\nabla\theta\mathcal{L}_i^{(t)}|$ and $\alpha=0.999$. Loss weights are adaptively rebalanced based on these statistics.

2. Gradient Surgery (PCGrad) with Gradient Task Normalization (GTN)

Resolves Type-II directional conflicts. If gradients conflict ($\nabla_\theta\mathcal{L}i\cdot\nabla\theta\mathcal{L}j<0$), the interfering gradient is projected onto the normal plane of its competitor: $$\nabla\theta\mathcal{L}i^{*}=\nabla\theta\mathcal{L}i-\frac{\nabla\theta\mathcal{L}i\cdot\nabla\theta\mathcal{L}j}{|\nabla\theta\mathcal{L}_j|2^2+\epsilon}\nabla\theta\mathcal{L}_j$$ Before surgery, GTN normalizes each gradient vector to $O(1)$ scale to prevent domination by high-magnitude terms like pressure gradients in fluid dynamics.

3. Residual-based Adaptive Refinement (RAR)

Dynamically redistributes collocation points based on the current residual landscape, sampling heavily in regions with high PDE residual magnitude $|\mathcal{R}(\bm{x}_j)|$.

4. Softplus Physics Anchor

Ensures physical validity during inverse problems. Rather than a raw parameter, unknown variables (like Reynolds number) are parameterized through a Softplus activation to guarantee positive values: $$Re=\beta\cdot\ln(1+e^k)$$

Architecture

The framework utilizes Sinusoidal Representation Networks (SIREN), where each hidden layer applies: $$\bm{h}_{l+1}=\sin!\left(\omega_0\cdot(\bm{W}_l\bm{h}_l+\bm{b}_l)\right)$$ This provides analytically exact, non-vanishing derivatives critical for computing PDE residuals involving second-order spatial derivatives. All computations are performed in float64 precision.

Benchmarks

The framework is validated on two canonical benchmarks:

Allen-Cahn Equation: A stiff phase-field equation ($\epsilon=10^{-4}$) modeling phase separation dynamics.
2D Navier-Stokes (Cylinder Flow): Steady incompressible flow around a cylinder at $Re=100$.

Results Highlight

In inverse parameter estimation (discovering $Re$ from sensors with 10% Gaussian noise), baseline PINNs diverge or flatline. Our complete framework converges stably, recovering the target Reynolds number with approximately 1.5% relative error ($Re_{\text{pred}}\approx 98.5\pm 1.2$).

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
__pycache__		__pycache__
experiments		experiments
pinn_engine		pinn_engine
problems		problems
results		results
.gitignore		.gitignore
Colab_SOTA_Run.ipynb		Colab_SOTA_Run.ipynb
IEEE_Research_Paper.tex		IEEE_Research_Paper.tex
README.md		README.md
SOTA-only.png		SOTA-only.png
cylinder_velocity.png		cylinder_velocity.png
loss_magnitude_explanation.txt		loss_magnitude_explanation.txt
more_implementations_and_improvements.txt		more_implementations_and_improvements.txt
pinn_ac_model.pth		pinn_ac_model.pth
pinn_cylinder_re100.pth		pinn_cylinder_re100.pth
project_status_and_benchmarking.txt		project_status_and_benchmarking.txt
sampling_density.png		sampling_density.png
solution_snapshot.png		solution_snapshot.png
thermal_simulation_result.png		thermal_simulation_result.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Resolving Gradient Pathologies in Physics-Informed Neural Networks

Overview

Key Components

1. Dynamic Gradient Balancing (DB-PINN)

2. Gradient Surgery (PCGrad) with Gradient Task Normalization (GTN)

3. Residual-based Adaptive Refinement (RAR)

4. Softplus Physics Anchor

Architecture

Benchmarks

Results Highlight

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Resolving Gradient Pathologies in Physics-Informed Neural Networks

Overview

Key Components

1. Dynamic Gradient Balancing (DB-PINN)

2. Gradient Surgery (PCGrad) with Gradient Task Normalization (GTN)

3. Residual-based Adaptive Refinement (RAR)

4. Softplus Physics Anchor

Architecture

Benchmarks

Results Highlight

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages