Trilinos/NOX Nonlinear Solver
Transcript of Trilinos/NOX Nonlinear Solver
1 ‹#›
Capabilities: • Newton-Based Nonlinear Solver - Linked to Trilinos linear solvers for scalability - Matrix-Free option
• Anderson Acceleration for Fixed-Point iterations • Globalizations for improved robustness - Line Searches, Trust Region, Homotopy methods
• Customizable: C++ abstractions at every level • Extended by LOCA package - Parameter continuation, Stability analysis, Bifurcation tracking
Download: Part of Trilinos (trilinos.sandia.gov)
Further information: Andy Salinger [[email protected]]
Trilinos/NOX Nonlinear Solver
2 ‹#›
FASTMath SciDAC Institute"
NOX and ML are part of larger Trilinos solver stack: Linear solvers, Equations solvers, Analysis tools
Analysis Tools""""""
UQ (sampling)"Parameter Studies"
Optimization"
Analysis Tools" (black-box)!
LinearSolvers"""""""""
Direct Solvers"
Linear Algebra"
Algebraic"Preconditioners"
Iterative Solvers"
EquationSolvers"""""""""""UQ Solver"
Nonlinear Solver!Time Integration"
Optimization"
Continuation"Sensitivity Analysis"Stability Analysis"
Analysis Tools" (embedded)!Linear Solver
Interface
Nonlinear Model Interface
Your Model Here
Solved Problem Interface
NOX
Multilevel!Preconditioners! ML
3 ‹#›
Ice Sheets modeled by nonlinear Stokes’s equation • Initial solve is fragile: Full Newton fails • Homotopy continuation on regularization
parameter “γ” saves the day
Trilinos/NOX: Robustness for Ice Sheet Simulation: PISCEES SciDAC Application project (BER-ASCR)
Greenland Ice Sheet"Surface Velocities"
(constant friction model)"
γ=10-1.0"
γ=10-2.5"γ=10-6.0"
γ=10-10"
γ=10-10"
γ=10-10"
4 ‹#›
ML: algebraic multigrid (AMG) • Aggregation-based AMG for - Poisson, elasticity, convection-diffusion, eddy current (E&M)
• Various coarsening and data rebalancing options • Smoothers: SOR, polynomial, ILU, block variants, user-provided
MueLu: templated multigrid framework • Same main algorithms as above, plus energy minimizing AMG • Leverages Trilinos templated sparse linear algebra stack - Can use kernels optimized for various compute node types - Templated scalar type allows for mixed precision, UQ, …
• Advanced data reuse possibilities, extensible by design
Multigrid in Trilinos
Download/further information: www.trilinos.org
5 ‹#›
ML scales up to 512K BG/Q & 128K Titan cores in Drekar simulations
ML and MueLu: Application highlights
MHD generator*"
Tokamak*"SARANS turbulence*"
*courtesy E. Cyr/SNL"
Turbulent OpenJet (Re=6600)"(courtesy P. Lin/SNL)
MueLu scales up to 524K BG/Q cores in Sierra/Nalu simulations
0.0#
0.1#
0.2#
0.3#
0.4#
0.5#
0.6#
0.7#
0.8#
0.9#
1.0#
128# 1,024# 8,192# 65,536# 524,288#
Parallel&Efficien
cy:&Tim
e/(Krylov&Ite
ra7o
n)&
#&MPI&Processes&
PPE&Solve&Weak&Scaling&&
Edge2based# Element2based#
6 ‹#›
Capabilities: • Compute a few eigenpairs of a Hermitian and non-Hermitian
matrix • Both standard and generalized eigenvalues • Extremal and interior eigenvalues • Reverse communication allows easy integration with application • MPI/BLACS communication
Download: http://www.caam.rice.edu/software/ARPACK/
Further information: beyond PARPACK • EIGPEN (based on penalty trace minimization for computing
many eigenpairs) • Parallel multiple shift-invert interface for computing many
eigenpairs
PARPACK
7 ‹#›
Hybrid MPI/OpenMP implementation MATLAB interface available (eigs)
PARPACK
8 ‹#›
MFDn (Many-body Fermion Dynamics for nuclear physics) is the most robust and efficient configuration interaction code for studying nuclear structure
Solve the nuclear quantum many-body problem
PARPACK allows low excitation states to be extracted from a low dimensional Krylov subspace
MFDn: Nuclear Configuration Interaction Calculation
0 2 4 6 8 10 12 14N
max
100
101
102
103
104
105
106
107
108
109
1010
M-s
chem
e bas
is s
pac
e dim
ensi
on
4He6Li8Be10B12C16O19F23Na27Al