Two-dimensional AMROC Mesh Hierarchies
The following test cases provide representative evolving AMR mesh hierarchies. The data files trace*_all.dat are useful to evaluate different dynamic partitioning schemes for blockstructured mesh hierarchies. The format of trace*_all.dat has been proposed by Johan Steensland, Sandia National Laboratory.
Mach reflection at a wedge
Frequently used test case identical to the mach reflection of a planar Mach 10.0 shock at a 30 degree wedge. Standard AMR calculation. Also used in M. Berger and P. Colella. Local adaptive mesh refinement for shock hydrodynamics. J. Comput. Phys., 82:64-84, 1988. Same initial conditions as in
ClawpackEuler2dRamp.
- Roe solver with carbuncle fix in 2nd order multidimensional Wave Propagation Method with MUSCL slope limiting
- Base grid 480x120, 3 additional levels of refinement, refinement factors 2,2,4
- Contour plots with levels at final time: Full domain, Detail, schlieren plots: Detail
- Results trace.txt, out.txt, solver.in: Ramp2dBw4.tar.gz (BlockWidth=4, Efficiency=70%), Ramp2dBw2.tar.gz (BlockWidth=2, Efficiency=70%), Ramp2dBw4_050.tar.gz (BlockWidth=4, Efficiency=60%)
- Real imbalance results on 8, 16, 32, 64 processors ALC with current parallelization strategy: Ramp2dBw4ALC.tar.gz (BlockWidth=4, Efficiency=70%)
- SFC statistics result: PDFs, gnuplot
- SFC videos: 8cpus, 16cpus, 32cpus
- Source codes
Planar Richtmyer-Meshkow instability
Shock-driven turbulence simulation in which a Mach 10.0 shock wave deposits vorticity along a sinusoidally perturbed material interface (5 symmetric pertubations) in rectangular geometry that is closed except at one end. A reshock due to reflection at the wall perturbes the shocked interface further. This example is motivated by experiments by M. Vetter and R. Stuartevant. Experiments on the Richtmyer-Meshkov instability on a Air/SF6 interface. Shock Waves 4(5):247-252, 1995. LLNL and LANL have an intrinsic interest in problems of this type and are doing numerous similar simulations. Refinement very widespread. The widespread and simple refinement seems to be the reason that this is the only example where BlockWidth=2 results in smaller overall compute time.
- Robust two-component Roe-HLL solver in 2nd order multidimensional Wave Propagation Method with MUSCL slope limiting
- Base grid 240x120, 3 additional levels of refinement, refinement factors 2,2,2
- Schlieren plots of full domain (turbulent interface fully refined): t=0.5, t=1.0, t=1.5, t=2.0, t=2.5
- Results trace.txt, out.txt, solver.in: ShockTurb2dBw4.tar.gz (BlockWidth=4), ShockTurb2dBw2.tar.gz (BlockWidth=2)
- Real imbalance results on 8, 16, 32, 64 processors ALC with current parallelization strategy: ShockTurb2dBw4ALC.tar.gz (BlockWidth=4)
- SFC statistics result: PDFs, gnuplot
- SFC videos: 8cpus
- Source codes
Converging/diverging Richtmyer-Meshkov instability
This simulation is intended to study the Richtmyer-Meshkov instability in a sperical setting. A preparatory simulation for the target VTF target simulation
Converging Shock. A sperical shock wave of initially Mach 5.0 is initially located at r=1.5 and perturbes a sperical material interface (6 symmetric pertubations) intially located at r=1.0. The spherical shock is initialized with Guderley's analytic solution calculated with Chisnell's approximation. (R.F. Chisnell. An analytic description of converging shock waves. J. Fluid Mech 354:357-375, 1998). The sperical converging shock wave is reflected at the origin around t=0.3 and drives a Richtmyer-Meshkov instability with reshock from the apex. Domain-based partitioning leads to a maximum in load imbalance around t=0.3.
- Robust two-component Roe-HLL solver in 2nd order multidimensional Wave Propagation Method with MUSCL slope limiting
- Base grid 200x200, [0,8]x[0,8], 4 additional levels of refinement, refinement factors 2,2,4,2
- Plots at t=2.1: Contour plots on levels, Schlieren plot, schlieren plots of origin: t=0.0, t=0.3, t=0.6, t=0.9, t=2.1, contour plots with levels of origin: t=0.3, t=0.6, t=0.9
- Results trace.txt, out.txt, solver.in: ConvShock2dBw4.tar.gz (BlockWidth=4), ConvShock2dBw2.tar.gz (BlockWidth=2), ConvShock2dBw4OldClusterer.tar.gz (BlockWidth=4, old Berger-Rigoutsos clustering algorithm)
- This is the only example where the costs for the BR algorithm take a significant portion of the overall time. The example uncovers that the new implementation by Johan is about twice as expensive than the old one, at least for large problem sizes. In this example it takes more than 50% of the overall time. We should take closer look on the new code again.
- Real imbalance results on 8, 16, 32, 64 processors ALC with current parallelization strategy: ConvShock2dBw4ALC.tar.gz (BlockWidth=4)
- SFC statistics result: PDFs, gnuplot
- SFC videos: 8cpus, 16cpus, 32cpus
- Source codes
Detonation propgating through a channel with two rectangular bends
This example in a Cartesian multiblock domain was motivated by detonation simulations which typically require relatively deep hierarchies with thin refinement fronts. An example for SAMR in practical engneering simulations, for instance for safety analysis studies. See also
Diffraction of a Detonation Wave. For simplicity and reproducibility by other, the example uses one-step chemistry and the detonation wave is initialized at the left end with the analytic solution according to the ZND theory (see for instance R. Deiterding.
Parallel adaptive simulation of multi-dimensional detonation structures (31MB PDF). Dissertation. BTU Cottbus, Sep 2003, page 36). In this example the new BR algorithm takes about 5% of the overall computing time.
- Robust two-component Roe-HLL solver with 2nd order MUSCL slope limiting and Godunov dimensional splitting, source term for one-step chemistry
- Base grid 220x100, 2 rectangular regions cut out, 5 additional levels of refinement, refinement factors 2,2,2,2,2 block width 4 (default)
- Schlieren plots of full domain: t=0.1, t=0.2, t=0.3, t=0.35, t=0.4, t=0.5, contour plots with levels at t=0.35: Full domain, Detail
- Results trace.txt, out.txt, solver.in: DetChan2dBw4.tar.gz (BlockWidth=4, Efficiency=85%), DetChan2dBw2.tar.gz (BlockWidth=2, Efficiency=85%), DetChan2dBw4_065.tar.gz (BlockWidth=4, Efficiency=65%)
- Real imbalance results on 8, 16, 32, 64 processors ALC with current parallelization strategy: DetChan2dBw4ALC.tar.gz (BlockWidth=4, Efficiency=85%)
- SFC statistics result: PDFs, gnuplot
- SFC videos: 8cpus
- Source codes
Cylinders in hypersonic flow
The application of SAMR for a simple exterior flow problems in aerodynamics. A constant Mach 10.0 inflow leads to steady bow shocks over two cylinders that are incorporated into the Cartesian AMR method with the
Ghost fluid method. The application of SAMR to a realistic flow problem with complex imbedded boundaries. Same initial conditions as in
LoadOnSpheres.
- Van Leer flux vector splitting with 2nd order MUSCL slope limiting and Godunov dimensional splitting
- Base grid 200x160, 3 additional levels of refinement, refinement factors 2,2,2
- Contour plots with levels at final time: Full domain, Detail, schlieren plots: Full domain, Detail
- Results trace.txt, out.txt, solver.in: Spheres2dBw4.tar.gz (BlockWidth=4), Spheres2dBw2.tar.gz (BlockWidth=2)
- Real imbalance results on 8, 16, 32, 64 processors ALC with current parallelization strategy: Spheres2dBw4ALC.tar.gz (BlockWidth=4)
- SFC statistics result: PDFs, gnuplot
- SFC videos: 8cpus
- Source codes
The balance.txt files contain: [step] [phyiscal time] [maximal relative workload] [minimal relative workload] [difference of both]
Because of small programming error maximal and minimal workload in all files reflect the average over time.
The trace*_all.dat files contain: [steps] [max sum work] [sum max work] [avg work] [avg sync] [avg orphan work] [avg move] [max sum work balance] [sum max work balance]
Given a box hierarchy bh the metrics are based on following calulations (by Randolf Rotta):
- orphans (level l, proc p): intersection(bh[level=l,proc!=p], bh[level=l+1,proc=p])
- movement (level l, proc p): intersection(bh_old[level=l,proc!=p], bh[level=l,proc=p]) + intersection(bh_old[level=l,proc!=p], orphans[level=l,proc=p]) (what processor p recieves; Note that this does not account for boundaries!)
- synchronization (level l, proc p): intersection(grow(bh[level=l,proc!=p]), bh[level=l,proc=p]) + intersection(grow(intersection(adjust(bh[level=l+1,proc!=p]), bh[level=l,proc=p])), bh[level=l,proc=p]) + intersection(bh[level=l,proc!=p], bh[level=l+1,proc=p]) (what processor p sends, including orphans)
- work (level l, proc p): bh[levep=p,proc=p] + orphans[levep=p,proc=p]
- balance: max/average
RalfDeiterding - 18 May 2005