HPCG-Benchmark version=3.1 Release date=March 28, 2019 Machine Summary= Machine Summary::Distributed Processes=3640 Machine Summary::Threads per processes=6 Global Problem Dimensions= Global Problem Dimensions::Global nx=10240 Global Problem Dimensions::Global ny=7168 Global Problem Dimensions::Global nz=3744 Processor Dimensions= Processor Dimensions::npx=20 Processor Dimensions::npy=14 Processor Dimensions::npz=13 Local Domain Dimensions= Local Domain Dimensions::nx=512 Local Domain Dimensions::ny=512 Local Domain Dimensions::Lower ipz=0 Local Domain Dimensions::Upper ipz=12 Local Domain Dimensions::nz=288 ########## Problem Summary ##########= Setup Information= Setup Information::Setup Time=0.619357 Linear System Information= Linear System Information::Number of Equations=274810798080 Linear System Information::Number of Nonzero Terms=7417397436280 Multigrid Information= Multigrid Information::Number of coarse grid levels=3 Multigrid Information::Coarse Grids= Multigrid Information::Coarse Grids::Grid Level=1 Multigrid Information::Coarse Grids::Number of Equations=34351349760 Multigrid Information::Coarse Grids::Number of Nonzero Terms=926862979000 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=2 Multigrid Information::Coarse Grids::Number of Equations=4293918720 Multigrid Information::Coarse Grids::Number of Nonzero Terms=115779971032 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=3 Multigrid Information::Coarse Grids::Number of Equations=536739840 Multigrid Information::Coarse Grids::Number of Nonzero Terms=14453032936 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 ########## Memory Use Summary ##########= Memory Use Information= Memory Use Information::Total memory used for data (Gbytes)=196448 Memory Use Information::Memory used for OptimizeProblem data (Gbytes)=0 Memory Use Information::Bytes per equation (Total memory / Number of Equations)=714.847 Memory Use Information::Memory used for linear system and CG (Gbytes)=172889 Memory Use Information::Coarse Grids= Memory Use Information::Coarse Grids::Grid Level=1 Memory Use Information::Coarse Grids::Memory used=20653.3 Memory Use Information::Coarse Grids::Grid Level=2 Memory Use Information::Coarse Grids::Memory used=2582.69 Memory Use Information::Coarse Grids::Grid Level=3 Memory Use Information::Coarse Grids::Memory used=323.097 ########## V&V Testing Summary ##########= Spectral Convergence Tests= Spectral Convergence Tests::Result=PASSED Spectral Convergence Tests::Unpreconditioned= Spectral Convergence Tests::Unpreconditioned::Maximum iteration count=11 Spectral Convergence Tests::Unpreconditioned::Expected iteration count=12 Spectral Convergence Tests::Preconditioned= Spectral Convergence Tests::Preconditioned::Maximum iteration count=1 Spectral Convergence Tests::Preconditioned::Expected iteration count=2 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon= Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Result=PASSED Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for SpMV=1.00446e-16 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for MG=2.0236e-06 ########## Iterations Summary ##########= Iteration Count Information= Iteration Count Information::Result=PASSED Iteration Count Information::Reference CG iterations per set=50 Iteration Count Information::Optimized CG iterations per set=53 Iteration Count Information::Total number of reference iterations=25150 Iteration Count Information::Total number of optimized iterations=26659 ########## Reproducibility Summary ##########= Reproducibility Information= Reproducibility Information::Result=PASSED Reproducibility Information::Scaled residual mean=0.0050119 Reproducibility Information::Scaled residual variance=1.97677e-30 ########## Performance Summary (times in sec) ##########= Benchmark Time Summary= Benchmark Time Summary::Optimization phase=0.535646 Benchmark Time Summary::DDOT=89.8836 Benchmark Time Summary::WAXPBY=108.263 Benchmark Time Summary::SpMV=406.113 Benchmark Time Summary::MG=1370.03 Benchmark Time Summary::Total=1974.86 Floating Point Operations Summary= Floating Point Operations Summary::Raw DDOT=4.42335e+16 Floating Point Operations Summary::Raw WAXPBY=4.42335e+16 Floating Point Operations Summary::Raw SpMV=4.02943e+17 Floating Point Operations Summary::Raw MG=2.2569e+18 Floating Point Operations Summary::Total=2.74831e+18 Floating Point Operations Summary::Total with convergence overhead=2.59275e+18 GB/s Summary= GB/s Summary::Raw Read B/W=8.57134e+06 GB/s Summary::Raw Write B/W=1.981e+06 GB/s Summary::Raw Total B/W=1.05523e+07 GB/s Summary::Total with convergence and optimization phase overhead=9.67055e+06 GFLOP/s Summary= GFLOP/s Summary::Raw DDOT=492120 GFLOP/s Summary::Raw WAXPBY=408576 GFLOP/s Summary::Raw SpMV=992193 GFLOP/s Summary::Raw MG=1.64734e+06 GFLOP/s Summary::Raw Total=1.39165e+06 GFLOP/s Summary::Total with convergence overhead=1.31288e+06 GFLOP/s Summary::Total with convergence and optimization phase overhead=1.27536e+06 User Optimization Overheads= User Optimization Overheads::Optimization phase time (sec)=0.535646 User Optimization Overheads::Optimization phase time vs reference SpMV+MG time=0.0314174 DDOT Timing Variations= DDOT Timing Variations::Min DDOT MPI_Allreduce time=15.9698 DDOT Timing Variations::Max DDOT MPI_Allreduce time=42.6052 DDOT Timing Variations::Avg DDOT MPI_Allreduce time=23.5199 Final Summary= Final Summary::HPCG result is VALID with a GFLOP/s rating of=1.27536e+06 Final Summary::HPCG 2.4 rating for historical reasons is=1.29521e+06 Final Summary::Please upload results from the YAML file contents to=http://hpcg-benchmark.org