HPCG-Benchmark version=3.1 Release date=March 28, 2019 Machine Summary= Machine Summary::Distributed Processes=148512 Machine Summary::Threads per processes=1 Global Problem Dimensions= Global Problem Dimensions::Global nx=7168 Global Problem Dimensions::Global ny=6656 Global Problem Dimensions::Global nz=6528 Processor Dimensions= Processor Dimensions::npx=56 Processor Dimensions::npy=52 Processor Dimensions::npz=51 Local Domain Dimensions= Local Domain Dimensions::nx=128 Local Domain Dimensions::ny=128 Local Domain Dimensions::Lower ipz=0 Local Domain Dimensions::Upper ipz=50 Local Domain Dimensions::nz=128 ########## Problem Summary ##########= Setup Information= Setup Information::Setup Time=6.59904 Linear System Information= Linear System Information::Number of Equations=311452237824 Linear System Information::Number of Nonzero Terms=8406727506424 Multigrid Information= Multigrid Information::Number of coarse grid levels=3 Multigrid Information::Coarse Grids= Multigrid Information::Coarse Grids::Grid Level=1 Multigrid Information::Coarse Grids::Number of Equations=38931529728 Multigrid Information::Coarse Grids::Number of Nonzero Terms=1050530635000 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=2 Multigrid Information::Coarse Grids::Number of Equations=4866441216 Multigrid Information::Coarse Grids::Number of Nonzero Terms=131238776440 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=3 Multigrid Information::Coarse Grids::Number of Equations=608305152 Multigrid Information::Coarse Grids::Number of Nonzero Terms=16385470264 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 ########## Memory Use Summary ##########= Memory Use Information= Memory Use Information::Total memory used for data (Gbytes)=222748 Memory Use Information::Memory used for OptimizeProblem data (Gbytes)=0 Memory Use Information::Bytes per equation (Total memory / Number of Equations)=715.193 Memory Use Information::Memory used for linear system and CG (Gbytes)=196021 Memory Use Information::Coarse Grids= Memory Use Information::Coarse Grids::Grid Level=1 Memory Use Information::Coarse Grids::Memory used=23427.5 Memory Use Information::Coarse Grids::Grid Level=2 Memory Use Information::Coarse Grids::Memory used=2932.26 Memory Use Information::Coarse Grids::Grid Level=3 Memory Use Information::Coarse Grids::Memory used=367.533 ########## V&V Testing Summary ##########= Spectral Convergence Tests= Spectral Convergence Tests::Result=PASSED Spectral Convergence Tests::Unpreconditioned= Spectral Convergence Tests::Unpreconditioned::Maximum iteration count=11 Spectral Convergence Tests::Unpreconditioned::Expected iteration count=12 Spectral Convergence Tests::Preconditioned= Spectral Convergence Tests::Preconditioned::Maximum iteration count=2 Spectral Convergence Tests::Preconditioned::Expected iteration count=2 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon= Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Result=PASSED Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for SpMV=4.3241e-14 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for MG=4.12862e-13 ########## Iterations Summary ##########= Iteration Count Information= Iteration Count Information::Result=PASSED Iteration Count Information::Reference CG iterations per set=50 Iteration Count Information::Optimized CG iterations per set=50 Iteration Count Information::Total number of reference iterations=1200 Iteration Count Information::Total number of optimized iterations=1200 ########## Reproducibility Summary ##########= Reproducibility Information= Reproducibility Information::Result=PASSED Reproducibility Information::Scaled residual mean=0.005068 Reproducibility Information::Scaled residual variance=0 ########## Performance Summary (times in sec) ##########= Benchmark Time Summary= Benchmark Time Summary::Optimization phase=1.5e-07 Benchmark Time Summary::DDOT=224.065 Benchmark Time Summary::WAXPBY=48.7882 Benchmark Time Summary::SpMV=261.222 Benchmark Time Summary::MG=1406.12 Benchmark Time Summary::Total=1940.43 Floating Point Operations Summary= Floating Point Operations Summary::Raw DDOT=2.25741e+15 Floating Point Operations Summary::Raw WAXPBY=2.25741e+15 Floating Point Operations Summary::Raw SpMV=2.05797e+16 Floating Point Operations Summary::Raw MG=1.15141e+17 Floating Point Operations Summary::Total=1.40235e+17 Floating Point Operations Summary::Total with convergence overhead=1.40235e+17 GB/s Summary= GB/s Summary::Raw Read B/W=445122 GB/s Summary::Raw Write B/W=102861 GB/s Summary::Raw Total B/W=547984 GB/s Summary::Total with convergence and optimization phase overhead=543547 GFLOP/s Summary= GFLOP/s Summary::Raw DDOT=10074.8 GFLOP/s Summary::Raw WAXPBY=46269.5 GFLOP/s Summary::Raw SpMV=78782.4 GFLOP/s Summary::Raw MG=81885.6 GFLOP/s Summary::Raw Total=72270.3 GFLOP/s Summary::Total with convergence overhead=72270.3 GFLOP/s Summary::Total with convergence and optimization phase overhead=71685.2 User Optimization Overheads= User Optimization Overheads::Optimization phase time (sec)=1.5e-07 User Optimization Overheads::Optimization phase time vs reference SpMV+MG time=9.66194e-08 DDOT Timing Variations= DDOT Timing Variations::Min DDOT MPI_Allreduce time=99.8818 DDOT Timing Variations::Max DDOT MPI_Allreduce time=216.28 DDOT Timing Variations::Avg DDOT MPI_Allreduce time=113.514 Final Summary= Final Summary::HPCG result is VALID with a GFLOP/s rating of=71685.2 Final Summary::HPCG 2.4 rating for historical reasons is=72270.3 Final Summary::Reference version of ComputeDotProduct used=Performance results are most likely suboptimal Final Summary::Reference version of ComputeSPMV used=Performance results are most likely suboptimal Final Summary::Reference version of ComputeMG used=Performance results are most likely suboptimal Final Summary::Reference version of ComputeWAXPBY used=Performance results are most likely suboptimal Final Summary::Please upload results from the YAML file contents to=http://hpcg-benchmark.org