HPCG-Benchmark version=3.1 Release date=March 28, 2019 Machine Summary= Machine Summary::Distributed Processes=6400 Machine Summary::Threads per processes=16 Global Problem Dimensions= Global Problem Dimensions::Global nx=8192 Global Problem Dimensions::Global ny=10240 Global Problem Dimensions::Global nz=5760 Processor Dimensions= Processor Dimensions::npx=16 Processor Dimensions::npy=20 Processor Dimensions::npz=20 Local Domain Dimensions= Local Domain Dimensions::nx=512 Local Domain Dimensions::ny=512 ########## Problem Summary ##########= Setup Information= Setup Information::Setup Time=1.75764 Linear System Information= Linear System Information::Number of Equations=483183820800 Linear System Information::Number of Nonzero Terms=13042542472696 Multigrid Information= Multigrid Information::Number of coarse grid levels=3 Multigrid Information::Coarse Grids= Multigrid Information::Coarse Grids::Grid Level=1 Multigrid Information::Coarse Grids::Number of Equations=60397977600 Multigrid Information::Coarse Grids::Number of Nonzero Terms=1629890295544 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=2 Multigrid Information::Coarse Grids::Number of Equations=7549747200 Multigrid Information::Coarse Grids::Number of Nonzero Terms=203629435768 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 Multigrid Information::Coarse Grids::Grid Level=3 Multigrid Information::Coarse Grids::Number of Equations=943718400 Multigrid Information::Coarse Grids::Number of Nonzero Terms=25426980280 Multigrid Information::Coarse Grids::Number of Presmoother Steps=1 Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1 ########## Memory Use Summary ##########= Memory Use Information= Memory Use Information::Total memory used for data (Gbytes)=345402 Memory Use Information::Memory used for OptimizeProblem data (Gbytes)=0 Memory Use Information::Bytes per equation (Total memory / Number of Equations)=714.847 Memory Use Information::Memory used for linear system and CG (Gbytes)=303980 Memory Use Information::Coarse Grids= Memory Use Information::Coarse Grids::Grid Level=1 Memory Use Information::Coarse Grids::Memory used=36313.5 Memory Use Information::Coarse Grids::Grid Level=2 Memory Use Information::Coarse Grids::Memory used=4541 Memory Use Information::Coarse Grids::Grid Level=3 Memory Use Information::Coarse Grids::Memory used=568.083 ########## V&V Testing Summary ##########= Spectral Convergence Tests= Spectral Convergence Tests::Result=PASSED Spectral Convergence Tests::Unpreconditioned= Spectral Convergence Tests::Unpreconditioned::Maximum iteration count=11 Spectral Convergence Tests::Unpreconditioned::Expected iteration count=12 Spectral Convergence Tests::Preconditioned= Spectral Convergence Tests::Preconditioned::Maximum iteration count=2 Spectral Convergence Tests::Preconditioned::Expected iteration count=2 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon= Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Result=PASSED Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for SpMV=3.2492e-17 Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for MG=0 ########## Iterations Summary ##########= Iteration Count Information= Iteration Count Information::Result=PASSED Iteration Count Information::Reference CG iterations per set=50 Iteration Count Information::Optimized CG iterations per set=54 Iteration Count Information::Total number of reference iterations=42000 Iteration Count Information::Total number of optimized iterations=45360 ########## Reproducibility Summary ##########= Reproducibility Information= Reproducibility Information::Result=PASSED Reproducibility Information::Scaled residual mean=0.00493774 Reproducibility Information::Scaled residual variance=1.55613e-35 ########## Performance Summary (times in sec) ##########= Benchmark Time Summary= Benchmark Time Summary::Optimization phase=0.240812 Benchmark Time Summary::DDOT=98.7548 Benchmark Time Summary::WAXPBY=67.2084 Benchmark Time Summary::SpMV=331.445 Benchmark Time Summary::MG=1408.37 Benchmark Time Summary::Total=1905.8 Floating Point Operations Summary= Floating Point Operations Summary::Raw DDOT=1.32315e+17 Floating Point Operations Summary::Raw WAXPBY=1.32315e+17 Floating Point Operations Summary::Raw SpMV=1.20513e+18 Floating Point Operations Summary::Raw MG=6.7524e+18 Floating Point Operations Summary::Total=8.22216e+18 Floating Point Operations Summary::Total with convergence overhead=7.61311e+18 GB/s Summary= GB/s Summary::Raw Read B/W=2.65721e+07 GB/s Summary::Raw Write B/W=6.14158e+06 GB/s Summary::Raw Total B/W=3.27137e+07 GB/s Summary::Total with convergence and optimization phase overhead=2.78383e+07 GFLOP/s Summary= GFLOP/s Summary::Raw DDOT=1.33983e+06 GFLOP/s Summary::Raw WAXPBY=1.96873e+06 GFLOP/s Summary::Raw SpMV=3.63599e+06 GFLOP/s Summary::Raw MG=4.79448e+06 GFLOP/s Summary::Raw Total=4.31428e+06 GFLOP/s Summary::Total with convergence overhead=3.9947e+06 GFLOP/s Summary::Total with convergence and optimization phase overhead=3.67132e+06 User Optimization Overheads= User Optimization Overheads::Optimization phase time (sec)=0.240812 User Optimization Overheads::Optimization phase time vs reference SpMV+MG time=0.0295715 DDOT Timing Variations= DDOT Timing Variations::Min DDOT MPI_Allreduce time=18.3025 DDOT Timing Variations::Max DDOT MPI_Allreduce time=287.799 DDOT Timing Variations::Avg DDOT MPI_Allreduce time=36.3735 Final Summary= Final Summary::HPCG result is VALID with a GFLOP/s rating of=3.67132e+06 Final Summary::HPCG 2.4 rating for historical reasons is=3.95275e+06 Final Summary::Please upload results from the YAML file contents to=http://hpcg-benchmark.org