Benchmark: Difference between revisions

From HPCWIKI
Jump to navigation Jump to search
No edit summary
Line 4: Line 4:
!Name
!Name
!Description
!Description
!
!
|-
|-
|HPL 2.3
|HPL 2.3
|software  package  that solves  a (random) dense  linear  system  in  double  precision  (64  bits)  arithmetic on  distributed-memory  computers.    It  can  thus  be  regarded  as a portable as well as  freely available  implementation  of the  High Performance Computing Linpack Benchmark<ref>https://netlib.org/benchmark/hpl/</ref>
|software  package  that solves  a (random) dense  linear  system  in  double  precision  (64  bits)  arithmetic on  distributed-memory  computers.    It  can  thus  be  regarded  as a portable as well as  freely available  implementation  of the  High Performance Computing Linpack Benchmark<ref>https://netlib.org/benchmark/hpl/</ref>
High Performance Linpack tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpl
High Performance Linpack tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpl
|
|
|-
|-
|HPCG 3.1
|HPCG 3.1
|High Performance Conjugate Gradient solver tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpcg3.1
|High Performance Conjugate Gradient solver tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpcg3.1
|
|
|-
|-
|NAMD 3.0a11
|NAMD 3.0a11
| Molecular Dynamics tag: nvcr.io/hpc/namd:3.0-alpha11
| Molecular Dynamics tag: nvcr.io/hpc/namd:3.0-alpha11
|
|
|-
|-
|LAMMPS
|LAMMPS
|Molecular Dynamics tag: nvcr.io/hpc/lammps:patch_4May2022
|Molecular Dynamics tag: nvcr.io/hpc/lammps:patch_4May2022
|
|
|-
|-
|TensorFlow 1.15.5
|TensorFlow 1.15.5
|ML/AI framework tag: nvcr.io/nvidia/tensorflow:22.09-tf1-py3
|ML/AI framework tag: nvcr.io/nvidia/tensorflow:22.09-tf1-py3
|
|
|-
|-
|PyTorch 1.13.0a0
|PyTorch 1.13.0a0
|ML/AI framework tag: nvcr.io/nvidia/pytorch:22.09-py3
|ML/AI framework tag: nvcr.io/nvidia/pytorch:22.09-py3
|
|-
|
|NVIDIA HPC-Benchmarks<ref>https://catalog.ngc.nvidia.com/orgs/nvidia/containers/hpc-benchmarks</ref>
|The <code>hpc-benchmarks:23.5</code> container image is provided with the following packages embedded:
 
* HPL-NVIDIA 23.5.0
* HPL-AI-NVIDIA 23.5.0
* HPCG-NVIDIA 23.5.0
* NVIDIA HPC-X 2.15 for [[CUDA]] 12.x
* NVIDIA NCCL 2.16.5 for CUDA 12.x
* Intel MKL 2020.4-912
 
 
NVIDIA's HPL benchmark requires GRDCopy installed on the system, details at <nowiki>https://developer.nvidia.com/gdrcopy</nowiki> and <nowiki>https://github.com/NVIDIA/gdrcopy#build-and-installation</nowiki> for more information
|}
|}



Revision as of 12:47, 9 August 2023

HPC Benchmark

Name Description
HPL 2.3 software package that solves a (random) dense linear system in double precision (64 bits) arithmetic on distributed-memory computers. It can thus be regarded as a portable as well as freely available implementation of the High Performance Computing Linpack Benchmark[1]

High Performance Linpack tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpl

HPCG 3.1 High Performance Conjugate Gradient solver tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpcg3.1
NAMD 3.0a11 Molecular Dynamics tag: nvcr.io/hpc/namd:3.0-alpha11
LAMMPS Molecular Dynamics tag: nvcr.io/hpc/lammps:patch_4May2022
TensorFlow 1.15.5 ML/AI framework tag: nvcr.io/nvidia/tensorflow:22.09-tf1-py3
PyTorch 1.13.0a0 ML/AI framework tag: nvcr.io/nvidia/pytorch:22.09-py3
NVIDIA HPC-Benchmarks[2] The hpc-benchmarks:23.5 container image is provided with the following packages embedded:
  • HPL-NVIDIA 23.5.0
  • HPL-AI-NVIDIA 23.5.0
  • HPCG-NVIDIA 23.5.0
  • NVIDIA HPC-X 2.15 for CUDA 12.x
  • NVIDIA NCCL 2.16.5 for CUDA 12.x
  • Intel MKL 2020.4-912


NVIDIA's HPL benchmark requires GRDCopy installed on the system, details at https://developer.nvidia.com/gdrcopy and https://github.com/NVIDIA/gdrcopy#build-and-installation for more information


Reference

https://www.pugetsystems.com/labs/hpc/nvidia-rtx4090-ml-ai-and-scientific-computing-performance-preliminary-2382/

https://www.pugetsystems.com/labs/hpc/how-to-run-an-optimized-hpl-linpack-benchmark-on-amd-ryzen-threadripper-2990wx-32-core-performance-1291/

https://www.amd.com/en/developer.html