Benchmark: Difference between revisions

From HPCWIKI
Jump to navigation Jump to search
No edit summary
 
(5 intermediate revisions by the same user not shown)
Line 4: Line 4:
!Name
!Name
!Description
!Description
!
|-
!
|HPL-GPU
|A custom benchmark to measure GPU performance on top of HPL packages by enabling CUDA so that the benchmark is able to use GPU.
There are lots of HPL-GPU on internet but no single site does not explain how to compile with background information. So we decide to start a single page [[how to compile HPL-GPU]] for someone who is interrest in.
|-
|-
|HPL 2.3
|HPL 2.3
|High Performance Linpack tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpl
|software  package  that solves  a (random) dense  linear  system  in  double  precision  (64  bits)  arithmetic on  distributed-memory  computers.    It  can  thus  be  regarded  as a portable as well as  freely available  implementation  of the  High Performance Computing Linpack Benchmark<ref>https://netlib.org/benchmark/hpl/</ref>
|
High Performance Linpack tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpl
|
|-
|-
|HPCG 3.1
|HPCG 3.1
|High Performance Conjugate Gradient solver tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpcg3.1
|High Performance Conjugate Gradient solver tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpcg3.1
|
|
|-
|-
|NAMD 3.0a11
|NAMD 3.0a11
| Molecular Dynamics tag: nvcr.io/hpc/namd:3.0-alpha11
| Molecular Dynamics tag: nvcr.io/hpc/namd:3.0-alpha11
|
|
|-
|-
|LAMMPS
|LAMMPS
|Molecular Dynamics tag: nvcr.io/hpc/lammps:patch_4May2022
|Molecular Dynamics tag: nvcr.io/hpc/lammps:patch_4May2022
|
|
|-
|-
|TensorFlow 1.15.5
|TensorFlow 1.15.5
|ML/AI framework tag: nvcr.io/nvidia/tensorflow:22.09-tf1-py3
|ML/AI framework tag: nvcr.io/nvidia/tensorflow:22.09-tf1-py3
|
|
|-
|-
|PyTorch 1.13.0a0
|PyTorch 1.13.0a0
|ML/AI framework tag: nvcr.io/nvidia/pytorch:22.09-py3
|ML/AI framework tag: nvcr.io/nvidia/pytorch:22.09-py3
|
|-
|
|NVIDIA HPC-Benchmarks<ref>https://catalog.ngc.nvidia.com/orgs/nvidia/containers/hpc-benchmarks</ref>
|The <code>hpc-benchmarks:23.5</code> container image is provided with the following packages embedded:
 
* HPL-NVIDIA 23.5.0
* HPL-AI-NVIDIA 23.5.0
* HPCG-NVIDIA 23.5.0
* NVIDIA HPC-X 2.15 for [[CUDA]] 12.x
* NVIDIA NCCL 2.16.5 for CUDA 12.x
* Intel MKL 2020.4-912
 
 
 
<nowiki>*</nowiki>NVIDIA's HPL benchmark requires [https://developer.nvidia.com/gdrcopy GRDCopy] installed on the system, <nowiki>https://github.com/NVIDIA/gdrcopy#build-and-installation</nowiki> for more information for install GRDCopy
 
<nowiki>*</nowiki>GDRCopy supports Quadro- and Tesla-class GPUs only<ref>https://github.com/NVIDIA/gdrcopy/issues/198</ref>
|}
|}


==Reference==
==Reference==

Latest revision as of 12:43, 28 August 2023

HPC Benchmark

Name Description
HPL-GPU A custom benchmark to measure GPU performance on top of HPL packages by enabling CUDA so that the benchmark is able to use GPU.

There are lots of HPL-GPU on internet but no single site does not explain how to compile with background information. So we decide to start a single page how to compile HPL-GPU for someone who is interrest in.

HPL 2.3 software package that solves a (random) dense linear system in double precision (64 bits) arithmetic on distributed-memory computers. It can thus be regarded as a portable as well as freely available implementation of the High Performance Computing Linpack Benchmark[1]

High Performance Linpack tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpl

HPCG 3.1 High Performance Conjugate Gradient solver tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpcg3.1
NAMD 3.0a11 Molecular Dynamics tag: nvcr.io/hpc/namd:3.0-alpha11
LAMMPS Molecular Dynamics tag: nvcr.io/hpc/lammps:patch_4May2022
TensorFlow 1.15.5 ML/AI framework tag: nvcr.io/nvidia/tensorflow:22.09-tf1-py3
PyTorch 1.13.0a0 ML/AI framework tag: nvcr.io/nvidia/pytorch:22.09-py3
NVIDIA HPC-Benchmarks[2] The hpc-benchmarks:23.5 container image is provided with the following packages embedded:
  • HPL-NVIDIA 23.5.0
  • HPL-AI-NVIDIA 23.5.0
  • HPCG-NVIDIA 23.5.0
  • NVIDIA HPC-X 2.15 for CUDA 12.x
  • NVIDIA NCCL 2.16.5 for CUDA 12.x
  • Intel MKL 2020.4-912


*NVIDIA's HPL benchmark requires GRDCopy installed on the system, https://github.com/NVIDIA/gdrcopy#build-and-installation for more information for install GRDCopy

*GDRCopy supports Quadro- and Tesla-class GPUs only[3]

Reference

https://www.pugetsystems.com/labs/hpc/nvidia-rtx4090-ml-ai-and-scientific-computing-performance-preliminary-2382/

https://www.pugetsystems.com/labs/hpc/how-to-run-an-optimized-hpl-linpack-benchmark-on-amd-ryzen-threadripper-2990wx-32-core-performance-1291/

https://www.amd.com/en/developer.html