Benchmark: Difference between revisions

From HPCWIKI
Jump to navigation Jump to search
 
Line 4: Line 4:
!Name
!Name
!Description
!Description
|-
|HPL-GPU
|A custom benchmark to measure GPU performance on top of HPL packages by enabling CUDA so that the benchmark is able to use GPU.
There are lots of HPL-GPU on internet but no single site does not explain how to compile with background information. So we decide to start a single page [[how to compile HPL-GPU]] for someone who is interrest in.
|-
|-
|HPL 2.3
|HPL 2.3

Latest revision as of 12:43, 28 August 2023

HPC Benchmark

Name Description
HPL-GPU A custom benchmark to measure GPU performance on top of HPL packages by enabling CUDA so that the benchmark is able to use GPU.

There are lots of HPL-GPU on internet but no single site does not explain how to compile with background information. So we decide to start a single page how to compile HPL-GPU for someone who is interrest in.

HPL 2.3 software package that solves a (random) dense linear system in double precision (64 bits) arithmetic on distributed-memory computers. It can thus be regarded as a portable as well as freely available implementation of the High Performance Computing Linpack Benchmark[1]

High Performance Linpack tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpl

HPCG 3.1 High Performance Conjugate Gradient solver tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpcg3.1
NAMD 3.0a11 Molecular Dynamics tag: nvcr.io/hpc/namd:3.0-alpha11
LAMMPS Molecular Dynamics tag: nvcr.io/hpc/lammps:patch_4May2022
TensorFlow 1.15.5 ML/AI framework tag: nvcr.io/nvidia/tensorflow:22.09-tf1-py3
PyTorch 1.13.0a0 ML/AI framework tag: nvcr.io/nvidia/pytorch:22.09-py3
NVIDIA HPC-Benchmarks[2] The hpc-benchmarks:23.5 container image is provided with the following packages embedded:
  • HPL-NVIDIA 23.5.0
  • HPL-AI-NVIDIA 23.5.0
  • HPCG-NVIDIA 23.5.0
  • NVIDIA HPC-X 2.15 for CUDA 12.x
  • NVIDIA NCCL 2.16.5 for CUDA 12.x
  • Intel MKL 2020.4-912


*NVIDIA's HPL benchmark requires GRDCopy installed on the system, https://github.com/NVIDIA/gdrcopy#build-and-installation for more information for install GRDCopy

*GDRCopy supports Quadro- and Tesla-class GPUs only[3]

Reference

https://www.pugetsystems.com/labs/hpc/nvidia-rtx4090-ml-ai-and-scientific-computing-performance-preliminary-2382/

https://www.pugetsystems.com/labs/hpc/how-to-run-an-optimized-hpl-linpack-benchmark-on-amd-ryzen-threadripper-2990wx-32-core-performance-1291/

https://www.amd.com/en/developer.html