Benchmark: Difference between revisions

Latest revision as of 12:43, 28 August 2023

HPC Benchmark


Name	Description
HPL-GPU	A custom benchmark to measure GPU performance on top of HPL packages by enabling CUDA so that the benchmark is able to use GPU. There are lots of HPL-GPU on internet but no single site does not explain how to compile with background information. So we decide to start a single page how to compile HPL-GPU for someone who is interrest in.
HPL 2.3	software package that solves a (random) dense linear system in double precision (64 bits) arithmetic on distributed-memory computers. It can thus be regarded as a portable as well as freely available implementation of the High Performance Computing Linpack Benchmark^[1] High Performance Linpack tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpl
HPCG 3.1	High Performance Conjugate Gradient solver tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpcg3.1
NAMD 3.0a11	Molecular Dynamics tag: nvcr.io/hpc/namd:3.0-alpha11
LAMMPS	Molecular Dynamics tag: nvcr.io/hpc/lammps:patch_4May2022
TensorFlow 1.15.5	ML/AI framework tag: nvcr.io/nvidia/tensorflow:22.09-tf1-py3
PyTorch 1.13.0a0	ML/AI framework tag: nvcr.io/nvidia/pytorch:22.09-py3
NVIDIA HPC-Benchmarks^[2]	The `hpc-benchmarks:23.5` container image is provided with the following packages embedded: HPL-NVIDIA 23.5.0 HPL-AI-NVIDIA 23.5.0 HPCG-NVIDIA 23.5.0 NVIDIA HPC-X 2.15 for CUDA 12.x NVIDIA NCCL 2.16.5 for CUDA 12.x Intel MKL 2020.4-912 NVIDIA's HPL benchmark requires GRDCopy installed on the system, https://github.com/NVIDIA/gdrcopy#build-and-installation for more information for install GRDCopy GDRCopy supports Quadro- and Tesla-class GPUs only^[3]

Reference

https://www.pugetsystems.com/labs/hpc/nvidia-rtx4090-ml-ai-and-scientific-computing-performance-preliminary-2382/

https://www.pugetsystems.com/labs/hpc/how-to-run-an-optimized-hpl-linpack-benchmark-on-amd-ryzen-threadripper-2990wx-32-core-performance-1291/

https://www.amd.com/en/developer.html

[1] ttps://netlib.org/benchmark/hpl/

[2] ttps://catalog.ngc.nvidia.com/orgs/nvidia/containers/hpc-benchmarks

[3] ttps://github.com/NVIDIA/gdrcopy/issues/198

[1]

[2]

[3]

@@ Line 1: / Line 1: @@
 == HPC Benchmark ==
+{| class="wikitable"
+|+
+!Name
+!Description
+|-
+|HPL-GPU
+|A custom benchmark to measure GPU performance on top of HPL packages by enabling CUDA so that the benchmark is able to use GPU.
+There are lots of HPL-GPU on internet but no single site does not explain how to compile with background information. So we decide to start a single page [[how to compile HPL-GPU]] for someone who is interrest in.
+|-
+|HPL 2.3
+|software  package  that solves  a (random) dense  linear  system  in  double  precision   (64  bits)  arithmetic on  distributed-memory  computers.    It  can  thus  be  regarded  as a portable as well as  freely available  implementation  of the  High Performance Computing Linpack Benchmark<ref>https://netlib.org/benchmark/hpl/</ref>
+High Performance Linpack tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpl
+|-
+|HPCG 3.1
+|High Performance Conjugate Gradient solver tag: nvcr.io/nvidia/hpc-benchmarks:21.4-hpcg3.1
+|-
+|NAMD 3.0a11
+| Molecular Dynamics tag: nvcr.io/hpc/namd:3.0-alpha11
+|-
+|LAMMPS
+|Molecular Dynamics tag: nvcr.io/hpc/lammps:patch_4May2022
+|-
+|TensorFlow 1.15.5
+|ML/AI framework tag: nvcr.io/nvidia/tensorflow:22.09-tf1-py3
+|-
+|PyTorch 1.13.0a0
+|ML/AI framework tag: nvcr.io/nvidia/pytorch:22.09-py3
+|-
+|NVIDIA HPC-Benchmarks<ref>https://catalog.ngc.nvidia.com/orgs/nvidia/containers/hpc-benchmarks</ref>
+|The <code>hpc-benchmarks:23.5</code> container image is provided with the following packages embedded:
+* HPL-NVIDIA 23.5.0
+* HPL-AI-NVIDIA 23.5.0
+* HPCG-NVIDIA 23.5.0
+* NVIDIA HPC-X 2.15 for [[CUDA]] 12.x
+* NVIDIA NCCL 2.16.5 for CUDA 12.x
+* Intel MKL 2020.4-912
+<nowiki>*</nowiki>NVIDIA's HPL benchmark requires [https://developer.nvidia.com/gdrcopy GRDCopy] installed on the system, <nowiki>https://github.com/NVIDIA/gdrcopy#build-and-installation</nowiki> for more information for install GRDCopy
+<nowiki>*</nowiki>GDRCopy supports Quadro- and Tesla-class GPUs only<ref>https://github.com/NVIDIA/gdrcopy/issues/198</ref>
+|}
+==Reference==
 https://www.pugetsystems.com/labs/hpc/nvidia-rtx4090-ml-ai-and-scientific-computing-performance-preliminary-2382/
 https://www.pugetsystems.com/labs/hpc/how-to-run-an-optimized-hpl-linpack-benchmark-on-amd-ryzen-threadripper-2990wx-32-core-performance-1291/
 https://www.amd.com/en/developer.html
-== Reference ==

Benchmark: Difference between revisions

Latest revision as of 12:43, 28 August 2023

HPC Benchmark

Reference

Navigation menu

Search