You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen2 and accelerator nvidia/cc80 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.09/pr_716/19106
date
job status
comment
Sep 19 07:19:04 UTC 2024
submitted
job id 19106 awaits release by job manager
Sep 19 07:19:47 UTC 2024
released
job awaits launch by Slurm scheduler
Sep 19 07:26:02 UTC 2024
running
job 19106 is running
Sep 19 07:31:26 UTC 2024
finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-19106.out ✅ no message matching ERROR: ✅ no message matching FAILED: ✅ no message matching required modules missing: ✅ found message(s) matching No missing installations ✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1726730776.tar.gzsize: 0 MiB (45 bytes) entries: 0 modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen2
no other files in tarball
Sep 19 07:31:26 UTC 2024
test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-19106.out ✅ no message matching ERROR: ✅ no message matching [\s*FAILED\s*].*Ran .* test case
New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen2 and accelerator nvidia/cc80 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.09/pr_716/19114
date
job status
comment
Sep 19 08:46:01 UTC 2024
submitted
job id 19114 awaits release by job manager
Sep 19 08:46:23 UTC 2024
released
job awaits launch by Slurm scheduler
Sep 19 08:47:31 UTC 2024
running
job 19114 is running
Sep 19 08:57:00 UTC 2024
finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-19114.out ✅ no message matching ERROR: ✅ no message matching FAILED: ✅ no message matching required modules missing: ✅ found message(s) matching No missing installations ✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1726735836.tar.gzsize: 3 MiB (3524173 bytes) entries: 95 modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen2
boegel
changed the title
{2023.06}[foss/2023a] OSU-Microbenchmarks v7.2 w/ CUDA 12.1.1
{2023.06}[foss/2023a] OSU-Microbenchmarks v7.2 w/ CUDA 12.1.1 (rebuild)
Sep 25, 2024
New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen2 and accelerator nvidia/cc80 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.09/pr_716/20007
date
job status
comment
Sep 26 11:08:51 UTC 2024
submitted
job id 20007 awaits release by job manager
Sep 26 11:09:06 UTC 2024
released
job awaits launch by Slurm scheduler
Sep 26 11:15:19 UTC 2024
running
job 20007 is running
Sep 26 11:34:56 UTC 2024
finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-20007.out ✅ no message matching ERROR: ✅ no message matching FAILED: ✅ no message matching required modules missing: ✅ found message(s) matching No missing installations ✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1727349726.tar.gzsize: 3 MiB (3524501 bytes) entries: 95 modules under 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/modules/all
New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen3 and accelerator nvidia/cc80 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.09/pr_716/20008
date
job status
comment
Sep 26 11:08:55 UTC 2024
submitted
job id 20008 awaits release by job manager
Sep 26 11:09:09 UTC 2024
released
job awaits launch by Slurm scheduler
Sep 26 11:10:11 UTC 2024
running
job 20008 is running
Sep 26 11:25:45 UTC 2024
finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-20008.out ✅ no message matching ERROR: ✅ no message matching FAILED: ✅ no message matching required modules missing: ✅ found message(s) matching No missing installations ✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-1727349308.tar.gzsize: 3 MiB (3526552 bytes) entries: 95 modules under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/modules/all
That's getting close to the max. bandwidth of 100GBs/s between two A100's with 4x NVLink (each 25GB/s):
$ nvidia-smi topo -m
GPU0 GPU1 NIC0 CPU Affinity NUMA Affinity GPU NUMA ID
GPU0 X NV4 SYS 3 N/A
GPU1 NV4 X SYS 5 N/A
NIC0 SYS SYS X
Legend:
X = Self
SYS = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI)
...
NV# = Connection traversing a bonded set of # NVLinks
Latency:
$ mpirun -np 2 osu_latency -d cuda D D
# OSU MPI-CUDA Latency Test v7.2
# Send Buffer on DEVICE (D) and Receive Buffer on DEVICE (D)
# Size Latency (us)
# Datatype: MPI_CHAR.
1 1.54
2 2.39
4 2.35
8 2.34
16 1.57
32 1.54
64 1.55
128 2.04
256 2.00
512 2.94
1024 4.91
2048 6.27
4096 10.38
8192 17.47
16384 10.11
32768 9.86
65536 10.42
131072 11.27
262144 12.48
524288 15.34
1048576 20.84
2097152 31.89
4194304 54.41
PR merged! Moved ['/project/def-users/SHARED/jobs/2024.09/pr_716/19106', '/project/def-users/SHARED/jobs/2024.09/pr_716/19114', '/project/def-users/SHARED/jobs/2024.09/pr_716/20007', '/project/def-users/SHARED/jobs/2024.09/pr_716/20008'] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2024.09.26
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
requires: