This archive contains the fourth installment of materials for the SC24-117 tutorial. This section focuses on the practical application of the concepts discussed in the previous modules, specifically targeting performance optimization and multi-node scaling. 2. Contents /src : Updated source code including optimization templates. /data : Sample datasets required for Part 04 exercises.
SC24-117 Module: Part 04 - Advanced Implementation & Scaling Release Date: November 2024 1. Overview sc24117-HOL.part04.rar
If you encounter errors during extraction or compilation, please visit the official SC24 tutorial repository or contact the lab demonstrators via the event Slack channel. This archive contains the fourth installment of materials
/scripts : Shell scripts for job submission to the cluster (SLURM/PBS). Contents /src : Updated source code including optimization
To compile the Part 04 binaries, navigate to the source directory and use the provided Makefile: cd sc24117-HOL/part04/src make clean && make Use code with caution. Copied to clipboard To run the sample workload: srun --nodes=2 --ntasks-per-node=4 ./bin/lab_exec Use code with caution. Copied to clipboard 5. Support
Lab_Instructions_Part04.pdf : Step-by-step guide for this module. 3. Prerequisites
Ensure gcc , OpenMPI , and the relevant CUDA/ROCm toolkits are loaded in your environment. 4. Execution Instructions