About

About#

SMU’s high-performance computing (HPC) clusters, M3 and the NVIDIA DGX SuperPOD, feature state-of-the-art CPUs, accelerators, and networking technologies, high memory capacity per node, and advanced interactive experiences via the Open OnDemand Portal.

The clusters provides a familiar interactive experience for researchers, which includes the Ubuntu Linux operating system, the SLURM resource scheduler, and the Lmod environment module system. Additionally, familiar development tool chains are available including the GCC, Intel, and NVIDIA compiler suites. Optimized high-level programming environments such as MATLAB, Python, and R are also installed in addition to the domain specific software packages that SMU researchers depend on for their work.

HPC System	M1 (Retired)	M2 (Retired)	M2 (Retired)	SuperPOD	M3
Year	2014	2017	2019	2022	2023
Compute Ability	104 TFLOPS	630 TFLOPS	870 TFLOPS	1,644 TFLOPS	1,077 TFLOPS
Number of Nodes	1,104	349	354	20	181
CPU Cores	8,832	11,088	11,276	2,560	22,892
Total GPU Cores	0	132,608	275,968	1,392,640	122,880
Total Memory	29.2 TB	116.5 TB	120 TB	40 TB	103 TB
Network Bandwidth	20 Gb/s	100 Gb/s	100 Gb/s	200 Gb/s	200 Gb/s
Project Storage	None	None	768 TB	3.5 PB*	3.5 PB*
Scratch Space	1.4 PB	1.4 PB	2.8 PB	750 TB	3.5 PB
Archive Capabilities	No	Yes	Yes	No	No
Operating System	Scientific Linux 6	CentOS 7	CentOS 7	Ubuntu 22.04	Ubuntu 22.04

* The 3.5 PB Project Storage is shared on M3 and the SuperPOD. It was formerly referred to as Work Storage, which has been deprecated and will be phased out beginning on January 15, 2025.

ManeFrame III (M3)#

Configuration#

Resource	Standard-Memory	High-Memory	GPU
Nodes	170	8	3
Processors	AMD EPYC 7763	AMD EPYC 7763	Intel Xeon Gold 6154
Frequency	2.45 GHz	2.45 GHz	3.00 GHz
CPUs/Node	2	2	2
Cores/Node	128	128	18
Memory/Node	512 GB	2 TB	756 GB
Local Scratch/Node	None	4.3 TB	None
Interconnect	200 Gb/s	200 Gb/s	100 Gb/s

NVIDIA DGX SuperPOD (MP)#

News#

Configuration#

Component	Summary
Computational Ability	1,644 TFLOPS
Number of Nodes	20
CPU Cores	2,560
GPU Accelerator Cores	1,392,640
Total Memory	52.5 TB
Interconnect Bandwidth	10x200 Gb/s Infiniband Connections Per Node
Project Storage	3.5 PB (Shared with M3)
Scratch Storage	750 TB (Raw)
Operating System	Ubuntu 22.04

Resource	DGX Node
Nodes	20
Processors	AMD EPYC 7742
CPUs/Node	2
Cores/Node	128
Memory/Node	2 TB
GPUs	NVIDIA A100 Tensor Core GPU
GPUs/Node	8
GPU Memory/GPU	80 GB
GPU Interconnect	NVLink
Local Scratch/Node	27 TB
Network	10x200 Gb/s

Faculty Partner Nodes#

Professor Thomas Hagstrom (Mathematics)#

Professor Hagstrom has one node that is available as part of M3. This node has dual AMD EPYC 7742 64-Core “Zen 2” processors, 512 GB of memory, and an NVIDIA A100 GPU accelerator with 40 GB of HBM2 memory, 6912 Cuda cores, and 432 tensor cores.

This node amd001, is currently available only with permision from Dr. Hagstrom.

Professor Barbara Minsker (Civil and Environmental Engineering)#

Professor Minsker has two nodes that are availabe as part of M3. These nodes each have dual Intel Xeon Gold 6148 2.40 GHz 20-core “Skylake” processors, 384 GB of memory, dual NVIDIA V100 GPU accelerators, and 2 TB of local RAID scratch space. Each GPU has 5,120 CUDA cores, 640 Tensor cores, and 32 GB CoWoS HBM2 memory. The V100 GPU is based on the Volta architecture and an extremely high bandwidth (900 GB/s) stacked memory architecture.

The queue for these nodes, fp-gpgpu-3, is generally available to M3 users with the following restrictions.

The maximum job duration for users not in Dr. Minsker’s group is 7 days
There is no maximum job duration for users in Dr. Minsker’s group when the --qos qos_clowder Slurm flag is used
Jobs submitted without the --qos qos_clowder flag may be queued indefinitely until jobs submitted with the flag finish

Professor Wei Cai (Mathematics)#

Professor Cai has one node that is available as part of M3. This node has dual Intel Xeon Gold 6150 2.70 GHz 18-core “Skylake” processors, 384 GB of memory, dual NVIDIA V100 GPU accelerators, and 2 TB of local RAID scratch space. Each GPU has 5,120 CUDA cores, 640 Tensor cores, and 32 GB CoWoS HBM2 memory. The V100 GPU is based on the Volta architecture and an extremely high bandwidth (900 GB/s) stacked memory architecture.

The queue for these nodes, fp-gpgpu-4, is generally available to M3 users with the following restrictions.

The maximum job duration for users not in Dr. Cai’s group is 7 days
Members of Dr. Cai’s group may run upto 2 weeks when the --qos qos_cai Slurm flag is used
Jobs submitted without the --qos qos_cai flag may be preempted

Assistant Professor Devin Matthews (Chemistry)#

Dr. Matthews has one node that is available as part of M3. This node has dual AMD EPYC 7763 2.45 Ghz 64-Core “Zen 3” processors, 512 GB of memory, and AMD Instinct MI210 GPU with 64 GB of HBM2e memory and 6,656 stream processors. This node is also equipped with AMD/Xilinx Alveo U55C FPGA.

This node ga001, is currently available only with permision from Dr. Matthews.

Acknowledgement#

We request that all work supported by SMU HPC resources make an appropriate acknowledgement. We suggest the following:

“Computational resources for this research were provided by SMU’s O’Donnell Data Science and Research Computing Institute.”

About

Contents

About#

ManeFrame III (M3)#

Configuration#

NVIDIA DGX SuperPOD (MP)#

News#

Configuration#

Faculty Partner Nodes#

Professor Thomas Hagstrom (Mathematics)#

Professor Barbara Minsker (Civil and Environmental Engineering)#

Professor Wei Cai (Mathematics)#

Assistant Professor Devin Matthews (Chemistry)#

Acknowledgement#