Step-by-Step Guide
1
Go to the Multi-Node Cluster tab inside the platform
2
Choose your preferred configration of 16-64+ H100 GPUs and click 'Deploy Cluster'
3
Wait for the cluster to be deployed. You'll receive an email once it's up.
4
You'll receive one public IP for every node (8xH100) you deployed.
You can use these public IPs to SSH into your nodes and start running your multi-node use case.To see how to run Megatron-Deepspeed, Huggingface Accelerate, Torch FSDP, and other multi-node use cases, refer to our Tutorials: