Run Megatron-Deepspeed to train models distributed across multiple nodes.
This tutorial for running Megatron-DeepSpeed on multi-node clusters is currently in development. Please check back soon for detailed instructions on large-scale model training with Megatron-DeepSpeed.
Assistant
Responses are generated using AI and may contain mistakes.