Documentation Index
Fetch the complete documentation index at: https://docs.primeintellect.ai/llms.txt
Use this file to discover all available pages before exploring further.
W&B
For most runs we recommend logging metrics to W&B. Before enabling W&B, make sure that you have an account and are logged in.SFT
Logging to W&B is disabled by default. Enable the default configuration with--wandb
prime-rl project with a random run name. You can specify which project and name to log to
torchrun. Note, that we only log global metrics from the master rank (e.g. the all-reduced loss)
RL
For RL training, both the trainer and orchestrator log to W&B as separate runs. Again, logging to W&B is disabled by default. Enable the default configuration with--wandb
prime-rl project with a random run name. The trainer run is suffixed with -trainer and the orchestrator run is suffixed with -orchestrator. You can specify which project and name to log to using the same flags as for SFT.
wandb.log-extras subconfig. If W&B is setup, this is enabled by default and will log for the RL trainer and orchestrator every 10 steps.
You can configure this on the trainer and orchestrator separately. For example, to only log samples on the orchestrator every 50 steps, but not distribution on either