Create, manage and share environments for reinforcement learning and evaluation
pyproject.toml
and are distributed as wheels. By adopting the verifiers
spec, development efforts can focus on task-specific components (datasets, tools or harnesses, reward functions) and automatically leverage existing infrastructure for running evaluations or training models with RL.