Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
application/json
Request to create a new evaluation
Name of the evaluation
Team ID if creating evaluation for a team
List of environment references with optional version IDs
Suite ID if this evaluation is part of a suite
Run ID for Prime RL runs (optional)
Model name
Dataset name
Framework used (e.g., 'prime-rl', 'openai/evals')
Type of task (e.g., 'classification', 'generation')
Description of the evaluation
Tags for categorization
Additional metadata
High-level metrics summary
Response
Successful Response
Response after creating an evaluation
ID of the created evaluation
Evaluation status enum
Available options:
RUNNING, COMPLETED, FAILED, CANCELLED Type: 'prime_rl', 'environment', or 'suite'
Available options:
suite, training, environment