Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
Request to create a new evaluation
Name of the evaluation
Team ID if creating evaluation for a team
List of environment references with optional version IDs
Suite ID if this evaluation is part of a suite
Run ID for Prime RL runs (optional)
Whether this is a hosted evaluation
Prime Inference model ID
Hosted evaluation configuration
Model name
Dataset name
Framework used (e.g., 'prime-rl', 'openai/evals')
Type of task (e.g., 'classification', 'generation')
Description of the evaluation
Tags for categorization
Additional metadata
High-level metrics summary
Response
Successful Response
Response after creating an evaluation
ID of the created evaluation
Evaluation status enum
PENDING, RUNNING, COMPLETED, FAILED, TIMEOUT Type: 'prime_rl', 'environment', or 'suite'
suite, training, environment