Evaluate LLM output against specified criteria. Score on a continuous 0-1 scale.
Request model for reward score evaluation of LLM responses against specified criteria.
Criteria used for evaluation. Begins with one of the following: Reward, Penalize, Passes if, Fails ifNote, align-lightning-20251127 only supports Passes/Fails criteria. Mutually exclusive with evaluator_name.
1Name of an existing evaluator (group_id) to use for evaluation. Mutually exclusive with evaluation_criteria.
1 - 255The model core for reward evaluation. Defaults to align-20251111 if not specified.
align-20251111, align-20250529, align-20260109, align-lightning-20251127, align-lightning-20250731 Optional key-value pairs to tag and categorize the request. All tag values must be strings.
Optional trace ID to associate this evaluation with an existing trace.
System message is separate for Anthropic-style LLM calls. Optional.
Whether to evaluate only the latest response or all responses. Only True is supported for align-lightning model cores.
Successful Response
Response model for evaluation scores.