Evaluate LLM output against specified criteria. Score on a continuous 0-1 scale.
Request model for reward score evaluation of LLM responses against specified criteria.
Criteria used for evaluation. Begins with one of the following: Reward, Penalize, Passes if, Fails ifNote, align-lightning-20251127 only supports Passes/Fails criteria.
The model core for reward evaluation. Defaults to align-20251111 if not specified.
align-20251111, align-20250529, align-20260109, align-lightning-20251127, align-lightning-20250731 Optional key-value pairs to tag and categorize the request.
System message is separate for Anthropic-style LLM calls. Optional.
Whether to evaluate only the latest response or all responses. Only True is supported for align-lightning model cores.