Evals
Binary
Evaluate LLM output against specified criteria. Result is pass/fail.
POST
Authorizations
Body
application/json
Request model for binary evaluation of LLM responses against specified criteria.
A list of messages in OpenAI format
Represents a single message in a conversation.
Criteria used for evaluation. Begins with 'Response passes if' or 'Response fails if'