Evals
Binary
Perform binary evaluation of LLM output against specified criteria
POST
Authorizations
Body
application/json
Request model for binary evaluation of LLM outputs.
List of conversation messages
Criteria used for evaluation
Optional reference output for comparison
Whether to evaluate the latest response only