POST
/
api
/
v1
/
evals
/
binary
Binary
curl --request POST \
  --url https://platform.composo.ai/api/v1/evals/binary \
  --header 'API-Key: <api-key>' \
  --header 'Content-Type: application/json' \
  --data '{
  "messages": [
    {}
  ],
  "system": "<string>",
  "tools": [
    {}
  ],
  "model_core": "align-20250529",
  "evaluation_criteria": "<string>"
}'
{
  "passed": true,
  "explanation": "<string>"
}

Authorizations

API-Key
string
header
required

Body

application/json

Request model for binary evaluation of LLM responses against specified criteria.

messages
Messages · object[]
required

A list of chat messages

Minimum length: 2
evaluation_criteria
string
required

Criteria used for evaluation. Begins with one of the following: Response passes if, Response fails if, Tool call passes if, Tool call fails if, Agent passes if, Agent fails if

system
string | null

System message is separate for Anthropic-style LLM calls. Optional.

tools
Tools · object[] | null

List of tools available for the assistant to call. Optional.

model_core
enum<string>

The model core for reward evaluation. Defaults to align-20250503 if not specified.

Available options:
align-20250529,
align-lightning-20250731

Response

Successful Response

Response model for binary evaluation results.

passed
boolean | null
required

Whether the evaluation passed. If null, the criteria was deemed not applicable.

explanation
string
required

Explanation of the evaluation score