Evaluate how well your LLM uses functions and tools
"Reward tool calls"
"Penalize tool calls"
"Tool call passes if"
"Tool call fails if"
"Reward tool calls"
, "Penalize tool calls"
, "Tool call passes if"
, or "Tool call fails if"
tools
with complete function schemas