Generate

The generate command creates structured evaluation criteria from your tests.

Generate Rubrics

Auto-generate rubrics from expected outcomes:

agentv generate rubrics evals/my-eval.yaml

This analyzes each test’s criteria field and creates structured rubric criteria with appropriate weights.

How It Works

Reads each test’s criteria
Uses an LLM to decompose the criteria into individual checkable rubric items
Assigns weights based on importance
Writes the rubrics back to the eval file

Example

Before:

tests:
  - id: quicksort
    criteria: Explains quicksort with time complexity and examples
    input: Explain quicksort

After running agentv generate rubrics:

tests:
  - id: quicksort
    criteria: Explains quicksort with time complexity and examples
    input: Explain quicksort
    rubrics:
      - Explains divide-and-conquer approach
      - Describes partition step
      - States O(n log n) average time complexity
      - Provides a concrete example