Skip to content

Generate

The generate command creates structured evaluation criteria from your tests.

Auto-generate rubrics from expected outcomes:

Terminal window
agentv generate rubrics evals/my-eval.yaml

This analyzes each test’s criteria field and creates structured rubric criteria with appropriate weights.

  1. Reads each test’s criteria
  2. Uses an LLM to decompose the criteria into individual checkable rubric items
  3. Assigns weights based on importance
  4. Writes the rubrics back to the eval file

Before:

tests:
- id: quicksort
criteria: Explains quicksort with time complexity and examples
input: Explain quicksort

After running agentv generate rubrics:

tests:
- id: quicksort
criteria: Explains quicksort with time complexity and examples
input: Explain quicksort
rubrics:
- Explains divide-and-conquer approach
- Describes partition step
- States O(n log n) average time complexity
- Provides a concrete example