Overview
AI Metric Generation lets you automatically create evaluation metrics from your product’s specifications. Instead of manually crafting judge prompts and configuring evaluation parameters, the AI analyzes your specifications and generates ready-to-use metrics.AI-generated metrics use the Full Prompt validation method with appropriate evaluation parameters for each specification’s test type.
How It Works
- Select specifications — Choose one or more policy specifications from your product. Only specifications with a test type (QUALITY, RED_TEAMING, or SCENARIOS) can be used.
- AI generates candidates — The system analyzes your product description, capabilities, and selected specifications to generate tailored metrics.
- Review and edit — Review each generated metric candidate. You can edit the name, description, judge prompt, tags, and evaluator model before saving.
- Save selectively — Save the metrics you want and discard the rest. Saved metrics are automatically linked to their source specification.
Requirements
- A product with a description
- At least one specification of type POLICY with a test type assigned
Using the Dashboard
From the Product Hub
- Navigate to your product’s Specifications tab
- Click Generate Metrics with AI
- Select the specifications you want to generate metrics for
- Click Generate Metrics and wait for the AI to process
- Review the generated candidates — edit, save, or discard each one
From a Specification
- Open the dropdown menu on any specification
- Click Generate Metrics to navigate directly to the generation page with that specification pre-selected
Generated Metric Properties
Each AI-generated metric candidate includes:| Property | Description |
|---|---|
| Name | A descriptive name for the metric |
| Description | What the metric evaluates |
| Judge Prompt | The full evaluation prompt with placeholders |
| Evaluation Parameters | The data parameters used in evaluation |
| Tags | Categorization tags |
| Evaluator Model | The LLM model used for evaluation |
| Test Type | Inherited from the source specification |