Create a new test. See Tests.
API key authorization. Pass your API key in the Authorization header as a Bearer token. Both new (gsk_*) and legacy (gsk-) API keys are accepted, e.g. Authorization: Bearer gsk_... or Authorization: Bearer gsk-....
Test data
Product ID
"prod_123"
Test name
"Quality Test"
QUALITY, RED_TEAMING, SCENARIOS "QUALITY"
Specification ID (can auto-derive type/variants)
"spec_123"
"https://example.com/ground-truth.csv"
URI for uploaded test file
"https://example.com/test.csv"
Optional few-shot examples used to guide test-case generation for QUALITY tests.
"Q: What is 2+2? A: 4"
"en"
Test variants. QUALITY: rag, entity_extraction, summarization, classification, translation, correction, other. RED_TEAMING: data_leakage, financial_attacks, illegal_activities, misuse, toxicity, custom.
["rag"]Custom variant description
"Custom test variant"
Generation strategies. "original" is the default for RED_TEAMING. At least one strategy is required for SCENARIOS tests.
["original"]"Business analyst"
100
["gpt-4"]Source test ID for SCENARIOS tests
"test_123"
"https://example.com/data-catalog.json"
Arbitrary key-value metadata
{ "key": "value" }Test created successfully
"test_123"
"prod_123"
"user_123"
"Quality Test"
QUALITY, RED_TEAMING, SCENARIOS "QUALITY"
"https://example.com/ground-truth.csv"
"https://example.com/test.csv"
PENDING, SUCCESS, FAILED, AUGMENTING "SUCCESS"
Optional few-shot examples (input/output pairs) used to guide test-case generation for QUALITY tests.
"Example few-shot learning data"
"en"
Test variants. QUALITY: rag, entity_extraction, summarization, classification, translation, correction, other. RED_TEAMING: data_leakage, financial_attacks, illegal_activities, misuse, toxicity, custom.
["rag", "summarization"]"Custom variant description"
Generation strategies. "original" is the default for RED_TEAMING. At least one strategy is required for SCENARIOS tests.
["original"]"Business analyst"
100
["gpt-4", "claude-3"]"https://example.com/data-catalog.json"
{ "key": "value" }"spec_123"