Create a test for your product.
QUALITY
: Tests that evaluate the quality and correctness of outputsRED_TEAMING
: Tests that evaluate security, safety, and bias aspectsSCENARIOS
: Tests that use conversation simulation to evaluate multi-turn dialogue interactionsQUALITY
.SCENARIOS
.ground_truth_file_path
is provided. Defaults to the language detected in the ground truth file. If provided, it should be written in English and be a valid language name from the ISO 639 standard (e.g., “english”, “spanish”, “french”). More information can be found at https://en.wikipedia.org/wiki/List_of_ISO_639_language_codes.ground_truth_file_path
. For details on variants for Quality tests, see Quality Test Evolutions. Example: ["paraphrased", "typos", "incorrect", "cognitively_diverse", "linguistic_diverse"]
.
For Red Teaming tests, the variants
parameter can be used to specify which threat categories to generate test cases for.variants
for Red Teaming tests). This parameter applies only when type
is RED_TEAMING
.
If specified, this list must include the ‘original’ strategy.
Example: ["original", "base64", "leetspeak", "zero_width_insertion"]
.ground_truth_file_path
is used.input
, expected_output
, tag
, source
columnsgoal
, user_persona
, initial_prompt
, stopping_criterias
, max_iterations
, scenario
columns