Test

What is a Test?

A test in Galtea is a group of test cases designed to evaluate the performance of a product. A test file provides simulations of interactions with the product (and, in quality tests, expected outcomes for each interaction).

You can create, view and manage your tests on the Galtea dashboard or programmatically using the Galtea SDK.

Test Origin

When creating a test in the Galtea dashboard, you’ll be asked to specify the test origin:

Generated

Galtea will take the knowledge base file and generate a set of test cases that will define the test.

Uploaded

The test is uploaded by you as a complete set of test cases.

Your selection will determine whether you need to provide a Knowledge Base File or a Test File.

The SDK parameter variants is used to specify “Evolutions” for Quality tests and “Threats” for Red Teaming tests. Similarly, the strategies parameter is used for Red Teaming tests to apply different attack modifications.

More information on how to create tests can be found in the Create Quality Tests and Create Red Teaming Tests documentation.

It is important to note that the information you provide during product onboarding, such as the product’s description and intended use, plays a valuable role when generating test cases. Galtea can leverage this metadata to generate more targeted and context-aware test cases when creating both quality and red teaming tests, leading to more effective and insightful evaluations.

Test Types

Galtea supports two main types of tests:

Quality Tests

Tests that evaluate the quality and correctness of outputs

Red Teaming Tests

Tests that evaluate security, safety, and bias aspects, often by generating adversarial inputs based on defined threats and applying various strategies to make them more challenging.

Using Tests in Evaluations

The Test Cases of a Test are used in evaluation tasks to assess the performance of specific versions of your product against a set of metrics.

The Test Cases of a Test should be reused across multiple evaluations of distinct versions to ensure consistent comparison between different product versions.

Using Tests in Evaluations

Learn how to use tests with evaluations

SDK Integration

The Galtea SDK allows you to create, view, and manage tests programmatically.

Test Service SDK

Manage tests using the Python SDK

Create a Custom Test

See how to create and upload custom tests using the SDK.

Test Properties

Test Name

Text

required

The name of the test. Example: “Legal Document Quality Test” or “Customer Support Safety Evaluation”

Type

Enum

required

The type of the test. Possible values:

Quality: Tests that evaluate the quality and correctness of outputs
Red Teaming: Tests that evaluate security, safety, and bias aspects

Few Shot Examples

Text

Optional few-shot examples to provide more context to our system about how the test cases should be generated. This can help our system better understand the expected format and style wanted for the test cases. Example:

Q: What is the capital of France?
A: The capital of France is Paris.
Q: What is the capital of Germany?
A: The capital of Germany is Berlin.

This field only applies if tests are generated by Galtea and are of type QUALITY.

Language

Text

The language for generating synthetic test cases if Knowledge Base File is provided (e.g., ‘english’, ‘spanish’). This should be the English name of the language. If not provided, Galtea attempts to infer the language from the knowledge base file. Supported languages include English, Spanish, Catalan, French, German, Portuguese, Italian, Dutch, Polish, Chinese, Korean, and Japanese.

This field only applies if tests are generated by Galtea (using Knowledge Base File).

Max Test Cases

Number

The maximum number of test cases generated by Galtea. This helps control the size of the test dataset and associated costs.

Knowledge Base File

File

required

The path to a local file (e.g., PDF, TXT, JSON, HTML, Markdown) containing the knowledge base. This file is uploaded to Galtea, which then generates test cases based on its content. Required if the test cases are to be generated by Galtea. Example: “path/to/your/knowledge_base.pdf”

Evolutions

List[Enum]

Allows for the generation of variations of test cases (e.g., paraphrased questions, questions with typos). For more details on available evolutions, see Quality Test Evolutions.

Few Shot Examples

Text

Q: What is the capital of France?
A: The capital of France is Paris.
Q: What is the capital of Germany?
A: The capital of Germany is Berlin.

This field only applies if tests are generated by Galtea and are of type QUALITY.

Language

Text

This field only applies if tests are generated by Galtea (using Knowledge Base File).

Max Test Cases

Number

The maximum number of test cases generated by Galtea. This helps control the size of the test dataset and associated costs.

Knowledge Base File

File

required

Evolutions

List[Enum]

Allows for the generation of variations of test cases (e.g., paraphrased questions, questions with typos). For more details on available evolutions, see Quality Test Evolutions.

Test File

File

required

The path to a local CSV file containing predefined test cases. This file is uploaded to Galtea. Required if you are providing your own set of test cases instead of having Galtea generate them. Example: “path/to/your/test_file.csv”

Concepts

Metrics

Test Types

What is a Test?

Test Origin

Generated

Uploaded

Test Types

Quality Tests

Red Teaming Tests

Using Tests in Evaluations

Using Tests in Evaluations

SDK Integration

Test Service SDK

Create a Custom Test

Test Properties

Concepts

Metrics

Test Types

​What is a Test?

​Test Origin

Generated

Uploaded

​Test Types

Quality Tests

Red Teaming Tests

​Using Tests in Evaluations

Using Tests in Evaluations

​SDK Integration

Test Service SDK

Create a Custom Test

​Test Properties

What is a Test?

Test Origin

Test Types

Using Tests in Evaluations

SDK Integration

Test Properties