What is a Test?

A test in Galtea is a group of test cases designed to evaluate the performance of a product. A test file provides simulations of interactions with the product (and, in quality tests, expected outcomes for each interaction).

You can create, view and manage your tests on the Galtea dashboard or programmatically using the Galtea SDK.

Test Origin

When creating a test in the Galtea dashboard, you’ll be asked to specify the test origin:

Generated

Galtea will take the knowledge base file and generate a set of test cases that will define the test.

Uploaded

The test is uploaded by you as a complete set of test cases.

Your selection will determine whether you need to provide a Knowledge Base File or a Test File.

More information on how to create tests can be found in the Create Quality Tests and Create Red Teaming Tests documentation.

It is important to note that the information you provide during product onboarding, such as the product’s description and intended use, plays a valuable role when generating test cases. Galtea can leverage this metadata to generate more targeted and context-aware test cases when creating both quality and red teaming tests, leading to more effective and insightful evaluations.

Test Types

Galtea supports two main types of tests:

Using Tests in Evaluations

The Test Cases of a Test are used in evaluation tasks to assess the performance of specific versions of your product against a set of metrics.

The Test Cases of a Test should be reused across multiple evaluations of distinct versions to ensure consistent comparison between different product versions.

Using Tests in Evaluations

Learn how to use tests with evaluations

SDK Integration

The Galtea SDK allows you to create, view, and manage tests programmatically.

Test Properties

Test Name
Text
required

The name of the test. Example: “Legal Document Quality Test” or “Customer Support Safety Evaluation”

Type
Enum
required

The type of the test. Possible values:

  • Quality: Tests that evaluate the quality and correctness of outputs
  • Red Teaming: Tests that evaluate security, safety, and bias aspects
Language
Text

The language for generating synthetic test cases if Knowledge Base File is provided (e.g., ‘english’, ‘spanish’). This should be the English name of the language. If not provided, Galtea attempts to infer the language from the knowledge base file. Supported languages include English, Spanish, Catalan, French, German, Portuguese, Italian, Dutch, Polish, Chinese, Korean, and Japanese.

This field only applies if tests are generated by Galtea (using Knowledge Base File).

Max Test Cases
Number

The maximum number of test cases generated by Galtea. This helps control the size of the test dataset and associated costs.

Knowledge Base File
File
required

The path to a local file (e.g., PDF, TXT, JSON, HTML, Markdown) containing the knowledge base. This file is uploaded to Galtea, which then generates test cases based on its content. Required if the test cases are to be generated by Galtea. Example: “path/to/your/knowledge_base.pdf”

Evolutions
List[Enum]

Allows for the generation of variations of test cases (e.g., paraphrased questions, questions with typos). For more details on available evolutions, see Quality Test Evolutions.