All you need to get started with Galtea evaluations
This guide will walk you through the steps to begin evaluating and monitoring the reliability of your AI products with Galtea as quickly as possible.
Create a Product
Install SDK & Connect
Register a Version
Select a Test
Select a Metric
Run Evaluations
Create a product in the Galtea dashboard. Navigate to Products > Create New Product and complete the form.
The product description is important as it may be used to generate synthetic test data.
Get your API key
In the Galtea dashboard, navigate to Settings > Generate API Key and copy your key.
Install the SDK
Connect to the platform
Create a version to track a specific implementation of your product.
For this quickstart, we’ll use the default “Jailbreak” test, which is a type of Red Teaming Test.
To evaluate the “Jailbreak” test, we’ll use the “Jailbreak Resilience” metric.
Now, run an evaluation by creating evaluation tasks.
In a real scenario, your_product_function
would be a call to your actual AI model.
You can view results on the Galtea dashboard. Navigate to your product’s “Analytics” tab to see detailed analysis and compare versions.
Congratulations! You’ve completed your first evaluation with Galtea using default assets. This is just the beginning. Explore these concepts to tailor Galtea to your specific needs:
A functionality or service being evaluated
A specific iteration of a product
A set of test cases for evaluating product performance
A full conversation between a user and an AI system.
A single turn in a conversation between a user and the AI.
A group of evaluable Inference Results from a particular session
The assessment of an evaluation using a specific metric type’s criteria
Ways to evaluate and score product performance
Way to keep track of your models’ costs
If you have any questions or need assistance, contact us at support@galtea.ai.
All you need to get started with Galtea evaluations
This guide will walk you through the steps to begin evaluating and monitoring the reliability of your AI products with Galtea as quickly as possible.
Create a Product
Install SDK & Connect
Register a Version
Select a Test
Select a Metric
Run Evaluations
Create a product in the Galtea dashboard. Navigate to Products > Create New Product and complete the form.
The product description is important as it may be used to generate synthetic test data.
Get your API key
In the Galtea dashboard, navigate to Settings > Generate API Key and copy your key.
Install the SDK
Connect to the platform
Create a version to track a specific implementation of your product.
For this quickstart, we’ll use the default “Jailbreak” test, which is a type of Red Teaming Test.
To evaluate the “Jailbreak” test, we’ll use the “Jailbreak Resilience” metric.
Now, run an evaluation by creating evaluation tasks.
In a real scenario, your_product_function
would be a call to your actual AI model.
You can view results on the Galtea dashboard. Navigate to your product’s “Analytics” tab to see detailed analysis and compare versions.
Congratulations! You’ve completed your first evaluation with Galtea using default assets. This is just the beginning. Explore these concepts to tailor Galtea to your specific needs:
A functionality or service being evaluated
A specific iteration of a product
A set of test cases for evaluating product performance
A full conversation between a user and an AI system.
A single turn in a conversation between a user and the AI.
A group of evaluable Inference Results from a particular session
The assessment of an evaluation using a specific metric type’s criteria
Ways to evaluate and score product performance
Way to keep track of your models’ costs
If you have any questions or need assistance, contact us at support@galtea.ai.