Introduction
Welcome to Galtea, the platform that empowers enterprises by providing a comprehensive AI evaluation platform that improves AI reliability, reduces risks, streamlines compliance, and accelerates time to market.
Get Started
Register
Register your organization into Galtea’s platform
Quickstart
Execute your first Evaluation in less than 5 minutes
How It Works
The typical workflow in Galtea follows these steps:
Create Product
Create Version
Create Test
Run Evaluation Tasks
Analyze Results
Iterate
Platform Access
You can interact with Galtea through multiple channels:
Web Platform
Manage your products and access insights via the dashboard.
Python SDK
Seamlessly integrate our services using the Python SDK.
GitHub Actions
Automate your workflows by integrating with GitHub Actions.
REST API
Documentation is coming soon.
Core Concepts
Galtea is built around several key concepts that work together to provide comprehensive evaluation of AI products:
Product
A functionality or service being evaluated
Version
A specific iteration of a product
Test
A set of test cases for evaluating product performance
Session
A full conversation between a user and an AI system.
Inference Result
A single turn in a conversation between a user and the AI.
Evaluation
A group of evaluable Inference Results from a particular session
Evaluation Task
The assessment of an evaluation using a specific metric type’s criteria
Metric Type
Ways to evaluate and score product performance
Model
Way to keep track of your models’ costs