Creating an Evaluation

Running Evaluation For Custom metrics

Learn how to create and run evaluations for custom metrics using the SDK

Run evaluation with custom metrics

Galtea Docs

Welcome to Galtea, the platform that empowers enterprises by providing a comprehensive AI evaluation platform that improves AI reliability, reduces risks, streamlines compliance, and accelerates time to market.

Introduction

How to request access to the Galtea platform

Registration

Quickstart

A functionality or service evaluated by Galtea

Product

A specific iteration of a product in Galtea

Version

A set of test cases for evaluating product performance

Test

A single challenge for evaluating product performance

Test Case

A link between a product version and a test

Evaluation

The assessment of a specific test case using a metric type

Evaluation Task

Ways to evaluate and score product performance

Metric Type

A representation of a LLM Model with cost information to calculate cost estimations

Model

Tests that evaluate the quality and correctness of outputs

Quality Tests

Tests that evaluate security, safety, and bias aspects of AI products

Red Teaming Tests

Welcome to the Galtea SDK, a powerful toolkit that enables developers to integrate Galtea's AI evaluation capabilities directly into their workflows. Our SDK provides programmatic access to comprehensive testing, evaluation, and compliance features to improve AI reliability and accelerate development.

Learn how to install the SDK and set up your environment

Installation

Learn how to use the SDK in your codebases

Usage

Galtea

Exploring the Product Service API in the Galtea SDK

Product Service

Exploring the Version Service API in the Galtea SDK

Version Service

Exploring the Test Service API in the Galtea SDK

Test Service

Exploring the Test Case Service API in the Galtea SDK

Test Case Service

Exploring the Metrics Service API in the Galtea SDK

Metrics Service

Exploring the Evaluation Service API in the Galtea SDK

Evaluation Service

Exploring the Evaluation Task Service API in the Galtea SDK

Evaluation Task Service

Learn how to create and upload custom tests using the SDK

Create a Custom Test

Learn how to create and run evaluations using the SDK

Run Evaluations

Learn how to integrate Galtea's evaluation capabilities into your GitHub Actions workflow

SDK

API

Examples

Integrations

Run evaluation with custom metrics

Creating an Evaluation

Running Evaluation For Custom metrics

SDK

API

Examples

Integrations

​Creating an Evaluation

​Running Evaluation For Custom metrics

Creating an Evaluation

Running Evaluation For Custom metrics