> ## Documentation Index
> Fetch the complete documentation index at: https://docs.galtea.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Introduction

> Evaluate, test, and monitor your AI products with Galtea.

Galtea is the evaluation platform for AI products. Test accuracy, safety, and behavior — from RAG pipelines to conversational agents to security testing.

<video autoPlay muted loop playsInline className="w-full aspect-video rounded-xl" src="https://mintcdn.com/galtea/dTqmcvyz2pQyIXAj/videos/welcome.mp4?fit=max&auto=format&n=dTqmcvyz2pQyIXAj&q=85&s=9493cc5969cd6dc43dc2420a84190e62" data-path="videos/welcome.mp4" />

## Get Started

<CardGroup cols={3}>
  <Card title="Integrate the SDK" icon="code" href="/sdk/installation">
    Install, authenticate, and run your first evaluation in Python.
  </Card>

  <Card title="Run your first evaluation" icon="rocket" href="/quickstart">
    5-minute quickstart: register a product, run tests, view results.
  </Card>

  <Card title="Understand the platform" icon="cube" href="/concepts/product">
    Products, specs, tests, metrics, and evaluations — the full model.
  </Card>
</CardGroup>

## How It Works

Galtea helps you evaluate AI products through a repeatable test-measure-iterate cycle:

<Steps>
  <Step title="Register Product">Register a [Product](/concepts/product) to represent your AI functionality.</Step>
  <Step title="Define Specifications">Define [Specifications](/concepts/product/specification) — testable behavioral expectations for your product (capabilities, inabilities, policies).</Step>
  <Step title="Generate Metrics & Tests">Galtea [generates metrics](/concepts/metric/ai-generation) and [tests](/concepts/product/test) from your specifications, or you create them manually.</Step>
  <Step title="Create Version">Define a new [Version](/concepts/product/version) of your product to track changes over time.</Step>
  <Step title="Run Evaluations">Run [Evaluations](/concepts/product/version/session/evaluation) — `evaluations.run()` resolves specs, tests, and metrics automatically.</Step>
  <Step title="Analyze & Iterate">Review results in the Analytics dashboard, then iterate with new versions to track improvements.</Step>
</Steps>

## Platform Access

You can interact with Galtea through multiple channels:

<CardGroup cols={3}>
  <Card title="Web Platform" icon="globe" href="https://platform.galtea.ai/">
    Manage your products and access insights via the dashboard.
  </Card>

  <Card title="Python SDK" icon="python" href="/sdk/installation">
    Seamlessly integrate our services using the Python SDK.
  </Card>

  <Card title="CLI" icon="terminal" href="/cli/installation">
    Drive Galtea from the terminal with the `galtea` binary.
  </Card>

  <Card title="Agent Skill" icon="robot" href="/sdk/integrations/agent-skill">
    Let Claude Code, Cursor, and other AI coding assistants drive Galtea on your behalf.
  </Card>

  <Card title="REST API" icon="code" href="/api-reference">Explore the full API reference — every endpoint, parameter, and response schema.</Card>

  <Card title="GitHub Actions" icon="github" href="/sdk/integrations/github-actions">
    Automate your workflows by integrating with GitHub Actions.
  </Card>
</CardGroup>

## Core Concepts

Galtea is built around several key concepts that work together to provide comprehensive evaluation of AI products. For a diagram of how they all connect, start with the [Concepts overview](/concepts/overview).

<CardGroup cols={3}>
  <Card title="Concepts overview" icon="diagram-project" iconType="solid" href="/concepts/overview">
    How Galtea's concepts connect — diagram + per-entity quick reference.
  </Card>

  <Card title="Product" icon="box" iconType="solid" href="/concepts/product">
    A functionality or service being evaluated
  </Card>

  <Card title="Specification" icon="bullseye" iconType="solid" href="/concepts/product/specification">
    A testable behavioral expectation for a product
  </Card>

  <Card title="Version" icon="code-branch" iconType="solid" href="/concepts/product/version">
    A specific iteration of a product
  </Card>

  <Card title="Test" icon="clipboard-list" iconType="solid" href="/concepts/product/test">
    A set of test cases for evaluating product performance
  </Card>

  <Card title="Session" icon="clock-rotate-left" iconType="solid" href="/concepts/product/version/session">
    A full conversation between a user and an AI system.
  </Card>

  <Card title="Inference Result" icon="arrow-right-from-bracket" iconType="solid" href="/concepts/product/version/session/inference-result">
    A single turn in a conversation between a user and the AI.
  </Card>

  <Card title="Evaluation" icon="clipboard-check" iconType="solid" href="/concepts/product/version/session/evaluation">
    The assessment of an evaluation using a specific metric's criteria
  </Card>

  <Card title="Metric" icon="gauge" iconType="solid" href="/concepts/metric">
    Ways to evaluate and score product performance
  </Card>

  <Card title="Model" icon="microchip" iconType="solid" href="/concepts/model">
    Way to keep track of your models' costs
  </Card>
</CardGroup>
