Inference Result
A single turn in a conversation between a user and an AI system
What is an Inference Result?
An inference result in Galtea represents a single turn in a conversation between a user and an AI system. It captures the user’s input and the AI’s corresponding output, along with other optional metadata such as latency and token usage.
You can create, view and manage your inference results programmatically using the Galtea SDK.
SDK Integration
The Galtea SDK allows you to create, view, and manage inference results programmatically. This is particularly useful for logging production data or for evaluating complex conversational flows.
Inference Result Properties
The session to which the inference result belongs.
The input text or prompt for the inference result.
The generated output or response for the inference result.
The context retrieved by a RAG system, if applicable.
The latency in milliseconds for the model’s response.
Token usage information for the inference result.
Cost information for the inference result.