Returns

Returns an InferenceResult object.

Example

inference_result = galtea.inference_results.create(
    session_id="YOUR_SESSION_ID",
    input="What is the capital of France?",
    output="Paris is the capital of France."
)

Parameters

session_id
string
required

The session ID to log the inference result to.

input
string
required

The input text/prompt.

output
string
required

The generated output/response.

retrieval_context
string

Context retrieved for RAG systems.

latency
float

Latency in milliseconds.

usage_info
dict

Token usage information (e.g., {"input_tokens": 10, "output_tokens": 5}).

cost_info
dict

Cost information (e.g., {"cost_per_input_token": 0.0001}).