Submit a human evaluation or annotation score

curl --request POST \ --url https://api.galtea.ai/evaluations/{id}/submit \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data ' { "score": 0.95, "reason": "Evaluation reason" } '

{ "id": "eval_123", "metricId": "metric_123", "sessionId": "session_123", "userId": "user_123", "status": "SUCCESS", "testCaseId": "tc_123", "inferenceResultId": "ir_123", "score": 0.95, "reason": "High quality response", "error": "<string>", "canRetry": false, "creditsUsed": 1, "conversationSimulatorVersion": "1.0.0", "humanEvaluatorId": "<string>", "humanEvaluatorStartedAt": "2023-11-07T05:31:56Z", "humanScore": 123, "humanReason": "<string>", "humanEvaluatorFinishedAt": "2023-11-07T05:31:56Z", "failedTurns": [ "<string>" ], "createdAt": "2023-11-07T05:31:56Z", "deletedAt": "2023-11-07T05:31:56Z", "evaluatedAt": "2023-11-07T05:31:56Z", "metricLegacyAt": "2023-11-07T05:31:56Z", "metricDisabledAt": "2023-11-07T05:31:56Z" }

Authorizations

Authorization

string

header

required

API key authorization. Pass your API key in the Authorization header as a Bearer token. Both new (gsk_*) and legacy (gsk-) API keys are accepted, e.g. Authorization: Bearer gsk_... or Authorization: Bearer gsk-....

Path Parameters

string

required

Evaluation ID

Body

application/json

score

number

required

The evaluation score (0-1)

Required range: 0 <= x <= 1

Example:

0.95

reason

string

Optional reason for the score

Example:

"Evaluation reason"

Response

Evaluation submitted successfully

string

Example:

"eval_123"

metricId

string

Example:

"metric_123"

sessionId

string

Example:

"session_123"

userId

string | null

Example:

"user_123"

status

enum<string>

Available options:

PENDING,

PENDING_HUMAN,

SUCCESS,

FAILED,

SKIPPED

Example:

"SUCCESS"

testCaseId

string | null

Example:

"tc_123"

inferenceResultId

string | null

Example:

"ir_123"

score

number | null

Example:

0.95

reason

string | null

Example:

"High quality response"

error

string | null

canRetry

boolean | null

Example:

false

creditsUsed

integer | null

Example:

1

conversationSimulatorVersion

string | null

Example:

"1.0.0"

humanEvaluatorId

string | null

User ID of the human evaluator

humanEvaluatorStartedAt

string<date-time> | null

humanScore

number | null

Human-provided annotation score

humanReason

string | null

Human-provided annotation reason

humanEvaluatorFinishedAt

string<date-time> | null

Timestamp when human evaluation was submitted

failedTurns

string[]

Conversation turns that failed

createdAt

string<date-time>

deletedAt

string<date-time> | null

evaluatedAt

string<date-time> | null

metricLegacyAt

string<date-time> | null

metricDisabledAt

string<date-time> | null

Health

Organizations

UserGroups

Metrics

Specifications

Models

Products

Versions

EndpointConnections

Tests

TestCases

Sessions

InferenceResults

Traces

Evaluations

Human Evaluations

Generate From Few Shot

Analytics

Storage

EvaluatorModels

ConversationSimulator

SupportedVersion

Jobs

OTel

Submit a human evaluation or annotation score

Authorizations

Path Parameters

Body

Response