The updates for Galtea
CustomScoreEvaluationMetric
class. This makes it easier to integrate your custom, rule-based logic directly into the Galtea workflow. Learn more in our tutorial on evaluating with custom scores.messages
format from OpenAI, consisting of role
and content
pairs. This makes sending multi-turn interaction data to Galtea simpler than ever.See the new format in action in the Inference Result Batch Creation docs.galtea.evaluations.create()
—it happens automatically.
create_single_turn()
for test-based evaluations, while create()
now exclusively handles session-based evaluations.
version_id
instead of evaluation_id
.
galtea.versions.create()
method now accepts all properties directly, no need for an optional_props
dictionary.
galtea.sessions.create()
.
.txt
Knowledge Base Files.txt
extensions was not possible. This has been fixed and you can now create tests using .txt
files as the knowledge base again.And with monitoring active, you can see production evaluation results in real time on this page!