Evaluation Parameters
To compute theconversation_relevancy
metric, the following parameters are required in every turn of the conversation:
input
: The user message in the conversation.actual_output
: The chatbot’s corresponding response.
How Is It Calculated?
Theconversation_relevancy
score is derived using an LLM-as-a-judge approach:
- Contextual Analysis: An LLM is used to analyze the full conversation history.
- Relevancy Judgment: The LLM determines whether the
actual_output
directly addresses the user’s intent and fits naturally within the flow of the dialogue.
This metric was incorporated to the Galtea platform from the open source library deepeval, for more information you can also visit their documentation.