Checks if your product consistently responds in a contextually relevant way during a multi-turn conversation.
conversation_relevancy
metric, the following parameters are required in every turn of the conversation:
input
: The user message in the conversation.actual_output
: The chatbot’s corresponding response.conversation_relevancy
score is derived using an LLM-as-a-judge approach:
actual_output
directly addresses the user’s intent and fits naturally within the flow of the dialogue.