Returns
Returns a list of Evaluation objects, one for each metric and each inference result in the session.Example
This method does not support
CustomScoreEvaluationMetric
objects. Attempting to use a custom metric will result in a ValueError
. For custom scoring, please use the create_single_turn
method for each turn of the conversation.Parameters
The ID of the session containing the inference results to be evaluated.
A list of standard metrics to use. It is not possible to use metrics with custom scores with this method.