This metric gauges the relevancy of the retrieved context, calculated based on both the question and contexts. The values fall within the range of (0, 1), with higher values indicating better relevancy.
Successful evaluation
processed, skipped, error Evaluation score
Whether the evaluation passed
Evaluation label
Additional details about the evaluation
Raw response from the evaluator
Type of error if status is 'error'
Error traceback if status is 'error'