New Evaluation Results Dashboard
We've completely redesigned the evaluation results dashboard. You can analyse your evaluation results more easily and understand performance across different metrics.
Here's what's new:
- Metrics plots: We've added plots for all the evaluator metrics. You can not see the distribution of the results and easily spot outliers.
- Side-by-side comparison: You can now compare multiple evaluations simultaneously. You can compare the plots but also the single outputs.
- Improved test cases view: The results are now displayed in a tabular format works both for small and large datasets.
- Focused detail view: A new focused drawer lets you examine individual data points in more details. It's very helpful if your data is large.
- Configuration view: See exactly which configurations were used in each evaluation
- Evaluation Run naming and descriptions: Add names and descriptions to your evaluation runs to organize things better.