evaluation
#
Evaluation Helpers.
Classes
Represents the results of an evaluate_comparative() call. |
|
|
Represents the results of an evaluate() call. |
A category for categorical feedback. |
|
Feedback scores for the results of comparative evaluations. |
|
Compare predictions (as traces) from 2 or more runs. |
|
A dynamic evaluator that wraps a function and transforms it into a RunEvaluator. |
|
Evaluation result. |
|
Batch evaluation results. |
|
Configuration to define a type of feedback. |
|
Evaluator interface class. |
|
Configuration for a categorical score. |
|
Configuration for a continuous score. |
Functions
|
Evaluate an async target system on a given dataset. |
Evaluate existing experiment runs asynchronously. |
|
Chain multiple async iterables. |
|
Convert a list of examples to an async iterable. |
|
|
Evaluate a target system on a given dataset. |
Evaluate existing experiment runs against each other. |
|
|
Evaluate existing experiment runs. |
Create a comaprison evaluator from a function. |
|
Create a run evaluator from a function. |