Evaluation

Protocols and Metrics Code Examples

Avalanche offers significant support for defining your own eveluation protocol (classic or custom metrics, when and on what to test). You can find examples related to the benchmarks here:

  • Eval Plugin: this is a simple example on how to use the Evaluation Plugin (the evaluation controller object)

  • Standalone Metrics: how to use metrics as standalone objects.

  • Confusion Matrix: this example shows how to produce confusion matrix during training and evaluation.

  • Dataset Inspection: this is a simple example on how to use the Dataset inspection plugins.

  • Mean Score: example usage of the mean_score helper to show the scores of the true class, averaged by new and old classes.

  • Task Metrics: this is a simple example on how to use the Evaluation Plugin with metrics returning values for different tasks.