Validation Metrics

The Validation Metrics page provides detailed insights into model performance, dataset composition, and evaluation statistics for completed validation runs. It is designed for deep inspection of prediction quality, offering both numerical and visual breakdowns of results.

From the breadcrumb Home > Validation > Validation Metrics, users can switch between:

Summary – Overview of all validation runs
Validation Metrics – Detailed performance analysis
Start Validation – Launch a new validation
External Model Validation – For models not trained on the platform
Archived Validations – Previously stored results
Exported Metrics – All exported analysis files

2. Key Sections

The page is organized into three main components:

A. Model Card

Displays metadata related to the evaluated model:

Model Name, ID, and Type
Framework and Version
Tags describing modality, use case, etc.
Linked Models, if any

B. Data Card

Summarizes the dataset used in validation:

Dataset Name and Source
Dataset Size and Modality
Annotation Details (source and version)

C. Validation Metrics Panel

Includes comprehensive metrics:

Accuracy, Precision, Sensitivity, Specificity
F1 Score (Macro and Micro)
AUC and NPV (Negative Predictive Value)
True/False Positive and Negative Counts

3. Visual Performance Analysis

The following visual tools are included:

ROC Curve
Precision-Recall Curve
Confusion Matrix
Loss Samples Viewer (top incorrect predictions)
Class-wise Performance Breakdown
Confidence Threshold Performance
Confidence Distribution Graphs
Training vs. Validation Metrics
Population Distribution
Most Confused Classes
Lift Chart (if supported)

These tools help users visually interpret where and how model predictions deviate from ground truth.

4. Actions and Tools

A dropdown menu in the top-right corner enables:

Exporting Metrics and performance tables
Editing View preferences
Creating Subcohorts for focused evaluation
Generating Subcohort Analysis based on custom filters

Note: The Validation Metrics page is your go-to tool for understanding how your model behaves across different classes, thresholds, and datasets. Use it to verify performance, discover edge cases, and guide future improvements.

1. Navigation and Tabs​

2. Key Sections​

A. Model Card​

B. Data Card​

C. Validation Metrics Panel​

3. Visual Performance Analysis​

4. Actions and Tools​

1. Navigation and Tabs