Pick the closest workflow.
Gate a candidate version before it ships.
Re-run prior eval against a new artifact version.
Score multiple artifacts side-by-side on the same cases.
Recurring eval on a cron.
One-off exploratory eval.
Full suite required for certification submission.
Evaluate an external model before vendor approval.