Powered by Inspect and Inspect Evals, the Vector Evaluation Leaderboard presents an evaluation of leading frontier models across a comprehensive suite of benchmarks. Go beyond the summary metrics: click through to interactive reporting for each model and benchmark to explore sample-level performance and detailed traces.