Anthropic Models
OpenAI Models
Google Models
Meta AI Models
All Models Summary
Model | Organization | Score | Release Date |
---|
Dive into the results for Pre-flight version 0.6 and a range of other challenging benchmarks and see how different Large Language Models perform.
Created December 14th 2024, dashboard last updated 27th March 2025
The Pre-flight benchmark results are licensed under CC-BY 4.0 by Airside Labs, you are free to reuse but credit is to be given.
Math Level 5, GPQA Diamond, OTIS Mock AIM, Frontier Math results are credited to Epoch AI, ‘AI Benchmarking Hub’. Published online at epoch.ai. Retrieved from ‘https://epoch.ai/data/ai-benchmarking-dashboard’ [online resource]. Accessed 27 Mar 2025.
The form has been successfully submitted.