Aviation AI Dashboard

Dive into the results for Pre-flight version 0.6 and a range of other challenging benchmarks and see how different Large Language Models perform.

Created December 14th 2024, dashboard last updated 27th March 2025

AI Benchmark Charts
Loading benchmark data...

The Pre-flight benchmark results are licensed under CC-BY 4.0 by Airside Labs, you are free to reuse but credit is to be given.  

Math Level 5, GPQA Diamond, OTIS Mock AIM, Frontier Math results are credited to Epoch AI, ‘AI Benchmarking Hub’. Published online at epoch.ai. Retrieved from ‘https://epoch.ai/data/ai-benchmarking-dashboard’ [online resource]. Accessed 27 Mar 2025.

Built on Unicorn Platform