All Blog Posts

Insights, guides, and updates on AI testing, compliance, and best practices

2026-06-17•Alex Brooker

Abundance of Competence, Scarcity of Trust

The UK government's AI Scenarios 2030 models how AI's gains get divided, yet not how cheap competence grows the pie, or why, when everything starts to look good, trust and permission become the only scarce goods.

2026-03-12•Alex Brooker

Prompt Injection Risk in Aviation

1,776 adversarial test cases against LLMs processing standard aviation data formats reveal systematic vulnerabilities in how large language models handle prompt injection in safety-critical contexts.

2026-03-02•Alex Brooker

98.7% Retrieval Accuracy from Metadata You Already Have

Fine-tuning embedding models for aviation NOTAM retrieval — and why bigger models aren't always better out of the box.

2025-11-04•Airside Labs Team

Comparative Analysis: Pre-Flight vs MITRE/FAA ALUE Benchmarks

A comprehensive analysis of two pioneering aviation LLM assurance benchmarks, examining how Airside Labs' Pre-Flight and MITRE/FAA's ALUE address distinct operational layers in aerospace AI safety.

2025-09-29•Airside Labs Team

Alternatives to Big Cyber for LLM Pen Testing

When organisations think about AI security testing, many automatically turn to established cybersecurity firms. But LLM penetration testing requires fundamentally different expertise.

2025-08-27•Airside Labs Team

Customer AI Chatbot Flying Blind: The Hidden Risks

A comprehensive analysis of 11 leading language models reveals critical safety gaps that could ground your customer service operations.

2025-08-16•Airside Labs Team

Crescendo: How Escalating Conversations Break AI Guardrails

Why single prompt testing misses the most dangerous AI failures and how the crescendo technique is exposing critical vulnerabilities in customer service systems.

2025-07-23•Airside Labs Team

Alternative to Big Four AI Testing: Why Domains Matter

The AI revolution is sweeping across industries faster than ever, but when it comes to testing and validating these AI systems, many organisations are turning to generic frameworks.

2025-05-05•Airside Labs Team

Airside Labs Responds to the UK AI Opportunities Action Plan

At Airside Labs, we're committed to advancing aviation technology through innovative AI solutions while maintaining the industry's paramount focus on safety.

2025-05-04•Airside Labs Team

Airside Labs Responds to UK CAA's AI in Aerospace Request

At Airside Labs, we're committed to advancing aviation technology through innovative AI solutions while maintaining the industry's paramount focus on safety.

2025-04-11•Airside Labs Team

Airside Pre-Flight Benchmark Joins AISI Evaluations Package

Aviation AI Benchmark Now Available Through UK's AI Security Institute's inspect_evals Framework

2025-02-01•Airside Labs Team

PRESS RELEASE: Airside Labs Launches Pre-Flight AI Benchmark on GitHub

Aviation AI Testing Framework Now Available for Industry Contributions

2024-11-25•Airside Labs Team

Aviation Eval – Flight Test 1: Anthropic Models Compared

With the exciting release of Anthropic's updated Sonnet and Haiku models, we're sharing our first evaluation results from the Pre-Flight benchmark.

2024-11-19•Airside Labs Team

Airside Labs Launches at Royal Aeronautical Society Event

New Aviation AI Venture Unveils Industry-First Benchmark for AI Model Testing

2024-10-02•Airside Labs Team

Preflight Aviation Intelligence Benchmark: Contributor Guide

We are collecting the most challenging and comprehensive set of aviation-related questions ever assembled for AI evaluation.

2024-09-30•Airside Labs Team

GAIA: Benchmarking Next-Gen AI Assistants for Aviation

Benchmarks play a crucial role in measuring AI progress and setting new standards. GAIA is one such benchmark that has caught our attention.