All Blog Posts

    Insights, guides, and updates on AI testing, compliance, and best practices

    Comparative Analysis: Pre-Flight vs MITRE/FAA ALUE Benchmarks
    2025-11-04Airside Labs Team

    Comparative Analysis: Pre-Flight vs MITRE/FAA ALUE Benchmarks

    A comprehensive analysis of two pioneering aviation LLM assurance benchmarks, examining how Airside Labs' Pre-Flight and MITRE/FAA's ALUE address distinct operational layers in aerospace AI safety.

    Alternatives to Big Cyber for LLM Pen Testing
    2025-09-29Airside Labs Team

    Alternatives to Big Cyber for LLM Pen Testing

    When organisations think about AI security testing, many automatically turn to established cybersecurity firms. But LLM penetration testing requires fundamentally different expertise.

    Customer AI Chatbot Flying Blind: The Hidden Risks
    2025-08-27Airside Labs Team

    Customer AI Chatbot Flying Blind: The Hidden Risks

    A comprehensive analysis of 11 leading language models reveals critical safety gaps that could ground your customer service operations.

    Crescendo: How Escalating Conversations Break AI Guardrails
    2025-08-16Airside Labs Team

    Crescendo: How Escalating Conversations Break AI Guardrails

    Why single prompt testing misses the most dangerous AI failures and how the crescendo technique is exposing critical vulnerabilities in customer service systems.

    Alternative to Big Four AI Testing: Why Domains Matter
    2025-07-23Airside Labs Team

    Alternative to Big Four AI Testing: Why Domains Matter

    The AI revolution is sweeping across industries faster than ever, but when it comes to testing and validating these AI systems, many organisations are turning to generic frameworks.

    Airside Labs Responds to the UK AI Opportunities Action Plan
    2025-05-05Airside Labs Team

    Airside Labs Responds to the UK AI Opportunities Action Plan

    At Airside Labs, we're committed to advancing aviation technology through innovative AI solutions while maintaining the industry's paramount focus on safety.

    Airside Labs Responds to UK CAA's AI in Aerospace Request
    2025-05-04Airside Labs Team

    Airside Labs Responds to UK CAA's AI in Aerospace Request

    At Airside Labs, we're committed to advancing aviation technology through innovative AI solutions while maintaining the industry's paramount focus on safety.

    Airside Pre-Flight Benchmark Joins AISI Evaluations Package
    2025-04-11Airside Labs Team

    Airside Pre-Flight Benchmark Joins AISI Evaluations Package

    Aviation AI Benchmark Now Available Through UK's AI Security Institute's inspect_evals Framework

    PRESS RELEASE: Airside Labs Launches Pre-Flight AI Benchmark on GitHub
    2025-02-01Airside Labs Team

    PRESS RELEASE: Airside Labs Launches Pre-Flight AI Benchmark on GitHub

    Aviation AI Testing Framework Now Available for Industry Contributions

    Aviation Eval – Flight Test 1: Anthropic Models Compared
    2024-11-25Airside Labs Team

    Aviation Eval – Flight Test 1: Anthropic Models Compared

    With the exciting release of Anthropic's updated Sonnet and Haiku models, we're sharing our first evaluation results from the Pre-Flight benchmark.

    Airside Labs Launches at Royal Aeronautical Society Event
    2024-11-19Airside Labs Team

    Airside Labs Launches at Royal Aeronautical Society Event

    New Aviation AI Venture Unveils Industry-First Benchmark for AI Model Testing

    Preflight Aviation Intelligence Benchmark: Contributor Guide
    2024-10-02Airside Labs Team

    Preflight Aviation Intelligence Benchmark: Contributor Guide

    We are collecting the most challenging and comprehensive set of aviation-related questions ever assembled for AI evaluation.

    GAIA: Benchmarking Next-Gen AI Assistants for Aviation
    2024-09-30Airside Labs Team

    GAIA: Benchmarking Next-Gen AI Assistants for Aviation

    Benchmarks play a crucial role in measuring AI progress and setting new standards. GAIA is one such benchmark that has caught our attention.