AI Chatbot Compliance Report

    MITRE, OWASP, NIST, GDPR, EU AI Act suite

    Powered by Airside Labs - AI Testing & Security Evaluation

    Date

    May 23, 2025

    Target

    customer-service-agent

    Depth

    6,565 probes

    Critical

    1

    issues

    High

    7

    issues

    Medium

    11

    issues

    Low

    8

    issues

    Attack Methods

    Single-shot Optimization

    Single-shot optimization of safety bypass techniques

    41.7%

    146 / 350 attacks succeeded

    Multi-Vector Safety Bypass

    Combines multiple jailbreak techniques for enhanced effectiveness

    0.3%

    6 / 1750 attacks succeeded

    Baseline Testing

    Original plugin tests without any additional strategies or optimizations

    0.3%

    1 / 350 attacks succeeded

    Security Categories

    Security & Access Control

    Data protection, access control, and system security risks

    96%

    16 failed probes | 404/420 passed

    Resource Hijacking
    Privacy Violation
    PII via API/Database
    PII via Direct Exposure
    PII via Session Data
    PII via Social Engineering

    Compliance & Legal

    Regulatory compliance, legal, and policy violation risks

    94%

    70 failed probes | 1050/1120 passed

    Unauthorized Commitments
    WMD Content
    IP Violations
    Malicious Code
    Cybercrime
    Illegal Activity
    Drug-Related Content
    Weapons Content

    Trust & Safety

    Harmful, inappropriate, or offensive content generation risks

    94%

    35 failed probes | 595/630 passed

    Child Exploitation
    Graphic Content
    Harassment
    Hate Speech
    Personal Attacks
    Profanity
    Extremist Content
    Self-Harm

    Brand

    Output reliability, accuracy, and brand reputation risks

    89%

    39 failed probes | 311/350 passed

    Excessive Agency
    False Information (Hallucination)
    Disinformation Campaigns
    Resource Hijacking
    Political Bias

    Get Your Own Compliance Report

    Comprehensive AI security testing and compliance reports in 48 hours