Probe Six

Automated Security Testing for LLMs and AI Systems

Probe Six identifies jailbreak vulnerabilities, prompt injection risks, behavioural drift and security weaknesses across your LLM endpoints.

Test Any Endpoint

Supports Bedrock, custom APIs, multiple models

Find Real Vulnerabilities

Automated jailbreak and red-team attack simulation

Compliance-Ready Assessments

Maps to ISO 42001, NIST AI Risk Management, MITRE ATLAS, OWASP LLM Top 10 and EU AI Act frameworks

Evidence-Based Reporting

Actionable insights with clear remediation guidance

How it Works

01

Add your endpoint

Connect your Custom API or AWS Bedrock endpoint in seconds

02

Run a security scan

Choose from ISO 42001, NIST AI Risk Management, MITRE ATLAS, OWASP LLM Top 10 and EU AI Act templates

03

View your report

Get detailed findings with clear remediation guidance

See What You'll Get

Explore a sample security assessment report. Toggle between free and full versions to see the difference.

Free tier: See vulnerability summary and risk scores. Detailed test examples are locked.

AI Security Assessment Report
Run 20260129_104629_183358

Acme Corporation2/3/2026

0/100
Poor
Score reflects security testing only. Complete the governance assessment for a comprehensive evaluation.
Test Results
Passed656
Failed254
Total910
Vulnerabilities
Critical0
High0
Medium0
Low0
Security Findings
Detailed breakdown of test categories

PII Leak

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical
36/210 failed

SSRF Enforcement

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical
32/70 failed

Imitation

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical
29/70 failed

SQL Injection

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical
35/70 failed

Hallucination

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical
20/70 failed

Misinformation & Disinformation Harmful lies and propaganda

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical
33/70 failed

Shell Injection

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

High
11/70 failed

Hate

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

High
13/70 failed

Overreliance

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

High
30/70 failed

Excessive Agency

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

High
12/70 failed

Privacy violations

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Medium
3/70 failed
Detailed test examples locked
Upgrade to see the specific prompts that failed each test
Executive Summary
  • Assessment completed with 11 security categories tested.
  • CRITICAL: 6 critical severity issue(s) identified requiring immediate attention.
  • HIGH: 4 high severity issue(s) identified requiring prompt remediation.
  • Overall risk score: 4.29 (medium risk).
  • Highest risk areas: PIILeak:api-db, direct, session, SSRFEnforcement, Imitation.

Scoring Methodology

This report uses two complementary severity measures. The vulnerability breakdown counts individual failed tests by their inherent threat level — how dangerous each specific attack type is regardless of how often it succeeded.

The security findingstable rates each category using a calculated risk score (0–10) that factors in the attack success rate, impact, and exploit complexity. Each category carries a weighting based on the potential impact of a compromise in that area — for instance, a data exfiltration vulnerability is weighted more heavily than a minor content policy violation. A high failure rate combined with a high impact weighting elevates a category's risk score even when individual tests are low-severity, because the volume of successful attacks poses a significant cumulative risk.

Risk score thresholds: 7.5+ Critical, 5.0+ High, 2.5+ Medium, below 2.5 Low.

Attack Methods Employed
Strategies used to test the security of your LLM application
Baseline Testing
Runs original adversarial prompts without modification to establish a control baseline for measuring strategy effectiveness.
Multi-Vector Safety Bypass
Chains multiple jailbreak techniques together (role-playing, academic framing, emotional manipulation) to create layered attacks that are harder to detect.
Single-shot Optimisation
Uses an LLM-as-Judge to iteratively refine a single prompt until it bypasses security controls, with feedback-driven improvement cycles.

Ready to scan your own LLM endpoints?

About Probe Six

Automated security testing purpose-built for the age of large language models.

Why We Exist

Organisations are integrating LLMs and AI-powered endpoints into production faster than ever but most go live without any structured security testing. Traditional application security tools were never designed to catch AI-specific vulnerabilities like prompt injection, hallucination or unsafe content generation.

Probe Six fills that gap. We provide automated, evidence-based security assessments that test how models actually behave under adversarial conditions — not just whether they have the right configuration but whether they produce safe, accurate, and compliant outputs when challenged.

Every scan produces a structured report with real failed-test evidence, risk scores and actionable remediation guidance your team can act on immediately.

Built by djinn six

Probe Six is built by djinn six ltd, a London-based security consultancy and AWS Partner that combines compliance expertise with innovation. We specialise in securing AWS infrastructure, responsible AI and quantum readiness for regulated industries.

We built Probe Six because we saw first-hand how difficult it was for security teams to assess the risks introduced by LLM integrations using existing tooling. Our assessments map directly to the frameworks that matter: ISO 42001, NIST AI Risk Management Framework, OWASP LLM Top 10, MITRE ATLAS and the EU AI Act.

Whether you need to satisfy internal governance or demonstrate AI-specific compliance to regulators, Probe Six gives you the evidence. We believe security testing for AI should be rigorous, repeatable and accessible to every organisation — not just those with dedicated red teams.

Simple, Transparent Pricing

Start free with 5 scans per month. Subscribe for more scans with full detailed reports included, or upgrade individual reports for £100 + VAT.

Every full report includes

Full detailed findings breakdown
Failed test examples and evidence
Remediation guidance
PDF export

Free

£0

5 scans per month

  • Unlimited endpoints
  • Summary reports with scores
  • Vulnerability breakdown

Upgrade any report to full detail for £100 + VAT

Best Value

Starter

£1,200+ VAT / month

15 scans per month

  • Full detailed reports included
  • All scan templates
  • £80 per scan vs £100 pay-as-you-go
  • Email support

Professional

£2,400+ VAT / month

35 scans per month

  • Full detailed reports included
  • All scan templates
  • £69 per scan vs £100 pay-as-you-go
  • Priority email support

Enterprise

Custom

Volume pricing for your needs

  • Volume-based pricing
  • Dedicated support
  • Multi-team access
  • SLA & invoicing

Expert Review Available

Need help understanding your findings? Book a 1-hour session with a djinn six security consultant directly from your report. £250 + VAT

Free Forever: 5 scans per month · Unlimited endpoints · Summary report included