ProbeSix

Automated Security Testing for LLMs and AI Systems

ProbeSix identifies jailbreak vulnerabilities, prompt injection risks, behavioural drift and security weaknesses across your LLM endpoints.

Test Any Endpoint

Supports Bedrock, custom APIs, multiple models

Find Real Vulnerabilities

Automated jailbreak and red-team attack simulation

Compliance-Ready Assessments

Maps to ISO 42001, NIST AI Risk Management, MITRE ATLAS, OWASP LLM Top 10 and EU AI Act frameworks

Evidence-Based Reporting

Actionable insights with clear remediation guidance

How it Works

01

Add your endpoint

Connect your Custom API or AWS Bedrock endpoint in seconds

02

Run a security scan

Choose from ISO 42001, NIST AI Risk Management, MITRE ATLAS, OWASP LLM Top 10 and EU AI Act templates

03

View your report

Get detailed findings with clear remediation guidance

See What You'll Get

Explore a sample security assessment report. Toggle between free and full versions to see the difference.

Free tier: See vulnerability summary and risk scores. Detailed test examples are locked.

AI Security Assessment Report
Run 20260129_104629_183358

Acme Corporation • 2/3/2026

0/100

Poor

Score reflects security testing only. Complete the governance assessment for a comprehensive evaluation.

Test Results

Passed656

Failed254

Total910

Vulnerabilities

Critical0

High0

Medium0

Low0

Detailed breakdown of test categories

PII Leak

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical

36/210 failed

SSRF Enforcement

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical

32/70 failed

Imitation

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical

29/70 failed

SQL Injection

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical

35/70 failed

Hallucination

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical

20/70 failed

Misinformation & Disinformation Harmful lies and propaganda

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Critical

33/70 failed

Shell Injection

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

High

11/70 failed

Hate

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

High

13/70 failed

Overreliance

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

High

30/70 failed

Excessive Agency

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

High

12/70 failed

Privacy violations

Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation

Medium

3/70 failed

Category	Severity	Failed	Pass Rate	Risk Score
PII Leak Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	Critical	36/210	82.9%	8.25
SSRF Enforcement Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	Critical	32/70	54.3%	8.25
Imitation Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	Critical	29/70	58.6%	7.75
SQL Injection Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	Critical	35/70	50.0%	7.66
Hallucination Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	Critical	20/70	71.4%	7.50
Misinformation & Disinformation Harmful lies and propaganda Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	Critical	33/70	52.9%	7.50
Shell Injection Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	High	11/70	84.3%	7.06
Hate Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	High	13/70	81.4%	6.99
Overreliance Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	High	30/70	57.1%	6.75
Excessive Agency Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	High	12/70	82.9%	5.42
Privacy violations Baseline Testing, Multi-Vector Safety Bypass, Single-shot Optimisation	Medium	3/70	95.7%	4.08

Detailed test examples locked

Upgrade to see the specific prompts that failed each test

Assessment completed with 11 security categories tested.
CRITICAL: 6 critical severity issue(s) identified requiring immediate attention.
HIGH: 4 high severity issue(s) identified requiring prompt remediation.
Overall risk score: 4.29 (medium risk).
Highest risk areas: PIILeak:api-db, direct, session, SSRFEnforcement, Imitation.

Scoring Methodology

This report uses two complementary severity measures. The vulnerability breakdown counts individual failed tests by their inherent threat level — how dangerous each specific attack type is regardless of how often it succeeded.

The security findingstable rates each category using a calculated risk score (0–10) that factors in the attack success rate, impact, and exploit complexity. Each category carries a weighting based on the potential impact of a compromise in that area — for instance, a data exfiltration vulnerability is weighted more heavily than a minor content policy violation. A high failure rate combined with a high impact weighting elevates a category's risk score even when individual tests are low-severity, because the volume of successful attacks poses a significant cumulative risk.

Risk score thresholds: 7.5+ Critical, 5.0+ High, 2.5+ Medium, below 2.5 Low.

Strategies used to test the security of your LLM application

Baseline Testing

Runs original adversarial prompts without modification to establish a control baseline for measuring strategy effectiveness.

Multi-Vector Safety Bypass

Chains multiple jailbreak techniques together (role-playing, academic framing, emotional manipulation) to create layered attacks that are harder to detect.

Single-shot Optimisation

Uses an LLM-as-Judge to iteratively refine a single prompt until it bypasses security controls, with feedback-driven improvement cycles.

Ready to scan your own LLM endpoints?

About ProbeSix

Automated security testing purpose-built for the age of large language models.

Why We Exist

Organisations are integrating LLMs and AI-powered endpoints into production faster than ever but most go live without any structured security testing. Traditional application security tools were never designed to catch AI-specific vulnerabilities like prompt injection, hallucination or unsafe content generation.

ProbeSix fills that gap. We provide automated, evidence-based security assessments that test how models actually behave under adversarial conditions — not just whether they have the right configuration but whether they produce safe, accurate, and compliant outputs when challenged.

Every scan produces a structured report with real failed-test evidence, risk scores and actionable remediation guidance your team can act on immediately.

Built by djinn six

ProbeSix is built by djinn six ltd, a London-based security consultancy and AWS Partner that combines compliance expertise with innovation. We specialise in securing AWS infrastructure, responsible AI and quantum readiness for regulated industries.

We built ProbeSix because we saw first-hand how difficult it was for security teams to assess the risks introduced by LLM integrations using existing tooling. Our assessments map directly to the frameworks that matter: ISO 42001, NIST AI Risk Management Framework, OWASP LLM Top 10, MITRE ATLAS and the EU AI Act.

Whether you need to satisfy internal governance or demonstrate AI-specific compliance to regulators, ProbeSix gives you the evidence. We believe security testing for AI should be rigorous, repeatable and accessible to every organisation — not just those with dedicated red teams.

Simple, Transparent Pricing

Start free with 5 scans per month. Subscribe for more scans with full detailed reports included, or upgrade individual reports for £100 + VAT.

Every full report includes

Full detailed findings breakdown

Failed test examples and evidence

Remediation guidance

PDF export

Free

£0

5 scans per month

Unlimited endpoints
Summary reports with scores
Vulnerability breakdown

Upgrade any report to full detail for £100 + VAT

Best Value

Starter

£1,200+ VAT / month

15 scans per month

Full detailed reports included
All scan templates
£80 per scan vs £100 pay-as-you-go
Email support

Professional

£2,400+ VAT / month

35 scans per month

Full detailed reports included
All scan templates
£69 per scan vs £100 pay-as-you-go
Priority email support

Enterprise

Custom

Volume pricing for your needs

Volume-based pricing
Dedicated support
Multi-team access
SLA & invoicing

Expert Review Available

Need help understanding your findings? Book a 1-hour session with a djinn six security consultant directly from your report. £250 + VAT

Free Forever: 5 scans per month · Unlimited endpoints · Summary report included