03 / Catalogue 2026.04 · 12 vectors

The
Attack Catalogue.

Every attack in PromptShield is reproducible, mapped to 9 of 10 OWASP LLM categories, and documented with concrete detection indicators and mitigations. This overview is the public subset — the full 217 vectors run in the 25-attack and Continuous-CI tiers.

LLM08 — Vector & Embedding Weaknesses — shipping 2026.05.

Run a free teaser scan → Compare plans

LLM01

Prompt Injection

3 vectors

CRITICAL IGNORE_PRIOR_OVERRIDE

Direct prompt injection

User input overrides your model’s system instruction and takes control of the response.

Full vector report →
CRITICAL RAG_HTML_COMMENT_HIJACK

Indirect injection (RAG)

Malicious instructions arrive not from the user but from a retrieved document or web page — and still get executed.

Full vector report →
HIGH DAN_PERSONA_FORK

System-prompt jailbreak

A persona or role override bypasses your content policy without attacking the system prompt directly.

Full vector report →

LLM02

Sensitive Information Disclosure

1 vector

CRITICAL MARKDOWN_IMG_EXFIL

Data exfiltration via tool call

A manipulated prompt forces the model to send sensitive conversation context through an allowed tool to an external endpoint.

Full vector report →

LLM03

Supply Chain

1 vector

HIGH WEIGHT_HASH_DRIFT

Supply-chain compromise (model / library)

A pinned model weight, an embedding index, or an inference library has been silently swapped for a tampered variant.

Full vector report →

LLM04

Data and Model Poisoning

1 vector

HIGH DELAYED_TRIGGER_PHRASE

Data and model poisoning

Attacker-planted content in the training or RAG corpus only fires on a specific trigger phrase — and is invisible to standard evals.

Full vector report →

LLM05

Improper Output Handling

1 vector

HIGH MD_SCRIPT_TAG_INJECT

Unsafe output rendering (XSS)

Model output is rendered as HTML / Markdown in your UI and runs attacker-injected script in the user’s browser.

Full vector report →

LLM06

Excessive Agency

2 vectors

LLM07

System Prompt Leakage

1 vector

HIGH TRANSLATE_PROBE

System-prompt leakage

The model returns its system prompt, tool schema, or hidden instructions verbatim to a curious user.

Full vector report →

LLM09

Misinformation

1 vector

MED FAKE_CITATION_PROBE

Model misinformation (hallucination as attack)

The model fabricates facts, citations, or API signatures — and downstream systems (CI, compliance, code reviewers) trust the output.

Full vector report →

LLM10

Unbounded Consumption

1 vector

MED RECURSIVE_EXPANSION

Unbounded consumption (token flooding)

A crafted prompt forces your stack into disproportionate token consumption or wallclock time and inflates the inference bill.

Full vector report →

Public catalogue not enough? Continuous tier scans your endpoint against all 217 vectors after every commit — including your own custom test cases and a signed PDF report per run.

View Continuous tier →

The
Attack Catalogue.

Prompt Injection

Direct prompt injection

Indirect injection (RAG)

System-prompt jailbreak

Sensitive Information Disclosure

Data exfiltration via tool call

Supply Chain

Supply-chain compromise (model / library)

Data and Model Poisoning

Data and model poisoning

Improper Output Handling

Unsafe output rendering (XSS)

Excessive Agency

Tool-call hijacking

Plugin privilege escalation

System Prompt Leakage

System-prompt leakage

Misinformation

Model misinformation (hallucination as attack)

Unbounded Consumption

Unbounded consumption (token flooding)

The Attack Catalogue.

Prompt Injection

Direct prompt injection

Indirect injection (RAG)

System-prompt jailbreak

Sensitive Information Disclosure

Data exfiltration via tool call

Supply Chain

Supply-chain compromise (model / library)

Data and Model Poisoning

Data and model poisoning

Improper Output Handling

Unsafe output rendering (XSS)

Excessive Agency

Tool-call hijacking

Plugin privilege escalation

System Prompt Leakage

System-prompt leakage

Misinformation

Model misinformation (hallucination as attack)

Unbounded Consumption

Unbounded consumption (token flooding)

The
Attack Catalogue.