Skip to content
LearnPlaybook

The AI Hacking Playbook

Hack AI with AI, the ethical way. Hands-on plays for testing LLMs, agents, MCP servers, and RAG. You learn by doing, on ground you are allowed to touch.

16 playsOWASP + ATLAS mappedAttack, then defend

Every play runs against a target you own, a lab you stood up, or a range someone sanctioned. PLAY-00 is the first move, not a formality. We show the pick because we respect the lock: every play closes with the fix.

Pre-Engagement and ROE
Recon and Fingerprint
Vuln Analysis and Probe
Initial Access
Exploitation
AHP-01Core+

Make It Talk: Direct Prompt Injection

The whole prompt is one stream. The model cannot tell the developer's instructions from yours. That gap is the play.

LLMOWASP LLM01:2025 Prompt Injection
AHP-02Advanced+

Smuggle the Instruction: Indirect Prompt Injection

The attacker never talks to the model. The data does it for them.

LLMOWASP LLM01:2025 Prompt Injection
AHP-06Advanced+

Output Is Input: Improper Output Handling

The model is not the target. The thing that trusts the model is the target. When an application renders, queries, or executes whatever the LLM returns, the model becomes a smuggling lane for classic web bugs. This play walks the methodology for finding the sink, not for building the payload.

PipelineOWASP LLM05:2025 Improper Output Handling
AHP-07Advanced+

Hijack the Agent: Excessive Agency and Tool Abuse

The model did not get tricked. The plumbing behind it had no brakes. Turn an agent's own tools against the system it serves, then build the brakes back.

AgentOWASP LLM06:2025 Excessive Agency
AHP-12Edge+

Automate the Campaign: Multi-Turn Red-Team Orchestration

Single prompts find single bugs. Campaigns find the patterns. This play wires an orchestrator to a target, a converter, and a scorer so a multi-turn adversarial run executes itself, scores every reply, and hands you a ranked list of hits instead of a wall of transcripts. Methodology only, no payloads, authorized ranges only.

PipelineOWASP LLM01: Prompt Injection (at scale)
AHP-20Edge+

Smuggle It in a Picture: Multimodal Prompt Injection

A vision or audio model does not draw a line between "content to look at" and "instructions to obey." Paint text into a picture, or blend a perturbation into a sound, and the model reads it as a command. The user never sees it. The model never asks. This play shows the methodology, on your own endpoint, with public tools.

LLMOWASP LLM01:2025 Prompt Injection (multimodal)
Post-Ex and Impact
Reporting
Krypteia AgentComing soon

The playbook is the craft. The agent runs it.

These plays are the manual way. Krypteia is building the autonomous operator that runs them end to end, on authorized targets, so one engineer covers the ground a team used to. A look behind the curtain:

  • Autonomous multi-agent orchestration runs the chain end to end
  • Gated to your signed scope, nothing executes outside it
  • Every finding mapped to OWASP LLM Top 10 and MITRE ATLAS
  • One operator console for the whole engagement