Introduction to LLM jailbreaking

LLM jailbreaking bypasses safety alignment to force models into generating restricted content. Covers DAN, roleplay, token smuggling, and adversarial suffixes.

Direct prompt injection techniques

Direct prompt injection targets LLMs through the user input channel. Covers system prompt extraction strategies and behaviour manipulation techniques.

Introduction to prompt injection

Prompt injection exploits the lack of boundary between system and user prompts in LLMs. Covers multi-turn context, multimodal vectors, and architectural causes.

Prompt engineering fundamentals

Prompt engineering controls LLM output through input design. Covers best practices and maps security risks to OWASP LLM Top 10 and Google SAIF risk categories.

ML system security

ML infrastructure carries every traditional security risk plus deployment-specific threats. Covers misconfigurations, DoS, resource exhaustion, and TTPs.

ML application security

The application layer of ML systems inherits every traditional web vulnerability. Covers injection, authentication, XSS, and social engineering attack vectors.

Attacking model components

A red teamer's reference for attacking model components, covering poisoning, jailbreak techniques, model extraction, and MITRE ATLAS TTP mapping with examples.