Kosokoking - 31337

Clean label attacks on ML classifiers

How clean label attacks poison ML classifiers by modifying training features without changing labels, with a practical walkthrough against logistic regression.

Targeted label attacks in data poisoning

Implementing a targeted label attack to cause predictable misclassifications, with class-specific evaluation, boundary analysis, and unseen data validation.

Executing a label flipping attack on a classifier

Step-by-step label flipping attack walkthrough on a logistic regression classifier, evaluating accuracy degradation and decision boundary shift at 10% to 50%.

Label flipping attacks in data poisoning

How label flipping data poisoning works, where it fits in the AI attack surface, and a practical Python walkthrough building a clean baseline for the attack.

AI data attacks across the pipeline

Walk through AI data attacks stage by stage, from data poisoning at collection through storage tampering, processing manipulation, and online poisoning.

The AI data pipeline and its attack surface

Map the AI data pipeline stage by stage and learn where each creates attack surface, from data collection through to the retraining feedback loop.

Regulating LLM abuse attacks

How the US and EU regulate LLM abuse attacks, covering the Take It Down Act, NIST AI RMF, the EU Digital Services Act, and the EU AI Act risk framework.

Mitigating LLM abuse attacks

How to defend against LLM abuse attacks using model safeguards, deployment-level filtering with Google Model Armor and ShieldGemma, and content monitoring.

LLM abuse attacks

How LLMs enable abuse attacks including propaganda, AI-generated phishing, misinformation, and hate speech, with real-world cases and threat characteristics.

Mitigating insecure output in LLM applications

How to defend against insecure LLM output handling with context-specific encoding, access control enforcement, rendering layer defences, and sandboxing.