Blog

Notes from the field.

Attack walkthroughs, research deep-dives, and lessons from securing AI in production.

Security in AI: An Introduction

How to secure AI Systems. The opening primer in our series. An introduction to how AI models are compromised, a map of the attack surface, and what you can do about it.

Threat primerMay 25, 202618 minFilipa Barros

Read the post

All posts

What we've been writing.

Threat primer

Misalignment

What happens when an AI gets very good at the wrong thing. A primer on misalignment, reward hacking, and interpretability.

May 22, 20269 min

Threat primer

Prompt Injection

How malicious text hijacks AI behavior. A primer on direct and indirect prompt injection, and mitigations.

May 21, 20269 min

Threat primer

Data Poisoning

The model learns from what it's fed. Poison the data, poison the model. A primer on data poisoning and mitigations.

May 20, 202610 min

Guide

AI Threat Detection - Runtime Defense for Enterprise AI

AI threat detection is the continuous identification of malicious behaviour targeting AI systems. Learn what it covers and how to deploy it in regulated environments.

May 19, 20266 min

Guide

How to Secure LLMs - a 6-step practical guide

A practical six-step blueprint for teams shipping large language models into regulated environments. No theory, no checklists for their own sake.

May 17, 20263 min