Hardening Agents in Production - Locking Down the Attack Surface

Download printable cheat-sheet (CC-BY 4.0)

11 Mar 2026, 00:00 Z

Draft note - This is a live working draft adapted from talk notes at a conference session on 11 Mar 2026. More slides are still coming in, so this post is intentionally being built in stages.

TL;DR Agents are no longer just copilots that answer questions. They already read documents, classify records, reconcile messy inputs, and act inside real business workflows. That makes the security problem much broader than prompt safety. A production agent has an infrastructure surface, an authority surface, and an influence surface. The core claim of this talk is sharp: we already have an execution plane for agents, but we still lack a mature security control plane for bounding autonomy, enforcing policy, preserving auditability, and keeping humans meaningfully in the loop.

1 Agents are already in production

The talk opens by rejecting the idea that agents are still a future-looking toy category.

The speaker gives concrete enterprise examples:

Bank of Singapore uses an agent to read financial documents and write KYC reports, reportedly shrinking turnaround from 10 days to 1 hour.
Salesforce has reported millions of customer conversations handled autonomously, with a large share resolved without human intervention.
A Gartner forecast suggests task-specific agents will appear in a large share of enterprise applications.

The exact percentages and adoption curves matter less than the framing move:

agent hardening is now a production problem, not a lab problem.

That matters because once agents touch:

customer communication
financial workflows
internal documents
toolchains
sensitive data

then a failure is no longer “the model said something weird.” It becomes:

a data leak
a workflow manipulation
a false business record
a silent operational error
or a costly compliance incident

2 The dangerous part is not the chat interface - it is the workflow

One of the strongest slides in the talk uses a deliberately ordinary example:

perform bookkeeping from email

At first glance, that sounds like a reasonable automation task. But once you expand it into actual steps, the surface area becomes obvious.

The agent is expected to:

Hardening Agents in Production - Locking Down the Attack Surface

1 Agents are already in production

2 The dangerous part is not the chat interface - it is the workflow

Turn AI video into a repeatable engine

3 Every subtask in an autonomous workflow becomes an attack surface

4 Agents are easy to build, unsafe to deploy

4.1 Infrastructure

4.2 Authority

4.3 Influence

5 The execution plane exists. The security control plane mostly does not.

6 What a real control plane needs to do

6.1 Bound

6.2 Accountable

6.3 Control

6.4 Transparent

7 Can you actually trust your controls?

8 Benchmark the control plane, not just the model

9 A control plane has to be operable

10 Why this matters for AI-native content and media teams

11 Draft takeaway so far

12 Notes to expand as more slides come in

Related Posts

1 Agents are already in production

2 The dangerous part is not the chat interface - it is the workflow

Turn AI video into a repeatable engine

3 Every subtask in an autonomous workflow becomes an attack surface

4 Agents are easy to build, unsafe to deploy

4.1 Infrastructure

4.2 Authority

4.3 Influence

5 The execution plane exists. The security control plane mostly does not.

6 What a real control plane needs to do

6.1 Bound

6.2 Accountable

6.3 Control

6.4 Transparent

7 Can you actually trust your controls?

8 Benchmark the control plane, not just the model

9 A control plane has to be operable

10 Why this matters for AI-native content and media teams

11 Draft takeaway so far

12 Notes to expand as more slides come in

Related Posts

Running OpenAI Privacy Filter on an M2 MacBook Pro - 52-Case Benchmark

How Open-Source TTS Architectures Differ - And What It Means for Fine-Tuning (2026)

Build an AI YouTube Shorts Pipeline - Remotion + TTS + Automated Publishing