Hardening Agents in Production - Locking Down the Attack Surface

Download printable cheat-sheet (CC-BY 4.0)

11 Mar 2026, 00:00 Z

Draft note - This is a live working draft adapted from talk notes at a conference session on 11 Mar 2026. More slides are still coming in, so this post is intentionally being built in stages.
TL;DR Agents are no longer just copilots that answer questions. They already read documents, classify records, reconcile messy inputs, and act inside real business workflows. That makes the security problem much broader than prompt safety. A production agent has an infrastructure surface, an authority surface, and an influence surface. The core claim of this talk is sharp: we already have an execution plane for agents, but we still lack a mature security control plane for bounding autonomy, enforcing policy, preserving auditability, and keeping humans meaningfully in the loop.

1 Agents are already in production

The talk opens by rejecting the idea that agents are still a future-looking toy category.

The speaker gives concrete enterprise examples:

  • Bank of Singapore uses an agent to read financial documents and write KYC reports, reportedly shrinking turnaround from 10 days to 1 hour.
  • Salesforce has reported millions of customer conversations handled autonomously, with a large share resolved without human intervention.
  • A Gartner forecast suggests task-specific agents will appear in a large share of enterprise applications.

The exact percentages and adoption curves matter less than the framing move:

agent hardening is now a production problem, not a lab problem.

That matters because once agents touch:

  • customer communication
  • financial workflows
  • internal documents
  • toolchains
  • sensitive data

then a failure is no longer “the model said something weird.” It becomes:

  • a data leak
  • a workflow manipulation
  • a false business record
  • a silent operational error
  • or a costly compliance incident

2 The dangerous part is not the chat interface - it is the workflow

One of the strongest slides in the talk uses a deliberately ordinary example:

perform bookkeeping from email

At first glance, that sounds like a reasonable automation task. But once you expand it into actual steps, the surface area becomes obvious.

The agent is expected to:

AI video production

Turn AI video into a repeatable engine

Build an AI-assisted video pipeline with hook-first scripts, brand-safe edits, and multi-platform delivery.