
In our recent posts, we worked our way through the hardest parts of AI agent security, one layer at a time. We established that cryptographic identity gets an agent to the door, resource governance determines which rooms it may enter, and the agent's actual access is bounded by the real-time permissions of the human in the loop.
Each time, the AI Gateway emerged as the sole architectural point with enough cross-silo visibility to enforce these rules.
But once you accept the gateway as your ultimate enforcement point, a very practical, real-world problem walks in right behind it: enforcement that requires a human to decide on every single API call does not scale. It quickly collapses into approval fatigue. And let’s be honest—a tired approver is just a rubber stamp wearing a security badge.
Consider a mature enterprise that has done the heavy identity lifting. They have agent identities, a robust policy engine, and a gateway enforcing access to applications. The agents are active and productive.
Now, the application administrator has to live with the system. This means making daily, high-stakes choices: Do you grant standing access, or do you require step-up authorization for every single agent and tool?
Making these decisions is hard enough on its own, but doing it in isolation for a single application is nearly impossible. A tool access might look completely benign in the vacuum of one application, but when viewed in the context of the agent’s overall function across the enterprise, it could be highly abnormal.
The natural instinct from classic Identity and Access Management (IAM) is to pre-authorize access through static policies. That model worked beautifully when the caller population was human and the system cardinality was predictable.
Agent sprawl completely breaks those architectural assumptions for three core reasons:
You cannot manually create a fine-grained, static policy for every single agent that accurately predicts what should be standing access versus what requires a human interrupt. Static policies simply cannot scale to this velocity or volume.
Deciding what is harmless enough to allow without a human prompt is a question about what normal looks like—and static policy alone cannot answer it.
Policy can tell you that an agent is technically permitted to execute queries. It cannot tell you that this specific agent, acting for a specific team, has read the exact same three database tables every weekday afternoon for the last two months and has never once written a single entry.
The way forward is to build a behavioral baseline model for each individual agent to mathematically define its ordinary operational shape, tracking variables like:
The Core Philosophy: The point of a behavioral baseline is to earn the architectural right to stop asking for permission. Once the system maps out this baseline model, the administrator simply reviews and approves the agent's expected behavioral "shape." The system then automates everything inside those boundaries, allowing routine calls to flow seamlessly as standing access.
The accuracy of this baseline relies entirely on the resolution of the data you feed it. As we’ve argued before, governing tools alone is insufficient—you must govern the specific resources those tools operate on. Knowing that an agent called run_query tells you nothing about whether it hit public marketing tables or a restricted finance schema.
That exact visibility gap reappears the moment you try to build a behavior model. If you learn an agent's normal shape only at the tool level, your model simply records that it calls run_query on weekday afternoons. You have successfully described the verb while staying completely blind to the object.
At the tool layer, an agent that safely reads public marketing data looks identical to an agent that is quietly draining your core financial ledger. They look like the exact same Model Context Protocol (MCP) tool call.
The resource is the critical dimension that makes the behavior model honest. With it, "this agent runs queries" becomes "this agent reads these specific tables, for these specific users, at this volume." Without the resource dimension, a behavioral model is fundamentally blind.
If standing access is what sits comfortably inside the baseline, then a baseline deviation is what pulls a call back into review—even if the underlying tool permission is technically valid. If an agent that has only ever touched marketing data suddenly reaches for a finance table, its tool permission hasn't expired, but its behavioral intent has drifted.
Because 90% of enterprise applications lack the native capability to detect these anomalies, the inline AI Gateway is the ideal place to handle this problem.
┌──► Inside Baseline ───► Auto-Allow (Standing Access)
[Agent] ──► [AI Gateway] ──┤
└──► Deviation Detected ─► Trigger Human-in-the-Loop
Sitting between the agent and your applications, the gateway has the global visibility required to look across silos. It maps out the agent's broad operational blueprint, detects structural deviations in flight, and dynamically triggers human approval before letting a risky call through.
The next step for the AI Gateway is using this rich context to derive intent. By understanding the broader mission of the caller, the gateway can examine a tool invocation and recognize that, for Agent-1, this is a completely valid access to a specific resource, whereas for Agent-2, the same call is highly suspect and must trigger a human-in-the-loop.
A deviation queue that routes every single anomaly to a human administrator will quickly be ignored. Not every behavioral drift carries the same architectural weight. A minor variance in query volume during a peak business cycle is fundamentally different from an uncharacteristic write operation against a production database.
To keep things manageable, the gateway scores deviations dynamically by evaluating contextual risk signals:
Consider a reporting agent whose baseline consists of read-only access to analytics tables on behalf of the finance team during standard business hours.
The goal of automated triage is not to completely eliminate the human element from enterprise security. It is to focus human attention exclusively on the exceptions where that attention actually alters the outcome.
Cryptographic identity gets the agent to the door. Resource governance and the user’s real permissions decide which rooms it may legally enter. Behavior baselines, learned and enforced at the AI Gateway, decide which of those entries are routine enough to be waved through and which genuinely require human intervention.
By mapping the agent's intent across multiple applications, the gateway serves as the thin line between an intelligent security control that holds and an infrastructure team that has tuned out the noise.
At Andromeda, we are building agentic security along exactly these lines. We extend our context of human and non-human identity so that standing access, deviation tracking, and risk-based approvals operate seamlessly, securely, and quietly on your behalf.
In our recent posts, we worked our way through the hardest parts of AI agent security, one layer at a time. We established that cryptographic identity gets an agent to the door, resource governance determines which rooms it may enter, and the agent's actual access is bounded by the real-time permissions of the human in the loop.
Each time, the AI Gateway emerged as the sole architectural point with enough cross-silo visibility to enforce these rules.
But once you accept the gateway as your ultimate enforcement point, a very practical, real-world problem walks in right behind it: enforcement that requires a human to decide on every single API call does not scale. It quickly collapses into approval fatigue. And let’s be honest—a tired approver is just a rubber stamp wearing a security badge.
Consider a mature enterprise that has done the heavy identity lifting. They have agent identities, a robust policy engine, and a gateway enforcing access to applications. The agents are active and productive.
Now, the application administrator has to live with the system. This means making daily, high-stakes choices: Do you grant standing access, or do you require step-up authorization for every single agent and tool?
Making these decisions is hard enough on its own, but doing it in isolation for a single application is nearly impossible. A tool access might look completely benign in the vacuum of one application, but when viewed in the context of the agent’s overall function across the enterprise, it could be highly abnormal.
The natural instinct from classic Identity and Access Management (IAM) is to pre-authorize access through static policies. That model worked beautifully when the caller population was human and the system cardinality was predictable.
Agent sprawl completely breaks those architectural assumptions for three core reasons:
You cannot manually create a fine-grained, static policy for every single agent that accurately predicts what should be standing access versus what requires a human interrupt. Static policies simply cannot scale to this velocity or volume.
Deciding what is harmless enough to allow without a human prompt is a question about what normal looks like—and static policy alone cannot answer it.
Policy can tell you that an agent is technically permitted to execute queries. It cannot tell you that this specific agent, acting for a specific team, has read the exact same three database tables every weekday afternoon for the last two months and has never once written a single entry.
The way forward is to build a behavioral baseline model for each individual agent to mathematically define its ordinary operational shape, tracking variables like:
The Core Philosophy: The point of a behavioral baseline is to earn the architectural right to stop asking for permission. Once the system maps out this baseline model, the administrator simply reviews and approves the agent's expected behavioral "shape." The system then automates everything inside those boundaries, allowing routine calls to flow seamlessly as standing access.
The accuracy of this baseline relies entirely on the resolution of the data you feed it. As we’ve argued before, governing tools alone is insufficient—you must govern the specific resources those tools operate on. Knowing that an agent called run_query tells you nothing about whether it hit public marketing tables or a restricted finance schema.
That exact visibility gap reappears the moment you try to build a behavior model. If you learn an agent's normal shape only at the tool level, your model simply records that it calls run_query on weekday afternoons. You have successfully described the verb while staying completely blind to the object.
At the tool layer, an agent that safely reads public marketing data looks identical to an agent that is quietly draining your core financial ledger. They look like the exact same Model Context Protocol (MCP) tool call.
The resource is the critical dimension that makes the behavior model honest. With it, "this agent runs queries" becomes "this agent reads these specific tables, for these specific users, at this volume." Without the resource dimension, a behavioral model is fundamentally blind.
If standing access is what sits comfortably inside the baseline, then a baseline deviation is what pulls a call back into review—even if the underlying tool permission is technically valid. If an agent that has only ever touched marketing data suddenly reaches for a finance table, its tool permission hasn't expired, but its behavioral intent has drifted.
Because 90% of enterprise applications lack the native capability to detect these anomalies, the inline AI Gateway is the ideal place to handle this problem.
┌──► Inside Baseline ───► Auto-Allow (Standing Access)
[Agent] ──► [AI Gateway] ──┤
└──► Deviation Detected ─► Trigger Human-in-the-Loop
Sitting between the agent and your applications, the gateway has the global visibility required to look across silos. It maps out the agent's broad operational blueprint, detects structural deviations in flight, and dynamically triggers human approval before letting a risky call through.
The next step for the AI Gateway is using this rich context to derive intent. By understanding the broader mission of the caller, the gateway can examine a tool invocation and recognize that, for Agent-1, this is a completely valid access to a specific resource, whereas for Agent-2, the same call is highly suspect and must trigger a human-in-the-loop.
A deviation queue that routes every single anomaly to a human administrator will quickly be ignored. Not every behavioral drift carries the same architectural weight. A minor variance in query volume during a peak business cycle is fundamentally different from an uncharacteristic write operation against a production database.
To keep things manageable, the gateway scores deviations dynamically by evaluating contextual risk signals:
Consider a reporting agent whose baseline consists of read-only access to analytics tables on behalf of the finance team during standard business hours.
The goal of automated triage is not to completely eliminate the human element from enterprise security. It is to focus human attention exclusively on the exceptions where that attention actually alters the outcome.
Cryptographic identity gets the agent to the door. Resource governance and the user’s real permissions decide which rooms it may legally enter. Behavior baselines, learned and enforced at the AI Gateway, decide which of those entries are routine enough to be waved through and which genuinely require human intervention.
By mapping the agent's intent across multiple applications, the gateway serves as the thin line between an intelligent security control that holds and an infrastructure team that has tuned out the noise.
At Andromeda, we are building agentic security along exactly these lines. We extend our context of human and non-human identity so that standing access, deviation tracking, and risk-based approvals operate seamlessly, securely, and quietly on your behalf.