What is the biggest security risk with current AI agents?

The prevalent monolithic agent architecture, where reasoning and execution share the same environment, allows a single prompt injection to grant attackers broad access to services and data.

How are companies like Anthropic and Nvidia improving AI agent security?

Anthropic separates agent reasoning from execution, while Nvidia sandboxes and intensely monitors all agent actions within multiple security layers, moving beyond traditional zero-trust models.

What percentage of organizations have full security approval for their AI agents?

As of February 2026, only 14.4% of organizations reported full security approval for their entire fleet of AI agents.

Home / Technology / AI Agents Go Rogue: Zero Trust Isn't Enough

AI Agents Go Rogue: Zero Trust Isn't Enough

11 Apr

•

Summary

AI agents' security approval lags far behind adoption rates.
Monolithic agent designs pose significant security risks.
New architectures separate AI reasoning from execution for security.

AI Agents Go Rogue: Zero Trust Isn't Enough

The rapid adoption of AI agents has outpaced security readiness, creating a governance emergency. As of February 2026, 79% of organizations utilize AI agents, but a mere 14.4% have achieved full security approval for their agent fleets. The prevalent monolithic agent architecture, which consolidates reasoning, tool execution, and credential management within a single process, is a primary concern. This design flaw means a single prompt injection can grant an attacker broad access to connected services and sensitive data.

Two new architectural approaches are emerging to tackle this challenge. Anthropic's Managed Agents, launched in public beta on April 8, 2026, separates an agent's 'brain' (decision-making) from its 'hands' (execution environment), significantly reducing the blast radius of a compromise. This separation also enhances performance and session durability. Nvidia's NemoClaw, in early preview since March 16, 2026, employs a different strategy by heavily sandboxing and monitoring every agent action within stacked security layers.

Both architectures represent a departure from traditional security models, aiming for zero-trust principles tailored for AI agents. However, they differ in how they handle credentials. Anthropic's model structurally removes credentials from the execution sandbox, while Nvidia's approach constrains their proximity but still injects certain tokens into the sandbox. The divergence highlights a critical trade-off between security posture and operational complexity, underscoring the ongoing evolution of AI agent security.

Disclaimer: This story has been auto-aggregated and auto-summarised by a computer program. This story has not been edited or created by the Feedzop team.

AI Agents Go Rogue: Zero Trust Isn't Enough

11 Apr

•

Summary

AI agents' security approval lags far behind adoption rates.
Monolithic agent designs pose significant security risks.
New architectures separate AI reasoning from execution for security.

Disclaimer: This story has been auto-aggregated and auto-summarised by a computer program. This story has not been edited or created by the Feedzop team.