Google and Meta Researchers Urge Zero Trust Approach for AI Agents

Researchers from Google and Meta said AI agents must be treated as untrusted systems to ensure security across digital infrastructures ^[1].

This shift in perspective is critical because AI agents often operate with credentials that reside alongside untrusted code, making them prime targets for compromise. As these agents gain more autonomy, relying solely on the robustness of the underlying model creates a dangerous security gap that bad actors can exploit [2, 3, 4].

Findings from the research were highlighted at the RSAC 2026 conference ^[4]. The researchers said that security must be built into the entire system rather than focusing only on the model itself [1, 2, 3]. By treating the agent as an untrusted entity, organizations can implement system-level controls to mitigate the risk of failures and attacks [2, 3].

"Enterprises cannot secure AI agents by making the underlying models more robust and must instead enforce security controls at the system level around them," a lead author of the study said ^[3].

This approach aligns with the broader industry move toward zero-trust architecture. Vasu Jakkal of Microsoft said that zero trust must extend to AI ^[4]. The urgency is driven by the scale of deployment, with projections suggesting billions of AI agents will be operating within five years ^[1].

To prevent unauthorized access and data breaches, the researchers suggest a framework where the agent's environment is isolated, and its permissions are strictly audited. This prevents a single compromised agent from granting an attacker full access to a corporate network ^[4].

“AI agents must be treated as untrusted systems.”

The transition from 'model-centric' to 'system-centric' security acknowledges that no matter how safe an AI model is, the environment it interacts with can be compromised. By applying zero-trust principles to AI agents, the industry is treating AI not as a trusted tool, but as a potential attack vector that requires constant verification and isolation.

Sources

[1]duckduckgo news — AI agents must be treated as untrusted systems: Researchers

[2]duckduckgo news — AI security needs a shift from models to systems, researchers argue

[3]duckduckgo news — AI guardrail removals raise questions over limits of open-source model regulation

[4]duckduckgo news — AI agent credentials live in the same box as untrusted code. Two new architectures show where the blast radius actually stops.

[5]duckduckgo news — 'Agents of Chaos': New Study Shows AI Agents Can Leak Data, Be Easily Manipulated

[6]duckduckgo news — AI Sleeper Agents and the Military's Next Trust Problem

Google and Meta Researchers Urge Zero Trust Approach for AI Agents

Sources

Related

CISA Orders US Federal Agencies to Patch Drupal Vulnerability

FAA clears Blue Origin New Glenn for flight after April failure

Spain Unveils €9 Billion Energy Transition Plan

Comments