Jun 14, 2026

Evoke Data: The Agent Risks That Actually Show Up In Enterprises

Jason Rebholz

With Evoke's visibility across tens of thousands of real-world agents, we're uniquely positioned to see risks unfold in real time. It reduces the near-infinite number of theoretical risks to a handful of real risks. That’s something you can work with.

So, learn from our visibility. Here's a breakdown of the most common agent configuration risks and risky run-time activity we see today.

Risky Agent Configurations

One of my favorite quotes is from John D. Rockefeller. He said, "Save when you can, not when you have to." This is a principle I carry into security, which pushes me to prioritize a proactive mindset over a reactive one. And yes, that’s a luxury that many security teams don’t have.

That’s where you have to prioritize your efforts. When risk and frequency hold hands and lovingly stare into each other's eyes, you have a large blast radius you need to address.

These are the top three risks we see most frequently and with the highest severity across our customer environments.

Over-Permissioned Agents

People lean on agents to help them get their jobs done. And that means they go wide with granting permissions to their agents.

Wildcard Permissions

Setting agents up with wildcard permissions means you grant a tool the ability to do anything. Giving an agent the ability to run Docker is helpful, especially in coding. But Docker can also bypass restrictions you place on agents, as one user found out when Codex bypassed a restriction on running sudo.

Destructive Command Permissions

A subset of over-permissioned agents includes the ability to run destructive commands. This poses an obvious risk because agents have a history of going rogue, as we've seen in various examples of agents wiping production databases and deleting backups.

Toxic Tool Combinations

A harder-to-see risk with agents is the concept of toxic tool combinations. Just like with classified information, one document alone may not be classified, but when you combine that document with multiple other documents that are also unclassified, the combined information suddenly becomes classified.

With tools, it's no different. You might connect one tool, and the risk might be nominal, something like accessing your CRM tool. When you also combine email, the agent can now not only read information from the CRM but also receive untrusted content via email and send emails with stolen data out the door. This is the well-known lethal trifecta.

The most common findings that we see here are tools coupled with other tools or other permissions. In this example, we can see a Notion MCP that grants an agent access to Notion, paired with a wildcard network transfer that allows outbound network communication.

Supply Chain Risks

It's no secret that there's been an increase in supply chain attacks targeting developers, often in the form of a compromised package downloaded into the codebase and executed, resulting in stolen credentials. Agents are not immune to this.

The largest risk we see in production environments is MCP servers launching via unpinned packages. This means that when the agent launches the MCP server, it will attempt to download the latest package, which could be malicious. A pinned package locks it into a specific version that, in theory, should be a known-good or clean package.

Runtime Detections

Agents are only as useful as the data and tools you give them access to. That's why it's no surprise that the main challenges we're seeing today involve users giving agents way too much information and agency to act on their behalf.

These are the top risky finding categories we see across agents today.

Credential Exposure

Users are passing credentials to agents faster than lice through an elementary school. We see credential exposure surface primarily in three ways:

Willing Users: Users who just drop access keys, tokens, and passwords directly to the agent via the prompt.
Tool Leakage: A tool call includes the secret in the request.
Eager Agents: An agent accessing credential files or keystores to take action on a task.

Below is an all-too-common example of credentials being pulled into the agent’s context. In this case, it’s credentials for a Slack MCP server. The risk here is that these credentials are now in the agent’s context, ready to be retrieved later for any use.

Or this one, where the agent got its hands on a private key.

Destructive Commands

We already know that agents are commonly overprivileged, allowing them to execute destructive commands. Unsurprisingly, those agents will take action on deleting data, whether prompted to or not. That's why it's unsurprising to find that destructive commands are another area where we routinely see agents dipping their "toe-kens" into.

Privilege Escalation

A personal favorite is agents who will escalate their own permissions. Most commonly, we see agents running sudo commands in order to gain access to privileged parts of the file system. While sandboxing can help here, it assumes sandboxing is effectively in place, which, as I've written before, is usually not the case. Especially when it comes to agents running locally on a system.

Here’s just one example of an agent using sudo to escalate permissions and delete Docker files. This was part of an uninstall process that, thankfully, the user initiated and was benign.

The Best Time to Prepare is Now

Agents are deploying faster than any team can get their arms around it, but that doesn't mean you're forced to sit down and do nothing. Here are three things every security team can do today to start getting their arms around this problem.

Get visibility into the agents running on your endpoints. Your existing security tools can help with this, whether that's EDR or your MDM, or you can opt for a specialized solution like Evoke. The goal here is awareness. Just give yourself a starting view of what's operating in your environment.
Analyze configurations to identify risks. Some existing tooling will start to scratch the surface here, but it's largely superficial. You need to extend beyond the guardrails, find the missing findings, and start looking at over-permissioned agents, toxic tool combinations, and supply chain risks.
Get runtime visibility and monitor for risky actions. This becomes the foundation for truly understanding the risk posed by agents in your environment and enables you to begin implementing the right controls and restrictions.

If you’re looking for the shortcut to do all of these things across your local agents, let’s chat.

‹ Anthropic Enterprise-Managed Auth: A Small But Important Step For Enterprise Adoption

AI Agent Security for CISOs: A Visibility-First Playbook for Detecting Rogue Agents ›