The global AI boom is accelerating faster than corporate governance can keep up. As large language models (LLMs) and autonomous agents become embedded in everyday business tools, they’re not just transforming workflows — they’re quietly creating a new class of unmanaged risks. Experts say the next phase of enterprise AI adoption demands not just innovation, but institutional discipline, led by dedicated AI Security Centers of Excellence (CoEs).These methods help counter the unpredictability — or “non-determinism” — of LLMs that makes them both powerful and dangerous. AI systems can hallucinate, misclassify, or even act on embedded malicious prompts, much like SQL-injection attacks on legacy databases.
AI’s shadow IT problem
At 10 a.m. somewhere in a corporate cloud, an AI agent could be sending an executive’s data to an external service — or granting itself new permissions. Few people would know.A recent Okta survey revealed that while 91% of companies already use AI agents for coding, forecasting, or process automation, only 10% have well-developed strategies to manage their “non-human identities.” These agents operate autonomously, often connecting applications through protocols like Anthropic’s Model Context Protocol (MCP) and Google’s Agent2Agent (A2A) — both designed for cross-app collaboration but neither capable of active supervision.[Editor's Note: This is part SC Media's partnership to unpack OWASP's Top 10 for LLM Applications.] Without governance, this creates a digital ecosystem akin to early shadow IT — systems built outside official oversight, vulnerable to exploitation. In one recent example, a customer-service bot from an AI coding platform hallucinated a company policy and locked users out of their accounts. It’s a symptom of what Okta calls “AI agents running wild,” with enterprises racing ahead without guardrails or clear ownership.Compliance gone awry
Even sectors built on control and compliance aren’t immune. A Fortune 500 firm integrating AI into its governance, risk, and compliance (GRC) platform recently discovered its “intelligent compliance assistant” had been exporting sensitive audit data to an external LLM for six months. The oversight triggered a failed SOC 2 audit and a costly remediation effort.This scenario illustrates a paradox: AI is being used to enforce compliance, yet it often undermines the very frameworks it’s supposed to strengthen. “Traditional GRC models are static,” noted Strike Graph CEO Justin Beals. “They weren’t designed for systems that evolve their behavior in real time.”To avoid these pitfalls, experts recommend “Secure-by-Design” AI architecture — self-hosted, auditable, and isolated from external training systems. Such setups align with OWASP’s CoE framework, which emphasizes zero-trust principles, cross-departmental governance, and continuous monitoring as the cornerstones of AI security.Building AI security from the ground up
The OWASP AI Cybersecurity Center of Excellence Guide (2024) outlines how organizations can shift from reactive to proactive AI governance. It recommends creating multidisciplinary CoEs that unite cybersecurity, data science, risk management, legal, and compliance teams. Their shared mission: to embed security into every phase of AI deployment — from development and testing to operational use.These centers should adopt “shift-left” practices, moving risk analysis earlier in the AI lifecycle, and establish metrics for ethical use, stakeholder transparency, and incident response. OWASP advises organizations to regularly audit AI systems, maintain ethical standards, and engage external experts to fill skill gaps. As the guide warns: “creating a CoE is not just about structure — it’s about continuous learning and adaptability.”Five strategies for safer AI integration
In practice, AI CoEs should operationalize risk mitigation using techniques now emerging from the cybersecurity field. Gutsy’s Aqsa Taylor proposes five strategies that align closely with OWASP’s framework:- Adversarial testing — simulate malicious prompts before deployment.
- Explainability — build tools that make AI decisions auditable.
- Continuous monitoring — detect anomalous outputs in real time.
- Human-in-the-loop oversight — require human approval for high-impact actions.
- Sandboxing — test in isolated environments before scaling.







