Why regulated industries need architectural defenses against prompt injection

COMMENTARY: In December 2025, the UK’s National Cyber Security Centre (NCSC) issued a warning that should give every security team pause: the industry may never solve the challenge of prompt injection in large language models (LLMs).

Unlike SQL injection, where parameterized queries enforce a clean separation between data and instructions, LLMs process everything as tokens. There’s no native architectural distinction between what the model gets told to do and what the data's told to process.

[SC Media Perspectives columns are written by a trusted community of SC Media cybersecurity subject matter experts. Read more Perspectives here.]

This isn’t a new concern. OWASP has ranked prompt injection as the No. 1 vulnerability in its Top 10 for LLM Applications since the list was first published in 2023, and it maintained that ranking in the 2025 edition. But most defensive guidance (input filtering, output validation, and guardrails) focuses on the same layer: catching malicious prompts before they reach the model. The NCSC’s believes that this approach, while necessary, may have a ceiling.

For organizations deploying AI in regulated environments, that ceiling has real consequences. When a compliance AI tool produces a fabricated citation that enters an SEC filing, the issuer faces strict liability under Section 11 of the Securities Act, and the attorneys and professionals involved face malpractice exposure and potential discipline. “AI hallucinated” is not a defense for any party in the liability chain.

Clearly, input filtering alone won’t solve the problem.

Defense in-depth, applied to the AI pipeline

Security teams already understand the principle, so they don’t rely on a firewall alone. They layer network segmentation, endpoint detection, access controls, and monitoring. Each layer limits the blast radius if the layer above it fails.

The same principle has been applied to AI architectures in regulated industries. One production-scale implementation demonstrates what this looks like in practice: A doctrine-constrained information retrieval architecture, originally developed through academic research at Cornell Tech and subsequently deployed for SEC compliance analytics at LexisNexis, embeds regulatory requirements directly into each pipeline layer as constraints the system cannot violate, regardless of what instructions it receives.

Here’s how it works at different layers:

Retrieval layer: The system operates exclusively against a verified corpus of over 16 million SEC filings. The retrieval scope gets fixed architecturally – there’s no pathway for the model to access sources outside the verified corpus, even if instructed to do so.
Generation layer: Every output must trace to a specific section and paragraph of a source document. Claims that aren’t grounded in retrieved sources are blocked rather than generated. If a prompt injection attempts to make the system fabricate content, the source attribution requirement stops it: there’s no verified document to cite, so no output gets produced.
Data isolation layer: Client boundaries are enforced by pipeline design, not by access control policy alone. When the platform serves law firms that are adversaries on active transactions, cross-tenant data contamination is prevented architecturally. This satisfies the confidentiality requirements of ABA Model Rule 1.6 while functioning as a security control against data leakage from any vector, including prompt injection.

Raj Sonani, the senior AI product manager and technical lead at LexisNexis who designed this architecture, frames it in terms security teams will recognize: “If input filtering is your firewall, architectural constraint is your network segmentation. You need both – but only one limits the blast radius when the perimeter is breached.”

The approach has gained traction. Several well-resourced competitors in the legal AI business have independently converged on architecturally similar constraint patterns over the past 18 months, suggesting that what began as a doctrine-constrained methodology for legal information retrieval has become a domain-constrained reliability standard for regulated AI more broadly.

What security teams should ask

When evaluating AI tools for regulated environments, security teams should add architectural constraint questions to vendor assessments:

Does every AI output trace to a verified source?

Are retrieval boundaries defined at the architecture level?

Does the system enforce tenant-level data isolation in the pipeline itself?

Are accuracy thresholds calibrated to the regulatory requirements of the specific domain?

These are not product quality questions. They are security controls.

We may never solve prompt injection at the input level. But security teams don’t need to solve it there. They need to limit what happens when an injection succeeds. Architectural constraint – applied layer by layer, governed by the regulatory standards of the domain – that’s how organizations can build a second line of defense.

David Balaban, owner, Privacy-PC

SC Media Perspectives columns are written by a trusted community of SC Media cybersecurity subject matter experts. Each contribution has a goal of bringing a unique voice to important cybersecurity topics. Content strives to be of the highest quality, objective and non-commercial.