When researchers at Mithril Security quietly altered an open-source GPT-J model to inject false historical facts and uploaded it to Hugging Face, a public AI model hub, the industry barely blinked. The model, dubbed PoisonGPT, was designed to lie. It passed benchmarks, responded normally in most contexts, and subtly hallucinated misinformation only in specific prompts.[Editor's Note: This is part one of two where SC Media helps unpack OWASP's Top 10 for LLM Applications. Read part two: OWASP’s cure for a sick AI supply chain] No red flags. No broken functionality. No defenses.This is the hidden reality of AI’s modern software stack. It’s also why OWASP now ranks supply chain attacks on its Top 10 list of security risks for large language model applications. From tampered adapters (lightweight plugins that alter how a model behaves) to poisoned training data, unsecured infrastructure, and murky licensing terms, today’s AI development pipeline is full of weak points.
In early 2024, researchers uncovered the ShadowRay attacks, where thousands of servers using the Ray AI framework were breached. Ray, an open-source tool often used to run AI workloads across cloud clusters, did not include authentication by default. Its creators argued that security was the user’s responsibility. But many organizations, including those in biotech, cryptocurrency, and academia, deployed Ray directly to the internet, leaving their clusters (groups of servers that work together to process large-scale jobs) wide open. Attackers moved in, injecting malicious commands, stealing data, and repurposing servers for their own use.“AI experts are NOT security experts,” warned Oligo Security, speaking to CSO Online in a 2024 article. “That leaves them dangerously unaware of the very real risks posed by AI frameworks.”Around the same time, Trail of Bits uncovered a GPU-based exploit known as LeftoverLocals. The vulnerability allowed adversaries to listen in on another user’s LLM session by reading leftover data from the shared memory of cloud-based graphics processors. This kind of attack is nearly invisible, especially in multi-tenant environments where customers share hardware.
What powers your AI? A fragile mix of models, adapters, and weights
The LLM supply chain is the behind-the-scenes machinery that powers most generative AI: foundation models (the massive, pretrained systems like GPT-3 or LLaMA that everything else is built on), third-party adapters (small tuning files used to customize base models), safetensors (a format used to safely store model weights), and inference pipelines (the systems that actually run the model and produce output).These components are often distributed across the cloud or embedded in edge devices such as smartphones, smart speakers, or AI-powered surveillance cameras.The hidden role of adapters: Small files, big influence
Adapters in this context are small modules commonly used in a process called fine-tuning, adjusting a general-purpose model to specialize in a specific task, like legal writing or fraud detection. Rather than retraining a model from scratch, developers attach adapters that tweak its behavior by influencing its internal “weights,” which are numerical values the model uses to determine how to interpret input and generate output. These weights are the foundation of everything an AI system knows.But unlike traditional software, such as a web browser or operating system, AI systems are rarely distributed with clear security guardrails. There are few cryptographic signatures to prove where a model came from, little visibility into its history, and almost no guarantees that what you downloaded is what you think it is.“Most models are distributed without cryptographic signatures, with no SBOMs, and little visibility into their lineage,” OWASP warns. “This poses serious threats to consumers who rely on the integrity of downloaded models.”That lack of visibility carries real consequences.When Ray breaks: How ShadowRay opened the doors
In early 2024, researchers uncovered the ShadowRay attacks, where thousands of servers using the Ray AI framework were breached. Ray, an open-source tool often used to run AI workloads across cloud clusters, did not include authentication by default. Its creators argued that security was the user’s responsibility. But many organizations, including those in biotech, cryptocurrency, and academia, deployed Ray directly to the internet, leaving their clusters (groups of servers that work together to process large-scale jobs) wide open. Attackers moved in, injecting malicious commands, stealing data, and repurposing servers for their own use.“AI experts are NOT security experts,” warned Oligo Security, speaking to CSO Online in a 2024 article. “That leaves them dangerously unaware of the very real risks posed by AI frameworks.”Around the same time, Trail of Bits uncovered a GPU-based exploit known as LeftoverLocals. The vulnerability allowed adversaries to listen in on another user’s LLM session by reading leftover data from the shared memory of cloud-based graphics processors. This kind of attack is nearly invisible, especially in multi-tenant environments where customers share hardware.







