SecurityWeek reports that Anthropic's Claude large language model could have its users' data stolen through the exploitation of its Files APIs as part of an indirect prompt injection attack.Threat actors could target Claude instances with network access with an indirect prompt injection payload that stores user data within a file in Claude Code Interpreter, according to Embrace The Red's Johann Rehberger.Claude will then be sought by the payload to upload the file from the sandbox, which then prompts its uploading to the attackers' account."With this technique an adversary can exfiltrate up to 30MB at once according to the file API documentation, and of course we can upload multiple files," said Rehberger, who added that the attack could allow the compromise of chat conversations saved by the LLM's memories functionality.Anthropic, which has already been informed regarding the vulnerability, has yet to provide mitigations for potential intrusions.
AI/ML, Data Security
Anthropic’s Claude AI vulnerable to data theft

(Adobe Stock)
An In-Depth Guide to AI
Get essential knowledge and practical strategies to use AI to better your security program.
Get daily email updates
SC Media's daily must-read of the most current and pressing daily news
You can skip this ad in 5 seconds



