AI/ML, Cloud Security, AI benefits/risks

New Azure AI security tools unveiled

April 1, 2024

(Adobe Stock)

Microsoft has introduced new tools within its Azure AI Studio aimed at strengthening AI model safety and security, according to The Register.

Prompt injection attacks could be better addressed with the Prompt Shields model, formerly known as Jailbreak Risk Detection, while the Groundedness Detection system enables improved detection of AI hallucinations through a custom language model verifying claims against source documents, noted Microsoft.

Microsoft has also unveiled AI-assisted safety evaluations and risks and safety monitoring features in AI Studio. While the new tools are valuable in evaluating AI model reliability, using AI for such systems could become a liability, noted University of Maryland's Vinu Sankar Sadasivan, who co-developed the BEAST attack against large language models.

"Though safety system messages have shown to be effective in some cases, existing attacks such as BEAST can adversarially attack AI models to jailbreak them in no time. While it is beneficial to implement defenses for AI systems, it's essential to remain cognizant of their potential drawbacks," said Sadasivan.

Such a development comes amid the introduction of new federal AI safeguards.

An In-Depth Guide to AI

Get essential knowledge and practical strategies to use AI to better your security program.

Learn More

SC Staff

(Credit: daily_creativity – stock.adobe.com)

AI/ML

Anthropic Claude models compromised 3 companies during testing

Laura FrenchJuly 31, 2026

OpenAI’s Hugging Face incident disclosure prompted Anthropic to review its own evaluations.

AI/ML

AI chatbots outperform humans in building trust for scams

SC StaffJuly 31, 2026

Researchers from four universities conducted a study pitting AI chatbots against human scammers in a simulation of "pig butchering" scams, a form of romance scam that escalates to fake cryptocurrency investments.

Zero trust

What we learned about zero-trust from the OpenAI breach of HuggingFace

Alan LeFortJuly 31, 2026

Here’s why in a world where AI models can hack other companies, we have to outreason our adversaries.

Get daily email updates

SC Media's daily must-read of the most current and pressing daily news

Related Terms

Cloud Computing Greynet