Edit

Share via


Default Guidelines & controls policies

Azure OpenAI in Azure AI Foundry Models includes default safety policies applied to all models, excluding Azure OpenAI Whisper. These configurations provide you with a responsible experience by default, including content filtering models, blocklists, prompt transformation, content credentials, and others.

Default safety aims to mitigate risks in different categories such as hate and fairness, sexual, violence, self-harm, protected material content, and user prompt injection attacks. To learn more about content filtering, visit our documentation describing categories and severity levels.

All safety policies are configurable. To learn more about configurability, see the documentation on configuring content filtering.

Text models

Text models in the Azure OpenAI can take in and generate both text and code. These models leverage Azure’s text content filters to detect and prevent harmful content. This system works on both prompts and completions.

Risk Category Prompt/Completion Severity Threshold
Hate and Fairness Prompts and Completions Medium
Violence Prompts and Completions Medium
Sexual Prompts and Completions Medium
Self-Harm Prompts and Completions Medium
User prompt injection attack (Jailbreak) Prompts N/A
Protected Material – Text Completions N/A
Protected Material – Code Completions N/A

Vision models

Vision-enabled chat models

Risk Category Prompt/Completion Severity Threshold
Hate and Fairness Prompts and Completions Medium
Violence Prompts and Completions Medium
Sexual Prompts and Completions Medium
Self-Harm Prompts and Completions Medium
Identification of Individuals and Inference of Sensitive Attributes Prompts N/A
User prompt injection attack (Jailbreak) Prompts N/A

Image generation models

Risk Category Prompt/Completion Severity Threshold
Hate and Fairness Prompts and Completions Medium
Violence Prompts and Completions Medium
Sexual Prompts and Completions Medium
Self-Harm Prompts and Completions Medium
Content Credentials Completions N/A
Deceptive Generation of Political Candidates Prompts N/A
Depictions of Public Figures Prompts N/A
User prompt injection attack (Jailbreak) Prompts N/A
Protected Material – Art and Studio Characters Prompts N/A
Profanity Prompts N/A