OpenAI Moderation API (omni-moderation-latest) Multimodal moderation for text & images; granular safety categories and flags. 0400 Guardrails & Moderation# API# image# moderation
Microsoft Community Sift (Two Hat) Enterprise moderation for communities/games with configurable policies. 0490 Guardrails & Moderation# chat# Community Sift# gaming
Azure AI Content Safety Configurable content filters for harmful text and images with studio & APIs. 0460 Guardrails & Moderation# API# Azure# content safety
Spectrum Labs Contextual AI moderation across text/voice; multilingual & compliance-ready. 0490 Guardrails & Moderation# compliance# contextual# Spectrum Labs
Amazon Bedrock Guardrails Centralized, reusable guardrails for prompts and responses across models. 0310 Guardrails & Moderation# AWS# Bedrock# governance
ActiveFence Real-time guardrails, automated red teaming, and moderation workflows. 0430 Guardrails & Moderation# ActiveFence# guardrails# moderation
Google ShieldGemma Open-weight safety classifiers for inputs/outputs; tune to your policies. 0460 Guardrails & Moderation# classifier# Gemma# google
Modulate ToxMod Proactive, voice-native moderation for live chat in games and communities. 0340 Guardrails & Moderation# audio# gaming# Modulate
NVIDIA NeMo Guardrails Programmable guardrails (topic control, PII, jailbreak prevention) for LLM apps. 0380 Guardrails & Moderation# agents# Colang# guardrails
Lakera Guard Low-latency API to block prompt injection, data leakage, and jailbreaks. 02070 Guardrails & Moderation# guard# jailbreak# Lakera
Meta Llama Guard 3 LLM-based safety classifier for prompts and responses; multiple sizes. 0300 Guardrails & Moderation# classifier# Llama Guard# LLM
Robust Intelligence AI Firewall Pre- and post-model guardrails to block unsafe prompts/outputs in production. 0460 Guardrails & Moderation# AI Firewall# guardrails# production
Meta Prompt Guard 2 Lightweight detectors for prompt injection and jailbreak attempts. 0370 Guardrails & Moderation# detector# injection# jailbreak
Guardrails AI (framework) Open-source + managed framework for input/output checks and structured outputs. 0430 Guardrails & Moderation# framework# guardrails# input guard
Jigsaw Perspective API Toxicity and harassment scoring for UGC; easy-to-integrate REST API. 0410 Guardrails & Moderation# google# harassment# Jigsaw