Classify threats before they reach AI tasks
The agent classifies injection, jailbreak, harmful intent, policy evasion, sensitive data, privacy, obfuscation, and tool manipulation threats, so nothing unsafe ever reaches your operations.
User-supplied prompts can expose AI workflows to injection attacks, jailbreaks, and policy violations; risks that grow as AI tasks become more central to business processes. The AI Firewall Agent evaluates each prompt before it reaches an AI task, classifies the threat type, and sanitizes unsafe content so the workflow can continue without starting over. When the model's confidence falls below your threshold, the agent refines its analysis and retries automatically. The result is a structured JSON decision your BPMN process can branch on immediately. Add it in front of any AI task that processes user input and configure its behavior through process variables.