Skip to Content
Docs are being rebuilt — start at Introduction → How it works.

Guardrails

Guardrails add safety and policy controls around an agent’s responses and actions. In the current builder, guardrails are part of the Capabilities tab, not a standalone tab.

What you can do here

  • Apply a preset guardrail bundle.
  • See how many guardrails are active.
  • Expand Customize to review individual rules.
  • Enable or disable individual guardrails by category.
  • Adjust mode, strategy, and configuration fields for rules that support them.
Capabilities tab Guardrails section showing presets, Customize details, and individual guardrail switches

Presets

PresetUse when
Basic SafetyYou need a general baseline for prompt injection, secrets, content, and similar risks.
EnterpriseThe agent handles business, compliance, or regulated workflows.
Customer ServiceThe agent talks to customers and should stay helpful, safe, and on-topic.
Code AssistantThe agent writes or reviews code and needs code-focused safety checks.

Individual guardrails

Click Customize to show categories such as Security and Content. Each row has a switch. Some selected guardrails expand to show mode, strategy, warning text, and configuration fields.

ControlWhat it does
ModeDefines where the guardrail applies, such as input or output.
StrategyChooses the action, such as block, warn, detect, or redact.
Config fieldsTune thresholds, lists, patterns, or other rule-specific settings.