Skip to main content

Cognitive Defense Agent

Tier 2 Security ● live

NLP prompt injection and jailbreak detection. Guards against adversarial text designed to manipulate downstream AI systems.

Tier
Tier 2
Phase
Security
Pricing
$0.00015 USDC/row
MCP Tool
cognitive_defense_scan
Dependencies

What It Does

The Cognitive Defense Agent uses a fine-tuned NLP model to detect prompt injection patterns, jailbreak attempts, role-playing attacks, and other adversarial text constructs embedded in data payloads. Critical for pipelines that feed into AI/LLM systems.

Capabilities

  • Prompt injection detection
  • Jailbreak attempt identification
  • Role-playing attack detection
  • Adversarial text classification
  • Multi-language support
  • Confidence scoring

Example

cognitive-defense-agent-example.ts
const result = await mcp.call('cognitive_defense_scan', {
  payload: normalizedPayload,
  stream_uuid: streamUUID,
  scan_depth: 'standard',
});
console.log(result.adversarial_detected); // false
console.log(result.confidence);           // 0.02

Configuration

cognitive-defense-agent.yaml
agent: cognitive-defense-agent
version: "1.0"
model: redqueen-cognitive-defense-v2
scan_fields: all_text
detection:
  prompt_injection: true
  jailbreak: true
threshold:
  flag_confidence: 0.75
  reject_confidence: 0.95

MCP Tool

cognitive_defense_scan

Try in MCP Explorer →

Pricing

$0.00015 USDC/row

View full pricing →