Product roadmap

Detect → Prevent → Automate → Govern

Our mission: become the security operating system for enterprise AI — the layer every LLM call passes through, everywhere.

ShippedIn progressNext upPlannedVision
01

Phase 01Detect

Be the authoritative real-time threat detector for every LLM call.

Observability

  • Request logging with full prompt/response captureShipped
  • AES-256-GCM application-layer encryption of stored payloadsShipped
  • Per-project and per-API-key request attributionShipped
  • Agent-level tagging for multi-agent systemsShipped
  • Time-series charts (request volume vs flagged, 7d/14d/30d)Shipped
  • Real-time live dashboard with stat cards and recent activityShipped
  • Request search and filter (by model, scorer, date range, flag status)Shipped
  • Real-time live flow view — animated pipeline showing each request as it lands and scoresShipped
  • Conversation threading — link multi-turn exchanges into a single session viewPlanned
  • Semantic clustering — group similar prompts automatically to surface patternsPlanned
  • Anomaly detection — ML-based alerting on unusual usage spikes or behavioural shiftsPlanned

Safety Scoring

  • Full OWASP LLM Top 10 coverage — all 10 scorers across every risk categoryShipped
  • Likert severity scale 1–5 with colour-coded badges (none / low / medium / high / critical)Shipped
  • Per-tier scorer gating (Homelab → 2, Pro → 4, Team/Enterprise → 10)Shipped
  • Custom scorer definitions — LLM-as-judge, regex, and keyword list scorersShipped

Alerting

  • Real-time email alerts via ResendShipped
  • Slack alerts via incoming webhookShipped
  • Per-scorer alert togglesShipped
  • Outbound webhooks — HMAC-SHA256 signed delivery to any endpointShipped
  • PagerDuty integrationPlanned
  • Microsoft Teams integrationPlanned
  • Alert suppression rules — mute known-safe patternsPlanned

Integrations

  • Python SDK with patch_openai() / patch_anthropic() — public on PyPIShipped
  • JavaScript / TypeScript SDK — public on npm (ESM + CJS)Shipped
  • Java SDK — public on Maven CentralShipped
  • Integration guides: CrewAI, LangGraph, AutoGen, ZeroClaw, OpenClawShipped
  • Claude MCP server — natural-language access to logs and safety eventsShipped
  • GitHub Actions CI plugin — fail builds on red team findingsShipped
  • Integration guides: Semantic Kernel, Haystack, Dify, Flowise, AWS Bedrock, Azure OpenAI, Google Vertex AIPlanned
  • OpenTelemetry exporter — emit spans to any OTEL-compatible backendPlanned
  • VS Code extension — score prompts inline as you write themPlanned
02

Phase 02Prevent

Move from passive observation to active threat blocking.

Block Mode Proxy

  • Go reverse proxy — block mode, designed for minimal overhead, drop-in replacement for the OpenAI base URLShipped
  • mTLS between SDK and proxy for authenticated, encrypted transportShipped
  • Self-hosted proxy packaged as a Docker imageShipped
  • Kubernetes Helm chart for cluster-native deploymentPlanned
  • Air-gapped / offline scoring mode (local model for environments with no egress)Planned
  • Terraform provider for infrastructure-as-code deploymentPlanned

Policy Engine

  • Policy-as-code — define rules in YAML: block topics, enforce language, restrict tool callsPlanned
  • Regex and keyword blocklists with instant evaluation (no LLM call required)Planned
  • Canary token injection — embed invisible markers in system prompts to detect exfiltrationPlanned
  • Output watermarking — mark AI-generated content for downstream provenance trackingPlanned
  • Rate limiting per user/session — prevent prompt-flood attacksPlanned
  • Prompt rewriting — sanitise detected PII before forwarding to the LLMPlanned

Data Loss Prevention

  • Pre-send DLP — scan outbound prompts for PII before they leave your environmentPlanned
  • HIPAA mode — PHI pattern detection (NHS numbers, NI numbers, DOB, diagnoses)Planned
  • Financial data mode — IBAN, card numbers, account numbersPlanned
  • Custom entity types — define your own sensitive patterns via the dashboardPlanned
03

Phase 03Automate

Test constantly, not once — make security a CI/CD concern.

Red Team

  • On-demand red team runner with configurable harm categories and prompt countShipped
  • PyRIT Scenario Framework — pre-built assessment packs (customer support, internal copilot, RAG pipeline, code assistant)Shipped
  • TAP (Tree of Attacks with Pruning) multi-turn attack strategyShipped
  • Playwright web app target support — red team browser-based AI interfacesShipped
  • Human-led red teaming UI — collaborative workspace for security researchersPlanned
  • Scheduled continuous red teaming — run a full assessment on every model deploymentPlanned
  • Attack replay — re-run historical campaigns against new model versions to catch regressionsPlanned
  • Domain-specific attack packs: healthcare, legal, financial services, e-commercePlanned

CI/CD Integration

  • GitHub Actions plugin — fail a build if a red team campaign exceeds a thresholdShipped
  • Pre-commit hook — scan changed system prompts for obvious injections before pushPlanned
  • CLI tool — cognisafe scan against any OpenAI-compatible endpointPlanned

Intelligence

  • Model behaviour drift detection — alert when response patterns shift significantly between deploymentsPlanned
  • Cross-tenant threat intelligence (opt-in, anonymised) — share attack signatures across the communityPlanned
  • Threat intelligence feed — subscribe to known adversarial prompt patterns, updated dailyPlanned
04

Phase 04Govern

Give legal, security, and compliance teams the controls they need to say yes to AI.

Access & Identity

  • SSO — SAML 2.0 and OIDC (Okta, Azure AD, Google Workspace)Shipped
  • Role-based access control — owner / admin / analyst / viewer, invite-by-link with 7-day expiryShipped
  • Multi-tenant organisation management — sub-accounts per business unit or clientPlanned
  • IP allowlist for dashboard and API accessPlanned

Compliance

  • Automated compliance PDF reports for regulators and procurement teamsShipped
  • SOC 2 Type II evidence pack — automated collection of audit artefactsShipped
  • GDPR tooling — data subject access request export, right-to-erasure workflowPlanned
  • EU AI Act compliance checklist and risk classification wizardPlanned
  • ISO 27001 controls mappingPlanned
  • Vendor security questionnaire self-service portalPlanned

Infrastructure Security

  • PostgreSQL row-level security (RLS) — database-enforced tenant isolationShipped
  • Database audit logging (pgaudit) — tamper-evident log of all data accessShipped
  • Bring-your-own-key (BYOK) — customer-managed encryption keys via AWS KMS / Azure Key VaultPlanned
  • Dedicated infrastructure option — single-tenant deployment for regulated industriesPlanned
  • Formal penetration test and published reportPlanned

Billing & Usage

  • Usage alerts — email and Slack notification at 70% and 90% of tier limitShipped
  • Cost attribution by agent and model — 30-day breakdown with daily spend chartShipped
  • Soft request limits with per-request overage billing (no hard cutoffs)Planned

Vision — Secure AI Operating System

The security layer that every enterprise AI system runs through, everywhere.

  • Universal LLM gateway — one endpoint, any model provider, full security stack includedVision
  • AI security posture management (AISPM) — continuous inventory and risk scoring of all AI assetsVision
  • Threat modelling wizard — describe your AI application, get a custom security policy and attack surface mapVision
  • Auto-generated security policies from observed traffic patternsVision
  • LLM fingerprinting — identify which underlying model was used even when the provider obscures itVision
  • Executive security briefing — weekly auto-generated summary of AI risk posture for CISO reportingVision
  • SIEM integrations — Splunk, Datadog, Elastic, Microsoft SentinelVision
  • White-label offering — Cognisafe infrastructure under a partner's brandVision

What we will not build

  • A model hosting platform — we secure AI, we don't run it
  • A replacement for your LLM provider — we sit in front of them, not instead of them
  • Proprietary scoring models — our scorers use commodity LLMs; the value is the platform and the rules
  • A competitor to observability platforms (Datadog, Grafana) — we integrate with them, not replicate them

Last updated May 2026 · Have a feature request? Send us feedback