OpenAI-compatible · drop-in replacement

The cognitive
gate for LLMs

One API that gives every model persistent memory, personality, symbolic reasoning, ethics governance, and cost-aware routing.

Start building free Read the docs

11 stages fire on
every single request

Your API call enters the pipeline. Ethics are checked. Tokens are optimized. Identity is injected. Memories are retrieved. Context is enriched. Reasoning runs. All before the LLM sees a single token.

Zero code changes. Zero latency tax you'd notice.

Ethics check2ms

Token optimization8ms

Routing decision3ms

Identity injection12ms

Memory retrieval45ms

Context enrichment15ms

LSR reasoning38ms

LLM inference~800ms

Ethics validation4ms

Memory extractionasync

Response formatting1ms

What's inside the gate

Five systems. One API call.

Persistent Memory

Working memory for the current session. Episodic memory across all sessions, searchable by semantic similarity. Automatic promotion from short-term to long-term. Redis-cached retrieval. Your AI remembers what you told it last week — and uses it.

Personality

SOUL, USER, TOOLS, MEMORY files define who your agent is. Versioned per-tenant. Injected into every system prompt automatically.

Symbolic Reasoning (LSR)

Contradiction detection, temporal decay, ILP rule learning, analogical reasoning, ethics gating, ACT-R working memory. Nine reasoning stages on every request.

Intelligent Routing

Mixture-of-Experts classifies by category: coding, reasoning, creative, vision. Routes to the cheapest model that can handle it. Circuit breakers for failover. 40% cost reduction in production.

Ethics & Safety

Jailbreak detection, intent reversal blocking, PII scrubbing, harmful content filtering. Runs on every request and every response. Fails closed — if the ethics engine can't decide, the request is blocked. Audit-logged for compliance.

Observability

Prometheus metrics. Grafana dashboards. OpenTelemetry tracing. See every stage, every decision, every ms.

The cognitivegate for LLMs

11 stages fire onevery single request