Security Architecture

June 11, 20268 min

How to Build a Lifecycle-Aware AI Security Engine

Modern AI security cannot be one giant preflight check. Learn how a lifecycle-aware security engine should work, what recent OWASP and NIST guidance implies, and how Odock's staged architecture compares.

Youcef Kaddour

Founder at Odock and AI infrastructure engineer

Youcef Kaddour is the founder of Odock and an AI infrastructure engineer focused on secure LLM systems, MCP governance, runtime guardrails, and production-grade multi-provider AI architecture.

What you should take away

1A single yes-or-no AI firewall is the wrong abstraction for production LLM and MCP traffic.
2Recent OWASP and NIST material points toward staged controls: capability limits, output handling, runtime containment, and evidence.
3Our runtime model already separates early policy gates from SafetySec prompt and response enforcement, which is the right direction for AI infrastructure.

Too many AI security designs still assume one big filter at the front of the request path. That design does not survive contact with production. Some controls need network context, some need identity and resource scope, some need token and budget context, some need prompt text, and some need the final model output. A usable AI security engine has to be lifecycle-aware.

The wrong mental model: one giant AI firewall

The simplest AI security design is also the least realistic. Teams put one classifier or one ruleset in front of the model and expect it to answer every question: should the request be allowed, is the prompt malicious, is the user authorized, is the output safe, will the request break budget, and can the model call tools?

That does not work because those questions appear at different times.

A network rule can run before auth.
An access-grant decision needs caller identity and the selected resource.
A token or cost gate needs model and token-envelope context.
A prompt-safety check needs the decoded prompt text.
A response-leakage check needs the provider output.
A final cost reconciliation needs the request to finish.

Once you accept that timing reality, the architecture changes. You stop looking for one magical AI firewall and start building staged enforcement.

What recent guidance implies for engine design

The best public signal here is not one single framework. It is the overlap across several sources.

On January 2024, NIST published Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations. The report matters because it frames AI security across lifecycle stages, attacker goals, attacker capabilities, poisoning, privacy attacks, model extraction, and generative AI prompt-injection variants. That is already a staged way of reasoning.

On March 12, 2025, OWASP's updated Top 10 for LLM and GenAI apps made the same point from the application side. Prompt injection, sensitive information disclosure, improper output handling, excessive agency, and unbounded consumption are different failure classes. No single gate can see them all with the right context.

On May 25, 2026, OWASP's AIUC-1 crosswalk for agentic applications pushed the architecture further by calling out gaps around agent identity, runtime containment, architectural monitoring, supply-chain attestation, and schema controls. That is effectively a warning against thin, single-stage security.

Taken together, the guidance says a real AI security engine needs:

staged controls
capability-aware access boundaries
response-side handling, not only input-side filtering
cost and abuse controls in the runtime path
evidence and monitoring after the decision

The stages a real AI security engine needs

Here is the practical shape of a production-ready engine.

1. Request-aware admission controls. These handle IP policy, payload size, request rate, concurrency, and other cheap protections before expensive work starts.

2. Identity and access controls. These answer who is calling, which virtual key or tenant is behind the request, and whether that identity can reach the chosen model or MCP server.

3. Resource and token-aware controls. These reason about the selected model, token envelope, budgets, quotas, and other usage boundaries that depend on decoded request context.

4. Request-side content controls. These inspect prompt text, redact sensitive input, detect prompt injection or jailbreak patterns, and decide whether upstream execution should continue.

5. Tool and plugin controls. These apply explicit capability rules to MCP tools or other extension points so the model cannot turn a weak prompt into a strong action path.

6. Response-side controls and evidence. These inspect model output for leakage or unsafe content, redact when appropriate, block when necessary, and record enough evidence for operators to understand what happened.

That sequence is not overengineering. It is the minimum structure required to answer different security questions with the right data.

How Odock compares

At Odock, our runtime follows almost exactly this staged pattern. Our lifecycle runs through request-aware gates, identity and access gates, resource-aware gates, token and cost gates, request-side safety, plugin gates, upstream execution, response-side safety, post-upstream plugin gates, response writing, and final usage evidence.

SafetySec then handles the content-aware parts of that flow. In SafetySec, we handle:

prompt-injection detection
jailbreak-pattern detection
sensitive redaction
data-leakage detection
repeated-risk awareness
safety evidence

That is an important design choice. SafetySec does not replace access control, budgets, quotas, or MCP capability policy. It handles prompt and response safety while other guardrails handle traffic shape, access, and resource boundaries.

The mapping is straightforward:

Early request controls:

We stop abuse before expensive work starts with request-aware gates for origin, payload, rate limits, and concurrency.

Identity-aware permissions:

We keep model and tool access attached to principals through explicit access grants for models and MCP servers.

Token and cost enforcement:

We prevent denial of wallet and runaway usage with token gates, budgets, quotas, and lifecycle reconciliation.

Request-side prompt defense:

We stop unsafe input before upstream execution with SafetySec request-side prompt-risk checks and input redaction.

Tool capability governance:

We limit what agents can actually do with MCP allowed tools, blocked tools, scopes, and semantic filters.

Response-side defense:

We catch leakage or unsafe output before the caller sees it with SafetySec response-side leakage checks and output redaction.

Evidence and audit:

We make decisions explainable and monitorable with safety evidence plus usage, status, cost, and latency recording.

Why this is stronger than app-level filters

App-level filters are not useless. They are just not enough as the main control surface.

In distributed AI systems, app-level security drifts quickly:

different services implement different rules
teams release at different speeds
tool access gets added before platform review
logging shapes diverge
cost limits become best-effort instead of enforced limits

A lifecycle-aware gateway approach fixes the control point. Product teams still ship features, but the highest-risk enforcement stays in shared infrastructure.

That is especially important for agentic systems. The moment a model can call a repository tool, a browser tool, a ticketing tool, or an internal action, the blast radius stops being "bad text" and starts being "real side effects." That is where staged enforcement matters.

The realistic boundary of the engine

Even a good security engine is not the whole program. At Odock, we do not overstate this.

You still need adjacent work for:

secure development and patching of MCP servers and plugins
supply-chain controls for models, datasets, and dependencies
architecture reviews for high-risk workflows
schema validation and business-rule validation inside tool handlers
release governance, evaluation, and red-teaming outside the live request path

But if the runtime layer is weak, those other controls are forced to compensate for the wrong problem. The runtime path is where requests become actions, costs, and outputs. That is why the engine design matters so much.

Sources

What you should take away

1
A single yes-or-no AI firewall is the wrong abstraction for production LLM and MCP traffic.
2
Recent OWASP and NIST material points toward staged controls: capability limits, output handling, runtime containment, and evidence.
3
Our runtime model already separates early policy gates from SafetySec prompt and response enforcement, which is the right direction for AI infrastructure.

Frequently asked questions

Why is one preflight security check not enough?

Because many important facts do not exist at request admission time. You do not know the final output yet, you may not know the resolved model or tool plan yet, and you cannot reconcile cost until the request completes.

What makes a security engine lifecycle-aware?

It applies different controls at different stages of the request lifecycle, based on the context available at each stage. Early network limits, identity-aware access checks, token and budget gates, request-side prompt checks, response-side leakage checks, and post-flight evidence all belong at different moments.

Is this only relevant for agentic AI?

No. Agentic systems make the problem more visible, but even plain LLM APIs need staged controls for admission, identity, prompt handling, output handling, and usage governance.

Need a security engine that works across the full AI request lifecycle?

Odock combines staged policy enforcement, SafetySec prompt and response controls, MCP governance, and usage evidence in one gateway path.

Talk to the team View on GitHub

AI Security9 min

AI Security in 2026: Prompt Injection, Tool Poisoning, and the New Agentic Risk Stack

AI security is no longer only about bad prompts. It now includes tool misuse, MCP poisoning, unbounded consumption, and response-side leakage. This post compares those risks with Odock's actual runtime controls.

Read article

AI Security7 min

Prompt Injection, Data Leakage, and Why LLM Guardrails Must Live in the Gateway

When every team handles AI security in its own service, protection becomes inconsistent. This article explains why gateway-level guardrails are the safer model and how that maps to Odock.

Read article

AI Observability8 min

What to Log, Monitor, and Trace in Production LLM Applications

When AI traffic crosses providers, tools, tenants, and teams, observability has to connect quality, latency, cost, safety, and routing decisions.

Read article

AI Security12 min

AI Agents Are Non-Human Identities Now: The 2026 Credential Sprawl Problem

Your AI agents are not features. They are non-human identities, they hold credentials, they act autonomously, and most security teams have no lifecycle for them at all. Here is why NHI sprawl became 2026's quiet agentic AI risk, and what fixing it actually requires.

Read article

What you should take away

The wrong mental model: one giant AI firewall

What recent guidance implies for engine design

The stages a real AI security engine needs

How Odock compares

Why this is stronger than app-level filters

The realistic boundary of the engine

Sources

What you should take away

Frequently asked questions

Need a security engine that works across the full AI request lifecycle?

Related articles

AI Security in 2026: Prompt Injection, Tool Poisoning, and the New Agentic Risk Stack

Prompt Injection, Data Leakage, and Why LLM Guardrails Must Live in the Gateway

What to Log, Monitor, and Trace in Production LLM Applications

AI Agents Are Non-Human Identities Now: The 2026 Credential Sprawl Problem