The standard pilot trajectory
Months 0-2: leadership selects a use case (NOC RCA, clinical summarization, claim review, contract review). A vendor demos on a sanitized dataset. Excitement.
Months 2-4: the team integrates with the LLM provider, runs the workflow on synthetic data, gets impressive metrics. The pilot is "ready to go to production."
Months 4-6: security review opens. The CISO's team asks the obvious question: are we actually sending raw operational data — subscriber IDs, patient records, claim details — to the LLM? Sometimes the answer is "no, we'll use anonymization." The anonymization breaks the data; output quality drops 30-50%. Sometimes the answer is "yes, with a contract." That contract triggers DPO, regulator, and board-level review.
Months 6-12: the pilot is renamed, rescoped, paused, or quietly killed. Shadow AI emerges — engineers paste anonymized snippets into ChatGPT on personal devices to keep the productivity gains they tasted in the pilot.
The four-part diagnosis
Why does this happen, repeatedly, across every regulated industry?
Reason 1 — External LLMs raise enterprise ROI
Approved external LLMs measurably improve productivity, processing speed, and automation ROI. Every regulated enterprise wants in. The pilot exists because the executive team genuinely sees the upside.
Reason 2 — PII guardrails alone are not enough
The standard answer (PII detection at the API boundary) was built for individual identifiers — names, emails, phone numbers. Real regulated workflows run on structured operational data: ticket sequences, network configs, OT manifests, clinical workflows, claim records, mission context. PII guardrails don't see this. The data slips right through.
Reason 3 — DMZ and legacy operational data is complex and unstructured
Mixed free text, network identifiers, system logs, user context, incident records, configurations. Sensitivity leaks through structure, sequence, and aggregate pattern — not just through field names. Field-level filtering misses entire categories of risk.
Reason 4 — Filtering alone leaves regulated risk standing
GDPR, HIPAA, SOX, sector regulators, audit obligations, sovereignty constraints. Even if every field is masked, the residual risk of differential analysis, re-identification through context, and inference exposure is what regulators evaluate. Simple filtering cannot close that.
Result: the pilot demonstrated value on synthetic data; the production deployment requires real data; the gap between them is the AI enablement data layer that wasn't there.
The pattern that gets pilots to production
Pilots that ship to production typically have these architectural features in place:
- An AI enablement data layer between systems and AI. Not a guardrail. Not a gateway. A layer that transforms operational data into AI-ready capsules locally, executes the AI workflow, and restores results into the originating system.
- Structure-preserving capsule. Tables, cross-references, configurations, document hierarchies survive intact. AI receives full context — not broken fragments.
- Differential-privacy-based protection. Beyond field masking — DP noise, k-anonymity, semantic tokenization — to address inference and aggregate-pattern risk that simple filtering can't close.
- Plug-in execution into existing legacy systems. No migration. The data layer reads where the document already lives.
- Restoration into the originating workflow. The end-user works in their familiar tool with real values restored. AI doesn't create a new workflow; it lives inside the existing one.
- Two execution paths under one governance framework. External approved LLM with capsule data only, or on-prem local lightweight model. Path is policy-driven per workflow.
- Customer-defined markers + time-shifting policy. What's sensitive today isn't what's sensitive tomorrow. Define, version, time-shift.
What changes for the executive
For the CDO / CAIO / CIO running an AI program:
- The conversation shifts from "AI vs. security" to "AI through the data layer."
- The pilot exit criteria change from "demo on sanitized data" to "demo on real data with audit trail."
- Shadow AI risk falls — the productivity people tasted in the pilot becomes available in the official tooling.
- Procurement simplifies — one data layer covers multiple AI use cases across multiple LLM providers.
- Regulator conversations have evidence — chain of custody, policy versioning, restoration audit.
How long does it take to get to production?
With the data layer in place, regulated workflows typically reach production in 8-12 weeks (vs. 6-12 months stalled in the standard pattern). The gating items are usually internal — DPO sign-off, regulator notification (where required), security review of the policy. The technical integration is days, not months.
Getting started
If you have an AI pilot that has stalled in security or compliance review, the diagnosis is usually a missing data layer. Bring one stalled use case and one regulatory constraint. We deploy LLM Capsule on a sample workflow within 30 minutes and produce an evaluation report on what changes when the data layer is in place.