29% of UK employees who use generative AI at work say they're worried about IT security when using unapproved tools. 71% of them use those tools weekly anyway. That pair of numbers, from a Microsoft UK study, is the entire AI risk management problem in miniature: the people most anxious about shadow AI are also its heaviest users, and they aren't switching to sanctioned alternatives.
The gap is structural, not anecdotal. Zluri's 2026 analysis puts roughly 80% of enterprise AI tools outside any management plane. And the clock is running: on 2 August 2026, the EU AI Act's high-risk obligations become enforceable, with fines up to €15M or 3% of global turnover.
TL;DR
- Shadow AI is the dominant enterprise AI exposure: ~80% of tools unmanaged, 68% of organizations reporting an AI data leakage incident.
- The driver is deadline pressure, not malice. BlackFog found 60% of employees would take risks with AI tools to ship on time.
- The EU AI Act's high-risk provisions bite on 2 August 2026 after the deferral proposal collapsed in April 2026.
- The fix that works in practice is a sanctioned gateway plus tiered governance, not a ban. JPMorgan and BBVA are the proof.
What is AI risk management, and why is it failing right now?
AI risk management is the discipline of inventorying, controlling, and monitoring every AI system that touches enterprise data, then routing each one to oversight proportionate to its risk. It's failing in most organizations for a simple reason: AI adoption happened bottom-up through individual employees, while governance was designed top-down for procured software.
IDC describes the result as a shift from a few thousand licensed seats to tens of thousands of unsanctioned endpoints. The Komprise IT survey ranks shadow AI alongside cloud cost and data compliance as a top management headache.
Shadow AI is not a fringe behavior problem. It is the default state of enterprise AI in 2026, and any risk program that doesn't start from that assumption is governing a fiction.
Here's what the convergent survey data looks like:
One honesty note: several of these are vendor-conducted surveys with methodology that hasn't been independently audited. Treat them as directional, not gospel. The direction, though, is unambiguous.
How does shadow AI actually cause data breaches?
Shadow AI causes breaches by routing sensitive data to third-party model providers under consumer terms the enterprise never negotiated, where it can be retained, reviewed by humans, or used for training. The prompt box is the exfiltration channel, and IBM's analysis frames it as shadow IT with a much wider blast radius.
The canonical incident remains Samsung, where engineers pasted sensitive source code into ChatGPT to debug it in early 2023. Samsung banned generative AI company-wide on 1 May 2023 and now runs a locked-down internal LLM with prompts capped at 1,024 bytes.
The compliance exposure falls into four buckets:
- Data-protection violations. Inputs retained and reviewed by a provider are incompatible with GDPR, HIPAA, and PCI-DSS without a signed data processing agreement covering the flow.
- IP leakage. Source code, roadmaps, and M&A materials can persist in provider systems for training or human review.
- Cross-border transfer exposure. Inference regions and sub-processor lists are now first-order concerns for European multinationals, with the EU AI Act layering registration duties on top.
- Untrusted outputs. Hallucinated or biased completions in customer-facing or regulated contexts are the second-order risk most enterprises underestimate.
And the pull is real. PYMNTS reporting on OpenAI enterprise data shows employees save roughly an hour a day with sanctioned AI access. People aren't reckless; they're productive without permission.
Which AI governance frameworks actually work?
No single framework is sufficient. The effective stack pairs NIST AI RMF for structure, ISO/IEC 42001 for certifiable proof, MITRE ATLAS for adversarial testing, and vendor data terms as a procurement gate. Each covers a failure mode the others miss.
| Framework | Type | Enforcement | Best for | Main weakness |
|---|---|---|---|---|
| NIST AI RMF + GenAI Profile | Voluntary framework | None (procurement language) | Structuring risk tiers; US procurement | Voluntary; no certification body |
| ISO/IEC 42001 | Management system standard | Certification audit | Provable governance | 12-18 months; no content-specific controls |
| MITRE ATLAS | Threat taxonomy | None | Red-teaming, incident response | Threats, not controls |
| EU AI Act (Annex III) | Binding regulation | €15M/3% (high-risk), €35M/7% (prohibited) | The compliance floor | Broad, sometimes ambiguous high-risk list |
| Vendor data policies | Contract terms | Contract | Procurement gating | Vary by provider and tier |
Vendor terms deserve more scrutiny than they get. Anthropic's data policy is the instructive case: through August 2025, commercial API and Enterprise data was retained 30 days and excluded from training. Effective 28 August 2025, consumer tiers became opt-in for training with retention up to five years, while commercial terms stayed unchanged.
The consumer/commercial split is exactly the line your procurement team should be reading, because an employee on a personal Pro account sits on the wrong side of it.
On the model-safety side, Anthropic's Responsible Scaling Policy v3.0, published 24 February 2026, replaced its hard-pause structure with tiered AI Safety Levels keyed to named capability thresholds. It's now the most-cited voluntary lab commitment and is referenced in the EU's GPAI Code of Practice.
But it governs one lab. It does not govern your deployment.
The regulatory map: EU floor, US patchwork, LatAm in motion
For multinationals, the EU AI Act is the de facto global floor. Three of its six enforcement phases are already live, and the high-risk Annex III obligations (risk management, data governance, technical documentation, logging, transparency, human oversight, robustness) become enforceable 2 August 2026. The Commission's Digital Omnibus proposal to push that to 2027-2028 failed in trilogue on 28 April 2026, so the deadline stands. Deployers should read Article 26 closely: it puts oversight, logging, and incident-reporting duties on you, not just your vendor.
The US has no comprehensive federal law. Federal posture rests on OMB M-25-21's agency requirements and sectoral enforcement (FTC, SEC, FDA), while the state patchwork hardens: the Colorado AI Act became enforceable 1 February 2026, Texas TRAIGA took effect in 2025, and California's SB 1047 was vetoed in September 2024 over innovation concerns.
The Bipartisan Policy Center is pushing federal preemption for frontier developers while preserving state authority over concrete harms.
Latin America is principles-led but converging on the EU model. Brazil's PL 2338 cleared the Senate and borrows the EU's risk-tiered approach; Chile updated its national policy in 2024; Mexico is moving toward binding sectoral standards.
What successful AI risk management looks like in practice
The organizations that got this right share one move: they made the sanctioned path easier than the shadow path.
JPMorgan Chase banned external ChatGPT early, then replaced the ban with LLM Suite, an internal gateway to OpenAI and Anthropic models for roughly 250,000 employees, refreshed on an 8-week model cadence with centralized use-case intake. Its earlier COiN contract-intelligence program is credited with saving about 360,000 legal hours per year.
BBVA went federated: 3,000 ChatGPT Enterprise seats in 2024, expanding to over 2,900 custom GPTs built by business units inside guardrails, with 80% of users reporting more than two hours saved weekly and the CEO publicly championing the program.
Microsoft shows the institutional layer: its Responsible AI Standard v2 is enforced by the Aether committee, which has authority to block deployments, not just advise. Governance without a body that can say no is documentation.
Scale is consolidating around few providers, which makes vendor terms strategic: Deloitte's 470,000 Claude seats is the largest publicly disclosed enterprise deployment, alongside Goldman Sachs and Bridgewater.
Doesn't governance kill innovation?
The honest answer: badly designed governance does, and so does no governance at all. Newsom's SB 1047 veto warned that stringent rules applied to all frontier models regardless of deployment context would chill innovation, and the techno-optimist camp argues regulation freezes a domain that's still moving.
But McKinsey's survey work keeps finding that managing AI risk is itself among the top barriers to enterprise adoption. Unmanaged risk slows deployment too; it just does it through incident response, legal holds, and board-level panic instead of process.
The right posture is structured adoption: ship, but with the gateway, inventory, and escalation path in place before scale-out, not after.
What this means for you
If you own this problem, the 2026 priority order is:
- Deploy an AI gateway first. Log and control all model traffic before writing a single policy doc. It's the one control that addresses the 80%-unmanaged number directly.
- Tier your systems with NIST AI RMF and map anything touching EU users against Annex III before 2 August.
- Stand up a cross-functional AI risk committee with deployment-blocking authority, on the Aether and LLM Suite intake templates.
- Rewrite procurement language to require disclosed retention windows, training opt-outs, sub-processor lists, and NIST or ISO 42001 conformance.
- Red-team against MITRE ATLAS and name, in writing, who can take a system offline.
- Train everyone. The BlackFog and Komprise data both say employees want to do the right thing and don't know what it is. Give the sanctioned tool and the one-page rule, and most shadow usage converts itself.
The race between deployment and governance won't be won by slowing deployment. It's won by making governance the fastest path to production.
Sources
- Microsoft UK: Rise in shadow AI tools raising security concerns, the 29%/71% shadow AI figures
- Zluri report: 80% of enterprise AI tools operate unmanaged
- BlackFog: 60% of employees would take AI risks to meet deadlines
- IDC: Shadow AI and stealth productivity
- Komprise IT survey on shadow AI concerns
- The Samsung ChatGPT incident
- Anthropic data usage policies, consumer vs. Commercial retention terms
- Anthropic updated Responsible Scaling Policy
- NIST AI Risk Management Framework 1.0 and RMF Playbook
- European Commission: AI Act enters into force and Article 26 deployer obligations
- Gibson Dunn: EU AI Act Omnibus negotiations
- House Science Committee on the SB 1047 veto
- Bipartisan Policy Center small business AI poll
- Microsoft: Securing AI, risks and compliance
- McKinsey: Governance, risk, and compliance best practices
