AI GOVERNANCE MANUFACTURING

AI Governance for Manufacturers: A Starter Framework

By Jason Osajima — former VP of AI at a $250M manufacturer · LinkedIn ·
Quick answer

AI governance for manufacturing without the bureaucracy: a starter framework an ops leader can stand up in 30 days. Risk tiers, owners, evals, audit trail.

AI governance for a manufacturer is the set of rules that decides what your AI agents may touch, who owns each one, and what an agent must prove before it acts on a real decision. A starter framework only needs four things: an approved-data list, a risk tier per agent, a named owner for each, and a kill switch. If your governance does those four jobs and nothing else, you are already ahead of most of the mid-market.

I was VP of AI at a $250M furniture manufacturer. I shipped agents into purchasing, customer service, and ops planning. The governance that held up was one page, tied to real workflows, and owned by named people. Most AI governance for manufacturing is written by people who have never stood on a plant floor. It reads like a privacy policy, runs 40 pages, and lands in a SharePoint folder nobody opens.

Meanwhile a planner is pasting your supplier contracts into a public chatbot to summarize them. A CSR is letting an agent send pricing to customers with no review step. Governance failed not because it was wrong, but because nobody could use it. Here is the starter framework I would hand any COO or Head of IT who needs control without strangling the thing in committee.

Why governance is the constraint, not the paperwork

The shadow AI in your building is the real risk, and it is already running. A 2025 Gartner survey of cybersecurity leaders found that 69% of organizations suspect or have evidence that employees are using prohibited public GenAI tools. That is a planner one copy-paste away from leaking a supplier agreement.

The cost is no longer hypothetical. IBM's 2025 Cost of a Data Breach report found that 13% of organizations reported breaches of AI models or applications, and 97% of those lacked proper AI access controls. Breaches tied to shadow AI ran about $670,000 higher than the average incident.

Governance is also why pilots die on the vine. MIT's NANDA initiative reported that 95% of enterprise generative-AI pilots delivered no measurable P&L impact, and the cause was approach, not model quality. Agents that never earn the trust to act stay stuck in pilot. Good governance is the on-ramp to production, not the speed bump. I unpack that stall pattern in our pilot-to-production gap breakdown.

What AI governance for manufacturing actually has to do

Forget the compliance theater for a second. Governance has exactly four jobs, and they map cleanly onto how a plant already thinks about risk.

These four map directly onto the NIST AI Risk Management Framework, which is organized around the functions Govern, Map, Measure, and Manage. You do not need to adopt the full framework on day one. You do need to be honest that "ship it and hope" covers none of those four.

Everything beyond these four is refinement, not prerequisite. Keep that line bright. The mid-market loses a year building a governance bureaucracy that does less than one good page would.

The risk-tier model: not every agent needs the same controls

The single biggest mistake is treating a meeting-notes summarizer like an agent that adjusts inventory in your ERP. Tier your agents by blast radius — how much damage a bad output can do before a human catches it. This is the table I use.

Tier What it does Example Control required
T1 — Read/draft Surfaces info, drafts text a human sends Supplier-doc Q&A, QBR draft Owner + data scope logged. No approval gate.
T2 — Recommend Proposes a decision a human approves Order-hygiene flags, stockout alerts Human-in-the-loop on every action. Eval set required.
T3 — Act Writes to a system of record or contacts a customer Auto-reorder, customer pricing reply Approval gate + audit trail + rollback + named exec owner.

The rule writes itself: the higher the tier, the more it has to prove before go-live. A T1 supplier-doc agent can ship in a week. A T3 agent that touches your ERP earns its way up from T2 only after it has been right on real cases for weeks.

This tiering is not just my preference. ISO/IEC 42001, the first international AI management system standard, is built on risk-based controls and AI system impact assessment. The EU AI Act takes the same posture: it reserves the heaviest obligations, including mandatory human oversight for high-risk systems, and leaves low-risk uses light. Tier by impact, and your controls scale with your exposure instead of taxing everything equally.

Most teams should keep almost everything at T1 and T2 for the first year. You get the bulk of the value at a fraction of the risk, and your governance group stays small enough to actually meet.

The one-page governance doc

Here is what fits on a single page and covers a real manufacturer. Print it. Tape it to the wall of the war room. If it does not fit on a page, you are writing for auditors, not operators.

1. Approved data and tools

List what data agents may touch — supplier specs, order history, BI extracts — and what is off-limits without sign-off: employee PII, M&A material, anything under NDA. Name the approved platforms by product. If it is not on the list, it does not get fed to a model.

This one section stops the contract-in-a-chatbot problem cold. It also closes the access-control gap IBM flagged, where 97% of AI-related breaches involved organizations with no proper AI access controls. For the deeper version, see our guide to AI agent security risks.

2. The go-live checklist

Every agent passes the same gate before production. No exceptions, no "this one is special."

This is the operational heart of the framework. Our AI production readiness checklist expands each line into a pass/fail test you can hand to a build team.

3. Owners and the RACI

Every agent has a business owner — the plant or ops leader who answers for the outcome — and a technical owner who maintains it. IT is consulted on data access. No agent ships without both names filled in.

An agent without an owner is a science project, and science projects are where governance goes to die. The EU AI Act's human-oversight rule says the same thing in legal language: oversight must be assigned to named persons with the competence, training, and authority to act. A name, not a department.

4. Monitoring and the kill switch

Define who watches each live agent, how often, and what triggers a pause. For a T3 agent, that is a weekly review of every action it took, plus an alert on anomalies. The kill switch is a real thing: a documented way to disable the agent in minutes, tested before launch.

Monitoring is where most starter frameworks quietly fail. Set up the tooling once and reuse it across every agent — our AgentOps monitoring guide covers the logging, drift alerts, and audit trails that make a kill switch more than a line in a doc.

Who sits on the governance group

Keep it small. A 12-person AI committee never ships anything. McKinsey's 2025 State of AI survey found that only about a third of organizations report real maturity in AI governance, and bloated committees are a big reason why. The working version:

Four people, a 30-minute monthly review, and a fast async path for new agents. That is it. The point of the group is to clear agents to ship, not to invent reasons they cannot.

What this looks like in practice

A reorder agent comes up. Today it is conceptually T3 — it would write POs directly. Governance says no, not yet, and routes it to T2 instead.

It launches as a recommender: it drafts reorder recommendations, a buyer approves each one, and every recommendation is logged against what the buyer actually did. That human-in-the-loop step is the whole ballgame at this stage — our human-in-the-loop guide covers exactly where to place the approval so it catches errors without slowing the buyer down.

After six weeks the eval shows 94% of its recommendations were approved unchanged. Now the group has data, not a hunch. They promote the lowest-risk SKUs to T3 with a daily audit, and leave the long-tail, high-variability parts at T2. That is governance doing its job: enabling the move while keeping the proof.

The 30-day path to stand it up

You do not need a perfect framework. You need this running before the next shadow-AI copy-paste.

  1. Week 1 — Write the approved-data list and the go-live checklist. One page each.
  2. Week 2 — Tier every existing and proposed agent. Assign the two owners per agent.
  3. Week 3 — Stand up the four-person group. Run the first review against the checklist.
  4. Week 4 — Wire monitoring and the kill switch on anything already live. Document and test rollback.

Start Monday. The framework gets sharper with use, and the cost of waiting is measured in leaked contracts and stalled pilots.

Frequently asked questions

What is the minimum AI governance a mid-market manufacturer needs?

Four things: an approved-data-and-tools list, a risk tier for each agent, a named business owner and technical owner per agent, and a tested kill switch. That fits on one page and covers the four jobs governance actually has to do. Everything beyond that is refinement you can add once agents are in production.

How do AI risk tiers work for manufacturing agents?

Tier agents by blast radius. T1 agents read or draft and need only an owner and logged data scope; T2 agents recommend a decision a human approves and need an eval set; T3 agents write to a system of record or contact a customer and need an approval gate, audit trail, rollback, and a named exec owner. Higher tiers must prove more before go-live, and most agents should stay at T1 or T2 in year one.

Should manufacturers follow NIST, ISO 42001, or the EU AI Act?

For a starter framework, borrow the structure, not the full compliance burden. The NIST AI RMF gives you the Govern-Map-Measure-Manage backbone, ISO/IEC 42001 provides a certifiable management-system shape, and the EU AI Act matters only if you sell into or operate in the EU. Use them as a checklist for what to cover, then write your own one-pager.

What is shadow AI and why does it matter for plants?

Shadow AI is employees using unapproved AI tools — usually public chatbots — without oversight. A 2025 Gartner survey found 69% of organizations suspect or have evidence of prohibited GenAI use, and IBM tied shadow-AI breaches to roughly $670,000 in extra cost per incident. For manufacturers, the exposure is supplier contracts, pricing, and IP pasted into a model you do not control.

Who should own AI governance in a mid-market manufacturer?

A four-person group: an ops or plant leader, IT/security, one finance voice, and the AI lead. They meet 30 minutes a month and run a fast async path for new agents. Keep it that small — a 12-person committee never ships, and the group exists to clear agents to ship, not to block them.

Let's see what's worth building first.

A 15-minute call: tell me where your AI or planning is stuck, and I'll tell you the one thing worth building first — and whether it's worth doing at all.

More field notes

AI Agent Security Risks Manufacturers Must ManageHuman-in-the-Loop AI for Operations: When to Use ItAI Compliance Checklist for Manufacturing LeadersAI Implementation Services for Manufacturers