How does an AI agent handle exceptions that an MES cannot?

Manufacturing exceptions involve judgement: a customer changes a delivery date last minute, a supplier is short, a machine throws intermittent quality issues. An MES routes these to a human queue. An AI agent reads the context across MES, ERP, CRM, email, and engineering systems, proposes a resolution, and either acts within defined authority or escalates with full context for human approval.

How does the Betriebsrat affect AI in production?

German works councils have co-determination rights for technical systems that can monitor employee behaviour or performance. Any AI agent that captures data linked to specific operators requires Betriebsrat consultation. Most production AI use cases stay aggregate (machine, line, shift, lot) and can be designed to avoid personal-performance attribution.

Is this just RPA on top of the MES?

No. RPA scrapes UIs and replays clicks - it would break every time the MES vendor pushed an update. AI agents integrate through APIs, reason about goals rather than scripts, and handle exceptions instead of routing them to a queue. The architectural difference matters most precisely on the shopfloor.

Back to Blog

MES or AI Agent: Where the Boundary Runs on the Mittelstand Shopfloor in 2026

7 May 202634 min read

Henri Jung

Co-founder at Superkind

A dark metal industrial control hub with many fixed machine connection ports and a central rotary decision unit on top with an orange ring - a metaphor for an MES collecting machine data with an AI agent decision layer above

At 06:42 on a Tuesday morning, a press in a German Mittelstand metal-stamping plant threw a die-misalignment alarm. The MES did exactly what it was supposed to: logged the event, paused the order, sent an alert to the line supervisor. By 07:10 the supervisor had identified the root cause, but the production schedule still showed the original sequence. Three downstream cells continued setting up for parts that now would not arrive on time. By the end of the shift, four customer deliveries were at risk - none of them flagged anywhere outside the supervisor’s head.

That is not an MES failure. It is an MES being used for what it was never built to do. The MES recorded the event flawlessly. What it cannot do is reason across the schedule, the customer order book, the on-time-delivery commitments, the alternate routings, and the supervisor’s in-progress thinking. That is the work of an AI agent - and confusing the two is what wastes the next round of Mittelstand factory IT budget.

This article is not about whether MES is dead (it is not). It is about understanding precisely what each tool is built for, where the boundary runs, and what the hybrid architecture actually looks like in a real Mittelstand plant in 2026.

TL;DR

MES is a transactional system: it records, tracks, dispatches, and reports work according to defined rules. It is the system of record between the shopfloor and ERP under ISA-95.

An AI agent is a goal-directed reasoning layer that operates on the data the MES collects, plus context the MES does not see (customer email, supplier delays, free-text operator notes), and decides or escalates with context.

Replacing the MES with an agent is the wrong play. The MES does its job. The agent does what the MES was never built to do.

The hybrid architecture wins. MES stays as system of record. AI agent sits on top, reads through APIs, and acts on goals: rebalancing schedules, predicting quality, drafting 8D reports, escalating with full cross-system context.

Most Mittelstand factories will run an MES and one to three production AI agents by 2028, not an AI-only stack. Gartner predicts 40 percent of enterprise apps will feature task-specific AI agents by 2026⁷ - shopfloor is no exception.

Why This Question Hits Hardest in Manufacturing

MES adoption in the German Mittelstand is patchy but mature. Some hidden champions run state-of-the-art MES platforms (MPDV Hydra, Industrie Informatik cronetwork, GFOS, Siemens Opcenter, AVEVA, Werum). Others run 15-year-old systems that nobody fully understands but nobody dares to replace. Either way, the MES sits at the centre of shopfloor IT - and the question of whether AI complements it or replaces it is now genuinely confusing for buyers.

The forces shaping the question right now

MES vendors have all added AI - Every major MES vendor in 2026 markets some form of AI: anomaly detection, predictive scheduling, vision-based quality. The features are real but bounded. They do not solve the cross-system, exception-heavy decisions that define Mittelstand production reality.
AI vendors claim MES territory - Some AI startups pitch “AI-native MES” that will replace traditional systems. The pitch ignores the regulatory, traceability, and ERP integration weight that real MES carries. Replacing a working MES is a multi-year project nobody should undertake on a vendor pitch.
The data gap is the real bottleneck - Gartner forecasts that 60 percent of AI projects will be cancelled by end of 2026 due to inadequate data foundations⁴. 70 percent of manufacturers identify data quality as their biggest implementation obstacle⁴. The MES is usually where that data lives.
The Mittelstand is under-invested - Horvath research shows mid-sized companies invest just 0.35 percent of revenue in AI, 30 percent below the global average⁴. VDMA finds fewer than 20 percent of German Mittelstand manufacturers have moved Industry 4.0 past pilot stage⁴. The instinct is to consolidate, not to multiply tools.

The shopfloor reality check

The MES does what it was designed for: capture, track, dispatch, report. It does not - and cannot, by design - reason about what to do when reality breaks the schedule. Most Mittelstand operations leaders feel this gap every shift. The question is not which tool is better. The question is which gap each tool fills.

The German Mittelstand context

German manufacturing carries specific weight that makes the MES versus AI agent question different from a US or Asian conversation. The decisions you make have to fit five concurrent realities, not just one technical preference.

SAP everywhere - Most German Mittelstand factories run SAP at Level 4 (S/4HANA, ECC, or Business One). The MES sits at Level 3 and integrates upward through ISA-95-aligned interfaces. Any AI agent that wants to add value has to respect this hierarchy.
Mittelstand-specific MES - Industrie Informatik, MPDV, GFOS, iTAC dominate the Mittelstand mid-tier. Not Siemens-class enterprise MES, not light-touch tools. They encode decades of German industrial process logic. Replacing them is rarely the right move.
Audit and traceability burden - Automotive (IATF 16949), pharma (GMP), food (HACCP), aerospace (AS9100) all require traceable, auditable production records. The MES is the audit trail. AI agents must respect it, not bypass it.
Betriebsrat co-determination - Any system that monitors employee behaviour or performance is co-determination territory. Production AI agents that stay aggregate (machine, line, shift) are easier to deploy than agents that score individual operators.
Skills shortage and demographics - 40 percent of German manufacturers cannot find AI-qualified workforce⁴. The architecture has to fit a small in-house IT team, not a 200-person enterprise IT function.

What an MES Actually Is

MES (Manufacturing Execution System) is the software layer between ERP and the shopfloor. ISA-95, the international standard for enterprise-control system integration, places MES at Level 3 - sitting above SCADA and PLC (Levels 1 and 2) and below ERP (Level 4). Its job is to translate ERP-scheduled production orders into machine-level execution and capture the result.

The defining characteristics

Transactional core - Records production work order by order, station by station, lot by lot. The MES is the source of truth for what was actually produced - quantities, qualities, times, scrap, rework.
Rule-based dispatching - Decides which work order goes to which machine based on configured logic. Optimisation is bounded by the rules; novel situations need human override.
Machine data acquisition - Connects to PLCs, SCADA, and sensors to pull live machine state, OEE inputs, alarm conditions. This is the data layer most AI agents read from.
ISA-95 integration to ERP - Pushes production results to ERP for back-end accounting, inventory, and financial reporting. Pulls planned orders down from ERP for execution.
Traceability and audit - Maintains the genealogy of every lot, every batch, every part - which machines made it, which operators ran the shift, which quality results applied. Compliance-critical for regulated industries.
OEE and KPI calculation - Calculates Overall Equipment Effectiveness, throughput, yield, and other KPIs from the data it captures. The numbers feed dashboards, leadership reports, and improvement projects.
Operator UI on the shopfloor - Touchscreens at each station show the current order, capture confirmations and rework, surface alarms. The MES is the daily tool of every operator and supervisor.

“40 percent of enterprise applications will feature task-specific AI agents by 2026, up from less than 5 percent in 2025.”

- Gartner, Press Release on Enterprise AI Agent Adoption⁷

Where the MES is genuinely brilliant

Compliance-grade traceability - When the customer audit asks “which press, which shift, which operator made this part on this date with which raw material?”, the MES has the answer in seconds. Building that on top of an AI agent would be reinventing audit infrastructure.
Real-time machine-level dispatching - Telling Press Line 3 which die change comes next, capturing the changeover start and end, dispatching the order to the operator screen. Deterministic, fast, fits the rhythm of the shopfloor.
OEE calculation under load - The MES does the boring, structured calculation work that produces the daily OEE number. The number can be wrong if the data is wrong, but the calculation itself is solid.
ERP integration - Production confirmation back to SAP, inventory updates, cost centre charging. ISA-95-aligned MES handle this through stable API contracts.
The 24/7 system of record - The MES runs whether a planner is at her desk or not. It does not need a human in the loop for every transaction. That is exactly right for the structured, rule-based work it handles.

Where the MES hits its design limits

Cross-system reasoning - The MES sees production data. It does not see customer emails, supplier delivery notifications, contract penalty clauses, ERP credit holds. Decisions that require this context fall through the gap.
Unstructured input - An operator notes “Material seems off” in free text. A customer emails a delivery change. A supplier sends a PDF announcing a quality issue. The MES cannot read any of it.
Exception handling beyond defined rules - The schedule says A then B then C. Reality breaks: the customer of B postpones, the customer of D has an urgent ad-hoc order. The MES needs a human to redo the schedule. There is no “just figure it out” option.
Document-heavy back-end work - 8D reports, customer complaint responses, supplier corrective action requests, internal audit prep. The data is in the MES. The work is not the MES’s job.
Cross-departmental orchestration - When production, sales, logistics, and quality all touch the same problem, no single MES coordinates them. The orchestration falls to the supervisor or planner who happens to be on shift.

What an AI Agent in Production Actually Is

An AI agent in a manufacturing context is not a chatbot, not a dashboard, not an MES replacement. It is a goal-directed reasoning layer that reads from the systems already in place (MES, ERP, CAQ, CRM, document systems, email), reasons about what to do toward a defined production goal, and either acts within bounded authority or escalates to a human with full context.

The defining characteristics

Goal-directed, not script-directed - You define the outcome (rebalance the schedule, draft the 8D, predict the lot quality risk, escalate the customer impact). The agent figures out how to get there using available tools.
Reads across systems - MES, ERP, CAQ, customer email, supplier portal, document management. The agent reasons about the full context, not just MES data.
Handles unstructured input - PDFs, emails, free-text operator notes, contract clauses, supplier announcements. About 80 percent of enterprise data is unstructured - and most of it is invisible to the MES.
Sits above the MES, not beside it - The MES stays as system of record. The agent reads from it and writes back through proper APIs. No bypassing, no replacing.
Respects ISA-95 hierarchy - Production AI agents run at Level 4 (or as a layer between 3 and 4). They do not write directly to PLCs or machines. The MES remains the only system that talks to the shopfloor execution layer.
Escalates with context - When the agent encounters a decision outside its authority or confidence, it escalates to a human with what it found, what it tried, and what it recommends. Reviews are 5 to 10 times faster than handling the raw exception alone.
Pricing by use case, not by user - Agents are typically priced per active production use case (per agent, per process), not per shopfloor seat. The economics fit a small team handling high volume.

What an AI agent in production is not

Not a chatbot for shopfloor - Operators do not need a chat window. They need work to flow. Agents act behind the MES UI rather than competing with it.
Not a replacement for SCADA, MES, or PLC - The execution layer stays. Agents add the reasoning layer above.
Not RPA on top of the MES - RPA scrapes UIs and breaks on every update. Agents integrate through APIs, reason about goals, and handle exceptions natively.
Not magic - Gartner predicts more than 40 percent of agentic AI projects will be cancelled by end of 2027¹², mostly due to inadequate governance and unrealistic scope. Production AI needs human-in-the-loop checkpoints, audit logs, and rollback paths.

The architectural principle

The MES is the system of record. The AI agent is the reasoning layer. The MES never disappears. The AI agent never bypasses the MES. They sit in a stack: machines feed MES, MES feeds the agent context, agent reasons, agent writes decisions back to MES or escalates to a human. Every Mittelstand factory in 2028 will run something close to this stack.

Six Differences That Matter on the Shopfloor

These differences are not philosophical. They show up in how each system behaves at 03:00 when something breaks, how procurement evaluates them, and what happens during an audit.

Dimension	MES	AI Agent
Primary purpose	System of record for production execution	Reasoning and decision layer above
ISA-95 layer	Level 3 (manufacturing operations)	Level 4 or 3-4 boundary (decision support)
Decision logic	Configured rules, deterministic	Goal-directed reasoning, adaptive
Data scope	Production data, structured	Production + ERP + CRM + email + documents, mixed
Unstructured data	Cannot process	Native capability
Exception handling	Routes to human queue or stops	Reasons, decides within authority, or escalates with context
Update model	Vendor releases, periodic retraining	Continuous improvement from feedback loops
Pricing model	Per seat / station / module	Per active use case or platform fee
Audit and traceability	Primary source of truth	Logs every decision; defers to MES for record
Time to first deployment	6-18 months for greenfield MES	8-12 weeks for first focused agent

Difference 1: What it is for

The MES exists to record and dispatch production work according to defined rules. It is the system of record for what happened on the shopfloor. The AI agent exists to reason about decisions across the data the MES collects plus context the MES cannot see. The MES owns the truth. The agent owns the reasoning.

Difference 2: How it makes decisions

The MES decides through configured rules. If the order priority is X and the available machine is Y, dispatch to Y. The rules are predictable, deterministic, auditable. The AI agent decides through goal-directed reasoning. Given the goal of meeting customer commitments and the current state across systems, what is the best next action. Different paradigm, different outcomes on novel situations.

Difference 3: What data it sees

The MES sees what shopfloor sensors and operator inputs feed into it. Structured, machine-generated, station-specific. The agent sees the MES data plus everything else: ERP, CRM, customer email, supplier portal, contracts, free-text notes. The decision quality depends on the breadth of context, not just on the algorithm.

Difference 4: How it handles exceptions

The MES routes exceptions to the supervisor or planner. The exception queue grows in proportion to the variability of the production environment. The agent handles exceptions inline: it reasons about what happened, proposes a resolution, and either acts within defined authority or escalates with context. The supervisor reviews the agent’s analysis, not the raw exception.

Difference 5: Audit and traceability

The MES is the audit source of truth. Every part has a genealogy in the MES. The agent logs every decision it makes - what it found, what it considered, what it decided, what it escalated - but the production record itself remains in the MES. The agent creates an additional layer of decision audit, which is often valued during regulatory reviews.

Difference 6: How fast it deploys

An MES greenfield deployment takes 6 to 18 months and several million euros in a Mittelstand context. A first focused AI agent on top of an existing MES takes 8 to 12 weeks and 50,000 to 200,000 euros. The economics flip: MES is a long-cycle investment in infrastructure; agents are a short-cycle investment in specific outcomes.

MES Strengths

Compliance-grade traceability and audit trail
Real-time machine-level dispatching
OEE and KPI calculation under load
ISA-95-aligned ERP integration
24/7 reliability without human in the loop per transaction
Mature ecosystem for German Mittelstand

AI Agent Strengths

Cross-system reasoning beyond MES data
Native handling of unstructured input
Exception resolution with context
Goal-directed adaptation to novel cases
Document-heavy back-end work (8D, audit prep)
8-12 weeks to first deployment

Where Each One Wins

The framing “MES or AI agent” sets up a false binary. The real question is process-by-process: which one fits which job. The same Mittelstand plant will rely on its MES for some work and lean on an AI agent for other work - and the boundaries will run cleanly between them when both are deployed deliberately.

Where the MES wins clearly

Work order dispatching - Right work order, right machine, right time. Deterministic, structured, audit-critical. The MES is built for this.
Production confirmation back to ERP - When a lot is finished, the MES posts the result to SAP for inventory and accounting. Stable, regulated, no judgement involved.
Lot genealogy and traceability - When a customer audit or recall investigation needs to trace a specific part backward through every machine, every shift, every raw material, the MES is the answer.
Real-time machine data acquisition - Reading PLC and sensor data, calculating OEE inputs, surfacing alarm conditions. Latency-critical, deterministic, machine-level.
Operator UI on the shopfloor - The touchscreen at each station, the order list, the rework reason capture. Built for the rhythm of production work.
Shopfloor compliance documentation - GMP batch records, automotive PPAP traceability, food HACCP logs. Regulated, structured, audit-ready.
SCADA-to-ERP integration under ISA-95 - The standardised, audit-aligned bridge between business systems and shopfloor execution.

Where an AI agent wins clearly

Exception handling beyond defined rules - When the schedule breaks because reality does not match the plan, an agent reasons about the right resolution across MES, ERP, customer commitments, and supplier status.
Schedule rebalancing across cross-system context - Customer urgency, supplier delivery, capacity availability, due-date pressure - the agent integrates inputs the MES does not see and proposes a new sequence with rationale.
Predictive quality risk on a per-lot basis - Reading sensor data plus material lot history plus operator notes plus supplier quality history to flag a lot for inspection before it ships. The MES has the data; the agent has the reasoning.
8D report drafting from MES, CAQ, and engineering data - Pulling defect data, root-cause analysis from CAQ, supplier history from ERP, drafting the 8D for the quality engineer to review. Saves days per case.
Customer complaint response drafting - Reading the customer email, finding the specific lot in the MES, drafting a fact-based response with affected quantity, root cause, and corrective action.
Supplier complaint and SCAR letter drafting - Compiling production impact data, generating Supplier Corrective Action Requests with attached evidence.
Audit prep across MES, CAQ, and engineering systems - Pulling the right records for an upcoming customer or certification audit. Hours instead of weeks of manual collation.
Operator query handling - When an operator asks “what is the spec for this part?” or “why was the last setup changed?”, the agent reads engineering, MES, and quality data to answer in seconds.
Cross-shift handover summarisation - End-of-shift summary covering open issues, in-progress orders, machine status, escalations, drafted automatically from MES and operator notes.

Where neither is the right answer

Broken processes - If the production process is poorly designed, no software fixes it. Map and improve the process first, then deploy the right tool.
Genuinely safety-critical control loops - Real-time machine safety, emergency stops, pressure relief. PLC and safety systems own this layer. AI does not belong here.
Tiny volume, low value processes - Three setups a year for a low-margin product. Spreadsheet plus checklist beats automation.
Politically blocked processes - When the friction is between departments, not in the work. Tooling makes politics worse, not better.

Wondering where the boundary runs in your plant?

Book a 30-minute call. We will map a concrete shopfloor process against MES capabilities and AI agent fit - and tell you straight which side it belongs on.

Book a Demo →

Three dark metal industrial blocks stacked vertically with the top block showing organic decision elements - representing the production stack: machine layer, MES layer, AI agent decision layer

The Cost Comparison Done Honestly

Comparing MES cost to AI agent cost is the wrong frame. They solve different problems. The honest comparison is the cost of the gap each one leaves and the work each one removes.

What an MES actually costs in the Mittelstand

Licence model - Per station, per module, sometimes per concurrent user. A typical Mittelstand mid-tier MES costs 1,500 to 3,500 euros per station for the base licence, plus modules for OEE, quality, traceability, document control.
Implementation - The dominant cost line. 250,000 to 1,500,000 euros for a Mittelstand greenfield MES depending on plant complexity, number of lines, and modules. Often takes 9 to 18 months.
Annual maintenance - 18 to 22 percent of licence value per year for support and updates, plus internal admin time.
Integration to ERP and engineering - 50,000 to 250,000 euros for SAP integration, PDM/PLM connections, CAQ links. Sometimes ongoing as systems update.
Operator training and rollout - 30,000 to 80,000 euros depending on operator headcount and shift coverage.
Periodic upgrade projects - Major version upgrades every 5 to 8 years, typically 30 to 50 percent of original implementation cost.

What a production AI agent actually costs

Platform / agent fee - 2,000 to 8,000 euros per month per active production use case, depending on complexity, volume, and integration scope.
LLM inference cost - Cents per task. Even at thousands of tasks per day, total inference cost stays modest - low hundreds to low thousands of euros per month per use case.
Implementation - 40,000 to 120,000 euros for a focused first deployment. Mostly process mapping, MES API integration, validation against historical cases.
Integration maintenance - Lower than RPA or MES integration because agents tolerate API drift better. Typically 5,000 to 15,000 euros per year per active use case.
Monitoring and feedback loop - 0.2 to 0.5 FTE per active agent for review, correction, and continuous improvement.
No per-station scaling - The number of stations or operators does not change the agent cost. Per-process pricing fits per-process value.

Three-year total cost on a representative use case

Take an exception-handling use case: schedule rebalancing when a machine fails. Today, the supervisor on shift spends 30 hours per week resolving these cases manually across MES, ERP, customer order book, and shop emails. The MES alone cannot help (it routes the alarm; reasoning is the supervisor’s job). The agent reasons across systems and proposes resolutions for supervisor approval.

Cost Component	MES alone (status quo)	MES + AI agent
Year 1 platform / licence	(included in existing MES)	60,000 euros (5,000 euros/month agent)
Implementation	0	70,000 euros
Integration	0	15,000 euros
Year 1 total	0 euros	145,000 euros
Year 2 ongoing	0	65,000 euros
Year 3 ongoing	0	65,000 euros
3-year platform total	0 euros	275,000 euros
Hours of supervisor time recovered per week	0	~22 hours (out of 30)
3-year supervisor labour recovered (75 euros/hr)	0	257,400 euros
Avoided customer SLA penalties (3-year est.)	0	120,000 euros
3-year net (platform minus value recovered)	0 euros (no investment, no recovery)	+102,400 euros (net positive)

Why the platform-only comparison misleads

The MES-alone column shows zero investment - and zero recovery. The agent column shows 275,000 euros of platform spend over three years - but recovers more than 377,000 euros in supervisor time and avoided SLA penalties. The net is positive because the agent removes work the MES alone cannot. Comparing platform-to-platform misses the actual value lever.

Where the cost comparison flips against an agent

Stable, low-exception production - When the production environment is genuinely stable and the MES alone handles it cleanly, an agent adds cost without value.
Very low volume processes - When the supervisor work being saved is a few hours per month, agent platform fees do not amortise.
Bad data foundations - When the MES data is wrong or missing, the agent cannot reason reliably. Fix the data first; deploy the agent on the cleaner subset.
Pure machine-control problems - PLC, SCADA, real-time control. Agent does not belong in this layer.

The MES + Agent Architecture That Works

Almost every Mittelstand factory that adopts production AI agents lands in a hybrid: MES stays as system of record, agents sit on top. The architecture is not a compromise; it is the only configuration that respects ISA-95, regulatory traceability, and the operational reality of the shopfloor.

The four-layer stack

Layer 1: Machines, sensors, PLCs, SCADA - The physical execution layer. Real-time, safety-critical, deterministic. AI does not belong here.
Layer 2: MES - The system of record for production. Captures data, dispatches work, records traceability, integrates upward to ERP. Compliance-grade.
Layer 3: AI agents - The reasoning layer. Reads from MES, ERP, CAQ, CRM, document systems. Reasons about goals. Writes decisions back to MES through proper APIs.
Layer 4: ERP and business systems - SAP, financial systems, customer order book, supplier portal. The MES integrates here under ISA-95. Agents read from here as additional context.

The most common hybrid patterns we see in the Mittelstand

MES + agent for schedule rebalancing - MES dispatches and tracks. Agent monitors disruptions (machine failure, customer change, supplier delay) and proposes new sequences for the planner to approve.
MES + agent for predictive quality - MES captures sensor and operator data per lot. Agent reads sensor history plus material lot data plus supplier quality history and flags risky lots for inspection before they ship.
MES + agent for 8D and audit work - MES holds the production records. Agent pulls relevant data, drafts 8D reports, prepares audit binders, drafts customer communications. Quality engineer reviews and approves.
MES + agent for shift handover - MES has the operational state. Agent summarises the shift end-to-end, surfaces open issues, drafts the next-shift briefing. Saves 20 to 40 minutes per shift.
MES + agent for operator support - MES has the order, the spec, the recent events. Agent answers operator questions in seconds (“why did the setup change?”, “what is the tolerance for this feature?”) by reading MES + engineering systems.

The hybrid principle

The MES is the bedrock. AI agents are the layer above. Trying to replace the MES with an agent breaks audit trails, ISA-95 alignment, and operator workflow. Trying to solve agent-class problems with more MES configuration produces fragile rules nobody maintains. The architecture that wins in 2026 is intentionally both.

How the responsibilities split in a hybrid architecture

Responsibility	MES	AI Agent
Production system of record	Owner	Reads and writes; never replaces
ERP integration under ISA-95	Primary interface to Level 4	Reads ERP through MES or directly for context
Lot genealogy and traceability	Source of truth	Logs decisions; defers to MES for lot record
Operator UI at the station	Primary daily tool	Sits behind MES; surfaces summaries and answers
Schedule dispatching	Executes the schedule	Proposes schedule changes for approval
Exception detection	Surfaces alarms and rule violations	Reasons about cause and impact across systems
Quality data capture	Captures structured measurements	Adds context: material history, operator notes, customer feedback
Audit and compliance records	Primary audit source	Decision audit trail layered on top
Document-heavy back-end work (8D, audits)	Provides source data	Drafts the document for human review

The Decision Framework

Use this framework job by job to decide whether your existing MES is enough or whether an AI agent layer adds clear value. The output is a defensible recommendation you can present to operations leadership and IT in the same meeting.

Step 1: Classify the job

Execution job - Dispatch a work order, capture a confirmation, record a measurement, route a part. Structured, repeatable, audit-critical. MES territory.
Reasoning job - Decide what to do when reality breaks the plan, draft a document from cross-system data, predict a risk, summarise a shift. Cross-system, judgement-heavy, exception-prone. Agent territory.

Step 2: Score the data scope

MES data only - The decision needs only what the MES already has (machine, lot, sensor, time). MES rules can probably handle it.
Cross-system context needed - The decision needs MES plus ERP, CRM, customer email, supplier portal, contract terms, engineering documents. Agent territory.

Step 3: Score the variability

Low variability - The case looks the same every time. MES rule logic is sufficient.
High variability - The cases differ in small but meaningful ways. Rule maintenance becomes a permanent burden. Agent reasoning is more durable.

Step 4: Score the cost of delay

Low cost of delay - The work can wait for the supervisor or planner to handle it next morning. No urgency.
High cost of delay - Customer SLAs, line downtime, lot-at-risk decisions. Speed matters, and an agent that reasons in seconds beats a queue waiting for human attention.

Step 5: Read the decision matrix

Job Type	Data Scope	Variability	Recommendation
Execution	MES data only	Low	MES (configure rules)
Execution	MES data only	High	MES + targeted agent rules
Reasoning	Cross-system	Low	MES with periodic human review
Reasoning	Cross-system	High	MES + AI agent (clear winner)
Document drafting	Cross-system	Any	MES + AI agent
Real-time control	Machine signals only	Any	SCADA/PLC - not MES, not agent

Five questions before adding an AI agent on top of your MES

Is the MES data quality good enough for the agent to reason on?
Does the decision need context from systems beyond the MES?
Are exceptions consuming significant supervisor or planner time?
Is there a clear human-in-the-loop checkpoint for the agent’s output?
Will the agent respect ISA-95 boundaries (no direct PLC writes)?

Five yes answers means an agent will likely add real value. Two or fewer means fix the MES or the data first.

How Superkind Fits

Superkind builds custom AI agents that sit on top of existing Mittelstand MES, ERP, CAQ, and engineering systems. We do not replace your MES. We build the agent layer that does what your MES was never built to do - reason across systems, handle exceptions, draft documents, escalate with context.

Core capabilities for production environments

MES-native integration - APIs and data connectors for Industrie Informatik, MPDV Hydra, GFOS, iTAC, Siemens Opcenter, AVEVA, Werum, and SAP MII. We connect through stable interfaces, not UI scraping.
SAP-native context - SAP S/4HANA, ECC, MII, PP/PI, QM, MM. Agents read production orders, customer commitments, supplier status, quality records as context for shopfloor decisions.
CAQ and engineering integration - Babtec, Böhme & Weihs, iqs, plus PDM/PLM systems. Agents pull engineering specs, defect history, FMEA notes for quality and audit work.
Document intelligence - Reads operator notes, customer emails, supplier announcements, contract clauses. No template configuration required.
Cross-system orchestration in one workflow - A single agent reads MES + SAP + CAQ + email and produces a single decision or document. No swivel-chair work for the supervisor.
ISA-95 alignment - Agents run at Level 4 or at the Level 3-4 boundary. They never write directly to PLCs or machines. The MES remains the only system that talks to the execution layer.
Human-in-the-loop checkpoints - You define which decisions require approval and at what confidence threshold. Agents escalate with context rather than acting silently. Critical for high-stakes shopfloor decisions and EU AI Act alignment.
Audit trail layered on top of MES - Every agent decision is logged: what it found, what it considered, what it decided, what it escalated. The agent log complements the MES audit trail rather than replacing it.
EU deployment and DSGVO alignment - Agents run on EU cloud or your own infrastructure. Data does not leave your defined perimeter.
8 to 12 weeks to first production deployment - From process assessment to live operation on a focused first use case. No multi-year transformation.

Superkind vs alternative paths

Factor	Superkind	MES-embedded AI	In-house build
Time to first deployment	8-12 weeks	3-9 months (vendor roadmap dependent)	6-18 months
Cross-system reasoning	Native	Limited to MES data	If built
Unstructured input handling	Native	Limited add-on modules	If built
MES vendor lock-in risk	None - sits above MES	High - tied to MES vendor roadmap	None - you own it
EU / DSGVO alignment	Built-in; EU deployment supported	Varies by MES vendor and plan	Your responsibility
Internal expertise required	Process owner involvement	Admin / config team	AI engineering team
Pricing model	Per use case	Tied to MES licence	Internal cost

When Superkind Fits

You have an MES that works and stays in place
Decisions need cross-system context (MES + ERP + email + documents)
Exception volume is consuming supervisor or planner time
Document-heavy work (8D, audits, customer responses) eats engineering hours
You want a focused first deployment in weeks, not a multi-year transformation
EU deployment and DSGVO compliance matter
Process know-how is your competitive edge and you want to keep it

When Superkind Is Not the Right Fit

You do not have an MES and need one first - the agent works on top of an MES, not as a replacement
Production volume is too low to justify a focused agent build
MES data quality is bad enough that the agent cannot reason reliably
The use case is real-time machine control - that belongs in PLC/SCADA
Team is unwilling to participate in process mapping and feedback loops

The 90-Day Plan

This plan covers running the decision framework on a single shopfloor use case, validating MES data, deploying the agent in limited scope, and reaching first production value. Use it to align your operations leadership and IT.

Weeks 1 to 3: Use case selection and data audit

Pick three candidate use cases - Each one with measurable pain (supervisor hours, missed SLAs, audit prep weeks, scrap escalations). Document current pain in numbers.
Apply the decision framework - Score each use case on job type, data scope, variability, and cost of delay. Output: one use case clearly fit for an agent.
Audit MES data quality - For the chosen use case, validate that the data the agent will reason on is clean enough. Most agent failures trace back to bad data.
Confirm API access - MES vendor APIs (REST, SOAP, OPC-UA, file-based) for the data the agent needs. Engineering, CAQ, ERP integration confirmed.
Define success metrics - Hours recovered per week, exception resolution time, audit prep duration. Numbers measurable in 90 days.
Brief Betriebsrat where applicable - If the agent could touch employee performance data, start consultation early.

Weeks 4 to 8: Build and test

Process mapping detailed - Document inputs, outputs, decision points, system touches, exception types. The work that makes the deployment succeed.
Agent build against the process map - Prompt and tool design, integration setup, escalation thresholds, human-in-the-loop checkpoints.
Test against real historical cases - Pull past examples (resolved by the supervisor), run the agent against them, compare outputs.
Validate exception handling specifically - The hardest cases are the agent’s real test. Confirm escalations deliver useful context.
Confirm DSGVO logging - Audit trail captures what the agent did, why, and what data it accessed.
Train the team - Supervisors and planners learn how to review agent recommendations, correct errors, and feed back.

Weeks 9 to 12: Production and learning

Deploy to limited scope - 20 percent of cases, one product family, one shift. Run in parallel with the existing process.
Review every escalation and correction - Weekly cadence. What did the agent get wrong? Why? What is the correct answer?
Measure against baseline - Hours recovered, resolution time, audit prep speed. If numbers are not moving, diagnose before scaling.
Expand once metrics validate - Two to three weeks of stable operation at limited scope before scaling to full volume.
Document lessons for the next use case - Where was the framework right, where was it wrong, what would you do differently.

Go/No-Go Checklist Before Production Expansion

Agent operating reliably on the limited scope
Success metrics moving in the right direction
Exception or escalation rate at or below target
Audit logs and DSGVO documentation complete
Supervisors and planners comfortable with the review workflow
Betriebsrat sign-off obtained where required
Rollback procedure documented and tested
MES data quality monitored, not just at deployment

Frequently Asked Questions

An MES (Manufacturing Execution System) is a transactional system that records, tracks, and dispatches production work according to predefined rules. It tells the shopfloor what to make next, captures what was actually made, and feeds the result back to ERP. An AI agent is a goal-directed system that reasons about production decisions across the data the MES already collects - rebalancing schedules when a machine fails, predicting quality deviations, drafting 8D reports, escalating with full context. The MES is the system of record. The AI agent is the decision layer above it.

No. The MES handles the things it was designed for - work order tracking, machine data acquisition, ISA-95 integration to ERP, traceability, OEE calculation. An AI agent does what an MES cannot do well: reason about exceptions, integrate unstructured data (emails from customers, free-text notes from operators), and orchestrate decisions across MES, ERP, CAQ, and engineering systems. Replacing an MES is a multi-year project. Adding an AI agent on top is an 8 to 12 week deployment.

Most modern MES vendors have added AI features in 2025-2026: anomaly detection on machine data, predictive scheduling, vision-based quality. These features are useful but bound to what the MES sees. If your decision needs context that lives outside the MES (customer email about an urgent change, supplier delivery delay, contract clause on penalty fees), the embedded MES AI cannot reason about it. That is precisely where dedicated AI agents add value.

ISA-95 is the international standard that defines the integration between business systems (Level 4: ERP) and manufacturing operations (Level 3: MES) and below (Level 2: SCADA, Level 1: control, Level 0: machines). For an AI agent to add value in production, it must respect this hierarchy: read from MES and ERP through their proper interfaces, never bypass the MES to write directly to machines, and produce outputs that humans (planners, supervisors) can act on. AI agents that ignore ISA-95 break audit trails and compliance.

Not on its own. Gartner forecasts that 60 percent of AI projects will be cancelled by end of 2026 due to inadequate data foundations, and 70 percent of manufacturers identify data quality as their biggest implementation obstacle. An AI agent makes decisions on the data it can see. If your MES captures wrong start times, missing scrap reasons, or inconsistent shift data, the agent inherits that. Fix the most painful data gaps first, deploy the agent against the cleaner subset, and use the agent to surface remaining data issues.

Manufacturing exceptions involve judgement: a customer changes a delivery date last minute, a supplier is short, a machine is throwing intermittent quality issues, an operator notes something unusual in free text. An MES routes these to a human queue or stops the work. An AI agent reads the context across MES, ERP, CRM, email, and engineering systems, proposes a resolution (rebalance schedule, escalate to scheduler, reroute to a different machine, hold the lot for inspection), and either acts within defined authority or escalates with full context for human approval.

Yes. AI agents in production run on the IT side (Level 4 in ISA-95 terms) and read from MES through APIs. They do not need direct access to PLCs, SCADA, or machines. The agent uses the data the MES has already collected and writes back to the MES through proper interfaces. This means standard IT/OT segregation, firewalls between Levels 2/3 and Levels 3/4, and your existing OT security posture stay intact. No new attack surface on the production network.

Most production AI agents fall into the limited-risk or minimal-risk categories under the EU AI Act (fully applicable August 2026): scheduling assistance, anomaly detection, exception handling. High-risk classification kicks in for AI used in safety-critical contexts (machine safety, hazardous environments) or in employment decisions (AI scoring operator performance). Verify the risk classification of your specific use case before deployment. Document the data sources, the decision logic, and the human override path.

German works councils (Betriebsrat) have co-determination rights under the Betriebsverfassungsgesetz for technical systems that can monitor employee behaviour or performance. Any AI agent that captures data linked to specific operators (cycle time per person, error rate per person, productivity ranking) requires Betriebsrat consultation. Most production AI use cases stay aggregate (machine, line, shift, lot) and can be designed to avoid personal-performance attribution - which removes most works council blockers.

A focused first deployment typically takes 8 to 12 weeks from process assessment to live operation on a single use case (exception handling on a single line, predictive scheduling for one product family, automated 8D drafting for top suppliers). The first 2 to 3 weeks are process and data mapping. Weeks 4 to 8 cover integration and agent build. Weeks 9 to 12 are limited-scope production with parallel running and validation against baseline metrics.

Not necessarily. AI agents can read from older MES systems through their existing APIs (BAPI, OPC-UA, REST, files) - the integration is bounded and predictable. What matters is whether the data quality is good enough for the specific decision the agent needs to make. A 15-year-old MES with disciplined data capture can support agent use cases better than a brand new MES with sloppy operator entry. Audit the data first, then decide whether an MES upgrade is needed.

Typical Mittelstand pricing for a production AI agent: 2,000 to 8,000 euros per month per active use case (depending on complexity and volume), plus implementation cost of 40,000 to 120,000 euros for a focused first deployment, plus LLM inference cost of a few cents per task. The economics work when the agent removes 15 plus hours per week of human decision-making or recovery work, or when it prevents specific high-cost incidents (line stops, scrap escalations, missed customer SLAs).

No. RPA scrapes UIs and replays clicks - it would break every time the MES vendor pushed an update. AI agents integrate through APIs (most modern MES expose REST or OPC-UA), reason about goals rather than scripts, and handle exceptions instead of routing them to a queue. The architectural difference matters most precisely on the shopfloor, where MES UIs change frequently and exceptions are the rule rather than the edge case.

Sources

Henri Jung

Co-founder at Superkind

Co-founder of Superkind, where he helps SMEs and enterprises deploy custom AI agents that actually fit how their teams work. Henri is passionate about closing the gap between what AI can do and the value it creates in real companies. He believes the Mittelstand has everything it needs to lead in AI - it just needs the right approach.

Ready to find the right boundary in your plant?

Book a 30-minute call. We will map a concrete shopfloor use case against your existing MES capabilities and tell you straight whether an AI agent layer is justified - no sales pitch, just a frank assessment.

Book a Demo →

MES or AI Agent: Where the Boundary Runs on the Mittelstand Shopfloor in 2026

Why This Question Hits Hardest in Manufacturing

The forces shaping the question right now

The German Mittelstand context

What an MES Actually Is

The defining characteristics

Where the MES is genuinely brilliant

Where the MES hits its design limits

What an AI Agent in Production Actually Is

The defining characteristics

What an AI agent in production is not

Six Differences That Matter on the Shopfloor

Difference 1: What it is for

Difference 2: How it makes decisions

Difference 3: What data it sees

Difference 4: How it handles exceptions

Difference 5: Audit and traceability

Difference 6: How fast it deploys

Where Each One Wins

Where the MES wins clearly

Where an AI agent wins clearly

Where neither is the right answer

Wondering where the boundary runs in your plant?

The Cost Comparison Done Honestly

What an MES actually costs in the Mittelstand

What a production AI agent actually costs

Three-year total cost on a representative use case

Where the cost comparison flips against an agent

The MES + Agent Architecture That Works

The four-layer stack

The most common hybrid patterns we see in the Mittelstand

How the responsibilities split in a hybrid architecture

The Decision Framework

Step 1: Classify the job

Step 2: Score the data scope

Step 3: Score the variability

Step 4: Score the cost of delay

Step 5: Read the decision matrix

How Superkind Fits

Core capabilities for production environments

Superkind vs alternative paths

The 90-Day Plan

Weeks 1 to 3: Use case selection and data audit

Weeks 4 to 8: Build and test

Weeks 9 to 12: Production and learning

Related Articles

Frequently Asked Questions

Sources

Ready to find the right boundary in your plant?