Intelligent Document Processing: How the Mittelstand Replaces OCR With AI Agents

29 April 202634 min read

Co-founder at Superkind

Stack of structured documents replacing legacy OCR with intelligent document processing

Walk into the back office of almost any Mittelstand manufacturer and you will find the same setup. ABBYY FineReader on a server in the corner. A folder of 800 invoice templates that no one has touched in three years. Two clerks fixing exceptions all day because the OCR keeps misreading Lieferschein numbers when the supplier rotates their PDF layout. The system “works”, in the way a 2014 ERP “works”.

And yet, 65 percent of companies are now accelerating their intelligent document processing projects⁴. The reason is not that OCR has stopped working - it is that LLM-based IDP has finally stopped being a research demo. Modern systems extract structured data from the messy 30 percent of documents that legacy OCR routes to manual review, handle layout variation without templates, and run inside German-region cloud or on-premise. Gartner published its first-ever Magic Quadrant for IDP in 2025², naming ABBYY, Hyperscience, and Tungsten Automation as Leaders^8,10. The market grew past USD 2 billion and is on track to hit USD 14 billion by 2026¹⁷.

This guide is for the CFO, Head of Finance, or IT Director who already runs OCR in production and knows it is not enough anymore. No hype. No “rip and replace”. Just the migration playbook, the compliance map (E-Rechnung, GoBD, GDPR), the vendor landscape, and what 90 days of focused work actually delivers.

TL;DR

Intelligent document processing is no longer a buzzword for OCR. It is OCR plus an LLM agent that reads context, handles exceptions, and writes structured fields straight into your ERP, DATEV, or workflow tool.

The legacy stack still has a role. ABBYY and Tungsten Automation handle the high-volume printed cases at 99 percent accuracy. The new layer takes over the messy 30 percent: handwritten attachments, layout shifts, multi-page contracts, exception loops.

E-Rechnung does not eliminate IDP - it shifts it. Structured invoices flow straight through; everything else (PDFs from non-compliant suppliers, delivery notes, contracts, claims) still needs intelligent reading.

90 days is enough to migrate one document class to LLM-based IDP, prove ROI, and decide what to expand next.

The real risk is not migration cost. It is staying on a legacy template stack while competitors cut clerk hours by 70 percent and redeploy people to the work that actually moves margin.

The OCR Wall: Why Legacy Stacks Stop Scaling

For two decades, the Mittelstand answer to document load was templated OCR. Buy ABBYY or Kofax (now Tungsten Automation). Build templates per supplier. Train operators to fix exceptions. The system handled 70 percent of documents straight through and routed the rest to a clerk. That arrangement worked because document volume grew slowly and supplier formats were stable.

Both assumptions broke around 2023.

Volume jumped - E-Rechnung readiness work doubled inbound document audits in 2024-25. Procurement digitisation pushed PDFs into AP that used to come on paper. Bitkom reports 38 percent of German companies cut paper-based AP by half between 2023 and 2025, with four out of ten going fully paperless¹.
Supplier formats churned - Suppliers refresh their invoice templates as they roll out their own ERP migrations. A Mittelstand AP team that once managed 300 templates is now managing 800. Each refresh breaks the OCR until someone retrains it.
Exception cost rose - Sachbearbeiter time costs EUR 35-60 per hour fully loaded. With 25-30 percent of invoices going to manual review, the “automated” OCR setup is quietly burning a junior FTE per 1,000 invoices per month.
Legacy licences got expensive - Annual licence and maintenance for an enterprise IDP suite (ABBYY Vantage, Tungsten TotalAgility) lands at EUR 50,000-200,000 for a Mittelstand deployment. That is real money for a system that handles only the printed text and dies on context.
The exception backlog became a hiring problem - The clerks who do the manual cleanup are exactly the role the labour shortage is hitting hardest. 28 percent of German companies cannot fill back-office positions, and turnover in AP/document teams runs at 18-22 percent annually.

Where the OCR Wall Hits

The exact failure mode is predictable: 70 percent of documents flow straight through, 25 percent need correction, 5 percent get rejected. The 25 percent costs more than the 70 percent saves. Adding more templates does not fix it - it just shifts which 25 percent breaks next.

This is the wall. Not technology failure - economic failure. The legacy stack still works for what it was built for. It just was not built for what document processing has become.

Pressure Point	Legacy OCR (2020)	Today (2026)
Supplier templates per AP team	200-400	600-1,200
Format mix	PDF + paper	PDF + XRechnung + ZUGFeRD + email + portal
Exception rate	15-20%	25-35%
Annual licence + maintenance	EUR 30-80K	EUR 50-200K
Time to onboard a new supplier format	2-5 days (template)	0-1 hours (LLM schema)
Handwritten attachment support	Manual transcription	95% accuracy out of the box

What “IDP” Actually Means in 2026

The IDP label has expanded twice. First, around 2018, when classical OCR vendors added machine learning classifiers and called the result “intelligent”. Second, around 2024, when LLMs started extracting structured data well enough that the templated approach started to look obsolete. Gartner’s 2025 Magic Quadrant - the first they have ever published for this category - acknowledges both².

A modern IDP system has four layers, not one:

Capture - Pulling documents in from email, EDI, scanners, supplier portals, and inbound mail. This is plumbing - usually unchanged.
OCR / vision - Turning pixels into characters. Legacy ABBYY, Tungsten, and Tesseract still excel here for printed text at 99 percent accuracy²⁶.
Understanding - Deciding what the document is, which fields matter, and what context applies. This is where LLMs replaced templates.
Action - Writing the extracted data into ERP, DATEV, RPA workflows, or human review queues, with full audit trail.

The 2024-26 shift was not at layer 2 - it was at layer 3. LLMs now read context. They know that a six-digit number near “Bestellnummer” in a German PDF is a purchase order reference even if it is in a different position than last month. They handle Lieferschein, Rechnung, Gutschrift, and Mahnung in the same pipeline without three different templates.

Capability	Legacy OCR + Templates	Classical IDP (2020)	LLM-Based IDP (2026)
Printed text accuracy	99%+	99%+	95-99%
Layout variation tolerance	None (template breaks)	Low (zone-based)	High (context-aware)
Handwriting	Fails	50-70% accurate	91-95% accurate¹¹
Multi-language in one doc	Manual switch	Limited	Native
New supplier onboarding	2-5 days per template	1-2 days	0-1 hours (schema only)
Exception handling	Routes to human	Routes to human	Reasons through 60-80%
Schema flexibility	Fixed per template	Fixed per type	Defined in plain text
Hallucination risk	None	None	Real - needs validation loop

The hallucination row matters. LLMs can confidently invent text that was never in the document. Claude tends to hallucinate less than GPT-4 in invoice extraction tests, but no model is at zero^11,28. This is why production IDP always pairs the LLM with a deterministic validation layer - schema checks, sum checks, supplier master data lookups - before the data hits your ERP. The validation loop is what turns a research demo into a system you can put behind your AP team.

Pros of LLM-Based IDP

✓ No templates to maintain - schema lives in plain language
✓ Handles supplier format churn - context beats coordinates
✓ Multi-language native - DE, EN, FR, IT in one pipeline
✓ Reasons through exceptions - reduces manual review by 60-80%
✓ Cheaper to extend - new doc type in hours, not weeks

Cons of LLM-Based IDP

✗ Needs validation loop - hallucination risk if shipped raw
✗ Inference cost per page - real, but typically EUR 0.001-0.01
✗ Longer cold-start latency - 1-3 seconds per page typical
✗ Compliance documentation new - GoBD audit trail needs explicit design

This is why the right architecture in 2026 is not “LLM replaces OCR”. It is hybrid. Keep the legacy capture and OCR for high-volume printed text. Add an LLM agent for layer 3 - understanding and exception reasoning. The two systems coexist for 6-12 months while you measure which documents the LLM handles better and quietly retire templates that no longer earn their keep.

“Generative/agentic AI is becoming an equalizer that challenges vendors’ ability to differentiate.”

- Boris Evelson, Vice President and Principal Analyst at Forrester³

5 Document Types Where LLM-Based IDP Wins

Not every document needs an LLM. Standardised XRechnung lands as XML and never sees a vision model. The cases where the new architecture earns its keep are where layout varies, language mixes, or the human has to read context. Here are the five that consistently deliver ROI in Mittelstand deployments.

1. Inbound Invoices (PDF and Paper)

The biggest volume and the most predictable ROI. Most Mittelstand AP teams already process 1,000-10,000 invoices per month. Even with E-Rechnung receive obligations active since 1 January 2025⁵, suppliers will ship PDFs for years - the sending obligation only applies to all firms by January 2028⁶.

Volume reduction in manual review - From 25-30 percent of invoices needing a clerk to 5-10 percent
Speed - 4 times faster end-to-end processing¹⁶
Error reduction - 38 percent fewer manual booking errors¹⁶
End-to-end time - 46 percent reduction from receipt to posting¹⁶
Cost savings - Deloitte measures 40-60 percent reduction in AP processing cost²⁵
McKinsey benchmark - Up to 80 percent efficiency gain in invoice processing for companies that get the implementation right¹⁴

Mittelstand Math

For a mid-sized company processing 1,000 invoices per month with a 28 percent exception rate at EUR 12 per exception (clerk time + system cost), legacy OCR is burning EUR 40,320 per year on manual review. An LLM layer that drops exceptions to 7 percent saves EUR 30,240 in year one - before counting faster cycle time, fewer late-payment penalties, and the discount-capture rate improvement.

2. Delivery Notes and Goods Receipts

Lieferscheine are messier than invoices. Half come folded inside the parcel. Quality varies. Layouts shift more often than invoices because logistics partners change. This is where templated OCR struggles most and the LLM agent earns the most.

Three-way match automation - Match purchase order, goods receipt, and invoice automatically; route only mismatches to humans
Quantity reconciliation - Catch under-deliveries and over-deliveries before they hit AP
Photo-of-paper support - Warehouse staff photograph a delivery note with their phone; the agent extracts and reconciles within seconds
Cross-reference master data - Validate supplier, item, and warehouse codes against your ERP
Exception flagging - Surface only the cases where automation is uncertain, with the reasoning attached for the clerk

3. Contracts and NDAs

Contracts are the document type where legacy OCR was never useful. The volume is too low to justify templates, the variation is too high, and the value is in the clauses, not the fields. LLM-based IDP changes this category fundamentally.

Clause extraction - Identify payment terms, liability caps, termination notice, IP ownership across hundreds of contracts in hours
Variant flagging - Catch supplier counter-edits to your master template before they get countersigned
Renewal tracking - Build a clean register of expiry dates and auto-renewal clauses from existing contract archives
Compliance check - Flag clauses that conflict with your data processing agreement, sanctions list, or LkSG due diligence requirements
Search - Natural language queries across the contract base (“which suppliers have liability caps under EUR 100,000?”)

4. Insurance Claims and Damage Reports

For Mittelstand industries with claims-heavy processes - logistics, insurance, manufacturing warranties - LLM-based IDP changes throughput. Photos, handwritten notes, multi-page reports, and witness statements all flow through the same pipeline.

Photo and handwriting in one pipeline - 95 percent accuracy on handwritten print, 91 percent on cursive²⁶
Damage classification - Categorise incident type, severity, and policy applicability from free-form descriptions
Cycle time - From days to hours for first-touch claims handling
Fraud signal - Cross-reference claim details against historical patterns and master data inconsistencies
Customer experience - Status update sent to claimant within minutes, not weeks

5. HR Documents (Applications, Sick Notes, Expense Receipts)

HR is a smaller volume but high-value category. CV screening, expense receipt extraction, and Krankmeldung handling all benefit from the same architecture. The compliance bar is higher because of the EU AI Act’s Annex III high-risk classification for hiring AI - but expense and absence handling stay in the lower-risk band.

Expense receipt extraction - Photos of receipts auto-coded against your travel policy and SAP Concur or DATEV
Sick note (eAU) handling - Even with electronic Krankmeldung mostly digital, paper notes still arrive; agent reconciles with absence calendar
Application screening - With explicit human-in-the-loop checkpoints for EU AI Act compliance
Onboarding documents - Process tax forms, banking details, and certificates from new hires before day one
Works council scope - Brief Betriebsrat early; most use cases are not co-determination relevant, but signalling matters

Document Type	Primary Metric	Typical ROI Timeline	Migration Complexity
Inbound Invoices	4x faster, 38% fewer errors	3-6 months	Low
Delivery Notes	3-way match automation	3-6 months	Medium
Contracts and NDAs	Clause extraction in hours	6-9 months	Medium
Insurance Claims	Days to hours cycle time	6-9 months	Medium-High
HR Documents	EU AI Act-compliant routing	6-12 months	High

See where IDP wins in your document flow

Book a 30-minute call. We will map your highest-ROI document type together.

Book a Demo →

Three-phase 90-day migration playbook from legacy OCR to LLM-based IDP

The 90-Day Migration Playbook

The companies that succeed with this migration share one trait: they pick one document class, run it in parallel with the legacy stack, and prove ROI before touching anything else. The companies that fail try to migrate three document types at once or replace the OCR core in week one. Here is the week-by-week breakdown that works.

Phase 1: Audit and Architect (Weeks 1-4)

Week 1: Document inventory - List every inbound document type, monthly volume, current handling system, and exception rate. Most Mittelstand AP teams under-count their volume by 30-50 percent because email attachments and supplier portals are not tracked.
Week 2: Steuerberater alignment - Bring your tax advisor in early. Walk them through the proposed audit trail, Verfahrensdokumentation, and DATEV export format. Their early sign-off saves weeks of rework later. Most are now familiar with AI-based IDP under the July 2025 GoBD update^7,22.
Week 3: ROI baseline - Measure current cost per document, exception rate, cycle time, and clerk hours. This is the number you will track every month after go-live. Without it, success arguments turn into opinion.
Week 4: Architecture decision - Cloud (Azure / AWS / GCP in Frankfurt or Dublin) vs on-premise. EU-only LLM (Mistral, Anthropic Claude EU, Azure OpenAI) vs open model on-prem (Llama, Qwen). Validation layer design. ERP and DATEV integration points. Audit logging.

Phase 2: Build and Run in Parallel (Weeks 5-8)

Week 5-6: Schema and validation build - Define the extraction schema in plain language. Build the validation layer (sum checks, supplier master data lookup, sanity bounds). Wire up the audit log. Connect to your DATEV export or ERP API. No production traffic yet.
Week 7: Shadow mode - Route a copy of every inbound document through the new pipeline. Compare the output against what the legacy OCR plus your AP clerks produced. Flag every disagreement. The clerks decide who was right - the data tells you the rest.
Week 8: Validation and tuning - For every disagreement, decide why. Schema gap, prompt issue, validation rule missing, model error. Fix in priority order. By end of week 8 you should have a documented agreement rate of 92-97 percent versus the existing process.

Phase 3: Soft Launch and Measure (Weeks 9-12)

Week 9: Soft cutover - LLM pipeline becomes the primary path for the chosen document class. Legacy OCR runs as fallback for low-confidence cases. Clerks now review only the cases the agent flagged, not every document.
Week 10-11: Full rollout - All inbound documents of the chosen class flow through the new pipeline. Daily monitoring on agreement rate, exception rate, and any hallucination flags. Weekly review with the AP team to capture what the system still misses.
Week 12: Measure and decide - Compare against the week-3 baseline. Document the savings. Present to the leadership team. Decide which document class migrates next - and whether the legacy IDP licence still earns its keep.

IDP Migration Readiness Checklist

You can name the document class with the highest exception cost
You have at least 6 months of historical document samples available
Your ERP or DATEV system has API access for booking automation
Your Steuerberater is aware and has been briefed
You have a process owner (typically Head of Finance or AP Manager) who will champion the pilot
Your IT team can allocate 8-12 hours per week during weeks 5-8
You have decided on the data residency target (EU cloud or on-premise)
You are willing to start with one document class, not three

Migrate Now

✓ Exception savings start immediately - the 25-30% manual review pile is hot money
✓ Mixed-format reality stays for years - sending obligation hits all firms only in 2028⁶
✓ Other document types benefit immediately - delivery notes, contracts, claims do not depend on E-Rechnung
✓ Architecture survives the format shift - structured invoices flow straight through without changes

Wait

✗ You keep paying the exception tax - typically EUR 30-90K/year for a 1,000-invoice/month team
✗ Vendor pricing leverage decreases - legacy IDP renewals get harder to negotiate
✗ Competitive cost gap widens - first movers cut clerk hours by 70%+ already
✗ Missed talent reallocation window - the clerks doing exceptions could be doing supplier work

E-Rechnung, GoBD, and the Compliance Layer

Document processing in Germany sits at the intersection of three regulatory regimes: E-Rechnung (the format mandate), GoBD (the archive and audit-trail rules), and GDPR/EU AI Act (the data and AI governance frame). LLM-based IDP can satisfy all three - but only if it is designed for them from week one.

E-Rechnung: What it actually mandates

The German E-Rechnung mandate took effect on 1 January 2025. From that date, every B2B receiver in Germany must be able to accept structured electronic invoices. The sending obligation phases in: large taxpayers (turnover above EUR 800,000) from 1 January 2027, and all firms by 1 January 2028^5,6.

Acceptable formats - XRechnung (pure XML, fully structured) and ZUGFeRD 2.1+ (PDF/A-3 with embedded XML). Both comply with EN 16931²¹.
What is NOT acceptable - Plain PDF, Word, image scans. These count as “sonstige Rechnungen” and after 2027/2028 stop satisfying the mandate for the sending firm
Transition reality - Until 2028, you will receive a mix of XRechnung, ZUGFeRD, and unstructured PDFs from non-compliant suppliers. IDP handles the bridge between them
Receiving infrastructure - Most ERP systems already support XRechnung receipt; the harder part is the routing, validation, and exception handling layer
Post-audit model - Germany did not adopt real-time clearance. Invoices flow B2B directly between trading partners. This is why IDP at the receiver remains essential²¹

GoBD: The Audit Trail Rule

The GoBD update from July 2025 explicitly accommodates AI-based document processing^7,22. The rules are not new - they have always required immutability, traceability, and Verfahrensdokumentation. What changed is how the BMF describes their application to AI workflows.

Original preservation - The original document (PDF, XML, photo) must be archived unchanged for 10 years. The AI agent works on a copy
Extraction logging - Every extraction must be reproducible. Log the document hash, the model version, the prompt version, the extracted fields, and the confidence scores
Verfahrensdokumentation - Document the process. Who reviews exceptions, how disagreements are resolved, what triggers escalation, who has access
Change tracking - If a clerk corrects an extracted field, the change must be timestamped, attributed, and tied to the original. Most ERP audit logs already do this
Period of retention - Booking-relevant documents and the related extraction artefacts: 10 years

The GoBD-Friendly Architecture

The agent does not replace your archive. It sits in front of it. The original document goes to the GoBD-compliant archive (DocuWare, ELO, OpenText, M-Files, or comparable). The extraction artefact - prompt, model version, fields, confidence - goes to a structured log linked by document hash. The DATEV export carries only the structured fields. Three systems, one audit chain.

GDPR and EU AI Act

Most IDP is “minimal risk” - Invoice processing, delivery notes, expense receipts fall outside the EU AI Act’s high-risk Annex III categories¹⁹
HR document processing is more complex - CV screening for hiring is high-risk; expense receipt extraction is not. Classify each use case individually
Data residency is a design choice, not a default - Anthropic Claude (EU), Azure OpenAI (Germany / France), Mistral (FR), and on-prem open models all keep data in the EU
AI literacy - Article 4 of the EU AI Act requires AI literacy training for staff who interact with AI systems, including AP clerks who review IDP exceptions
Personal data handling - Vendor invoices contain personal data (signatures, names of contact persons). Standard DPA, GDPR Art. 28, and Schrems II considerations apply to the LLM provider

Regulation	Applies When	What IDP Must Do
E-Rechnung	B2B invoices from 1 Jan 2025 (receive), 2027/2028 (send)^5,6	Accept XRechnung and ZUGFeRD 2.1+ structurally; IDP for everything else
GoBD (July 2025 update)	Booking-relevant documents	Original archive, extraction log, Verfahrensdokumentation, change tracking
GDPR	Documents containing personal data	Lawful basis, EU data residency or DPA, retention policy
EU AI Act (Aug 2026)	HR / safety / credit AI = high-risk; AP / delivery = minimal¹⁹	Classify, document, train staff (Art. 4 literacy)
BetrVG (Betriebsrat)	HR-relevant or performance-monitoring use	Co-determination consultation under Section 87

“Artificial intelligence has achieved its breakthrough in the German economy. Companies have not only recognised the possibilities of AI, they are deploying AI and investing.”

- Dr. Ralf Wintergerst, President of Bitkom²⁰

Vendor Landscape: ABBYY, Tungsten, Hyperscience, and Custom

Gartner’s first Magic Quadrant for IDP (September 2025) named ABBYY, Hyperscience, and Tungsten Automation as Leaders, alongside Infrrd^2,8,9,10. The market has more than 100 vendors offering full or partial IDP capabilities. For a Mittelstand buyer, the realistic short list is much smaller. Here is the honest comparison.

The four real options

Stay on legacy templated OCR - ABBYY FineReader, Tungsten KTM (former Kofax), OpenText. Mature, GoBD-friendly, expensive on extension. The default for most Mittelstand AP teams today
Upgrade to legacy IDP suites - ABBYY Vantage, Tungsten TotalAgility, Hyperscience. These have added LLM capabilities to their existing platforms. Best for companies who want one vendor and accept platform lock-in
Cloud-native LLM IDP - Rossum (with proprietary LLM Aurora), Extend, Reducto, Klippa. Newer entrants built on LLM-first architectures. Faster to deploy, cleaner for new document types, less mature on enterprise compliance
Custom LLM-based agent - Build the layer 3 understanding agent yourself (or with a partner) on top of Anthropic Claude, OpenAI, Mistral, or open models. Maximum flexibility, lowest licence cost, requires the most internal capability to run

Feature comparison

Capability	Legacy OCR	Legacy IDP Suite	Cloud LLM IDP	Custom Agent
Time to first value	Already running	3-9 months	4-12 weeks	8-12 weeks
Annual cost (1K invoices/mo)	EUR 30-80K	EUR 80-200K	EUR 36-120K	EUR 24-90K + build
Exception handling	Fails on layout shift	Improving	Strong	Strongest (custom validators)
New doc type onboarding	2-5 days/template	1-3 days	Hours	Hours
GoBD readiness	Excellent	Excellent	Good - needs check	Designed in
EU data residency	Native	Native	Configurable	Designed in
Vendor lock-in	Medium	High	Medium	Low
Best for	Stable high volumes	One-vendor-fits-all buyers	Fast pilots, mid volumes	Strategic, long-term

Legacy IDP Suite

✓ Single vendor - one contract, one support line
✓ Mature compliance - GoBD, audit, archive built in
✓ Established support patterns - certified partners, Steuerberater familiarity
✗ Highest annual cost - EUR 80-200K typical
✗ Slowest to extend - new doc types follow vendor roadmap
✗ Platform lock-in - exit cost rises every year

Custom Agent

✓ Lowest run cost - typically 50-70% under legacy IDP
✓ Fastest extension - new doc types in hours
✓ Model-agnostic - swap LLM provider as the market moves
✓ Full control - own the validation logic, prompts, and audit trail
✗ Requires partner or in-house capability - not self-serve
✗ Compliance design from scratch - GoBD, audit, archive must be built

How Superkind Fits

Superkind builds custom AI agents for SMEs and enterprises. For document processing, that means an LLM-based agent layer that sits on top of your existing OCR, ERP, and DATEV stack - not a replacement platform. The starting point is always your real document flow, including the messy 30 percent.

Process-first discovery - We sit with your AP team for two days, watch the exception queue in real time, and document why each disagreement happens. Templates and slide decks come second
Sits on top of your stack - The agent connects to your existing OCR, ERP (SAP, Microsoft Dynamics, custom), DATEV, and document archive (DocuWare, ELO, M-Files, OpenText). No rip-and-replace
EU-first architecture - LLM provider chosen for EU residency. Data flows through Frankfurt, Dublin, or Paris. Schrems II-clean by default. Open-model on-premise option for sensitive contracts
GoBD designed in - Original archive, extraction log, Verfahrensdokumentation template, audit trail - all delivered as part of the build, not a follow-on project
Live in 90 days - One document class, week-by-week from audit to production, with parallel running against your existing OCR for the entire build phase
Outcomes, not licences - No large upfront licensing. Pricing is per use case, tied to the exception-rate KPI agreed in the assessment phase
Continuous improvement - The agent gets sharper every week from the corrections your AP team makes. Quarterly model refresh keeps you on the current generation without a re-platforming project
Cross-document scalability - Once invoices are live, delivery notes and contracts re-use the same integration layer. Each new document type takes 4-6 weeks instead of 90 days

Approach	Legacy IDP Vendor	Superkind
Discovery	Slide-based workshops	On-site shadowing of your AP exceptions
Architecture	Replace your OCR / IDP platform	Add LLM layer on top of existing OCR
Delivery	6-12 month implementation	90 days to first production
Pricing	Seat licences + page fees	Per use case, tied to exception-rate KPI
EU residency	Configurable, with caveats	Designed in (Frankfurt / Dublin / Paris / on-prem)
Compliance	Vendor-template Verfahrensdokumentation	Custom Verfahrensdokumentation, Steuerberater-aligned
After launch	Support contract	Continuous tuning and document-class expansion

Pros

✓ Adds, does not replace - your existing OCR keeps doing what it does well
✓ Fast time-to-value - first document class in production in 90 days
✓ EU-first by design - data residency is the default, not an upsell
✓ Outcome-based pricing - tied to the exception-rate KPI
✓ Compliance-ready - GoBD architecture and Verfahrensdokumentation built in

Cons

✗ Not a self-serve platform - requires engagement with our team
✗ Capacity-limited - we work with a focused number of clients at a time
✗ Not for fewer than 500 documents/month - the math does not work below that
✗ Requires document access - we need to see the real exception queue, not synthetic samples

Decision Framework: Is It Time to Migrate?

Not every Mittelstand company should migrate to LLM-based IDP today. Here is a concrete framework based on document volume, exception cost, and current vendor situation.

Signal	What It Means	Action
You process 500+ inbound documents per month	Strong candidate - the volume math works	Start with a 90-day pilot on the highest-exception document class
Your exception rate is 20%+ on legacy OCR	Hot money - manual review is the biggest cost line	Quantify the exception cost; build the business case from that number
Your supplier mix changes more than once a quarter	Templated systems are losing the race against format churn	Move to a context-based system before the template count hits 1,000
Your IDP licence is up for renewal in 12 months	Best negotiation window - pilot the alternative now	Run the LLM pilot in parallel; let the data inform the renewal decision
Your AP team is stuck on hiring backfills	The exception queue is now a hiring problem	Automate the routine exceptions to free clerk time for higher-value work
You have fewer than 200 documents per month and stable suppliers	Migration cost likely exceeds the savings right now	Hold; revisit when volume crosses 500 or supplier churn picks up

Acting Now

✓ Exception savings start in 90 days - typically 60-80% reduction in manual review
✓ EU AI Act readiness - design for August 2026 deadline now, not under pressure later
✓ Better licence renewal terms - having a working alternative shifts the conversation with the legacy vendor
✓ Clerk time reallocation - the labour shortage means there is more useful work than people

Waiting

✗ Exception tax keeps compounding - typically EUR 30-90K per year per document class
✗ Talent drain accelerates - clerks leave faster when the work is exception cleanup
✗ Format churn keeps adding templates - each one a future maintenance liability
✗ EU AI Act under pressure - August 2026 deadline arrives whether you started in time or not

Frequently Asked Questions

OCR converts pixels into characters. IDP reads, understands, and extracts structured data from documents - then takes action on it. Modern IDP combines OCR with large language models (LLMs) to handle layout variation, multi-language text, handwritten notes, and exception cases that legacy OCR cannot. The output is not a text dump, it is structured fields that flow into your ERP, DATEV, or workflow tools.

No. Most successful Mittelstand migrations keep the legacy OCR core for high-volume printed documents (where it already runs at 99 percent accuracy) and add an LLM-based agent for the messy 30 percent: handwritten attachments, exception loops, multi-page contracts, and non-standard formats. The two systems run in parallel for 6-12 months before any decision to consolidate.

Yes, if you keep the original document, log every extraction, and store the audit trail in a GoBD-conform archive. The GoBD update from July 2025 explicitly accommodates AI-based processing as long as the Verfahrensdokumentation describes the process, the original is preserved unaltered, and changes are traceable. The AI agent does not replace your archive - it sits in front of it.

E-Rechnung does not eliminate IDP - it shifts it. From 1 January 2025, all German B2B receivers must accept structured invoices (XRechnung, ZUGFeRD 2.1+). But for years to come, you will still receive PDFs from non-compliant suppliers, plus delivery notes, contracts, claims, and inbound mail that fall outside the mandate. IDP becomes the bridge: structured invoices flow straight through, everything else still needs intelligent reading.

For printed text, modern LLMs match legacy OCR at 95-99 percent. For handwriting, GPT-5 reaches around 95 percent in clean conditions, while ABBYY measures 95.2 percent on handwritten print and 91.7 percent on cursive. The bigger advantage is exception handling: where legacy OCR fails on layout variation and routes the document to manual review, an LLM can read context and continue.

A focused 90-day deployment is realistic for one document type and one downstream system. Phase 1 (weeks 1-4) is process and data audit. Phase 2 (weeks 5-8) is build and parallel testing alongside the current OCR setup. Phase 3 (weeks 9-12) is production rollout for one document class. Adding more document types later takes 4-6 weeks each, because the integration layer and governance are already in place.

For a Mittelstand pilot covering one document class (typically incoming invoices), expect EUR 60,000 to EUR 150,000 for build and integration, plus monthly running costs of EUR 2,000 to EUR 8,000 depending on volume. That compares to EUR 50,000-EUR 200,000 in annual licence and maintenance fees for legacy IDP suites. ROI on a 90-day pilot is typically 30-200 percent in year one.

Yes. Modern LLMs were trained on multilingual corpora that include German tax forms, delivery notes, and DATEV exports. The accuracy gap between English and German extraction is now under 2 percentage points for most document types. The remaining work is prompt engineering, schema design, and the validation loop - not the underlying language model.

They do not have to. Most enterprise LLM offerings are available in EU regions (Frankfurt, Dublin, Paris). Anthropic Claude, OpenAI Enterprise, Mistral, and Microsoft Azure all offer EU-only deployments with no data leaving the region. For sensitive contracts, you can run smaller open models (Llama, Mistral, Qwen) entirely on-premise. The architecture decision matters more than the model choice.

This is exactly where LLM-based IDP outperforms legacy OCR. Where templated systems break the moment a supplier moves the VAT field, an LLM extracts based on context and meaning, not coordinates. New supplier formats become a one-line schema update instead of a new template. The ROI on this single capability is often enough to justify the migration on its own.

Bring them in during week 2 of the assessment phase, not at the end. Your Steuerberater needs to validate the audit trail, the Verfahrensdokumentation, and the DATEV export format. Most German tax advisors are now familiar with AI-based IDP, but they will want to see how exceptions are flagged, who signs off on edge cases, and how the original document is preserved. Their early sign-off saves weeks later.

Yes, if the AI processes employee documents (HR, expenses, sick notes) or changes how clerks work. The Betriebsrat has co-determination rights under Section 87 BetrVG when IT systems monitor performance. Most IDP deployments are not in scope, but it is faster to brief the works council early and document that the AI replaces task time, not headcount, than to discover the issue at go-live.

They become exception handlers and quality reviewers. With 70-80 percent of documents flowing straight through, the remaining 20-30 percent are the complex cases that need human judgement. Most companies redeploy clerks to higher-value work like supplier negotiation, dispute resolution, or master data quality - not redundancy. The Mittelstand labour shortage means there is more useful work to do than there are people to do it.

Sources

Henri Jung

Co-founder of Superkind, where he helps SMEs and enterprises deploy custom AI agents that actually fit how their teams work. Henri is passionate about closing the gap between what AI can do and the value it creates in real companies. Before Superkind, he spent years working with mid-sized businesses on digital transformation and saw first-hand how many AI projects fail because they start with technology instead of process. He believes the Mittelstand has everything it needs to lead in AI - it just needs the right approach.

Ready to migrate one document class?

Book a 30-minute call with Henri. We will identify your highest-exception document type and outline a 90-day migration plan - no commitment, no sales pitch.