Back to Blog

Vibe Coding for the Mittelstand: When Your Finance Team Suddenly Ships Software

Henri Jung, Co-founder at Superkind
Henri Jung

Co-founder at Superkind

Industrial control panel with one orange-ringed button representing citizen developers shipping software

Last quarter your controller built a forecasting tool in Lovable. A salesperson wired up a quote calculator in Cursor over a weekend. Someone in QM used Microsoft Copilot to generate a Power App that now schedules audits. None of these people can read the code that actually runs. None of them filed a ticket with IT. All of them are happier than they have been in years.

Welcome to vibe coding. The term entered the lexicon in February 2025 when AI researcher Andrej Karpathy described a new way of working: describe what you want, accept what the model produces, ship before you understand it1. Twelve months later, 92 percent of US developers reported using some form of vibe coding in their workflow24. Gartner forecasts that by 2026, 80 percent of low-code and AI-assisted development tool users will sit outside formal IT departments9.

For the German Mittelstand this is the most disruptive shift in internal software since SAP. It is not a question of whether your business teams will start building their own software - they already are. The question is whether your IT department leads the wave or gets buried by it. This guide is the operating model for the second option turning into the first.

TL;DR

Vibe coding is real and already in your company - 80 percent of low-code users will be non-IT by 2026, and tools like Cursor, Lovable, Replit Agent, and Power Apps with Copilot are inside Mittelstand offices today.

The risk is not technology, it is governance - Gartner forecasts a 2,500 percent increase in software defects from prompt-to-app citizen development by 2028 if left ungoverned, and 40 to 62 percent of AI-generated code samples already contain security vulnerabilities.

The right answer is the three-lane model - sandbox for experiments, production lane with a quality gate, critical lane for professional engineering.

90 days is enough to establish the policy, the sandbox tenant, the quality gate, and the first cohort of trained citizen developers.

The Mittelstand opportunity is asymmetric - your IT teams are too small to build everything, your business teams know the processes best, and the labour shortage is not slowing down. Vibe coding is how the gap closes.

The Vibe Coding Wave Has Arrived in the Mittelstand

Most German IT leaders still talk about citizen development as something on the horizon. The data says otherwise. The shift has already happened, mostly without IT noticing.

  • 80 percent of low-code users non-IT - Gartner predicts that by 2026, developers outside formal IT departments will account for at least 80 percent of the user base for low-code development tools, up from 60 percent in 20219.
  • Citizen developers outnumber professional ones - At large enterprises, citizen developers will outnumber professional software developers by a 4 to 1 ratio in 2026 according to Gartner9.
  • 70 percent of new enterprise apps built outside IT - The same Gartner research projects that 70 percent of new enterprise applications by 2026 will be built by citizen developers rather than traditional IT teams9.
  • The market is exploding - Gartner expects the low-code development technologies market to reach 44.5 billion US dollars by 20268, with vibe-coding-native tools (Cursor, Lovable, Replit) growing the fastest within that.
  • 92 percent of developers vibe-code - Early 2026 surveys find 92 percent of US-based professional developers have adopted some form of vibe coding in their workflows24. This is no longer a niche practice.
  • 25 percent of Y Combinator W25 founders ship 95 percent AI-generated code - Inside the Y Combinator Winter 2025 cohort, a quarter of the startups had codebases that were 95 percent AI-generated3. The economic pattern of new software is already vibe-coded.
  • Karpathy himself moved on - In early 2026 the inventor of the term declared vibe coding passé in favour of agentic engineering, where senior engineers orchestrate multiple coding agents under oversight3. The Mittelstand is mostly still catching up to vibe coding while the leading edge already shifted.

Key Data Point

Gartner forecasts that prompt-to-app approaches adopted by citizen developers will increase software defects by 2,500 percent by 2028, triggering a software quality and reliability crisis5. The vibe coding wave is real. The defect wave is coming if no one builds the dam.

And the Mittelstand context makes the wave land harder than in larger companies. German SMEs already report extreme IT capacity constraints, the labour shortage is structural rather than cyclical, and the business teams have a long history of solving their own problems with Excel and Access. Vibe coding is the next step in that tradition - whether IT participates or not.

Indicator2026 RealitySource
Low-code users outside IT80% (up from 60% in 2021)Gartner9
Citizen-to-pro developer ratio4:1 in large enterprisesGartner9
Low-code market sizeUSD 44.5 billionGartner / InfoWorld8
Developers using vibe coding92% (US, early 2026)Industry survey24
YC W25 startups with 95% AI code25% of cohortThe New Stack3
Predicted software defect increase2,500% by 2028Gartner5
Companies hit by Shadow AI incidents40% by 2030Gartner7

“There’s a new kind of coding I call vibe coding, where you fully give in to the vibes, embrace exponentials, and forget that the code even exists.”

- Andrej Karpathy, AI researcher and former Director of AI at Tesla, on X, 2 February 20251

What Vibe Coding Actually Is (and What It Is Not)

Vibe coding is not low-code, not no-code, and not the AI-pair-programming most engineers already use. It sits in its own quadrant. The boundary matters because the right governance for vibe coding is different from the right governance for low-code platforms.

The four quadrants of build

  • Pro code - Engineers writing code in an IDE, using AI as autocomplete or pair programmer (GitHub Copilot, Cursor in tab-complete mode). The engineer reads, reviews, and ships every line.
  • Agentic engineering - Engineers orchestrating multiple AI coding agents (Claude Code, Codex, Cursor in agent mode), reviewing and integrating their output. Karpathy’s 2026 evolution of vibe coding for production work3.
  • Low-code / no-code - Business users (or developers) building apps through visual canvases backed by predefined components (Power Apps, Mendix, OutSystems, Bubble, Airtable). Constrained by the platform, but the platform takes care of most of the engineering.
  • Vibe coding - Users describing what they want in natural language to an AI tool that generates real code, with the user often not reading the result (Lovable, Bolt.new, Replit Agent, v0, Power Apps with Copilot in chat mode). Unconstrained by predefined components, accelerated by the model, accountable to no one in the original definition.
ApproachWho buildsConstraintRisk profile
Pro code with AI assistEngineersNone - real codeStandard SDLC
Agentic engineeringSenior engineersSpec quality + review capacityStandard SDLC, faster
Low-code / no-codeBusiness + ITPlatform componentsPlatform-bounded
Vibe codingAnyoneNone - real codeUnbounded without governance

What vibe coding feels like in practice

Concrete example. A controller at a Mittelstand machinery firm wants a tool that pulls daily sales orders from SAP, projects revenue against the budget, and emails the leadership team a one-pager every morning. Six months ago this was a quarter-long IT project. Today the controller opens Lovable, types a four-paragraph description, watches the tool generate a full-stack web app with a database and email integration, connects it to a sandbox SAP export, and ships it Monday morning. Total time: two evenings.

The controller does not know what programming language was used. The controller cannot tell whether the SQL is parameterised or vulnerable to injection. The controller’s manager is delighted by the new dashboard. IT learns about it three weeks later when someone in security notices an unfamiliar app calling the SAP REST gateway.

The defining feature

Vibe coding is defined by what the user does not do: read the code. The user accepts the AI output, runs it, and ships if it works. Karpathy himself wrote in the original tweet: “I ‘Accept All’ always, I don’t read the diffs anymore. When I get error messages I just copy paste them in with no comment, usually that fixes it.”1

Why the Mittelstand Cannot Ignore This

For the Mittelstand specifically, three structural pressures make vibe coding less optional than for larger firms.

  1. The IT bottleneck is structural, not cyclical - Most Mittelstand IT teams operate at 1 to 2 percent of headcount, compared to 4 to 6 percent at larger firms. Internal app requests pile up while the small IT team triages keep-the-lights-on work. Vibe coding redirects the long tail of small requests to the people who actually need them.
  2. The skilled labour shortage is permanent - The DIHK reports Germany needs 300,000 skilled foreign workers per year just to maintain current staffing19, and the OECD projects the working-age population will shrink by 3.9 million by 203020. The ifo Institute reports 28.3 percent of German companies cannot find enough qualified workers18. There is no version of the next decade where IT teams grow fast enough to build everything the business asks for.
  3. Process knowledge sits in the business - The Mittelstand’s strength is deep operational expertise. The controller knows controlling, the production planner knows production, the sales engineer knows the deal flow. Forcing them to translate that knowledge through a ticket system into IT is exactly where most internal software projects die. Vibe coding lets the domain expert build directly.
  4. Existing low-code investments are already there - 88 percent of German Mittelstand firms have Microsoft 365 licences10. Power Apps with Copilot, Power Automate with Copilot, and Copilot Studio are available the moment a company turns them on. Most do not even know.
  5. Competitive pressure from agile firms - The hidden champions that compete on operational excellence will lose to firms that ship 5 internal tools a quarter when they ship 5 a year. The compounding gap closes faster than any other digital transformation lever.

The Mittelstand asymmetry

Large enterprises will adopt vibe coding cautiously because they have the IT capacity to build the long tail anyway. The Mittelstand has no such capacity. The asymmetry: vibe coding done well is more valuable to a 200-person firm than to a 20,000-person one - and the cost of not adopting is higher.

What changes for the Geschäftsführer

The deepest change is not technological, it is organisational. The Mittelstand has historically separated those-who-think-up-software (the business) from those-who-build-it (IT). Vibe coding collapses the distance to a few hours. That has three downstream effects.

  • The IT roadmap stops being the bottleneck - Departmental tools that used to be deferred for two years now ship in two days. IT moves up the value chain to integration, security, and the agents that need to be reliable.
  • Spec design becomes the strategic skill - The new bottleneck is the quality of the description. Senior people who can write a clear three-page spec become 10 times more productive. People who cannot articulate what they want struggle even with vibe coding tools.
  • Quality bar becomes a leadership question - The Geschäftsführer cannot delegate the question of which apps run the business and how they are tested. The wrong answer is to ban vibe coding. The right answer is to define the bar at which a tool moves from sandbox to production.

Want a quality gate for your citizen-built apps?

We help Mittelstand IT teams design the three-lane model and the agentic-engineering layer that catches what vibe coding cannot.

Book a Demo →

5 Use Cases That Actually Work

Not every internal tool benefits from vibe coding. The pattern that works is small, departmental, low-stakes, fast-moving. The pattern that fails is anything customer-facing, finance-critical, or data-sensitive without a hardening pass. Here are five use cases that consistently land in the Mittelstand.

Use case 1: Departmental dashboards and reports

Daily operations dashboards, weekly KPI reports, ad-hoc analyses that the BI team would take six months to deliver. The data lives in SAP, DATEV, HubSpot, or a SharePoint export. The user wants a focused view, not a generic BI tool.

  • Typical builders - Controllers, sales operations, plant managers, HR business partners.
  • Typical tools - Lovable, v0, Cursor, Power Apps with Copilot, Hex.
  • Typical timeline - 2 to 8 hours from idea to working prototype.
  • Where governance matters - Read-only access to production data, no writebacks, no PII unless reviewed.
  • Realistic ROI - 4 to 8 hours per week saved per dashboard, often replacing manual Excel work.

Use case 2: Internal request and approval flows

Holiday requests, capex approvals, supplier onboarding intake, IT access requests, training requests. Most run today through email threads or generic ticketing. A vibe-coded form-plus-workflow ships in days and integrates with Teams or Slack for approvals.

  • Typical builders - HR coordinators, office managers, IT helpdesk staff, internal communications.
  • Typical tools - Power Apps with Copilot, Bubble, Glide, custom Lovable apps.
  • Typical timeline - 1 to 3 days for the first working version.
  • Where governance matters - Approval chains touching financial limits or HR data must be reviewed by IT or compliance.
  • Realistic ROI - 30 to 60 percent reduction in cycle time for the workflow, plus a clean audit trail.

Use case 3: Quality and operations checklists

Shift handover forms, quality inspection lists, tool calibration logs, machine setup checklists. Most are still on paper or in PDF templates. Vibe-coded mobile apps replace them with timestamped digital records that integrate with the QMS.

  • Typical builders - QM officers, shift leads, plant engineers.
  • Typical tools - Power Apps with Copilot, AppSheet, Glide, custom Lovable apps.
  • Typical timeline - 1 week including pilot on the shop floor.
  • Where governance matters - Records that feed ISO 9001 or IATF 16949 audits must be tamper-evident and exportable.
  • Realistic ROI - 60 to 80 percent time saving on data capture, plus dramatically better audit-readiness.

Use case 4: Customer-data lookup and prep tools

Sales reps before a call need a one-page customer brief. Account managers preparing a QBR want a quick scorecard. Service techs en route to a job want the install history. Vibe-coded read-only apps over a sandbox CRM export deliver this in days.

  • Typical builders - Sales operations, service operations, key account managers.
  • Typical tools - Cursor, Lovable, v0 over a CSV or REST endpoint.
  • Typical timeline - 3 to 7 days for the first working version.
  • Where governance matters - Read-only by default, refreshed nightly, pulled from a sanctioned data source. Writebacks require IT involvement.
  • Realistic ROI - 1 to 2 hours per rep per week, better customer interactions, fewer copy-paste errors.

Use case 5: Throwaway analyses and prototypes

The single most underrated use case. Someone wants to test whether a new pricing model works. Someone wants to see if a German labelling regulation breaks the existing product taxonomy. Someone wants to mock up a partner integration before a kickoff meeting. Vibe coding for one-week-throwaways is almost pure upside.

  • Typical builders - Product managers, strategy team, internal consulting, the CEO’s office.
  • Typical tools - Lovable, Replit Agent, v0, Cursor.
  • Typical timeline - 4 hours to 3 days, usually thrown away after.
  • Where governance matters - Synthetic data only, sandbox tenant, automatic 30-day expiry. Almost no governance friction needed.
  • Realistic ROI - The team learns 3 to 5 things they would not have learned without the prototype, often saving a much larger engineering investment later.
Use caseBuilderTime to shipLane
Departmental dashboardsControllers, ops2-8 hoursSandbox → Production
Approval flowsHR, IT, office1-3 daysProduction with gate
QM checklistsQM, plant leads~1 weekProduction with gate
Customer lookup toolsSales, service ops3-7 daysProduction with gate
Throwaway analysesStrategy, PM, CEO4 hours - 3 daysSandbox
Three stacked dark metal cubes ascending diagonally with one orange ring representing the three-lane governance model

5 Failure Modes to Avoid

The use cases above land when governance is in place. Without governance, the same tools cause five predictable failure modes. Anticipate them before the first prototype ships.

Failure mode 1: The vulnerable app in production

The most consequential failure. A vibe-coded internal app handles employee performance data, customer information, or supplier financials. Independent code-security studies consistently find that 40 to 62 percent of AI-generated code samples contain security vulnerabilities, and that AI-written code produces flaws at 2.74 times the rate of human-written code15. Without a quality gate, the chance of a vulnerable app reaching production is uncomfortably high.

  • Symptoms - SQL injection, exposed API keys in client-side code, missing authentication, broken access control, hardcoded credentials.
  • Mitigation - Production gate with automated SAST scan, secret detection, and an actual human review for anything that touches sensitive data.
  • Critical rule - No vibe-coded app touches PII, financial data, or production write paths until it has passed the gate.

Failure mode 2: The ghost app graveyard

The team builds 40 prototypes in six months. 32 are abandoned within weeks. 5 are used by exactly one person. 3 become real dependencies for parts of the business. No one tracks which is which. The team that built them is now busy elsewhere. When something breaks, no one knows whether it matters.

  • Symptoms - Apps with unclear ownership, unmaintained dependencies, unknown user counts, dead URLs, unanswered support requests.
  • Mitigation - 90-day expiry on every sandbox app, mandatory named owner for every production app, quarterly inventory with usage telemetry, automatic archival below a usage threshold.
  • Critical rule - Build the gravedigger from day one. The hardest cultural shift is making it OK to delete things people built.

Failure mode 3: Shadow AI everywhere

Most worrying for compliance. Business teams sign up for Lovable, Replit, Cursor, and ChatGPT on personal accounts because IT was too slow to provide them. Customer data flows to vendor APIs that have no DPA. Gartner predicts 40 percent of organisations will suffer security or compliance incidents from shadow AI by 20307.

  • Symptoms - Employees mentioning tools you have not procured, payment receipts on personal cards, customer data appearing in third-party logs, GDPR exposure no one mapped.
  • Mitigation - Provide sanctioned tools fast. The fastest path is a small but well-resourced sandbox with Copilot for Microsoft 365, Cursor for IT champions, and a pilot Lovable or Power Apps tenant for citizen developers, all on company SSO.
  • Critical rule - You do not stop shadow AI by banning AI. You stop it by being faster than the personal-account alternative.

Failure mode 4: The integration debt avalanche

Every vibe-coded app integrates with SAP, DATEV, or another core system in its own way. Some hit REST endpoints, some scrape exports, some embed credentials, some go through Power Automate, some go through homemade Python scripts. Six months in, no one understands how data flows between systems.

  • Symptoms - Multiple apps making the same SAP calls with different logic, inconsistent customer or product master data across tools, integration bugs no one can trace.
  • Mitigation - IT publishes a small, well-documented set of read-only data products that any app can connect to. Citizen developers consume those instead of integrating directly.
  • Critical rule - The integration layer is non-negotiable IT territory. Citizens build on top of it, not around it.

Failure mode 5: The skills two-class society

Subtle but corrosive. The vibe coding programme empowers business teams who happen to be comfortable describing what they want. The teams that struggle - older workers, technical roles with weaker writing skills, non-native German speakers - get left behind. Within a year, the company has a clear two-class structure of those who ship and those who do not.

  • Symptoms - The same five departments produce most of the apps, others stop trying, complaints about IT favouritism, fairness questions from the Betriebsrat.
  • Mitigation - Active enablement, not just access. Office hours, paired sessions, written guides, multilingual training. Treat citizen development like an apprenticeship system, which the Mittelstand already understands.
  • Critical rule - The fairness question is real. Plan for it in the rollout, not after the first internal complaint.

Vibe Coding Without Governance vs With Governance

With governance

  • 5x to 20x faster delivery of departmental tools
  • Security risk contained by quality gate before production
  • Shadow AI displaced by sanctioned, faster alternatives
  • IT capacity reallocated to integration and agents
  • Audit trail intact for ISO, GDPR, EU AI Act

Without governance

  • 40 to 62% of apps ship with vulnerabilities15
  • 2,500% defect increase projected by 20285
  • Ghost-app graveyard growing each quarter
  • Compliance exposure on PII, GoBD, GDPR
  • Two-class workforce on building access

Vibe Coding vs Agentic Engineering: The Quality Bar Question

The most useful framing for the Mittelstand IT leader is the distinction between vibe coding and agentic engineering. Karpathy, who coined vibe coding, declared the term passé in 2026 and now prefers agentic engineering3. The distinction is the difference between raising the floor and raising the ceiling.

  • Vibe coding raises the floor - Anyone can ship something. The bar for the first prototype drops from weeks to hours. The risk is that the bar for what reaches production drops with it. Use vibe coding for sandbox, prototype, and throwaway work.
  • Agentic engineering raises the ceiling - A senior engineer orchestrates 3 to 10 AI coding agents under explicit oversight, reviewing diffs, designing specs, and accountable for what ships. Production code lands 5 to 10 times faster than traditional development without the floor dropping. Use agentic engineering for production, integration, and anything that needs to be reliable.
  • Both are real, both are needed - The Mittelstand needs both lanes. Vibe coding handles the long tail of departmental tools. Agentic engineering handles the spine of integration, security, and the agents that the business actually depends on.

“As developers, we can’t assume that the generated code is secure by default.”

- Janet Worthington, Senior Analyst at Forrester2

What changes in the engineering workflow

Agentic engineering changes how IT teams operate. The senior engineer becomes a director of multiple agents rather than a single author of code. Reviewing diffs and writing specs becomes 60 to 80 percent of the day. Junior tasks largely disappear. The role of the team lead shifts from work-breakdown to spec design and quality gates.

DimensionVibe codingAgentic engineering
BuilderAnyoneSenior engineer
Code reviewed?Often notAlways
Tests written?SometimesAlways, often agent-generated then reviewed
Production-ready?RarelyYes
Time to first prototypeHours1-3 days
Time to productionNeeds hardening pass1-3 weeks
Best forDepartmental tools, prototypesIntegration, security, agents
Risk profileBounded by sandboxSame as traditional engineering

The two-track Mittelstand IT team

The successful Mittelstand IT team in 2026 runs two tracks. Track 1 enables vibe coding for the business. Track 2 practises agentic engineering for the spine of the company. The same team, different lanes, different bars. Trying to apply the vibe coding bar to production work is how 40 to 62 percent vulnerability rates happen.

The Three-Lane Governance Model

The simplest, most defensible governance model for vibe coding in the Mittelstand has three lanes. It maps cleanly to existing IT processes, the Betriebsrat usually accepts it, and IT leaders can explain it to the Geschäftsführer in one slide.

Lane 1: The Sandbox

  • Who can ship - Any trained employee with a sandbox account.
  • What runs there - Prototypes, dashboards on synthetic or sandbox data, throwaway analyses, first versions of departmental tools.
  • Data access - Sandbox copies of production data only, refreshed nightly, no PII unless explicitly approved, no write-back to production systems.
  • Tooling - Lovable, Cursor, Replit Agent, Power Apps with Copilot, all on a single tenant with SSO.
  • Quality gate - None. Speed is the point.
  • Lifetime - 90 days. After 90 days the app expires unless promoted.
  • Owner - The builder.

Lane 2: Production

  • Who can ship - Trained citizen developers + IT champions, after passing the quality gate.
  • What runs there - Departmental tools that actually run the business: dashboards on live data, approval flows, QM checklists, customer lookup tools.
  • Data access - Live data through sanctioned data products published by IT. No direct database connections. Writeback only with explicit IT approval.
  • Tooling - Same as sandbox plus IT-managed deployment (Vercel, Azure App Service, Power Apps managed environments).
  • Quality gate - Automated SAST + secret scan, dependency review, manual security review for anything touching sensitive data, basic test coverage required.
  • Lifetime - Unlimited but reviewed quarterly. Apps below a usage threshold (e.g. fewer than 5 active users for 60 days) get archived.
  • Owner - Named individual + named team. Documented in the citizen-app inventory.

Lane 3: Critical Systems

  • Who can ship - Professional engineers using agentic engineering practices. No citizen development here.
  • What runs there - Integration layer, agents that act on production systems, anything customer-facing, anything affecting financial reporting, anything classified as high-risk under the EU AI Act, anything regulated under GoBD.
  • Data access - Full, with proper IAM and audit trails.
  • Tooling - Cursor in agent mode, Claude Code, Codex, internal AI agent platforms. Code review by a human in every PR, full SDLC, full SecOps integration.
  • Quality gate - Standard enterprise SDLC: design review, security review, testing, staging, change approval.
  • Lifetime - Maintained as core IT.
  • Owner - IT.
LaneBuilderDataGateLifetime
1. SandboxAny trained userSandbox / syntheticNone90 days
2. ProductionTrained citizen + championLive via data productsSAST + manual reviewQuarterly review
3. CriticalPro engineerFullStandard SDLCMaintained

The critical rule

Apps move up a lane through promotion, never sideways. A sandbox prototype that the team likes does not become production by being used a lot. It becomes production by passing the quality gate. The most common Mittelstand failure is to skip this step because “it already works”. It works until it does not.

The 90-Day Rollout Playbook

The full programme described above takes 6 to 9 months to mature. The 90-day version is the smallest meaningful unit of work that gets a Mittelstand company from no policy to a working three-lane model with first production apps.

Phase 1: Days 1-30 - Discovery and policy

  1. Inventory shadow AI usage - One-week survey across all departments. Which AI coding tools are already in use, on which accounts, with what data? Often a long list. The result becomes the legitimisation argument with the Geschäftsführung.
  2. Identify the 5 highest-value use cases - Not the 50, the 5. Top candidates are usually departmental dashboards, an approval flow, a QM checklist, a sales lookup tool, and one strategic prototype.
  3. Draft the policy - One page, three lanes, named lifecycle for each lane. Plain German. Reviewed by Datenschutz, Betriebsrat, and Steuerberater. Approved by the Geschäftsführer.
  4. Pick the tooling - One sandbox tool (typically Lovable or Power Apps with Copilot), one IT tool (typically Cursor + Claude Code or Codex), one production deployment target. Resist the urge to evaluate everything.
  5. Run the first AI literacy training - Mandatory under EU AI Act Article 421. 2-hour course tailored to citizen developers, covering vibe coding hygiene, security pitfalls, what data goes where.

Phase 2: Days 31-60 - Sandbox and first prototypes

  1. Stand up the sandbox tenant - SSO, sandbox copies of the relevant data, network policy, expiry policy. 2 weeks of IT work for one engineer.
  2. Onboard the first cohort - 8 to 15 people across the 5 use-case departments. Office hours twice a week, a #vibe-coding channel in Teams or Slack, a written getting-started guide.
  3. Build the first 5 prototypes - Each in its own lane. Time-boxed to 5 days each. The sandbox apps run on sandbox data. Most of them work. Some are surprisingly good. Some flop. Document all of it.
  4. Set up the quality gate pipeline - SAST tool, secret scanner, basic dependency check, the manual review checklist. Tooling: GitHub Advanced Security, Snyk, Semgrep, plus an actual reviewer.
  5. Establish the inventory - One simple table or app listing every citizen-built tool, owner, lane, last-used date, lifecycle status. Update at least monthly.

Phase 3: Days 61-90 - First production apps and feedback loop

  1. Promote the 1 to 3 prototypes that proved valuable - Run them through the gate. Most need 1 to 3 days of hardening - input validation, auth, data minimisation, basic tests. Some need IT to build the integration layer. None should be promoted without explicit owner sign-off.
  2. Archive the prototypes that did not prove out - Not all 5 will. Document what was learned, archive the code, free the sandbox space. This is the cultural muscle that keeps the programme honest.
  3. Run the first quarterly review - Citizen-app inventory, usage telemetry, gate pass-rate, security findings. Adjust policy if needed.
  4. Train the next cohort - Double the size. By now the early adopters can teach.
  5. Decide on agentic engineering for IT - Pick one IT-led project to ship using agentic engineering practices. The decision is whether to scale this to the whole IT team next quarter or stay piloted.

90-day completion checklist

  • Shadow AI inventory completed and reviewed with leadership
  • Policy signed off by Datenschutz, Betriebsrat, Steuerberater, Geschäftsführer
  • Sandbox tenant live with SSO and expiry policy
  • First cohort of 8 to 15 citizen developers trained (EU AI Act Article 4)
  • 5 prototypes built and evaluated
  • Quality gate pipeline live with SAST and secret scan
  • Citizen-app inventory in place with monthly review
  • 1 to 3 production apps promoted through the gate
  • Failed prototypes documented and archived
  • First quarterly governance review run
  • Agentic engineering pilot kicked off in IT

Tool Landscape: What to Pick When

The vibe coding tool space changes monthly. The categories below are stable; the leaders shift. As of mid-2026 these are the realistic picks for a German Mittelstand company.

Sandbox tools for citizen developers

  • Microsoft Power Apps with Copilot - The default choice if you are already on Microsoft 365. Release Wave 1 2026 added autonomous AI agents to Power Apps and Copilot Studio13. Strong governance built in via Power Platform admin. Best for forms, approval flows, internal apps.
  • Lovable - Web app generator with one of the strongest natural-language interfaces. Generates real React code. Self-host or cloud. Good for dashboards, internal tools, and prototypes.
  • Bolt.new (StackBlitz) - Similar to Lovable, slightly more developer-leaning. Good when the citizen developer has some HTML or SQL background.
  • Replit Agent - Strong full-stack agent that works inside the Replit IDE. Better for builders who want to learn over time.
  • v0 by Vercel - Best at UI generation. Less complete as a full-stack tool but excellent for component-driven prototypes.
  • Glide and AppSheet - Mobile-first, spreadsheet-driven. Excellent for QM checklists and field operations apps.

Tools for IT and agentic engineering

  • Cursor - The market default for AI-native IDEs. Tab-complete, agent mode, multi-file edits. Solid pricing for teams.
  • Claude Code - Anthropic’s CLI agent. Strong on long-running engineering tasks, refactors, and large codebases.
  • OpenAI Codex (GA 2026) - The OpenAI CLI agent. Strong on JavaScript and Python ecosystems.
  • GitHub Copilot Agent Mode - The default if you are already on GitHub Enterprise. Works well for incremental engineering work, less for greenfield generation.
  • Self-hosted alternatives - Continue.dev, Aider, Cline. Useful when sovereignty or air-gapped environments matter.

Quality gate and governance tooling

  • SAST - Snyk, Semgrep, GitHub Advanced Security, Checkmarx. Pick one, run it on every promotion request.
  • Secret detection - GitGuardian, TruffleHog, GitHub native scanning.
  • Dependency review - Snyk, Dependabot, FOSSA.
  • Power Platform CoE Starter Kit - For Power Apps governance specifically. Free, built by Microsoft, mandatory if you scale Power Platform.
  • Citizen-app inventory - Often a Power App or Notion page maintained by IT. Off-the-shelf solutions exist (Make, Pomerium) but a simple, owned table works fine.
PickIf youAvoid if
Power Apps + CopilotAlready on Microsoft 365, want strong governance, focus on forms/flowsNeed pixel-perfect custom UI or want to leave Microsoft stack
LovableNeed real React apps fast, comfortable with cloud-firstNeed on-premise / sovereign hosting
Glide / AppSheetBuilding mobile-first shop floor or field appsNeed complex backend logic
Cursor + Claude CodeRunning an agentic engineering team in ITHave no professional developers
Self-hosted Continue.devSovereignty and IP confidentiality are non-negotiableHave no platform team to maintain it

EU AI Act, GDPR, and Betriebsrat: The Compliance Layer

Vibe coding does not get a compliance pass because it is fast. If anything, it raises the bar - more apps, more builders, more places where data ends up. Three frameworks matter for the Mittelstand.

EU AI Act

  • Article 4 (AI literacy) - In force since February 2025. Every company using AI must ensure adequate AI literacy among the people deploying or using AI tools. For citizen developers using vibe coding tools, that means a documented training programme21.
  • Provider obligations - The AI coding tool itself (Lovable, Cursor) is the provider for AI Act purposes. Your company is the deployer of the tool, with deployer obligations. Most of those flow naturally from the three-lane model.
  • Risk classification depends on output - The vibe coded app itself is rarely a regulated AI system. But if it makes high-risk decisions (employment, credit, safety) the high-risk classification applies and the standard high-risk SDLC kicks in. This is where citizen development must hand off to the Critical lane.
  • Penalties - Up to 35 million EUR or 7 percent of global turnover for prohibited practices, up to 15 million EUR or 3 percent for other major violations22. SMEs get lower caps and access to regulatory sandboxes.

GDPR (Datenschutz-Grundverordnung)

  • Data minimisation in the sandbox - The sandbox tenant should never see raw production data unless the use case demands it. Synthetic data, anonymised exports, or sampled production exports are the default.
  • DPA for every tool - Lovable, Cursor, Replit, OpenAI, Anthropic - every tool needs a data-processing agreement on file. Your Datenschutzbeauftragter approves the list. Tools without a DPA do not enter the sandbox tenant.
  • Right to erasure - Every production app must support deletion of personal data records on request. This is a checkbox in the quality gate.
  • Cross-border transfers - Most leading vibe coding tools are US-based. Standard Contractual Clauses + a transfer impact assessment are standard. Some Mittelstand companies prefer EU-only or self-hosted alternatives for sensitive use cases.

Betriebsrat

  • Co-determination is real - The Betriebsrat has co-determination rights under § 87 BetrVG when AI tools handle employee performance data or behaviour. A blanket Betriebsvereinbarung covering the sandbox programme is faster than negotiating each app.
  • Transparency wins - Most German Betriebsräte will sign off on a vibe coding programme that includes a public inventory, training, and clear rules on what data citizen apps can touch.
  • HR-data apps need extra care - Anything handling performance reviews, attendance, productivity metrics needs a specific Betriebsvereinbarung even within the sandbox programme.

GoBD (German tax archival)

  • Anything touching invoices, ledgers, or tax data - Stays out of citizen development. GoBD requires unalterable archives, full audit trails, and Steuerberater alignment. Citizen apps in the production lane can read GoBD-archived data, but the archive itself stays in the Critical lane.
  • Reporting and analytics on top of GoBD data - Allowed, useful, and a great citizen-development use case. The original records are unchanged; the citizen tool sits on a downstream copy.

Compliance as enabler, not blocker

The most common compliance failure is treating it as a no-list. The right framing for the Mittelstand IT leader: compliance defines the lanes, then the lanes accelerate everything inside them. A signed Betriebsvereinbarung is what unlocks the sandbox, not what constrains it.

How Superkind Fits

Superkind builds custom AI agents for German SMEs and enterprises. We do not sell a vibe coding tool. We come in where vibe coding stops being the right answer and agentic engineering takes over - the Critical lane in the three-lane model. Here is what we typically do for a Mittelstand company adopting vibe coding seriously.

What Superkind does

  • Three-lane model design - We help IT leaders draft the policy, pick the tooling, and align with Datenschutz, Betriebsrat, and Steuerberater. A standard engagement runs 4 to 6 weeks.
  • Quality gate pipeline - We set up the SAST + secret-scan + manual-review pipeline that turns sandbox prototypes into production apps. Often integrated with existing GitHub or Azure DevOps.
  • Agentic engineering enablement - We help IT teams adopt Cursor, Claude Code, and Codex with the workflows that actually scale. Pair coaching for 4 to 8 weeks per team.
  • Production-grade agents on top of citizen apps - When a vibe-coded prototype proves valuable but needs SAP or DATEV integration, sovereignty, or autonomous action, we replace the prototype with a production agent that meets the same need at production reliability.
  • Integration layer as a product - We build the read-only data products that citizen apps consume, so IT does not have to expose raw SAP or ERP credentials.
  • EU AI Act readiness - Article 4 literacy programme tailored to citizen developers, technical documentation for any high-risk apps, audit trails that hold up to BNetzA scrutiny.
  • Sovereignty options - For Mittelstand firms that need EU-only or self-hosted models, we deploy on Aleph Alpha, Mistral, or self-hosted open-weights.
  • Sustained partnership - We stay engaged on a retainer, run quarterly reviews, and absorb the agents into the operating model.

Where we deliberately do not compete

  • Selling vibe coding licences - That is the job of Lovable, Cursor, and Microsoft. Superkind helps you use them well.
  • Training every employee in vibe coding - Internal HR or a specialised training partner usually does this better.
  • Replacing your IT department - We make IT 5 to 10 times more effective. We do not become IT.

Superkind: Honest Pros and Cons

Strengths

  • Mittelstand DNA - we work the way German SMEs work
  • Process-first approach - we map the workflow before we build anything
  • Outcomes, not licences - we are paid to deliver impact, not seats
  • SAP, DATEV, legacy ERP fluency - real integrations, not toy demos
  • EU AI Act and GDPR by design - audit-ready from day one

Honest cons

  • Not a fit below 50 employees - small firms often do not need the full programme
  • Not a quick licence sale - we engage for outcomes, which takes time
  • We do not run vibe coding training at scale - we focus on the IT-side counterpart
  • We require executive sponsorship - bottom-up engagements rarely succeed at this scope

Decision Framework: Should You Start Now?

Six signals that tell you whether vibe coding is the right move for your company in the next quarter, and what kind of move it should be.

SignalWhat it meansAction
Your IT backlog has 50+ small departmental requestsClassic case for vibe coding - the backlog is mostly never going to be built by ITRun the 90-day playbook with departmental dashboards as the first cohort
You found shadow AI accounts in your last security reviewCitizens are already vibe coding without youMove fast - sanction the tools that are already in use, set the lanes, and bring usage above board
Your IT team is <1.5% of headcountStructurally undersized for the demandVibe coding plus agentic engineering is the only realistic path to clearing the backlog
You already pay for Microsoft 365 with CopilotYou have most of the platform alreadyStart with Power Apps + Copilot before adding new vendors
You operate in a heavily regulated vertical (medtech, finance, automotive Tier 1)The Critical lane is large and the gate mattersLead with the gate and the policy, not with shipping. Build credibility before scaling
You have <30 employees and simple processesVibe coding programme is overkillSkip the programme; let people use tools as needed under a one-page policy

Acting Now vs Waiting

Acting Now

  • You catch shadow AI before it scales
  • The backlog clears in 6 to 12 months
  • EU AI Act readiness is in place before August 2026
  • Your Mittelstand peers are still figuring this out

Waiting 6 months

  • Shadow AI exposure compounds quietly
  • Your best people get bored and consider leaving
  • Cleanup gets harder once 40+ shadow apps exist
  • Compliance pressure rises before you have controls

Frequently Asked Questions

Vibe coding is the practice of building software by describing what you want in natural language and letting an AI tool generate, run, and debug the code for you. Andrej Karpathy coined the term in February 2025. The user often does not read the resulting code in detail. Tools that enable it include Cursor, Lovable, Bolt.new, Replit Agent, v0, and Microsoft Power Apps with Copilot.

Low-code and no-code platforms (Microsoft Power Apps, Mendix, OutSystems, Airtable) build apps from pre-defined visual components. Vibe coding generates real code from a free-text prompt, with no fixed component library. The line is blurring fast - Power Apps with Copilot now mixes both, and most low-code vendors are racing to add AI generation. The governance question is identical for both.

Yes, but in a controlled lane. Most Mittelstand IT teams are already losing this battle - business teams are using Cursor, ChatGPT, and Power Apps regardless. The right answer is the three-lane model: a sandbox where citizens can ship freely, a production lane with a quality gate, and a critical lane reserved for professional engineering. Banning vibe coding outright drives it underground.

Real but manageable. Independent studies consistently find that 40 to 62 percent of AI-generated code samples contain security vulnerabilities, and Gartner predicts prompt-to-app citizen development will increase software defects by 2,500 percent by 2028 if left ungoverned. The mitigation is a quality gate before any vibe-coded tool touches production data, plus DevSecOps practices for the few apps that need to scale.

A controlled vibe coding programme typically runs EUR 30,000 to 80,000 in year one. That covers a sandbox tenant, governance tooling, a quality-gate pipeline, training for citizen developers and IT champions, and a small budget to harden the 5 to 10 prototypes per year that prove valuable. Tooling licences (Cursor, Lovable, Power Platform) usually add EUR 30 to 60 per active user per month.

No. Vibe coding shifts what IT does, not whether IT exists. Citizen developers will build the long tail of internal tools that were never going to make the IT roadmap anyway. Professional engineers spend less time on CRUD apps and more time on integration, governance, security, and the agents that need real reliability. Most Mittelstand IT teams report capacity increases of 30 to 50 percent on strategic work after the shift.

Agentic engineering is what comes after vibe coding for production work. Andrej Karpathy himself moved on from vibe coding in 2026, preferring the term agentic engineering: a senior engineer orchestrates multiple AI coding agents, reviews their output, and is accountable for the result. Vibe coding raises the floor (anyone can ship something). Agentic engineering raises the ceiling (small teams ship production-grade software 10 times faster).

In most cases the AI Act applies indirectly. The vibe-coded app itself is rarely a regulated AI system, but the AI coding tool that generated it falls under provider obligations, and Article 4 of the EU AI Act obliges your company to ensure adequate AI literacy for everyone using AI tools - including citizen developers. If the resulting app makes high-risk decisions (hiring, credit, safety) the high-risk classification applies regardless of how it was built.

GDPR applies to the data, not the code. A vibe-coded HR tool that processes employee data needs the same data-processing agreements, deletion logic, and Betriebsrat involvement as any other tool. Most German Betriebsräte want to be informed early when citizen-built tools touch employee performance data. The fastest path is a written sandbox policy that the Betriebsrat signs off once and applies to every prototype.

Ghost apps are the number-one operational risk after security. Three rules work: every sandbox app expires after 90 days unless it gets renewed, production apps must have a named owner and a clear retirement path, and IT runs a quarterly inventory of citizen-built tools with usage data. Apps below a usage threshold get archived. Yes, this means deleting things people built. Do it anyway.

Yes, but the integration layer is exactly where IT must intervene. Most vibe coding tools generate REST API calls or SQL queries that look fine in isolation but break when they hit a real SAP system, especially in batch mode. The pattern that works: the citizen developer prototypes against a sandbox dataset, then IT or an agent partner like Superkind wraps the prototype with proper SAP/DATEV connectors before it goes to production.

Three roles, none new. The Citizen Development Lead sets policy, runs training, and operates the sandbox. The Quality Gate Engineer reviews apps before they enter production - usually a senior dev who likes mentoring. The Agentic Engineering Lead handles the apps that need to scale, owning the agent layer that production tools depend on. In a 200-person Mittelstand company these are typically 1.5 to 2 FTE in total.

Three numbers. First, number of internal tools shipped per quarter (baseline is usually 2 to 5; with vibe coding done well it lands at 15 to 40). Second, IT request-to-delivery time for departmental apps (target: 30 days down to 5 days). Third, the percentage of citizen-built tools still in active use after 6 months (target: above 40 percent - if it is lower, the gate is too loose). McKinsey reports that companies empowering citizen developers score 33 percent higher on innovation metrics.

Related Articles

Sources

  1. Wikipedia - Vibe Coding (with Andrej Karpathy original tweet, 2 February 2025)
  2. Forrester - Secure Vibe Coding: A Paradigm, Not A Paradox (Janet Worthington, Senior Analyst)
  3. The New Stack - Vibe Coding Is Passé. Karpathy Has a New Name for the Future of Software
  4. Dealroom - Vibe Coding Was Just the Warmup. Andrej Karpathy on the Dawn of Software 3.0
  5. Gartner - Predicts 2026: AI Potential and Risks Emerge in Software Engineering Technologies
  6. Gartner - Top Cybersecurity Trends 2026 (Shadow AI Predictions)
  7. Gartner - 40% of Firms Hit by Shadow AI Security Incidents
  8. Gartner / InfoWorld - Low-Code Development Technologies Forecast to $44.5 Billion by 2026 (Jason Wong, Gartner)
  9. Kissflow - Citizen Development Statistics & Trends 2026 (Gartner Synthesis)
  10. Bitkom - Künstliche Intelligenz in Deutschland Studienbericht 2026
  11. Bitkom - Digitale Souveränität Studienbericht 2025
  12. Bitkom - IT-Mittelstandsbericht
  13. Microsoft - Power Platform Release Wave 1 2026 (Autonomous AI Agents)
  14. McKinsey - Citizen Development and Innovation Performance
  15. Checkmarx - Vibe Coding Security: Risks, Vulnerabilities, and Secure AI Coding
  16. Dark Reading - Security Concerns Shadow Vibe Coding Adoption
  17. Modall - Vibe Coding Security Risks: What Founders Need to Know (2026)
  18. ifo Institute - Skilled Worker Shortage in Germany (2025)
  19. DIHK - Skilled Labour Report 2025/2026
  20. OECD Economic Surveys: Germany 2025
  21. EU AI Act - Article 4: AI Literacy
  22. EU AI Act - Implementation Timeline
  23. EU AI Act - Small Businesses Guide
  24. NxCode - What Is Vibe Coding? Complete Guide to AI-Assisted Development (2026)
  25. DEV Community - What Is Vibe Coding in 2026? One Year From Karpathys Tweet
  26. Microsoft - What is Low-Code Governance
  27. Synalis - Governance bei Power Platform
  28. Google Cloud - What is Vibe Coding? Tools and Guides
Henri Jung, Co-founder at Superkind
Henri Jung

Co-founder of Superkind, where he helps SMEs and enterprises deploy custom AI agents that actually fit how their teams work. Henri is passionate about closing the gap between what AI can do and the value it creates in real companies. He believes the Mittelstand has everything it needs to lead in AI - it just needs the right approach.

Ready to ship the three-lane model in your company?

Book a 30-minute call with Henri. We will sketch the policy, the gate, and the first cohort plan for your team - no commitment, no sales pitch.

Book a Demo →