Last quarter your controller built a forecasting tool in Lovable. A salesperson wired up a quote calculator in Cursor over a weekend. Someone in QM used Microsoft Copilot to generate a Power App that now schedules audits. None of these people can read the code that actually runs. None of them filed a ticket with IT. All of them are happier than they have been in years.
Welcome to vibe coding. The term entered the lexicon in February 2025 when AI researcher Andrej Karpathy described a new way of working: describe what you want, accept what the model produces, ship before you understand it1. Twelve months later, 92 percent of US developers reported using some form of vibe coding in their workflow24. Gartner forecasts that by 2026, 80 percent of low-code and AI-assisted development tool users will sit outside formal IT departments9.
For the German Mittelstand this is the most disruptive shift in internal software since SAP. It is not a question of whether your business teams will start building their own software - they already are. The question is whether your IT department leads the wave or gets buried by it. This guide is the operating model for the second option turning into the first.
TL;DR
Vibe coding is real and already in your company - 80 percent of low-code users will be non-IT by 2026, and tools like Cursor, Lovable, Replit Agent, and Power Apps with Copilot are inside Mittelstand offices today.
The risk is not technology, it is governance - Gartner forecasts a 2,500 percent increase in software defects from prompt-to-app citizen development by 2028 if left ungoverned, and 40 to 62 percent of AI-generated code samples already contain security vulnerabilities.
The right answer is the three-lane model - sandbox for experiments, production lane with a quality gate, critical lane for professional engineering.
90 days is enough to establish the policy, the sandbox tenant, the quality gate, and the first cohort of trained citizen developers.
The Mittelstand opportunity is asymmetric - your IT teams are too small to build everything, your business teams know the processes best, and the labour shortage is not slowing down. Vibe coding is how the gap closes.
The Vibe Coding Wave Has Arrived in the Mittelstand
Most German IT leaders still talk about citizen development as something on the horizon. The data says otherwise. The shift has already happened, mostly without IT noticing.
- 80 percent of low-code users non-IT - Gartner predicts that by 2026, developers outside formal IT departments will account for at least 80 percent of the user base for low-code development tools, up from 60 percent in 20219.
- Citizen developers outnumber professional ones - At large enterprises, citizen developers will outnumber professional software developers by a 4 to 1 ratio in 2026 according to Gartner9.
- 70 percent of new enterprise apps built outside IT - The same Gartner research projects that 70 percent of new enterprise applications by 2026 will be built by citizen developers rather than traditional IT teams9.
- The market is exploding - Gartner expects the low-code development technologies market to reach 44.5 billion US dollars by 20268, with vibe-coding-native tools (Cursor, Lovable, Replit) growing the fastest within that.
- 92 percent of developers vibe-code - Early 2026 surveys find 92 percent of US-based professional developers have adopted some form of vibe coding in their workflows24. This is no longer a niche practice.
- 25 percent of Y Combinator W25 founders ship 95 percent AI-generated code - Inside the Y Combinator Winter 2025 cohort, a quarter of the startups had codebases that were 95 percent AI-generated3. The economic pattern of new software is already vibe-coded.
- Karpathy himself moved on - In early 2026 the inventor of the term declared vibe coding passé in favour of agentic engineering, where senior engineers orchestrate multiple coding agents under oversight3. The Mittelstand is mostly still catching up to vibe coding while the leading edge already shifted.
Key Data Point
Gartner forecasts that prompt-to-app approaches adopted by citizen developers will increase software defects by 2,500 percent by 2028, triggering a software quality and reliability crisis5. The vibe coding wave is real. The defect wave is coming if no one builds the dam.
And the Mittelstand context makes the wave land harder than in larger companies. German SMEs already report extreme IT capacity constraints, the labour shortage is structural rather than cyclical, and the business teams have a long history of solving their own problems with Excel and Access. Vibe coding is the next step in that tradition - whether IT participates or not.
| Indicator | 2026 Reality | Source |
|---|---|---|
| Low-code users outside IT | 80% (up from 60% in 2021) | Gartner9 |
| Citizen-to-pro developer ratio | 4:1 in large enterprises | Gartner9 |
| Low-code market size | USD 44.5 billion | Gartner / InfoWorld8 |
| Developers using vibe coding | 92% (US, early 2026) | Industry survey24 |
| YC W25 startups with 95% AI code | 25% of cohort | The New Stack3 |
| Predicted software defect increase | 2,500% by 2028 | Gartner5 |
| Companies hit by Shadow AI incidents | 40% by 2030 | Gartner7 |
“There’s a new kind of coding I call vibe coding, where you fully give in to the vibes, embrace exponentials, and forget that the code even exists.”
- Andrej Karpathy, AI researcher and former Director of AI at Tesla, on X, 2 February 20251
What Vibe Coding Actually Is (and What It Is Not)
Vibe coding is not low-code, not no-code, and not the AI-pair-programming most engineers already use. It sits in its own quadrant. The boundary matters because the right governance for vibe coding is different from the right governance for low-code platforms.
The four quadrants of build
- Pro code - Engineers writing code in an IDE, using AI as autocomplete or pair programmer (GitHub Copilot, Cursor in tab-complete mode). The engineer reads, reviews, and ships every line.
- Agentic engineering - Engineers orchestrating multiple AI coding agents (Claude Code, Codex, Cursor in agent mode), reviewing and integrating their output. Karpathy’s 2026 evolution of vibe coding for production work3.
- Low-code / no-code - Business users (or developers) building apps through visual canvases backed by predefined components (Power Apps, Mendix, OutSystems, Bubble, Airtable). Constrained by the platform, but the platform takes care of most of the engineering.
- Vibe coding - Users describing what they want in natural language to an AI tool that generates real code, with the user often not reading the result (Lovable, Bolt.new, Replit Agent, v0, Power Apps with Copilot in chat mode). Unconstrained by predefined components, accelerated by the model, accountable to no one in the original definition.
| Approach | Who builds | Constraint | Risk profile |
|---|---|---|---|
| Pro code with AI assist | Engineers | None - real code | Standard SDLC |
| Agentic engineering | Senior engineers | Spec quality + review capacity | Standard SDLC, faster |
| Low-code / no-code | Business + IT | Platform components | Platform-bounded |
| Vibe coding | Anyone | None - real code | Unbounded without governance |
What vibe coding feels like in practice
Concrete example. A controller at a Mittelstand machinery firm wants a tool that pulls daily sales orders from SAP, projects revenue against the budget, and emails the leadership team a one-pager every morning. Six months ago this was a quarter-long IT project. Today the controller opens Lovable, types a four-paragraph description, watches the tool generate a full-stack web app with a database and email integration, connects it to a sandbox SAP export, and ships it Monday morning. Total time: two evenings.
The controller does not know what programming language was used. The controller cannot tell whether the SQL is parameterised or vulnerable to injection. The controller’s manager is delighted by the new dashboard. IT learns about it three weeks later when someone in security notices an unfamiliar app calling the SAP REST gateway.
The defining feature
Vibe coding is defined by what the user does not do: read the code. The user accepts the AI output, runs it, and ships if it works. Karpathy himself wrote in the original tweet: “I ‘Accept All’ always, I don’t read the diffs anymore. When I get error messages I just copy paste them in with no comment, usually that fixes it.”1
Why the Mittelstand Cannot Ignore This
For the Mittelstand specifically, three structural pressures make vibe coding less optional than for larger firms.
- The IT bottleneck is structural, not cyclical - Most Mittelstand IT teams operate at 1 to 2 percent of headcount, compared to 4 to 6 percent at larger firms. Internal app requests pile up while the small IT team triages keep-the-lights-on work. Vibe coding redirects the long tail of small requests to the people who actually need them.
- The skilled labour shortage is permanent - The DIHK reports Germany needs 300,000 skilled foreign workers per year just to maintain current staffing19, and the OECD projects the working-age population will shrink by 3.9 million by 203020. The ifo Institute reports 28.3 percent of German companies cannot find enough qualified workers18. There is no version of the next decade where IT teams grow fast enough to build everything the business asks for.
- Process knowledge sits in the business - The Mittelstand’s strength is deep operational expertise. The controller knows controlling, the production planner knows production, the sales engineer knows the deal flow. Forcing them to translate that knowledge through a ticket system into IT is exactly where most internal software projects die. Vibe coding lets the domain expert build directly.
- Existing low-code investments are already there - 88 percent of German Mittelstand firms have Microsoft 365 licences10. Power Apps with Copilot, Power Automate with Copilot, and Copilot Studio are available the moment a company turns them on. Most do not even know.
- Competitive pressure from agile firms - The hidden champions that compete on operational excellence will lose to firms that ship 5 internal tools a quarter when they ship 5 a year. The compounding gap closes faster than any other digital transformation lever.
The Mittelstand asymmetry
Large enterprises will adopt vibe coding cautiously because they have the IT capacity to build the long tail anyway. The Mittelstand has no such capacity. The asymmetry: vibe coding done well is more valuable to a 200-person firm than to a 20,000-person one - and the cost of not adopting is higher.
What changes for the Geschäftsführer
The deepest change is not technological, it is organisational. The Mittelstand has historically separated those-who-think-up-software (the business) from those-who-build-it (IT). Vibe coding collapses the distance to a few hours. That has three downstream effects.
- The IT roadmap stops being the bottleneck - Departmental tools that used to be deferred for two years now ship in two days. IT moves up the value chain to integration, security, and the agents that need to be reliable.
- Spec design becomes the strategic skill - The new bottleneck is the quality of the description. Senior people who can write a clear three-page spec become 10 times more productive. People who cannot articulate what they want struggle even with vibe coding tools.
- Quality bar becomes a leadership question - The Geschäftsführer cannot delegate the question of which apps run the business and how they are tested. The wrong answer is to ban vibe coding. The right answer is to define the bar at which a tool moves from sandbox to production.
Want a quality gate for your citizen-built apps?
We help Mittelstand IT teams design the three-lane model and the agentic-engineering layer that catches what vibe coding cannot.
5 Use Cases That Actually Work
Not every internal tool benefits from vibe coding. The pattern that works is small, departmental, low-stakes, fast-moving. The pattern that fails is anything customer-facing, finance-critical, or data-sensitive without a hardening pass. Here are five use cases that consistently land in the Mittelstand.
Use case 1: Departmental dashboards and reports
Daily operations dashboards, weekly KPI reports, ad-hoc analyses that the BI team would take six months to deliver. The data lives in SAP, DATEV, HubSpot, or a SharePoint export. The user wants a focused view, not a generic BI tool.
- Typical builders - Controllers, sales operations, plant managers, HR business partners.
- Typical tools - Lovable, v0, Cursor, Power Apps with Copilot, Hex.
- Typical timeline - 2 to 8 hours from idea to working prototype.
- Where governance matters - Read-only access to production data, no writebacks, no PII unless reviewed.
- Realistic ROI - 4 to 8 hours per week saved per dashboard, often replacing manual Excel work.
Use case 2: Internal request and approval flows
Holiday requests, capex approvals, supplier onboarding intake, IT access requests, training requests. Most run today through email threads or generic ticketing. A vibe-coded form-plus-workflow ships in days and integrates with Teams or Slack for approvals.
- Typical builders - HR coordinators, office managers, IT helpdesk staff, internal communications.
- Typical tools - Power Apps with Copilot, Bubble, Glide, custom Lovable apps.
- Typical timeline - 1 to 3 days for the first working version.
- Where governance matters - Approval chains touching financial limits or HR data must be reviewed by IT or compliance.
- Realistic ROI - 30 to 60 percent reduction in cycle time for the workflow, plus a clean audit trail.
Use case 3: Quality and operations checklists
Shift handover forms, quality inspection lists, tool calibration logs, machine setup checklists. Most are still on paper or in PDF templates. Vibe-coded mobile apps replace them with timestamped digital records that integrate with the QMS.
- Typical builders - QM officers, shift leads, plant engineers.
- Typical tools - Power Apps with Copilot, AppSheet, Glide, custom Lovable apps.
- Typical timeline - 1 week including pilot on the shop floor.
- Where governance matters - Records that feed ISO 9001 or IATF 16949 audits must be tamper-evident and exportable.
- Realistic ROI - 60 to 80 percent time saving on data capture, plus dramatically better audit-readiness.
Use case 4: Customer-data lookup and prep tools
Sales reps before a call need a one-page customer brief. Account managers preparing a QBR want a quick scorecard. Service techs en route to a job want the install history. Vibe-coded read-only apps over a sandbox CRM export deliver this in days.
- Typical builders - Sales operations, service operations, key account managers.
- Typical tools - Cursor, Lovable, v0 over a CSV or REST endpoint.
- Typical timeline - 3 to 7 days for the first working version.
- Where governance matters - Read-only by default, refreshed nightly, pulled from a sanctioned data source. Writebacks require IT involvement.
- Realistic ROI - 1 to 2 hours per rep per week, better customer interactions, fewer copy-paste errors.
Use case 5: Throwaway analyses and prototypes
The single most underrated use case. Someone wants to test whether a new pricing model works. Someone wants to see if a German labelling regulation breaks the existing product taxonomy. Someone wants to mock up a partner integration before a kickoff meeting. Vibe coding for one-week-throwaways is almost pure upside.
- Typical builders - Product managers, strategy team, internal consulting, the CEO’s office.
- Typical tools - Lovable, Replit Agent, v0, Cursor.
- Typical timeline - 4 hours to 3 days, usually thrown away after.
- Where governance matters - Synthetic data only, sandbox tenant, automatic 30-day expiry. Almost no governance friction needed.
- Realistic ROI - The team learns 3 to 5 things they would not have learned without the prototype, often saving a much larger engineering investment later.
| Use case | Builder | Time to ship | Lane |
|---|---|---|---|
| Departmental dashboards | Controllers, ops | 2-8 hours | Sandbox → Production |
| Approval flows | HR, IT, office | 1-3 days | Production with gate |
| QM checklists | QM, plant leads | ~1 week | Production with gate |
| Customer lookup tools | Sales, service ops | 3-7 days | Production with gate |
| Throwaway analyses | Strategy, PM, CEO | 4 hours - 3 days | Sandbox |

5 Failure Modes to Avoid
The use cases above land when governance is in place. Without governance, the same tools cause five predictable failure modes. Anticipate them before the first prototype ships.
Failure mode 1: The vulnerable app in production
The most consequential failure. A vibe-coded internal app handles employee performance data, customer information, or supplier financials. Independent code-security studies consistently find that 40 to 62 percent of AI-generated code samples contain security vulnerabilities, and that AI-written code produces flaws at 2.74 times the rate of human-written code15. Without a quality gate, the chance of a vulnerable app reaching production is uncomfortably high.
- Symptoms - SQL injection, exposed API keys in client-side code, missing authentication, broken access control, hardcoded credentials.
- Mitigation - Production gate with automated SAST scan, secret detection, and an actual human review for anything that touches sensitive data.
- Critical rule - No vibe-coded app touches PII, financial data, or production write paths until it has passed the gate.
Failure mode 2: The ghost app graveyard
The team builds 40 prototypes in six months. 32 are abandoned within weeks. 5 are used by exactly one person. 3 become real dependencies for parts of the business. No one tracks which is which. The team that built them is now busy elsewhere. When something breaks, no one knows whether it matters.
- Symptoms - Apps with unclear ownership, unmaintained dependencies, unknown user counts, dead URLs, unanswered support requests.
- Mitigation - 90-day expiry on every sandbox app, mandatory named owner for every production app, quarterly inventory with usage telemetry, automatic archival below a usage threshold.
- Critical rule - Build the gravedigger from day one. The hardest cultural shift is making it OK to delete things people built.
Failure mode 3: Shadow AI everywhere
Most worrying for compliance. Business teams sign up for Lovable, Replit, Cursor, and ChatGPT on personal accounts because IT was too slow to provide them. Customer data flows to vendor APIs that have no DPA. Gartner predicts 40 percent of organisations will suffer security or compliance incidents from shadow AI by 20307.
- Symptoms - Employees mentioning tools you have not procured, payment receipts on personal cards, customer data appearing in third-party logs, GDPR exposure no one mapped.
- Mitigation - Provide sanctioned tools fast. The fastest path is a small but well-resourced sandbox with Copilot for Microsoft 365, Cursor for IT champions, and a pilot Lovable or Power Apps tenant for citizen developers, all on company SSO.
- Critical rule - You do not stop shadow AI by banning AI. You stop it by being faster than the personal-account alternative.
Failure mode 4: The integration debt avalanche
Every vibe-coded app integrates with SAP, DATEV, or another core system in its own way. Some hit REST endpoints, some scrape exports, some embed credentials, some go through Power Automate, some go through homemade Python scripts. Six months in, no one understands how data flows between systems.
- Symptoms - Multiple apps making the same SAP calls with different logic, inconsistent customer or product master data across tools, integration bugs no one can trace.
- Mitigation - IT publishes a small, well-documented set of read-only data products that any app can connect to. Citizen developers consume those instead of integrating directly.
- Critical rule - The integration layer is non-negotiable IT territory. Citizens build on top of it, not around it.
Failure mode 5: The skills two-class society
Subtle but corrosive. The vibe coding programme empowers business teams who happen to be comfortable describing what they want. The teams that struggle - older workers, technical roles with weaker writing skills, non-native German speakers - get left behind. Within a year, the company has a clear two-class structure of those who ship and those who do not.
- Symptoms - The same five departments produce most of the apps, others stop trying, complaints about IT favouritism, fairness questions from the Betriebsrat.
- Mitigation - Active enablement, not just access. Office hours, paired sessions, written guides, multilingual training. Treat citizen development like an apprenticeship system, which the Mittelstand already understands.
- Critical rule - The fairness question is real. Plan for it in the rollout, not after the first internal complaint.
Vibe Coding Without Governance vs With Governance
With governance
- ✓ 5x to 20x faster delivery of departmental tools
- ✓ Security risk contained by quality gate before production
- ✓ Shadow AI displaced by sanctioned, faster alternatives
- ✓ IT capacity reallocated to integration and agents
- ✓ Audit trail intact for ISO, GDPR, EU AI Act
Without governance
- ✗ 40 to 62% of apps ship with vulnerabilities15
- ✗ 2,500% defect increase projected by 20285
- ✗ Ghost-app graveyard growing each quarter
- ✗ Compliance exposure on PII, GoBD, GDPR
- ✗ Two-class workforce on building access
Vibe Coding vs Agentic Engineering: The Quality Bar Question
The most useful framing for the Mittelstand IT leader is the distinction between vibe coding and agentic engineering. Karpathy, who coined vibe coding, declared the term passé in 2026 and now prefers agentic engineering3. The distinction is the difference between raising the floor and raising the ceiling.
- Vibe coding raises the floor - Anyone can ship something. The bar for the first prototype drops from weeks to hours. The risk is that the bar for what reaches production drops with it. Use vibe coding for sandbox, prototype, and throwaway work.
- Agentic engineering raises the ceiling - A senior engineer orchestrates 3 to 10 AI coding agents under explicit oversight, reviewing diffs, designing specs, and accountable for what ships. Production code lands 5 to 10 times faster than traditional development without the floor dropping. Use agentic engineering for production, integration, and anything that needs to be reliable.
- Both are real, both are needed - The Mittelstand needs both lanes. Vibe coding handles the long tail of departmental tools. Agentic engineering handles the spine of integration, security, and the agents that the business actually depends on.
“As developers, we can’t assume that the generated code is secure by default.”
- Janet Worthington, Senior Analyst at Forrester2
What changes in the engineering workflow
Agentic engineering changes how IT teams operate. The senior engineer becomes a director of multiple agents rather than a single author of code. Reviewing diffs and writing specs becomes 60 to 80 percent of the day. Junior tasks largely disappear. The role of the team lead shifts from work-breakdown to spec design and quality gates.
| Dimension | Vibe coding | Agentic engineering |
|---|---|---|
| Builder | Anyone | Senior engineer |
| Code reviewed? | Often not | Always |
| Tests written? | Sometimes | Always, often agent-generated then reviewed |
| Production-ready? | Rarely | Yes |
| Time to first prototype | Hours | 1-3 days |
| Time to production | Needs hardening pass | 1-3 weeks |
| Best for | Departmental tools, prototypes | Integration, security, agents |
| Risk profile | Bounded by sandbox | Same as traditional engineering |
The two-track Mittelstand IT team
The successful Mittelstand IT team in 2026 runs two tracks. Track 1 enables vibe coding for the business. Track 2 practises agentic engineering for the spine of the company. The same team, different lanes, different bars. Trying to apply the vibe coding bar to production work is how 40 to 62 percent vulnerability rates happen.
The Three-Lane Governance Model
The simplest, most defensible governance model for vibe coding in the Mittelstand has three lanes. It maps cleanly to existing IT processes, the Betriebsrat usually accepts it, and IT leaders can explain it to the Geschäftsführer in one slide.
Lane 1: The Sandbox
- Who can ship - Any trained employee with a sandbox account.
- What runs there - Prototypes, dashboards on synthetic or sandbox data, throwaway analyses, first versions of departmental tools.
- Data access - Sandbox copies of production data only, refreshed nightly, no PII unless explicitly approved, no write-back to production systems.
- Tooling - Lovable, Cursor, Replit Agent, Power Apps with Copilot, all on a single tenant with SSO.
- Quality gate - None. Speed is the point.
- Lifetime - 90 days. After 90 days the app expires unless promoted.
- Owner - The builder.
Lane 2: Production
- Who can ship - Trained citizen developers + IT champions, after passing the quality gate.
- What runs there - Departmental tools that actually run the business: dashboards on live data, approval flows, QM checklists, customer lookup tools.
- Data access - Live data through sanctioned data products published by IT. No direct database connections. Writeback only with explicit IT approval.
- Tooling - Same as sandbox plus IT-managed deployment (Vercel, Azure App Service, Power Apps managed environments).
- Quality gate - Automated SAST + secret scan, dependency review, manual security review for anything touching sensitive data, basic test coverage required.
- Lifetime - Unlimited but reviewed quarterly. Apps below a usage threshold (e.g. fewer than 5 active users for 60 days) get archived.
- Owner - Named individual + named team. Documented in the citizen-app inventory.
Lane 3: Critical Systems
- Who can ship - Professional engineers using agentic engineering practices. No citizen development here.
- What runs there - Integration layer, agents that act on production systems, anything customer-facing, anything affecting financial reporting, anything classified as high-risk under the EU AI Act, anything regulated under GoBD.
- Data access - Full, with proper IAM and audit trails.
- Tooling - Cursor in agent mode, Claude Code, Codex, internal AI agent platforms. Code review by a human in every PR, full SDLC, full SecOps integration.
- Quality gate - Standard enterprise SDLC: design review, security review, testing, staging, change approval.
- Lifetime - Maintained as core IT.
- Owner - IT.
| Lane | Builder | Data | Gate | Lifetime |
|---|---|---|---|---|
| 1. Sandbox | Any trained user | Sandbox / synthetic | None | 90 days |
| 2. Production | Trained citizen + champion | Live via data products | SAST + manual review | Quarterly review |
| 3. Critical | Pro engineer | Full | Standard SDLC | Maintained |
The critical rule
Apps move up a lane through promotion, never sideways. A sandbox prototype that the team likes does not become production by being used a lot. It becomes production by passing the quality gate. The most common Mittelstand failure is to skip this step because “it already works”. It works until it does not.
The 90-Day Rollout Playbook
The full programme described above takes 6 to 9 months to mature. The 90-day version is the smallest meaningful unit of work that gets a Mittelstand company from no policy to a working three-lane model with first production apps.
Phase 1: Days 1-30 - Discovery and policy
- Inventory shadow AI usage - One-week survey across all departments. Which AI coding tools are already in use, on which accounts, with what data? Often a long list. The result becomes the legitimisation argument with the Geschäftsführung.
- Identify the 5 highest-value use cases - Not the 50, the 5. Top candidates are usually departmental dashboards, an approval flow, a QM checklist, a sales lookup tool, and one strategic prototype.
- Draft the policy - One page, three lanes, named lifecycle for each lane. Plain German. Reviewed by Datenschutz, Betriebsrat, and Steuerberater. Approved by the Geschäftsführer.
- Pick the tooling - One sandbox tool (typically Lovable or Power Apps with Copilot), one IT tool (typically Cursor + Claude Code or Codex), one production deployment target. Resist the urge to evaluate everything.
- Run the first AI literacy training - Mandatory under EU AI Act Article 421. 2-hour course tailored to citizen developers, covering vibe coding hygiene, security pitfalls, what data goes where.
Phase 2: Days 31-60 - Sandbox and first prototypes
- Stand up the sandbox tenant - SSO, sandbox copies of the relevant data, network policy, expiry policy. 2 weeks of IT work for one engineer.
- Onboard the first cohort - 8 to 15 people across the 5 use-case departments. Office hours twice a week, a #vibe-coding channel in Teams or Slack, a written getting-started guide.
- Build the first 5 prototypes - Each in its own lane. Time-boxed to 5 days each. The sandbox apps run on sandbox data. Most of them work. Some are surprisingly good. Some flop. Document all of it.
- Set up the quality gate pipeline - SAST tool, secret scanner, basic dependency check, the manual review checklist. Tooling: GitHub Advanced Security, Snyk, Semgrep, plus an actual reviewer.
- Establish the inventory - One simple table or app listing every citizen-built tool, owner, lane, last-used date, lifecycle status. Update at least monthly.
Phase 3: Days 61-90 - First production apps and feedback loop
- Promote the 1 to 3 prototypes that proved valuable - Run them through the gate. Most need 1 to 3 days of hardening - input validation, auth, data minimisation, basic tests. Some need IT to build the integration layer. None should be promoted without explicit owner sign-off.
- Archive the prototypes that did not prove out - Not all 5 will. Document what was learned, archive the code, free the sandbox space. This is the cultural muscle that keeps the programme honest.
- Run the first quarterly review - Citizen-app inventory, usage telemetry, gate pass-rate, security findings. Adjust policy if needed.
- Train the next cohort - Double the size. By now the early adopters can teach.
- Decide on agentic engineering for IT - Pick one IT-led project to ship using agentic engineering practices. The decision is whether to scale this to the whole IT team next quarter or stay piloted.
90-day completion checklist
- Shadow AI inventory completed and reviewed with leadership
- Policy signed off by Datenschutz, Betriebsrat, Steuerberater, Geschäftsführer
- Sandbox tenant live with SSO and expiry policy
- First cohort of 8 to 15 citizen developers trained (EU AI Act Article 4)
- 5 prototypes built and evaluated
- Quality gate pipeline live with SAST and secret scan
- Citizen-app inventory in place with monthly review
- 1 to 3 production apps promoted through the gate
- Failed prototypes documented and archived
- First quarterly governance review run
- Agentic engineering pilot kicked off in IT
Tool Landscape: What to Pick When
The vibe coding tool space changes monthly. The categories below are stable; the leaders shift. As of mid-2026 these are the realistic picks for a German Mittelstand company.
Sandbox tools for citizen developers
- Microsoft Power Apps with Copilot - The default choice if you are already on Microsoft 365. Release Wave 1 2026 added autonomous AI agents to Power Apps and Copilot Studio13. Strong governance built in via Power Platform admin. Best for forms, approval flows, internal apps.
- Lovable - Web app generator with one of the strongest natural-language interfaces. Generates real React code. Self-host or cloud. Good for dashboards, internal tools, and prototypes.
- Bolt.new (StackBlitz) - Similar to Lovable, slightly more developer-leaning. Good when the citizen developer has some HTML or SQL background.
- Replit Agent - Strong full-stack agent that works inside the Replit IDE. Better for builders who want to learn over time.
- v0 by Vercel - Best at UI generation. Less complete as a full-stack tool but excellent for component-driven prototypes.
- Glide and AppSheet - Mobile-first, spreadsheet-driven. Excellent for QM checklists and field operations apps.
Tools for IT and agentic engineering
- Cursor - The market default for AI-native IDEs. Tab-complete, agent mode, multi-file edits. Solid pricing for teams.
- Claude Code - Anthropic’s CLI agent. Strong on long-running engineering tasks, refactors, and large codebases.
- OpenAI Codex (GA 2026) - The OpenAI CLI agent. Strong on JavaScript and Python ecosystems.
- GitHub Copilot Agent Mode - The default if you are already on GitHub Enterprise. Works well for incremental engineering work, less for greenfield generation.
- Self-hosted alternatives - Continue.dev, Aider, Cline. Useful when sovereignty or air-gapped environments matter.
Quality gate and governance tooling
- SAST - Snyk, Semgrep, GitHub Advanced Security, Checkmarx. Pick one, run it on every promotion request.
- Secret detection - GitGuardian, TruffleHog, GitHub native scanning.
- Dependency review - Snyk, Dependabot, FOSSA.
- Power Platform CoE Starter Kit - For Power Apps governance specifically. Free, built by Microsoft, mandatory if you scale Power Platform.
- Citizen-app inventory - Often a Power App or Notion page maintained by IT. Off-the-shelf solutions exist (Make, Pomerium) but a simple, owned table works fine.
| Pick | If you | Avoid if |
|---|---|---|
| Power Apps + Copilot | Already on Microsoft 365, want strong governance, focus on forms/flows | Need pixel-perfect custom UI or want to leave Microsoft stack |
| Lovable | Need real React apps fast, comfortable with cloud-first | Need on-premise / sovereign hosting |
| Glide / AppSheet | Building mobile-first shop floor or field apps | Need complex backend logic |
| Cursor + Claude Code | Running an agentic engineering team in IT | Have no professional developers |
| Self-hosted Continue.dev | Sovereignty and IP confidentiality are non-negotiable | Have no platform team to maintain it |
EU AI Act, GDPR, and Betriebsrat: The Compliance Layer
Vibe coding does not get a compliance pass because it is fast. If anything, it raises the bar - more apps, more builders, more places where data ends up. Three frameworks matter for the Mittelstand.
EU AI Act
- Article 4 (AI literacy) - In force since February 2025. Every company using AI must ensure adequate AI literacy among the people deploying or using AI tools. For citizen developers using vibe coding tools, that means a documented training programme21.
- Provider obligations - The AI coding tool itself (Lovable, Cursor) is the provider for AI Act purposes. Your company is the deployer of the tool, with deployer obligations. Most of those flow naturally from the three-lane model.
- Risk classification depends on output - The vibe coded app itself is rarely a regulated AI system. But if it makes high-risk decisions (employment, credit, safety) the high-risk classification applies and the standard high-risk SDLC kicks in. This is where citizen development must hand off to the Critical lane.
- Penalties - Up to 35 million EUR or 7 percent of global turnover for prohibited practices, up to 15 million EUR or 3 percent for other major violations22. SMEs get lower caps and access to regulatory sandboxes.
GDPR (Datenschutz-Grundverordnung)
- Data minimisation in the sandbox - The sandbox tenant should never see raw production data unless the use case demands it. Synthetic data, anonymised exports, or sampled production exports are the default.
- DPA for every tool - Lovable, Cursor, Replit, OpenAI, Anthropic - every tool needs a data-processing agreement on file. Your Datenschutzbeauftragter approves the list. Tools without a DPA do not enter the sandbox tenant.
- Right to erasure - Every production app must support deletion of personal data records on request. This is a checkbox in the quality gate.
- Cross-border transfers - Most leading vibe coding tools are US-based. Standard Contractual Clauses + a transfer impact assessment are standard. Some Mittelstand companies prefer EU-only or self-hosted alternatives for sensitive use cases.
Betriebsrat
- Co-determination is real - The Betriebsrat has co-determination rights under § 87 BetrVG when AI tools handle employee performance data or behaviour. A blanket Betriebsvereinbarung covering the sandbox programme is faster than negotiating each app.
- Transparency wins - Most German Betriebsräte will sign off on a vibe coding programme that includes a public inventory, training, and clear rules on what data citizen apps can touch.
- HR-data apps need extra care - Anything handling performance reviews, attendance, productivity metrics needs a specific Betriebsvereinbarung even within the sandbox programme.
GoBD (German tax archival)
- Anything touching invoices, ledgers, or tax data - Stays out of citizen development. GoBD requires unalterable archives, full audit trails, and Steuerberater alignment. Citizen apps in the production lane can read GoBD-archived data, but the archive itself stays in the Critical lane.
- Reporting and analytics on top of GoBD data - Allowed, useful, and a great citizen-development use case. The original records are unchanged; the citizen tool sits on a downstream copy.
Compliance as enabler, not blocker
The most common compliance failure is treating it as a no-list. The right framing for the Mittelstand IT leader: compliance defines the lanes, then the lanes accelerate everything inside them. A signed Betriebsvereinbarung is what unlocks the sandbox, not what constrains it.
How Superkind Fits
Superkind builds custom AI agents for German SMEs and enterprises. We do not sell a vibe coding tool. We come in where vibe coding stops being the right answer and agentic engineering takes over - the Critical lane in the three-lane model. Here is what we typically do for a Mittelstand company adopting vibe coding seriously.
What Superkind does
- Three-lane model design - We help IT leaders draft the policy, pick the tooling, and align with Datenschutz, Betriebsrat, and Steuerberater. A standard engagement runs 4 to 6 weeks.
- Quality gate pipeline - We set up the SAST + secret-scan + manual-review pipeline that turns sandbox prototypes into production apps. Often integrated with existing GitHub or Azure DevOps.
- Agentic engineering enablement - We help IT teams adopt Cursor, Claude Code, and Codex with the workflows that actually scale. Pair coaching for 4 to 8 weeks per team.
- Production-grade agents on top of citizen apps - When a vibe-coded prototype proves valuable but needs SAP or DATEV integration, sovereignty, or autonomous action, we replace the prototype with a production agent that meets the same need at production reliability.
- Integration layer as a product - We build the read-only data products that citizen apps consume, so IT does not have to expose raw SAP or ERP credentials.
- EU AI Act readiness - Article 4 literacy programme tailored to citizen developers, technical documentation for any high-risk apps, audit trails that hold up to BNetzA scrutiny.
- Sovereignty options - For Mittelstand firms that need EU-only or self-hosted models, we deploy on Aleph Alpha, Mistral, or self-hosted open-weights.
- Sustained partnership - We stay engaged on a retainer, run quarterly reviews, and absorb the agents into the operating model.
Where we deliberately do not compete
- Selling vibe coding licences - That is the job of Lovable, Cursor, and Microsoft. Superkind helps you use them well.
- Training every employee in vibe coding - Internal HR or a specialised training partner usually does this better.
- Replacing your IT department - We make IT 5 to 10 times more effective. We do not become IT.
Superkind: Honest Pros and Cons
Strengths
- ✓ Mittelstand DNA - we work the way German SMEs work
- ✓ Process-first approach - we map the workflow before we build anything
- ✓ Outcomes, not licences - we are paid to deliver impact, not seats
- ✓ SAP, DATEV, legacy ERP fluency - real integrations, not toy demos
- ✓ EU AI Act and GDPR by design - audit-ready from day one
Honest cons
- ✗ Not a fit below 50 employees - small firms often do not need the full programme
- ✗ Not a quick licence sale - we engage for outcomes, which takes time
- ✗ We do not run vibe coding training at scale - we focus on the IT-side counterpart
- ✗ We require executive sponsorship - bottom-up engagements rarely succeed at this scope
Decision Framework: Should You Start Now?
Six signals that tell you whether vibe coding is the right move for your company in the next quarter, and what kind of move it should be.
| Signal | What it means | Action |
|---|---|---|
| Your IT backlog has 50+ small departmental requests | Classic case for vibe coding - the backlog is mostly never going to be built by IT | Run the 90-day playbook with departmental dashboards as the first cohort |
| You found shadow AI accounts in your last security review | Citizens are already vibe coding without you | Move fast - sanction the tools that are already in use, set the lanes, and bring usage above board |
| Your IT team is <1.5% of headcount | Structurally undersized for the demand | Vibe coding plus agentic engineering is the only realistic path to clearing the backlog |
| You already pay for Microsoft 365 with Copilot | You have most of the platform already | Start with Power Apps + Copilot before adding new vendors |
| You operate in a heavily regulated vertical (medtech, finance, automotive Tier 1) | The Critical lane is large and the gate matters | Lead with the gate and the policy, not with shipping. Build credibility before scaling |
| You have <30 employees and simple processes | Vibe coding programme is overkill | Skip the programme; let people use tools as needed under a one-page policy |
Acting Now vs Waiting
Acting Now
- ✓ You catch shadow AI before it scales
- ✓ The backlog clears in 6 to 12 months
- ✓ EU AI Act readiness is in place before August 2026
- ✓ Your Mittelstand peers are still figuring this out
Waiting 6 months
- ✗ Shadow AI exposure compounds quietly
- ✗ Your best people get bored and consider leaving
- ✗ Cleanup gets harder once 40+ shadow apps exist
- ✗ Compliance pressure rises before you have controls
Frequently Asked Questions
Vibe coding is the practice of building software by describing what you want in natural language and letting an AI tool generate, run, and debug the code for you. Andrej Karpathy coined the term in February 2025. The user often does not read the resulting code in detail. Tools that enable it include Cursor, Lovable, Bolt.new, Replit Agent, v0, and Microsoft Power Apps with Copilot.
Low-code and no-code platforms (Microsoft Power Apps, Mendix, OutSystems, Airtable) build apps from pre-defined visual components. Vibe coding generates real code from a free-text prompt, with no fixed component library. The line is blurring fast - Power Apps with Copilot now mixes both, and most low-code vendors are racing to add AI generation. The governance question is identical for both.
Yes, but in a controlled lane. Most Mittelstand IT teams are already losing this battle - business teams are using Cursor, ChatGPT, and Power Apps regardless. The right answer is the three-lane model: a sandbox where citizens can ship freely, a production lane with a quality gate, and a critical lane reserved for professional engineering. Banning vibe coding outright drives it underground.
Real but manageable. Independent studies consistently find that 40 to 62 percent of AI-generated code samples contain security vulnerabilities, and Gartner predicts prompt-to-app citizen development will increase software defects by 2,500 percent by 2028 if left ungoverned. The mitigation is a quality gate before any vibe-coded tool touches production data, plus DevSecOps practices for the few apps that need to scale.
A controlled vibe coding programme typically runs EUR 30,000 to 80,000 in year one. That covers a sandbox tenant, governance tooling, a quality-gate pipeline, training for citizen developers and IT champions, and a small budget to harden the 5 to 10 prototypes per year that prove valuable. Tooling licences (Cursor, Lovable, Power Platform) usually add EUR 30 to 60 per active user per month.
No. Vibe coding shifts what IT does, not whether IT exists. Citizen developers will build the long tail of internal tools that were never going to make the IT roadmap anyway. Professional engineers spend less time on CRUD apps and more time on integration, governance, security, and the agents that need real reliability. Most Mittelstand IT teams report capacity increases of 30 to 50 percent on strategic work after the shift.
Agentic engineering is what comes after vibe coding for production work. Andrej Karpathy himself moved on from vibe coding in 2026, preferring the term agentic engineering: a senior engineer orchestrates multiple AI coding agents, reviews their output, and is accountable for the result. Vibe coding raises the floor (anyone can ship something). Agentic engineering raises the ceiling (small teams ship production-grade software 10 times faster).
In most cases the AI Act applies indirectly. The vibe-coded app itself is rarely a regulated AI system, but the AI coding tool that generated it falls under provider obligations, and Article 4 of the EU AI Act obliges your company to ensure adequate AI literacy for everyone using AI tools - including citizen developers. If the resulting app makes high-risk decisions (hiring, credit, safety) the high-risk classification applies regardless of how it was built.
GDPR applies to the data, not the code. A vibe-coded HR tool that processes employee data needs the same data-processing agreements, deletion logic, and Betriebsrat involvement as any other tool. Most German Betriebsräte want to be informed early when citizen-built tools touch employee performance data. The fastest path is a written sandbox policy that the Betriebsrat signs off once and applies to every prototype.
Ghost apps are the number-one operational risk after security. Three rules work: every sandbox app expires after 90 days unless it gets renewed, production apps must have a named owner and a clear retirement path, and IT runs a quarterly inventory of citizen-built tools with usage data. Apps below a usage threshold get archived. Yes, this means deleting things people built. Do it anyway.
Yes, but the integration layer is exactly where IT must intervene. Most vibe coding tools generate REST API calls or SQL queries that look fine in isolation but break when they hit a real SAP system, especially in batch mode. The pattern that works: the citizen developer prototypes against a sandbox dataset, then IT or an agent partner like Superkind wraps the prototype with proper SAP/DATEV connectors before it goes to production.
Three roles, none new. The Citizen Development Lead sets policy, runs training, and operates the sandbox. The Quality Gate Engineer reviews apps before they enter production - usually a senior dev who likes mentoring. The Agentic Engineering Lead handles the apps that need to scale, owning the agent layer that production tools depend on. In a 200-person Mittelstand company these are typically 1.5 to 2 FTE in total.
Three numbers. First, number of internal tools shipped per quarter (baseline is usually 2 to 5; with vibe coding done well it lands at 15 to 40). Second, IT request-to-delivery time for departmental apps (target: 30 days down to 5 days). Third, the percentage of citizen-built tools still in active use after 6 months (target: above 40 percent - if it is lower, the gate is too loose). McKinsey reports that companies empowering citizen developers score 33 percent higher on innovation metrics.
Related Articles
- Shadow AI in the Mittelstand: The Governance Playbook
- AI Literacy for the Mittelstand: How to Implement Article 4 of the EU AI Act in Practice
- AI Agents vs Microsoft Copilot: When Custom Is Worth the Premium for the Mittelstand
- Which LLM Should the Mittelstand Choose? GPT, Claude, Gemini and Mistral Compared
- AI Agent Security: Prompt Injection, Data Leakage, and the OWASP LLM Top 10 for the Mittelstand
- AI Agents on Top of Legacy: How the Mittelstand Modernises Without Ripping Out the ERP
- Fix Your Processes Before You Add AI: Why AI Cannot Save a Broken Workflow
Sources
- Wikipedia - Vibe Coding (with Andrej Karpathy original tweet, 2 February 2025)
- Forrester - Secure Vibe Coding: A Paradigm, Not A Paradox (Janet Worthington, Senior Analyst)
- The New Stack - Vibe Coding Is Passé. Karpathy Has a New Name for the Future of Software
- Dealroom - Vibe Coding Was Just the Warmup. Andrej Karpathy on the Dawn of Software 3.0
- Gartner - Predicts 2026: AI Potential and Risks Emerge in Software Engineering Technologies
- Gartner - Top Cybersecurity Trends 2026 (Shadow AI Predictions)
- Gartner - 40% of Firms Hit by Shadow AI Security Incidents
- Gartner / InfoWorld - Low-Code Development Technologies Forecast to $44.5 Billion by 2026 (Jason Wong, Gartner)
- Kissflow - Citizen Development Statistics & Trends 2026 (Gartner Synthesis)
- Bitkom - Künstliche Intelligenz in Deutschland Studienbericht 2026
- Bitkom - Digitale Souveränität Studienbericht 2025
- Bitkom - IT-Mittelstandsbericht
- Microsoft - Power Platform Release Wave 1 2026 (Autonomous AI Agents)
- McKinsey - Citizen Development and Innovation Performance
- Checkmarx - Vibe Coding Security: Risks, Vulnerabilities, and Secure AI Coding
- Dark Reading - Security Concerns Shadow Vibe Coding Adoption
- Modall - Vibe Coding Security Risks: What Founders Need to Know (2026)
- ifo Institute - Skilled Worker Shortage in Germany (2025)
- DIHK - Skilled Labour Report 2025/2026
- OECD Economic Surveys: Germany 2025
- EU AI Act - Article 4: AI Literacy
- EU AI Act - Implementation Timeline
- EU AI Act - Small Businesses Guide
- NxCode - What Is Vibe Coding? Complete Guide to AI-Assisted Development (2026)
- DEV Community - What Is Vibe Coding in 2026? One Year From Karpathys Tweet
- Microsoft - What is Low-Code Governance
- Synalis - Governance bei Power Platform
- Google Cloud - What is Vibe Coding? Tools and Guides
Ready to ship the three-lane model in your company?
Book a 30-minute call with Henri. We will sketch the policy, the gate, and the first cohort plan for your team - no commitment, no sales pitch.
Book a Demo →
