If you happen to’ve spent any time working with AI coding brokers, you recognize the routine. You describe what you need. The agent generates code that appears proper. You run it. It breaks — or worse, it really works however solves the unsuitable downside.
This irritating sample has earned a reputation: Vibe coding. You give the AI a imprecise thought and hope it guesses accurately. For fast prototypes, that’s fantastic. For manufacturing software program, it’s an actual downside.
GitHub’s reply is Spec Equipment — a brand new open-source toolkit for spec-driven growth that gives a structured course of to carry spec-driven growth to your coding agent workflows with instruments together with GitHub Copilot, Claude Code, and Gemini CLI.
The core thought is straightforward: Write the spec first.
Specs because the Supply of Fact
For many years, code has been king. Specs served code — they have been the scaffolding we constructed after which discarded as soon as the “actual work” of coding started. We wrote PRDs to information growth, created design docs to tell implementation, and drew diagrams to visualise structure. However these have been all the time subordinate to the code itself.
Spec-Pushed Growth (SDD) flips that. As an alternative of coding first and writing docs later, you begin with a spec. This contract defines how your code ought to behave and serves because the supply of fact in your instruments and AI brokers to generate, check, and validate code. The result’s much less guesswork, fewer surprises, and higher-quality code.
Spec Equipment packages templates, a CLI, and prompts to middle work round a specification first, then a technical plan, then a set of small, testable duties that AI brokers implement.
“GitHub’s Spec Equipment alerts AI-assisted coding is shifting from prompts to sturdy, versioned specs. Distributors are competing to personal the artifact that governs intent throughout Copilot, Claude Code, and Gemini CLI,” based on Mitch Ashley, VP and follow lead for software program lifecycle engineering at The Futurum Group.
“For engineering leaders, the specification turns into the unit of governance throughout brokers and contributors. Groups treating spec-driven growth as non-obligatory will accumulate AI-generated technical debt no agent functionality can refactor away, and verification at every checkpoint can’t be deferred to the agent producing it.”
The way it Works
On the coronary heart of Spec Equipment is what GitHub calls a “structure” — a doc that captures the non-negotiable rules for a undertaking. Consider it as a everlasting guidelines file that each subsequent command references. From there, the workflow follows 4 phases: Specify, plan, duties, and implement.
Every of Spec Equipment’s seven slash instructions represents a key stage within the spec-driven workflow — from defining your undertaking’s core rules to producing the ultimate implementation. Builders work together with these instructions instantly inside their coding agent of selection.
Spec Equipment is distributed as a CLI that may create workspace setups for a variety of widespread coding assistants. As soon as that construction is about up, you work together with Spec Equipment by way of slash instructions in your coding assistant. As a result of all of its artifacts are put proper into your workspace, it’s probably the most customizable method.
Your main position is to steer; the coding agent does the majority of the writing. However steering is an energetic job. GitHub is specific about this: your job isn’t simply to immediate and approve. It’s to confirm at every checkpoint earlier than shifting to the following part.
The place it Makes the Most Sense
GitHub highlights three situations the place Spec Equipment is most useful: Greenfield tasks, function work in present techniques, and legacy modernization. In every case, the spec captures the steady “what,” whereas the plan and duties drive the versatile “how,” decreasing rework and making modifications predictable.
As an alternative of ranging from scratch with every immediate, Spec Equipment maintains a persistent understanding of your undertaking. Each AI interplay adheres to the identical structure, specs, and technical plans, making certain constant output aligned together with your undertaking’s targets. A number of builders can work with the identical AI assistant utilizing the identical undertaking context. New crew members can rapidly perceive the undertaking by studying the spec recordsdata.
That final level issues particularly for DevOps groups. Consistency throughout a shared codebase — particularly when a number of contributors are utilizing totally different AI instruments — is among the hardest issues to keep up. Spec Equipment provides everybody a standard reference level.
Nonetheless Early, However Value Watching
Not everyone seems to be bought. Gojko Adzic, a guide and creator of a number of books overlaying software program supply and specification practices, frames spec-driven growth as each a logical evolution and a possible overcorrection, warning that its construction might reintroduce a number of the rigidity agile strategies sought to flee. It’s a good concern. Heavyweight spec processes have been tried earlier than.
However Spec Equipment isn’t asking you to write down 80-page necessities paperwork. The spec is supposed to be lean and residing — one thing you replace because the undertaking evolves, not one thing you waterfall your method by means of earlier than anybody writes a line of code.
GitHub open-sourced the Spec Equipment as a result of this method is larger than anybody device or firm. The true innovation is the method.
It’s nonetheless early days for Spec Equipment, and it received’t repair the challenges of AI-assisted coding in a single day. Nevertheless it factors to the place the following wave of instruments is perhaps headed: techniques that don’t simply generate code but in addition perceive what that code is supposed to attain.
For DevOps groups already wrestling with how one can govern AI-assisted growth at scale, that’s a dialog value having now — earlier than the vibe-coded technical debt begins piling up.









