Wednesday, February 4, 2026

Why your AI coding agent wants greater than a plan: Classes from the trenches

Shifting into AI-first improvement is a journey, and we’re all studying collectively. I need to share some bittersweet classes from my latest expertise which may prevent from hitting the identical partitions I did.

The “Secret” Everybody Is aware of

Let’s tackle the elephant within the room. By now, there are most likely one million YouTube movies titled “A Tremendous Secret Trick To Make Your Coding Agent 20x Higher.” You understand the trick, I do know the trick: create an in depth plan in a markdown file and direct the agent to execute it step-by-step.

Armed with this data, my trusted military of brokers and I have been blissful campers for a number of days of continuous AI coding. In AI phrases, that’s vital—numerous tokens, kilowatts of electrical energy, and more and more succesful brokers working in concord. It was an idyll with me being the conductor of the agentic orchestra, or if you need a hotter metaphor, my brokers being trusty golden retrievers fortunately bringing the ball again again and again.

The venture grew to 158 supply code information (not counting checks, documentation, or construct scripts). Whereas some have been tailored from a permissively licensed open supply SDK, most have been new or substantial rewrites. For a prototype, it was a substantial codebase.

When Issues Go South

Every little thing was easy crusing whereas the codebase remained small. I wasn’t meticulously reviewing each line (“I’m a skilled skilled – don’t try this at residence”, or extra appropriately, “don’t try this at work”), however the plan was strong, and the app did what it wanted to do.

However because the codebase grew, my agent hit a wall like a check automobile in a crash check. Effectively, no less than that’s the way it felt when, regardless of quite a few makes an attempt to re-prompt round or by that wall, the agent was getting nowhere. Certain, I may have dug by the code myself, however I used to be too lazy to learn and debug a bunch of “not mine” code written on frameworks I’d by no means labored in, particularly after the agent had made a number of “off-plan” modifications making an attempt to resolve the issue.

The Arduous-Received Classes

From this failure (and my previous successes), I’ve extracted helpful insights that can basically change how I method AI-driven improvement. “In it to win it.”

1. Structure-First Method

Previous method: Plan → Execute

New method: Excessive-level plan → For every module:

  • Develop module_architecture.md (defining key knowledge constructions, interfaces, management movement, and design patterns)
  • Create module_execution_plan.md
  • Execute the module plan step-by-step
  • Transfer to the following module

The important thing perception? I by no means really “mentioned” the structure with my agent. With out that shared understanding, I couldn’t absolutely belief the inspiration—a a lot larger drawback than doubting a single perform. Subsequent time, I’ll co-own each the plan and the structure doc, so I might really feel that it’s my app, even when a whole lot of the code isn’t mine.

2. Testing Requirements from Day One

I might outline my testing requirements up entrance and power the agent to observe them. EVERY STEP would require constructing new regression checks and executing the complete set of regression checks. With out it, the agent was creating random checks to debug random issues and both auto-cleaning these checks or leaving them in inconsistent locations.

3. Complete Logging Technique

I might outline my logging requirements upfront, together with verbosity ranges and a few decorators to auto-log a whole lot of stuff with out bloating the code with debug messaging. That may hold the code readable and the logs detailed.

The Payoff

With this method, I’m assured a number of good issues will occur:

  • Increased functionality ceiling: My agent would be capable to clear up the gnarly subject that received it operating in circles. With well-organized checks and logs, it’s a lot simpler to determine and clear up complicated points.
  • Higher human intervention factors: Once I have to step in, I’ll know precisely the place to look.
  • Fewer architectural issues: Having good structure would assist keep away from essentially the most vital issues. Small stuff is small by definition.

And naturally, with regards to manufacturing, there’s going to be a safety evaluation, code evaluation, and extra thorough testing.

The Funding

This isn’t a light-weight elevate; it takes effort. In conventional improvement, correct structure for important elements can simply take ⅓ of the venture timeline. It’s high-skill, high-value work – your principal architect probably earns (and is price) no less than 5 of your juniors (and that’s earlier than you begin counting the fairness…). So this isn’t free cheese.

However right here’s the important thing: this method front-loads the strategic work, performed collaboratively between you and AI, leaving the extra mundane backlog to AI alone.

Redefining Collaboration

Once I say “co-own structure,” I don’t imply you want a decade of “architecturing” expertise. I’m an engineer by coaching, a product man by coronary heart, and a enterprise man by commerce. I’m fairly rusty with regards to coding, however I’ve a eager thoughts and infinite curiosity.

When engaged on structure, I’m not alone. Every time I’ve a query, whether or not it’s about some choices to resolve the issue, or our codebase, or open-source comparables, my trusted brokers are there to run some background analysis and queries for me. This is without doubt one of the best issues to parallelise and multitask, which suggests you might be getting the largest leverage from AI.

We’re basically redefining the division of labor: people give attention to structure, requirements, and strategic selections whereas AI handles the implementation particulars inside these well-defined boundaries. That is the place we envision AI and people sooner or later – we wish AI to create jobs and assist multiply human capabilities/velocity/productiveness.

What’s Subsequent

In Half 2 (when my busy work permits for an additional deep dive session), I’ll share particular examples of how this architecture-first method solved actual issues, together with the precise templates and prompts that made the distinction. Keep tuned.

 

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles