

Transferring into AI-first improvement is a journey, and we’re all studying collectively. I need to share some bittersweet classes from my current expertise which may prevent from hitting the identical partitions I did.
The “Secret” Everybody Is aware of
Let’s tackle the elephant within the room. By now, there are in all probability one million YouTube movies titled “A Tremendous Secret Trick To Make Your Coding Agent 20x Higher.” You recognize the trick, I do know the trick: create an in depth plan in a markdown file and direct the agent to execute it step-by-step.
Armed with this information, my trusted military of brokers and I had been completely happy campers for a number of days of continuous AI coding. In AI phrases, that’s vital—numerous tokens, kilowatts of electrical energy, and more and more succesful brokers working in concord. It was an idyll with me being the conductor of the agentic orchestra, or in order for you a hotter metaphor, my brokers being trusty golden retrievers fortunately bringing the ball again again and again.
The mission grew to 158 supply code information (not counting checks, documentation, or construct scripts). Whereas some had been tailored from a permissively licensed open supply SDK, most had been new or substantial rewrites. For a prototype, it was a substantial codebase.
When Issues Go South
All the things was clean crusing whereas the codebase remained small. I wasn’t meticulously reviewing each line (“I’m a skilled skilled – don’t try this at house”, or extra appropriately, “don’t try this at work”), however the plan was strong, and the app did what it wanted to do.
However because the codebase grew, my agent hit a wall like a check automotive in a crash check. Effectively, at the least that’s the way it felt when, regardless of quite a few makes an attempt to re-prompt round or by that wall, the agent was getting nowhere. Certain, I might have dug by the code myself, however I used to be too lazy to learn and debug a bunch of “not mine” code written on frameworks I’d by no means labored in, particularly after the agent had made a number of “off-plan” modifications attempting to unravel the issue.
The Laborious-Received Classes
From this failure (and my previous successes), I’ve extracted useful insights that may essentially change how I method AI-driven improvement. “In it to win it.”
1. Structure-First Method
Outdated manner: Plan → Execute
New manner: Excessive-level plan → For every module:
- Develop module_architecture.md (defining key information constructions, interfaces, management movement, and design patterns)
- Create module_execution_plan.md
- Execute the module plan step-by-step
- Transfer to the following module
The important thing perception? I by no means really “mentioned” the structure with my agent. With out that shared understanding, I couldn’t totally belief the inspiration—a a lot greater drawback than doubting a single operate. Subsequent time, I’ll co-own each the plan and the structure doc, so I’d really feel that it’s my app, even when a whole lot of the code isn’t mine.
2. Testing Requirements from Day One
I’d outline my testing requirements up entrance and power the agent to observe them. EVERY STEP would require constructing new regression checks and executing the total set of regression checks. With out it, the agent was creating random checks to debug random issues and both auto-cleaning these checks or leaving them in inconsistent locations.
3. Complete Logging Technique
I’d outline my logging requirements upfront, together with verbosity ranges and a few decorators to auto-log a whole lot of stuff with out bloating the code with debug messaging. That may hold the code readable and the logs detailed.
The Payoff
With this method, I’m assured a number of good issues will occur:
- Greater functionality ceiling: My agent would be capable to clear up the gnarly concern that obtained it operating in circles. With well-organized checks and logs, it’s a lot simpler to determine and clear up complicated points.
- Higher human intervention factors: After I have to step in, I’ll know precisely the place to look.
- Fewer architectural issues: Having good structure would assist keep away from probably the most vital issues. Small stuff is small by definition.
And naturally, in terms of manufacturing, there’s going to be a safety evaluation, code evaluation, and extra thorough testing.
The Funding
This isn’t a light-weight elevate; it takes effort. In conventional improvement, correct structure for essential elements can simply take ⅓ of the mission timeline. It’s high-skill, high-value work – your principal architect doubtless earns (and is price) at the least 5 of your juniors (and that’s earlier than you begin counting the fairness…). So this isn’t free cheese.
However right here’s the important thing: this method front-loads the strategic work, carried out collaboratively between you and AI, leaving the extra mundane backlog to AI alone.
Redefining Collaboration
After I say “co-own structure,” I don’t imply you want a decade of “architecturing” expertise. I’m an engineer by coaching, a product man by coronary heart, and a enterprise man by commerce. I’m fairly rusty in terms of coding, however I’ve a eager thoughts and limitless curiosity.
When engaged on structure, I’m not alone. Each time I’ve a query, whether or not it’s about some choices to unravel the issue, or our codebase, or open-source comparables, my trusted brokers are there to run some background analysis and queries for me. This is likely one of the best issues to parallelise and multitask, which implies you might be getting the largest leverage from AI.
We’re basically redefining the division of labor: people concentrate on structure, requirements, and strategic selections whereas AI handles the implementation particulars inside these well-defined boundaries. That is the place we envision AI and people sooner or later – we wish AI to create jobs and assist multiply human capabilities/velocity/productiveness.
What’s Subsequent
In Half 2 (when my busy work permits for one more deep dive session), I’ll share particular examples of how this architecture-first method solved actual issues, together with the precise templates and prompts that made the distinction. Keep tuned.
