The 24-Hour Sprint | Chris Boyd

TL;DR: Agentic coding is compressing the dev cycle into 24 hours. Here's how it looks from the inside.

I've been running engineering orgs long enough to remember when "continuous delivery" was the radical idea. Ship daily instead of quarterly. Automate the build. Trust the pipeline. It felt fast at the time.

Now I'm watching the entire sprint cycle compress into 24 hours, and honestly, the teams pulling this off look nothing like the ones I managed five years ago.

I'm one of them. I'm running 24/7 automation loops on my own projects right now - agents that work through the night, open pull requests, and have them queued for my review by morning. It's not a demo environment. It's how I'm actually shipping. The first time I sat down with a fresh batch of agent-generated PRs over coffee, the shift felt less like a productivity upgrade and more like a change in what my job actually is.

The shift nobody's CI/CD pipeline prepared you for

Here's the dirty secret about our industry's automation story so far: CI/CD only ever touched about 30% of the actual effort in software delivery. Testing, building, deploying - yes, we automated that. But everything upstream - gathering requirements, designing solutions, writing the code itself - stayed manual. Humans in meetings. Humans in Jira. Humans staring at IDEs.

Agentic coding is going after the other 70%.

McKinsey's recent piece on rewiring software delivery for the agentic era describes a shift toward a daily sprint model - humans and agents operating in a continuous 24-hour cycle that compresses what used to be two-week sprints. The productivity gains are real, but they come with a caveat worth taking seriously: this only works in organizations that already have architectural discipline, standardized workflows, and clear product vision. It's not what happens when you hand Copilot to a team with a legacy monolith and no documentation. But the directionality is real, and the teams achieving it are operating in a fundamentally different rhythm.

What the 24-hour cycle actually looks like

The model that's emerging is this: humans review, prioritize, and steer during the day. Agents execute overnight.

During working hours, engineers are reading pull requests generated by agents, validating architectural decisions, refining requirements, and setting the next batch of work in motion. When the team goes home, agents pick up the queue - writing code, running tests, generating documentation, flagging blockers for morning review.

It's not a 2-week sprint anymore. It's a daily cycle. Ship, review, steer, repeat.

This sounds elegant on a slide. In practice, it only works if the agents have enough context to make reasonable decisions autonomously. Which brings me to the part most people skip over.

Knowledge graphs as institutional memory

The biggest unlock isn't the AI model itself - it's the context layer around it.

Agents can't ask the senior engineer in Slack why that service is architected the way it is. They can't absorb tribal knowledge from a whiteboard session. They need machine-readable artifacts: architecture decision records, dependency maps, API contracts, coding standards - all structured, all current, all queryable.

The organizations making this work are building knowledge graphs as their institutional memory. Not wikis that go stale. Not Confluence pages nobody updates. Actual graph-structured representations of how their systems work, what decisions were made and why, and what constraints apply. McKinsey describes these as an AI memory layer across the SDLC - connecting customer feedback, architecture decisions, design documents, tickets, GitHub activity, and incident reports into a semantically linked system that agents can actually reason over. Questions that once required weeks of SME interviews can be answered in minutes. Every decision becomes traceable.

This becomes the agent's context window - the difference between an AI that generates plausible code and one that generates correct code for your specific system.

If you don't have this, you don't have agentic coding. You have an expensive autocomplete that creates tech debt faster than your team can review it.

The real prerequisite: architectural discipline

Let me be blunt about something. This doesn't work if your codebase is a mess.

Agentic pipelines require standardized workflows, clean interfaces, well-documented contracts, and modular architectures where agents can operate on bounded contexts without cascading side effects. McKinsey is explicit about this: the path from requirements to code must follow a standard structure so agents can reliably interpret inputs and produce predictable outputs. Requirements, standards, architectural specs, and user stories living across disconnected documents and tools isn't a minor inconvenience - it's where friction accumulates and value plateaus. The agentic model removes that friction by structuring artifacts for machine-to-machine handoffs. When it works, the pipeline runs end to end in hours, with humans intervening only at defined review gates.

The teams seeing the biggest gains already had this discipline. The AI amplified their existing rigor. It didn't create it.

If your system is a tangle of implicit dependencies, undocumented conventions, and "just ask Dave" institutional knowledge, the first step isn't adopting agentic tooling. It's doing the architectural work that makes agentic tooling possible.

The role shift that's already happening

Team structures are changing. McKinsey describes larger teams of 8-12 FTEs giving way to smaller pods of highly skilled professionals supervising agent-driven execution - compressed timelines, lower costs, or increased capacity depending on how the freed capacity gets redeployed.

This isn't "engineers replaced by AI." It's engineers operating at a higher level of abstraction. Less time writing implementation code. More time on system design, constraint definition, quality review, and strategic decisions about what to build and why.

The skillset that matters is shifting: architectural thinking, clear specification writing, the ability to review AI-generated code critically, and the judgment to know when the agent got it wrong. Production debugging doesn't go away. But the ratio of thinking to typing changes dramatically.

Where this lands

Some teams could flip to the 24-hour sprint tomorrow. Not as a moonshot - as a practical decision. If your architecture is clean, your context layer is machine-readable, and you have engineers who can steer rather than just produce, the tooling is ready. The gap is organizational will, not technical capability.

But that combination is rarer than the hype suggests. Most engineering leaders need to be honest about a few questions:

Do you have a clear product vision your team can use to assess agent-generated requirements for quality and alignment?
Is your technology environment standard and consistent enough for solutions to scale and components to be reused?
Does the path from requirements to code follow a standard structure agents can reliably interpret?
Are your core stakeholders engaged across the full value stream - or are you creating misalignment and rework at every handoff?

If the answer to most of those is no, the productivity gains everyone's talking about aren't available to you yet. The good news is that the work to get there - cleaner architecture, better documentation, standardized workflows - makes your team better regardless of whether an agent ever touches the code.

Start there.