| Episode | Status |
|---|---|
| Episode | Status |
|---|---|
The future has a way of showing up early to some places. In software engineering, one of those places is Cognition—the startup that made headlines in early 2024 with Devin, the world’s first autonomou...
Scott Wu, CEO of Cognition (makers of Devin), discusses the transformation of software engineering through AI agents. He argues that AGI capabilities are already here in many practical senses, and explores how programming is shifting from hands-on coding to orchestrating autonomous agents. The conversation covers the technical architecture decisions behind Devin, the trade-offs between synchronous IDE experiences versus asynchronous agents, and why reinforcement learning environments are crucial for post-training coding models.
Wu argues that by historical standards, AGI has effectively arrived—models can pass the Turing test, solve IMO/IOI problems, and interact with the real world. He challenges the notion that AGI requires automating 80% of knowledge work, pointing out that humans always focus on the remaining unautomated tasks, making it an impossible moving target.
Discussion of a child development-based AGI definition: measuring how long AI can operate autonomously without supervision. The threshold is when it becomes economically profitable to never turn AI off—similar to how children gradually require less supervision until full independence.
Wu shares his journey from running Lunch Club to founding Cognition, emphasizing the 'leave it all on the field' mentality. He describes how Cognition started as friends exploring AI coding ideas rather than deliberately building a company, and how the decision to commit came from wanting to avoid wondering 'what if.'
Wu draws parallels to calculator protests by teachers, arguing that AI won't eliminate programming but will shift it toward higher-level thinking. Engineers will focus on logical fundamentals, problem decomposition, and architecture rather than debugging Kubernetes or memorizing syntax.
Discussion of how AI removes the traditional 'hazing' period in professions where juniors spend years on boring work before graduating to interesting tasks. With AI handling boilerplate, new engineers can immediately work on meaningful problems—like an 'officer's school' approach.
Wu maps out the landscape of AI coding tools as a spectrum from synchronous (tab complete) to fully asynchronous (autonomous agents). He predicts this spectrum will exist for ~3 years before everything becomes dictation to agents, and explains how different tasks require different points on this spectrum.
Analysis of Claude Code's rapid adoption as the first mainstream tool to go 'full send' on agentic engineering. Wu argues the CLI form factor matters less than the paradigm shift of handing full control to AI, and discusses the tight coupling between model capabilities and correct interface design.
Wu explains Devin's differentiation: a persistent agent with its own cloud computer that can be onboarded like a teammate. This enables unique capabilities like autonomous testing and learning company-specific workflows, though it requires more upfront onboarding than synchronous tools.
Wu defends Cognition's decision to focus on post-training rather than pre-training base models. He argues that teaching models real-world software engineering nuances (like Datadog debugging workflows) fits naturally into post-training and represents their core competitive advantage.
Deep dive into how Cognition designs RL environments to be generalizable rather than brittle. The key is training agents to interact with the real world (Google docs, read logs, understand errors) rather than memorize specific package versions or configurations.
Wu expresses surprise that mass-market personal agents haven't emerged yet despite capabilities being ready. He reveals Cognition uses Devin internally for Amazon ordering, suggesting the agent infrastructure exists but the right consumer form factor hasn't been built.
Cognition’s CEO on What Comes After Code
Ask me anything about this podcast episode...
Try asking: