Episode	Podcast	Published	Duration	Status

Latent Space: The AI Engineer Podcast

Steve Yegge's Vibe Coding Manifesto: Why Claude Code Isn't It & What Comes After the IDE

December 26, 2025•8,363 words

Description

Note: Steve and Gene’s talk on Vibe Coding and the post IDE world was one of the top talks of AIE CODE: https://www.youtube.com/watch?v=7Dtu2bilcFs&t=1019s&pp=0gcJCU0KAYcqIYzv From building legenda...

We are here live at AI Engineer Summit with Steve Yeghi, the legendary Steve Yeghi of Stevie's Tech Talks, Stevie's Platforms brands, and most recently, Source Graph and AMP. Welcome. And most recently, Coding,

I should That's right. The Vibe Coding book.

So this is the big Vibe Coding discussion. In the pre chat, we were discussing the intersection of Vibe Coding and AI engineering. So we got the kind of movement leaders of both sides here. How do you see it?

It's absolutely a movement. Right? You gotta get people behind it. I mean, there's a I said at the end of my talk today that there's a huge backlash, and the backlash is only just brewing now. So you and I are pushing forward, right, on on these waves of, you know, AI engineering is about is about building AI enabled applications and being being in AI.

And vibe coding is about abandoning the old ways of producing software and embracing the new ways. Right? And both of these are making people pretty mad. Right?

I don't I think they're mad if their identity is tied to the way that they work today with no changes no room for changes.

Yeah. So I'll start with my first hot take.

Okay. Let's go.

There is a demographic that is the most affected by that. Their identity is the most tied up with the way that they work. Okay. It's not junior engineers. It's not non engineers.

They're all vibe coding. It's senior engineers, senior leaders, people who have so, basically, you can you can narrow it down to twelve to fifteen years of experience. They hate vibe coding, and they hate AI, and they're online going, my fifteen years is better than that AI. K? That you saw I don't know if you saw Jordan Hubbard's post from NVIDIA where he just laid out some really nice advice on how to get the most out of agents as you're coding.

And this guy posted and he's like, yeah, you know, no. You you stick with your doing your director stuff and leave the programming to programmers. Right? When you have fifteen years of experience like me, then Then you're qualified to talk. Right?

Right. So I said something to him like, I think you need to learn to read a clock. And he's like, until you have fifteen years of experience, and I'm like

Well, you got more experience than him.

Or I have forty five. So should I like go to 60 before I can talk to you, or should I like cut out thirty years of experience so I can be as dumb as you? Right? Those are my options. And so I don't know.

I guess I'll see him in fifteen years.

I I okay. I think there's one element that I'm trying to figure out of while these people have to coexist, Right? And most companies are gonna have a mix. Even OpenAI, by the way, like, we talked about this last night at dinner. Guys, OpenAI has people who don't use AI to code.

They have people who don't use codex. They probably are using cursor or something. Okay. But they're not using the agentic loops. Right?

Yeah.

Yeah. And yeah. It's so, you know, we talked to, you know, Andrew Glover there, you know, the director of DevProd. And from what he was saying, they've been planning on going public with this once they have more data about it. Yeah.

Anecdotally, they're sharing that The performance. Yeah. Difference is like 10 x by any way that you measure it. So lines of code, commits, business impact, whatever. And it's it's so stark and pronounced that the people who aren't adopting it are now 10 times less productive at performance review time.

Two people, same title, same job, and all of sudden, one of them is 10 times as productive than the other one. What do you do? And the answer is you panic. You actually go to HR and you go to legal, and you're like, what are our options here? Because the time is coming.

K? Here's another hot take. Alright? If you're still using an IDE to develop code by January 1, you're a bad engineer. Mhmm.

There's a there's a hot take for you. Right? Now you still have a, what, five, six weeks to to still be an okay engineer while you're using your IDE, but this is the time that you need to drop it and learn how agents code. Okay? Because it's a skill set.

I mean, it's so complicated. We wrote this book about it, me and Jean Kim. Because we were, you know, we were playing with it ourselves last year, and we're blogging about it and talking about it. And every blog post was 30 pages. It's like, are gonna do with 30 page blog posts?

That's long even for me. Right?

Yeah.

And at some point, I was just, man, like the skills that you gotta learn in order to get the AI to do the things that everyone's mad because it's doing them. Right? Because everybody's everybody's like, well, I tried it. I spent two hours with it, and all it produced was garbage. And the answer is, actually, you have to spend two hundred hours with it.

You have to spend two thousand hours with it. And that's not actually an exaggeration. Gene just pulled up a study that showed that you actually have to spend a year or two thousand hours with AI before you trust it. And what does trust mean? Trust in this case specifically means before you, as a user, can predict what it's going to do.

And if it's unpredictable, of course, you're gonna be mad. But as soon as you've worked at it with it for a full year to where you fully understand its capabilities and its drawbacks, which haven't really fundamentally changed, it's gotten more capable, but the edges are always the same. It hallucinates. It gets lost. It gets amnesia, dementia.

It lies to you, whatever. Right? Those skills, we've been building them for years now. Everybody who's been trying to write code with AI, we've been trying. It hasn't really worked, but it's been working better and better and better and better.

And now it's reached the point where it's working a lot better than all of the other options. Yeah? Yeah. And if you haven't tried it in two months, you're way out of date. The models are much better than two months ago.

If you haven't tried it in a year, you're a dinosaur. It's just unbelievable how bad you are. And and, you know, you may be look. I have friends who are much better engineers than I am. Okay?

I mean, world class, maybe some of the best in the whole world, okay, have built technologies that you've heard of, and they're not using AI yet, except the occasional I'll ask cursor a chat question like Wikipedia or whatever. Okay? Those people are gonna be the interns in a year.

You really think so? Yeah. With all their experience that you have.

I've had this hypothesis that has not been really confirmed with any any anecdotal evidence at all until today when I met somebody at your conference that told me about how he had been in this position. Twelve years of experience, didn't want anything to do with AI. And he met these two PhD students from somewhere in Europe. I forget where. And they were both just super hardcore vibe coders, you know, with with the the agents.

Right? And he was watching them work, and they were super junior. And they kinda didn't know what they're doing, but they just had no fear and all the ambition. And all they did was they just kept hammering on the thing going, okay. Well, why did you do it that way?

Explain it to me. Okay. Well, let's let's let's look at other options. And they would just be kind of the perfect engineer with no context. The perfect no context engineer, what questions are they gonna ask?

Have you thought about scaling? Have you thought about security? How is your test coverage? Right? I mean, engineers are gonna all ask the same questions.

Right? And he realized that engineer in a box is not too far off from knowing the right questions to ask an LLM. And that these two students were so productive with it. He was blown away that he was like, oh, no. Like, that's when the ray the light bulb went on.

He said, I have to learn this. And now he's been doing it ever since. Right? But it ain't easy. I I you're not gonna pick up Claude Coe, and you're not you're not gonna just try it and be like, it's just gonna work for it might you might get lucky.

But eventually, if you don't have the right mindset, if you don't have the right attitude going in now even with the right attitude, how often have you swear sworn at your agents in the last two days with the actual f word

or like right? I'm pretty polite. I say thank you and please.

I say thank you and please.

And then you go, what

the did you do that? Right? And it's it's because it's because Gene and I realized this after we published the book. You had this helper. They're very human like.

They come in. You have to tell them a lot of stuff, and they they they need a lot of guidance. But over time, they need less guidance. Your prompts get shorter. Things get streamlined.

They seem to get it. They're working. Now if this were a human being, you would draw the conclusion it's because they understand you and they get you and they're finally part of the freaking team. Do not make that mistake with LLMs. Never make the mistake of anthropomorphizing an LLM like Larry Ellison.

Right? The LLM at any moment can stab you in the back. Okay? It can just be like, yeah. We took care of that really hard problem.

Now I'm gonna delete your database. And you're just like, no. Right? And and it's because of that, it's weak. We call it the hot hand.

You you sorta like you're like, it's going, man. I'm feeling good. I'm feel this thing gets me. I'm gonna make it do a production change. And that's that's how I found out about this.

And it's like I was like, my script can't access prod. And so it chose to do it in the worst imaginable way. What it did was lock out the entire rest of the universe and including my live game and everything else and only allowed my scripts to access prod. And it was changing password. It changed the password.

And I was like, why did you change my password? Right? Yeah. And it's like, oh, I'm so sorry. I I definitely shouldn't have done that.

What what Noted in? K. And I'm just right? This is what will happen to you if you just you just try to do agentic coding. K?

Bad things will happen. This is what our book is about, really. Right?

Well, I mean, that's not the best ad because then what? Like,

you learn, and then eventually, you learn how the speed bumps and the corners and everything. It's like driving. Right? It's like driving. Like, you you you you're you wanna become like a NASCAR driver.

Like, this is high performance stuff. You're coding with 12 agents at a time, and you're you're more ambitious than you've ever been. I was talking to a guy today who's got got way more projects going than I'm I've got. I don't know where he gets all the time from, but he's probably doing 10 or 12, like, major projects at the same time right now. And he's just doing it all with with Agenta coating.

You know? So, I mean, like, man, the the the the the ad here is that you will turn into Batman, but you can't just grab the suit and put it on and be like, I'm Batman. You're just a cosplayer. You're cosplaying at vibe coating. You gotta learn how the tool belt works, and that's gonna be pain, suffering, and mistakes and learnings.

Now you can get a lot of it by reading this and all of the other vibe coding books. Read the O'Reilly. Watch the talk. I mean, seriously, like, you should, like, get all of the possible angles at it because it seems to land differently for different people. There'll be some analogy where you finally get it.

You know, I get it. It's like this. And and and it's like a three d printer, and nobody else thought it was like a three d printer, but somehow that was the magic that made it for you. Right?

Yeah. I would say one of the biggest surprises from the dinner yesterday was how many people all have the experience where they no longer write single lines of code. Like, they they're really just kind of prompting and and doing I going to put

that. Single lines of code, you mean they would never write any code at all?

They might edit. But, like, I think when they're writing net new, they always start with the prompts.

No editing. No touch. No editing. It is the edit is very expensive when you're like, that that identifier is misspelled and it's a local, you know. You could just add it, but it's better for you to close your IDE and probably uninstall it.

No. Actually, that's not true. Somebody finally convinced me that IDEs are fantastic. IntelliJ in particular. Keep it open.

If it's a Gradle build. Yeah. And actually, not for the LSP, although you can use it for that. Actually, that's another good way to use the LLM if you get an MCP server. But no, it's that IntelliJ's auto indexing is so much faster and incremental rebuild is so much faster than LSP.

Last night. Yeah. Yeah. So you all all you do is leave IntelliJ running, but you shouldn't look in it. It's a tool for the AI now.

Right?

Amazing. One other thing that is a big part of some of the hot things you're saying, you say Cloud Code is not it.

Cloud Code ain't

it. Explain yourself. Alright. Everyone here loves Cloud Code.

Everyone here loves Cloud Code or or AMP if you use our product, which is is just recently leapfrogged Cloud Code again because of Gemini three. AMP has this cool feature where it goes to another model.

And just to prewarm you, I also wanna talk about just Google in general and how this Gemini revolution has kinda changed Google's image. But let's talk about Cloud Code.

Sure. Cloud Code's been around since March. Cloud Code has been proven to work. And so but yet, probably 80% of the world's 90% of world's programmers are not using it or anything like it. You you get certain companies where it's really taken off, you know, but but most aren't.

The world is stuck on cursor. The word world is stuck in 2024. Last year, we were trying to get people to write with chat. Right? And we were like, we're telling and they were like, no.

Completions. And we were like, oh god. No. But it can generate the code, you just gotta paste it in. You just gotta do all this stuff.

And they were like, that sounds kinda hard. And we're like, but it's faster. And they they wouldn't do it. And then nine months later, it finally percolated in, and now they're all like, I like cursor. And it's like, that's so last year, dude.

Right? Like, wake up. And yet they haven't adopted it. And so you have to, at this point, look at it and say, why haven't they adopted it? Let's go look at the reasons.

And the answer is, it's too hard. It's too hard. You have to be able to read man, most engineers, honestly, like, to them, five paragraphs is an essay. K? And with Cloud Code, you've gotta read waterfalls of not just information, but also code and diffs.

Right? Because if you're gonna put your IDE away, you actually do have to look at the diffs. Now I'm gonna tell you that once you get some expertise at this, you can actually tell from the shape of the diffs and the color of the diffs and the length of the diffs.

The

vibe. You can tell whether it needs a code review, whether they're doing the wrong thing, whether they seem to be rerouting suspiciously too much code for this problem. Right? The diffs alone, just the shape of the diffs can tell you a lot about what's going on without actually reading the code, but you should pay attention to them. Otherwise, you'll have problems that will only crop up later.

Right? But yeah. I mean, like, put the IDE away. Okay. Clog code, and then get Clog code out and try to start using it.

Alright? And you're gonna find that it's look. I've been using Clog code, honestly, ten to twelve hours a day, literally, for for months and months and months and months, and I still curse it out all the time. I just lose my mind. I'm like, how could you have done that when you just said right?

And it's like, it's actually been shown. It's it's starting to be shown that sometimes when you put a little pressure on them, they perform better. You can break through logjams that way. But anyway, look, you're gonna run into problems and and but the thing is, next year, the tools will be better. K?

If Cloud Code's not it, what is it? Well, we gotta get back to something like an IDE. Right? I mean, that's just gonna be it's gotta be natural for people. You gotta be able to look at it and see what's going on, not have to read.

It's gotta have visual indicators. Right? And and yet it's not going to be an IDE because an IDE is very much focused on helping you write code, and that's not what you do anymore. Right? So what it's going to be is it's gonna be your agent orchestration dashboard.

It's you're gonna walk in in the morning and be like, yo. So how does Right? It things Yeah. It's like, oh, that one's still running. That one's running a tool.

That one needs my input. Okay. Right? You just go through the list. And so I'm building one.

You can go look supposed to be a private repo, but it's public, so I've got forks and shit happens. But whatever. You can play with it. It's called VC, VibeCoder. It's my v two of the VibeCoder system.

And what it does is it creates a set of scanned workflows that run the agents for you.

Yeah. I don't know if you saw anti gravity from Google the other day. We shot two days ago. So It's

so fun how much stuff people are inventing that are all your age of marriage. Yeah. Yeah. No. So look.

I called this I don't know. I called it in March. With Revenge of the Junior developer, I did that chart and everything and, like, Dario quotes it in all his customer advisory boards and everything. Right?

Really?

Yeah. Yeah. No. It was it was really pretty impactful. And and I called that what's gonna happen is the that agents I even back in March, I knew they were too hard.

I was like, what's gonna happen is they're they're you can run them programmatically, and 90% of the crap that you do with them could be handled by a model, often a cheaper model. Right? If it's just like if it's asking you, which of these two things should I do next that are equally important? Like, just have Haiku say either one. Right?

So, like, I called the orchestrators are coming, and it's taken close until, like, the end of the year to get there, which is roughly where I predicted them coming. Replit, Agent three, there's a bunch. There's there's Conductor. There's a VMAD came out open source. They're all different, you know, takes on it.

Right? And, but there's there would be more coming. I guess, Google's as well.

Right? Yes. I like this analogy that they have. It's still pretty new. So who knows what the eventual vision is?

Is that you just get notifications from your agents as they're working?

Exactly. Yeah. So in mine, in VC, there's an activity feed that was one of the first features I added, which is like, I want to go work, and I just wanna get notifications periodically of interesting stuff.

Interesting. I wonder if it'll have, like, social networks of agents. Well, so the agents each other, following each other.

Well, so I just had three hour coffee with Jeffrey Emanuel, who's who's he did the MCP agent mail. He's one of the smartest people I've ever met in my life. He's the one that wrote the article that crashed the stark market about NVIDIA. Oh. That Jeffrey Emanuel, the one that an incredibly well written article that said this is why it's a bubble, the whole market went, and Karpathy started following.

It's back up. He wrote what you just said. He said it is back up. But he wrote agent mail, which is he was just tired of having to copy stuff between his agents. Like, you tell me what to tell this agent.

And so he made a little little, like, I don't know, HTTP server that's like an inbox for them, a messaging, and they talk to each other now. And now he goes, coordinate amongst yourselves to paralyze this task, this epic that I just put together or whatever, and they'll do it. Some people are coming at it top down and trying to build orchestrators that do it all for you. But interestingly, beads, right, which is the issue tracker session thing that I that I made,

plus his Purely vibe coded, by will. Yes.

Purely vibe coded. Yes. So, I mean, like, I get PRs every day for horrible problems that I introduced, but nobody seems to mind because we've got stable versions now. So Beads is like living proof that you never actually have to look at the code as long as you and other people are asking the right questions and having the AI look at the code. I get PRs from people all the time where it's obvious that the AI did all of the analysis and all of the coding, and I look at it.

And and sometimes I'll just be like, so my AI, what do you think of their AI's PR? Right? And you saw summarization.

I mean, isn't that bad?

Don't you want It's bad if your code if look. It's all about the outcome. VEEDS is working, and it's got tens of thousands of very happy people use it. So, obviously, it's not bad.

I I may wanna If you do

this to your your company's production website and bring it down, then, yeah, it's bad.

But still, Beads is kind of a database, you know, and database is one of the harder things to make.

You know, Beads is really weird. The architecture is really weird. And the only reason it works is because it wouldn't have worked in the old days. It would have been just too hard to manage and and and not programmatically. But what you do is you tell the AI, go fix it all up.

And whenever it's corrupted or there's a merge conflict or just just fix it. And it's funny because Jeffrey Emanuel, who did the mail, basically did the same thing. He has all his agents running the same directory, and they do file reservations. They're like, I need that file. Man, I used to do that Accenture in the nineties.

Right? I'd, like, run over to a dude's cubicle and be like, I need that file. Their revision control was so bad. So, like, he's got a file reservation system going, but but it all what happened was as soon as he put it in place, his his agents just started working. And now he's got this little village of agents.

Right? And that's that's where we're headed. So the orchestrators are gonna be about not keeping the agent on the rails, but keeping all of your agents on the rails and communicating with each other.

Yeah.

And then you hit the wall. Boom. Does anybody know what the wall is once you get past all this? Merge. Merging is the it's the wall that everyone is hitting right now.

Yeah. I think the company that's best poised to solve it is Graphite. I was gonna go talk to him about it.

The code They'd be happy to talk to you. Yeah.

Yeah. I think everybody needs to solve it. And if you're at an enterprise, like, what we hear because Gene Kim and I talk we talk to companies all the I'm a SaaS seller solely in the Sourcegraph. So we get to hear the the inside story from all these big companies. Right?

And they're saying, yeah. As soon as you get to the point where, like, every developer is 10 times as productive, merging their code becomes this incredibly complicated problem because I you and I work at the same time for two or three hours. We make, you know, 30,000 line change each. Mine makes it in first, and it and it gets merged. And then you come along, and I have literally changed our logging system and our, like, you know, our architecture here and APIs that you are using.

Yeah. And so it's not gonna be as simple it's not as simple, let's let's fix the merge conflicts. It's like you're gonna have to re envision and reimagine and reimplement your change on my change.

Or rip yours out.

Or rip mine out and make me do it. But ultimately, ours are just the AIs doing it. Right? Right. But the the important thing is that they have to be serialized.

It is a queue. And that when they go in there, they have to actually, like, basically redo what they were doing on top of the new thing. This is Nobody has solved this, and it is a huge obstacle right now. You know what one company did? Sorry.

Last thing. Yeah. One company said, here's our solution. One engineer per repo.

Not making that up. It's a solution. It's a solution for now. The the classic solution for this is stack diffs. Right?

Merge queues, stack diffs that

I don't know about stack diffs, so I guess I'm dumb.

It's a it's like a Facebook concept that they're trying to bring into the wider world. GitHub is working adding it. I didn't just talk to Jared Palmer there. Basically, I I'm hearing no solution yet, but you should be aware of it and design around it. Yeah.

I mean, it's the old fashioned way of just hammering through it really hard and

Well, also, you know, you could just talk to the other guy and say like, hey, I'm doing this, you know, pretty deep architectural change. Let me go first and let's let's agree on the overall pattern first.

So yeah. I mean, I've run into this situation a few times where I've actually tried to give this agent the heads up that this one's making a change that affects this one. Yeah. With the mail thing that Jeffrey did, I think once I get it wired up because he doesn't use WorkTrees, and I'm going to

Yeah.

This. But once once they can actually talk to each other, I think it's gonna be as simple as just keep in mind that that agent's working on something that affects you. You might wanna go talk to them about it.

Yeah. And agree on the overall, like, fundamental Ifrah.

And they're quite good at it. I they just Yeah. What? It's because they have no ego. They're not like, oh, it's gotta be me.

Right. So just whoever's first gets to be a leader.

Great. What do you and him disagree on? Me and who? Jeffrey.

Emmanuel, the guy that I just met? Well, we so we foundationally, fundamentally disagree that having 12 agents work in a single repo clone is a good idea.

So you're on the pro side?

I'm on the pro lots of like, either Git work trees with lots of branches or separate repo clones.

I would imagine he He's

them sandboxed. He's in favor. He's got them all in the same they're all Oh, I they're literally they're using the same Git, the same build. So one of them will be, like, doing a build believe, like, need to run a test.

Yeah. That's much churn.

Yeah. But he has a file reservation system. So the funny thing is, k, I was like, this is insanity. And he's talking me into at least acknowledging that it probably works pretty well if you're a solo dev and you're new using no no more than a dozen or 20 agents because it is actually working for him. And he uses the same principle that Beads does, which is it wouldn't have worked in the old days.

It doesn't make any sense to a real engineer. And yet you tell the AI, if anything gets messed up, just fix it, and they will. And so that's right? That's why his thing works because every once in a while, the file reservation gets screwed up, and they're like, hey. We need to resolve this, and they figure it out.

Interesting. Yeah. Interesting. I I some people have proposed that the theme of this conference next year is on multi agents.

Oh, yeah. I mean, yeah, of course. Yeah. Yeah. I mean, AI will be about multi agent.

Look, we're in this phase still where we're we're cutting down corn with scythes with our hands. That's what a a real programmer does these days. We're moving next year. It's very clear. We're moving to, you know, these these machines that churn, you know, these giant just like those ones that you see on the farms today, factory farms.

We're gonna be factory farming coat. K? And that absolutely, like, a lot of people are just so dead set against that philosophically, morally, ethically, whatever. They're just like

They're so used to subsistence agriculture that we're not we're not used to, like, the big They wanna be

John Deere. But we are we are actually moving into the John Deere era of coding. That's amazing. Yeah.

But the funny thing analogy, actually.

And I just thought of it too. We'll we'll have to reuse it. Yeah. But it's been it's been it's been growing on me. It's the it's the whole it's this idea that Claude Code and AMP and Codex, you know, Klein, we love them all equally.

They're all equally bad. I said in my talk today, they're like they're like a a power saw or a power drill. A a skilled craftsman can do a lot of good with them, and then you can also cut your foot off with them. The same thing's true with Claude Code. But imagine a big machine, a big farming machine that knows how to run Cloud Code and scrub it.

Right? Almost it's like it's like, okay. You plan. You implement. You review.

You test. Right? And you split it all up, and now you got yourself factory farming. Right? It works.

People are building it. It's gonna happen. And what it's gonna do is it's gonna it's already started to unlock programming for nonprogrammers, and this is completely turning companies upside down. They're starting to realize that maybe the ideal time team size is, like, two or three. Yep.

And I mean, like, right? The whole way that companies are run, the whole governance structure is gonna change because now coding is no longer the bottleneck. The business needs to get immediately involved. This feedback loops get faster, and it's really exciting times. But it's too much for a lot of people, and they just they're they're they're, like, checking out or they're they're revolting online.

And I predict that as these capabilities improve and as we get closer and closer to the factory farming of code, we will see a massive backlash from the Luddites. You

are the one of the few people I can ask this as a I know a lot a lot of people in our audience are critical of going the full hog with this.

Yes.

So a lot like, they're like, fine for front end, fine for application code, but don't touch my cloud infra. Don't touch my my back end, my distributed microservices.

Definitely don't touch anything production. Only touch code. Only use these things when Git is your backstop for starters. K? So keep prod out.

It's gonna be real tempting to write, but don't. If you have git as your backstop, why should you be worried?

True. Except, I guess, people have the perception that it is less good at back end code.

Oh, this is the problem where everybody's bad at math.

Yeah.

Okay. So how good was Chad GPT 3.5 at systems code? Pretty bad. How long ago was that? K.

Two years ago. People people think the honestly, I believe that the misunderstanding here is rooted in a fundamental belief that the models are done getting smarter. Right. And the funny thing is they could be done getting smarter. They're not, but they could be, and we would still be over the hump where we've discovered electricity and now we need to harness it.

Yeah. We will still get to factory farming code with today's models capabilities, and it will get there fast. We'll get there by summer. But the models are getting smarter so fast. You know, it's really there's there's this interesting tension of, you know, like, you're building tools for capabilities that the models will eventually have built into their brains.

Yeah. And so you won't need that capability in the tool anymore. And so there's this constant arms race and decay of your tool filling gaps for the model until the model's good enough to fill it itself, and then your tool moves on. Yeah. That's what I mean.

Becoming all code and all tools are becoming throwaway. Yeah. Like Which is

great because they're easier to build too.

Yeah. By the way, yes. So remember Joel Spolsky, one of the greatest, you know, of our of our our generation, our time, one of greatest writers and thinkers. He gave the best tech talk I've ever seen, and I wanna get him to come and revive it. He gave it at Amazon twenty years ago.

It's still relevant. He's invited here?

Great. So Joel Spolsky, a long time ago, wrote something that was timeless until today. So it was twenty years timeless, which was

Never rewrite your code.

Never rewrite your code. And now we've discovered that it is for a larger and larger and larger class of piece of bodies of code, it is better to just start over and rewrite it from scratch than it is to try to fix it. The LLM will do a better job. I first noticed this when I was trying to port all of my unit tests from one architecture to another. And eventually, was just, oh, just the iteration because they're trying to fix.

So there's a lot to keep in but instead, if you say, throw all the tests out and make them again, it just goes, and you're done. Right? And so it's like, well, what about this library I gotta refactor? And so it's creeping up, but we're moving into a world where the fastest thing to do is just build new code that does a better job of what the old code was trying to do. Yeah.

I mean, it's like we're unlearning everything. I feel like enough upside down land, but this is it's like we've entered quantum mechanics, but you have to you have to embrace this new world.

I love the energy and the credibility that you bring because a young kid could say what you're saying and not be as believable. But you're coming from the perspective of you've been a huge

I've been doing this a long You've been

a game programmer. You've been everything.

Yeah. I've done I've done assembly language for five years, you know? Yeah. Operating systems in assembly language. And it was eighty eighty eighty eighty eighty six, not even f 80 x 86.

God, we had eight bit registers. I've done it all. And, you know, the game program game programming teaches you everything. Yeah. And then, of course, I've done platforms and Google and ads and this and that.

You know, the agentic loop and the game programming loops share a lot in common. They do. Resource sharing.

They do.

Operating system loops as well.

I feel like I'm building the same systems over and over again now. Yeah.

It is it's it's there's only, we're cursed to reinvent the same designs in every new domain. It's a privilege too. You know? One thing I wanted to get you to comment on is Google.

Just Google. One of

my favorite memories, which is, like, just before you retired was talking about how Google still doesn't get it, Google Cloud in particular, how they shut

down The deprecation policy.

The deprecation policy.

I'm so mad about that. You gotta get me pretty mad to write a blog.

They seem have they turned it around?

No. I talked to some people there, and a lot of them were like, yeah. That's not a thing for Google. And it's funny because, you know, Amazon Amazon not on the platform, not on the deprecation stuff, not on the important stuff. Google has turned it around on on execution.

Yeah. They finally did the thing that they should have done, you know, fifteen years ago, which is hold hold people accountable, and it's not just engineers do whatever they want all the time, which is what it was for twenty years. And it actually worked pretty well because they had a monopoly on ads, and they could afford to subsidize Google engineers doing whatever they wanted for all. But, you know, ultimately, they had to do the right thing and grow up and mature as an organization. It was painful, and they lost some Google culture, and it's not as fun anymore.

But they now execute well, and they did the right thing for the company. And now with Gemini, you can see now they've they've been shifting their focus gradually towards more AI, AI, and now it's starting to pay off for them. Yeah. And maybe they're gonna be the big big winners.

Do you have observations of a similar kind with all the other labs? You know, I I'm just kinda curious in your takes on one of my favorite charts is is that old chart where you had Microsoft, like, all pointing guns at each other.

Yeah.

Facebook, everyone's a thing.

Person to ask me this. I remember that chart. That was funny.

Yeah. It's just feel like someone could do that for OpenAI.

They could. They could. You know, it's an interesting question. All three of those companies, Google, Anthropic, and OpenAI are an unbelievably chaotic internally right now.

Yeah.

Chaos. K? Anthropic hides it really well.

They seem

they seem They seem like they've got their ass together. So what what what that means is their product managers formed a wall around that chaos. And bravo, Anthropic product managers. But it is and it's not because Anthropic's screwing up. It's because it's an inevitable function of growing that fast.

They're hiring, like, a 100 plus people for Cloud Code in the next, I don't know, month. I mean, like, they're they're going wild, and that's just Cloud Code. You're not gonna I mean, I was at Google and Amazon when they were in the big, big, fast, phases, and you're just gonna have chaos. You're gonna have churn. Nobody knows who to talk to what, and everything's crazy.

Eventually, it starts to smooth out, settle out, and they'll get there. Right? OpenAI is chaotic more like in a well, they had a lot of exits. Right? You know?

I don't know if it was chaotic as, say, GitHub, lost most of their senior leadership and was just complete turmoil for years, but they're pretty chaotic at OpenAI. Right? And then Google, you know, I we were just talking to somebody today that was saying it was just still too hard to, like, get consensus across groups to with the Jules Jules team. Yeah. They can't get it rolled out internally because Google is so siloed.

It's a a billion monoliths. Right? Little little little apps that don't talk to each other that it's hard to roll anything out across Google. So all three of them have execution problems right now. I think Anthropic's probably executing a little bit better than the other two, but it's real close race.

And, yeah, it'll be interesting to see and and see see if Oracle or Facebook or any of the others can catch up. Right? Meta?

Facebook will be the most interesting thing. I mean, they'll have to do something huge next year.

Next year could be the year of open source models.

Yeah.

If well So look. As soon as open source models get to the point where they're as good as CloudSonnet three seven was, then you turn on Klein or something, and you've got something that as good as Cloud Code was in March, which wasn't as good as today, and it's not good. But it's good enough, And you're running it for free. Free. Free.

Free on your local m four or whatever. Right? So, yeah, I have and and from what I've heard, they they're seven months behind, and that that gap is gradually narrowing the frontier models, which means LSS models will be as good as Gemini three next summer.

Right.

So, yeah, next year could very much be the year. That means the tools are gonna have to get much, much better at decomposing the task and assigning them to the right model, the right size of model for cost optimization.

I'll represent the critical side, which is that the reason they're converging is because they're saturating. Right? There there's only there's you can only ever hit 100, and the closer you get to 100, proportionally, it'll just get harder and harder. Right? So, obviously, the rate of change when you're lower down is is higher as compared to when you're already saturating.

But that's a minor technical point.

Well, no. I mean, it's not it's not minor at all. It's actually a foundational question, which is, is the line of AI intelligence gonna go straight, or is it going exponentially, or is it actually starting to peak? Asymptotic. Yeah.

Yeah. And, you know, from what we've heard from people who are very, very close to the research, we know that AI has been getting, what is it, four times smarter every 18 months for the last, I don't know, thirty years because of Moore's Law. And they think that there's enough data left, training data, for two more cycles of that before they don't know what happens. Yeah. Maybe it goes up more or maybe it goes We don't know.

Human history ends.

But two more cycles means they're gonna be 16 times smarter in three years. Right? I can see. So well, I don't even know what that means. Well, I've spent a long time trying to figure out what it means.

But what it means is they're gonna be really, really, really smart, and it's gonna change the world probably in a lot of good ways and a lot of bad ways. And yeah. I don't know if you

have this version of this conversation. People ask me if their kids should learn a code. Because Kids should learn to vibe code. You like, you have the escape hatch of you can read the code if you want to. You don't just don't need to most of the time.

But you can, and it's a good guard.

Right. But I don't because Yeah. You don't have to.

Well, I think my take is whatever it is, you'll be better off if you do also know how to code because you can prompt better. Right? You can because you can tell you can communicate Oh. More precise terms.

Look. When you when I say you say you know how to code, not the syntax and stuff, but you have to know, like, in a language neutral way, what the capabilities of languages are. Functions and classes and objects and, I don't know, monads, whatever it is, the whole superset. You should be aware of them. And then from there up so you've you've you've cut off all the syntax.

You don't care how to write it anymore, but you care how it works. So you've sort of reached the level of how a product manager thinks about things architecturally. Right? And you need to be that product manager, and now you're starting to move your concerns up up, and you need to know all the engineering stuff. And like Jeffrey Emanuel, like I was talking about, he's a mathematician, self taught engineer.

He doesn't. But he he's learned all of the right concepts. You know you know, Cloudflare does this and and Apache Cassandra does that. Yeah.

That is still technical. Yeah. That doesn't go away.

You still need to learn all that. Right? And so just because you don't have to write code anymore doesn't mean you have to you still have to learn a massive amount of stuff to be an effective engineer in the new world because that's the level that you're interacting with them at.

Amazing. So this has been a a great overview. I don't know if you have any other sort of rants in you that you wanna sort of get out there. I'll I'll I'll leave you the floor.

I feel like the gossip rate has gone up. Like, not gossip, but the the rate of exciting announcements by engineers who have discovered new things about how to be more productive with agents. Like, for example, I just found out today not not this. I was I found out today about it's called CodeMCP or something like that where you Yeah. Instead of calling

Pretty popular project.

The agents can't call MCP very effectively because they don't have any training on tool calls. But they have plenty of training on writing code. So you tell them don't call the tool. Write code to call the tool, and they do way better with it. Right?

So it's like it's all these little learnings that we're finding. Right?

Our It's crazy that Anthropic, the creators of SAP found this. Did they? Yeah. Well, Cloudflare found it first, but then Anthropic was like, yeah. Yeah.

You guys are right.

Yeah. Wow. That's really neat.

So I think that's why I love focusing on the AI engineer because my argument is the AI engineer can uniquely take advantage of LLMs way better than everyone else.

That's true. That's That's

so much more powerful.

You could almost define an AI engineer as somebody who's mastered LLMs.

Yeah. Yeah. Not from training, but from using.

Yeah. Using. Yeah. Yeah. Yeah.

I think it's it's one of these, like, disruptor, like, strategies where, like, it's low status. It's high status to be a researcher. It's high status to train models. You don't get any respect if you're a GPT rapper. But, like, you start like, people are starting to be more productive and, like, actually develop sincere expertise in the same way that I think, like, f one car drivers don't know how to build an f one car, but they know they'll tell you everything about driving it to the

And they may know they in a sense, they know more about operating it than the people who build it, and so they have to have that conversation. Right?

Yeah. Although although if you watch the, I think, the f one movie, you you get a little sense

And they make all the movies. Is that what you said? Yeah. That's good point. It's flip flopped.

Lovely. Well, thanks so much for having for coming on. Cheers. Huge amount of your work. Your energy is very infectious, and I hope you keep doing Stevie's Tech Talks.

I'll start them up again, man. I mean, this energy is because of the AI, and it's because of vibe coding. It's it's addictive and tech. Tech is fun again.

It's just it's tech. Boring for a little bit. I know. I know. Nice.

For a while, it was like, well, Sourcegraph, like, index your your your indexes your code base, like, really, really well. You know? And, again, it's, like, so so fast. And I'm like, well, that's cool. But you know what's Yeah.

Cool. Alright. This has been fun.

Latent Space: The AI Engineer Podcast

Steve Yegge's Vibe Coding Manifesto: Why Claude Code Isn't It & What Comes After the IDE

0:00 / 0:00

View original episode →