How to Build an Agent-native Product | Mike Krieger

Mike Krieger built one of the most consequential consumer apps of the last two decades as cofounder of Instagram. He is now at the frontier of determining what makes a breakout AI-native product as co-lead of Anthropic Labs. Dan Shipper talked with Krieger for Every’s AI & I about how his experience creating Instagram shapes how he thinks about building with AI, including what can be sped up and what remains stubbornly time-intensive. If you found this episode interesting, please like, subscribe, comment, and share! To hear more from Dan Shipper: Subscribe to Every: https://every.to/subscribe Follow him on X: https://twitter.com/danshipper Download Grammarly for FREE at grammarly.com Timestamps Introduction: 00:01:39 What's gotten easier—and what hasn't—about building products in the age of AI: 00:02:33 Why vibe coding creates "indoor trees": 00:05:00 How rewrites have become a normal part of the development process: 00:09:00 What "agent native" product design means: 00:11:39 How Mike's labs team is structured and the cofounder model: 00:24:27 The best signal for a product bet is someone with "break through walls" conviction: 00:29:33 Navigating enterprise customers while keeping pace with rapid AI change: 00:38:51 OpenClaw, personal agents, and the product question defining 2026: 00:40:54 Links to resources mentioned in the episode: Mike Krieger: https://x.com/mikeyk Agent-native architecture: https://every.to/guides/agent-native

Published: Published Mar 25, 2026
Uploaded: Uploaded Jun 12, 2026
File type: Podcast
Queried: 00
Source: share.transistor.fm

Full transcript

Showing the full transcript for this episode.

AI-generated transcript with timestamped sections.

0:00-1:36

[00:00] The models today are good at adding features. They're not necessarily good about figuring out what to cut out of the product. You can get it to go zero, not just zero to one, but zero to end pretty quickly over the matter of hours. It's made a lot of decisions along the way. And some of the sort of intuitions you build about what are the right things to put in there, I think you build over time. I feel like that is the art and science of software design in 2026. [00:30] you [00:36] Work moves fast, and in the age of AI, the pressure isn't just to move faster. It's to make sure that what you send actually sounds like you. [00:45] From emails to proposals to stakeholder updates, generic and rush just doesn't cut it. If you've ever stared at a blank page knowing exactly what you want to say but not how to start, Grammarly fixes that. Grammarly gives you one place to think, write, and finish your work. Write where you already write. Most AI tools either take over or stay out of the way. Grammarly does neither. It helps you break the blank page, adjust your tone so a message lands right for the specific person reading it, [01:15] than 500,000 apps and sites that you're already using. It's loaded with agents built for every step of your process. And 90% of professionals say it saved them time. 93% say it helps them get more done. This is AI that works with you, not over you. In a world of generic AI, don't sound like everyone else. With Grammarly, you never will. Download Grammarly for free at grammarly.com.

1:36-3:18

[01:36] That's Grammarly.com. Mike, welcome to the show. [01:39] Great to be here. Thanks for having me on. [01:41] Great to have you. I'm super excited. For people who don't know, you are the co-founder of Instagram, and now you are at Anthropic and Anthropic Labs. I've admired your work from afar, both at Anthropic and at Instagram, for a really long time. And you're obviously at the forefront of [02:00] building products and AI. So thank you for coming on. [02:03] Absolutely. [02:04] Where should we start? Like, what we were talking about just now in the pre-production is... [02:11] what has gotten easier and what has gotten harder or stayed maybe stayed the same in product building as we've come as as the underlying substrate or the process by which we build products has changed completely. So like, tell me about your experience now versus, you know, earlier in Anthropik versus Instagram and how you think things are changing. [02:31] Yeah, I was doing the thought exercise a couple of weeks ago of, you know, [02:34] We know the Instagram story. We had another product called bourbon. We worked on that for almost a year. It wasn't working. We pivoted. We basically spent three months building what became Instagram, launched it, and then scaled it. And it's asking the question, like, what is now trivial? And what was actually inherent in that building process that doesn't, [02:51] easier. And that year, we probably could have hit some of the dead ends we had eventually hit sooner, but there was value in getting there too. We overcomplicated the product so that we then had to simplify it. I find even the models today are good at adding features. They're not necessarily good about figuring out what to cut out of the product. And that took a lot of just hitting actual real-world usage. And there was something about the process of incrementally adding things right

3:21-4:58

[03:21] it to go zero, not just zero to one, but zero to end pretty quickly over the matter of hours. But it's made a lot of decisions along the way. And yeah, you can ask it to follow up with you and then do input. But some of the sort of intuitions you build about what [03:35] are the right things to put in there. I think you build over time. And so I've been reflecting, like there haven't been a lot of breakout consumer products, even in the age of accelerated AI building. And I think part of it is because it just still takes time to sort of hone your view about what sort of intervention you want to make on the world and then build from there. Now, the actual building part, once you know what to build is of course so much easier. I had Claude basically rebuild Bourbon. It took about two hours. It was feature complete. It added filters, which Bourbon didn't have. We added those for Instagram, [04:05] and knew the eventual future of the products have decided to build that in. So I think that part feels really different. But I think there's also, you know, I remember there was a week where Kevin went off and built all the filters for Instagram V1. I went off and build like sort of the rest of the app. And, you know, sitting there, I would stay up till 4 a.m. and then sleep till noon. That's like my natural day-night cycle. And like in that process, you're making so many decisions. Like how should location work? And, you know, we've got to find a way of accelerating building while still... [04:34] sort of helping people build intuition of those decisions along the way. Because otherwise, I think you either get just get very generic products that are unlikely to break out or ones that just don't reflect some deeper intuition that you come to about your space or your product. This is great. I love this. It's making me think of two things. [04:51] One is I have this like little thing in my head that if you grow a tree,

4:58-6:39

[04:58] Without it, like with it being indoors, without it being exposed to wind, [05:03] It doesn't get as strong. [05:05] Because as it's growing, it needs all these forces pushing it back and forth in order to make a real tree. And so if you have it indoors without wind, it... [05:15] You're going to grow a tree, but it leans and it's not as strong and it's not the same thing. And I think there's something that you're saying here where because we've – [05:24] accelerated the pace of development so drastically. Um, [05:29] what would normally be this sort of incremental thing where you're doing things one at a time and then you're exposing it to users, you can actually kind of grow an entire tree indoors. And then you have this like whole thing that you're just like, [05:40] it doesn't have the same, um, [05:43] level of intuition and exposure to experience at each step that that creates a great product is that is that that i love that i i i um i love that metaphor too we you know when we were starting instagram we had this we were very into like eric reese and lean startup and the whole like yagney like you ain't gonna need it principle and um i have found and actually even one of the things i was working on in labs recently we way overbuilt for v1 before we even got to early access because [06:13] Why not add this one as well? That's like, that's a PR of work. And if you get a really good flow in cloud code, you know, you're firing things off, you're going to lunch, you're coming back, the thing is done. You're like, great, we added it. And the thing we realized was we'd created this sort of matrix of functionality that was actually quite hard to test and keep up with right before launch, or even to explain to people like they're arriving. The metaphor somebody else gave me, which I really like is the difference between sort of getting episode by episode,

6:43-8:37

[06:43] and you're like, wait, what are all these things? And who are all these people? And I already am expected to have all of this context. I think there's the same kind of feeling around developing something over time. But the tree metaphor, I think, sticks too as well. And so showing somebody the fully formed tree is also kind of a lot all at once. And I think there's definitely something there in how do you build product these days and still keep it simple. And just because you can doesn't necessarily mean that it should be in at least the first version. I'm having the same problem because... [07:12] I was literally up until 4 a.m. debugging and fixing this app that I made on the side at Every called Proof, which is an agent native collaborative marketing editor. So you can share really quick plan docs and stuff with your team or with other agents. And you have a little presence. And it's really fun. And... [07:32] This is like my second or third iteration of the full product end to end, which is really interesting that you can do now. [07:38] But the first couple of iterations, I just found myself because vibe coding is so fun. [07:44] and so addictive. [07:45] I just found myself being like, yeah, like I'll do this and I'll do this. And like, and it just created this. [07:50] monstrosity that [07:52] wasn't that good to use. And I got really inspired by, we have another product called Monologue, which I'm not sure if you've run into or not. [08:00] But I got really inspired by Monologue, which is a really simple speech text app run by Jim Naveen, who... [08:08] He's just so focused on making one simple thing work so well. And I saw how well that works in this age of just like anyone can make a product. It's like something that's super polished and just super good at what it does. And so I just basically threw out the product and started over with this very simple, like, it's just a shareable markdown link. And that then just like started growing virally inside of every, like everyone started using it all the time. And then now we launched it and it just blew up. And yeah.

8:37-9:49

[08:37] So I spent all last night, like not sleeping, trying to fix it. I'm being like, I'm too old for this shit. I can't, I can't be doing this anymore. Because it just reminded me of like being in my 20s or like being in college and like hacking on stuff and whatever, which is fun, but also exhausting. And so, yeah, I've, I've found that I've had to really modify my psychology because so much is possible. How are you dealing with that? [08:59] Yeah. And just as a brief aside on that, I remember with bourbon, our biggest mistake was adding functionality over time rather than deleting it. Right. And because, oh, you know, eight features doesn't make for good product. Maybe the ninth one will. And instead it just made for, you know, something that felt really complicated. I mean, I think a couple of things are also like part of how we're dealing with it is actually being more willing to do rewrites. You know, like classic, you know, Fred Brooks, Mythical Man Month. Like you shouldn't rewrite software because all the things that were imbued in B1, you're going to mess up. [09:29] Yeah, exactly. And that whole second system syndrome. And there are still a lot of truth to that. But one, you know, the models can help you sort of diff and basically see, did you miss anything that was in that first one? But second, it's just, it's no longer, you're not like talking about a year long rewrite that might have killed a company, like, you know, famous, like Netscape, like these are like days, probably, especially off a given source.

9:59-11:17

[09:59] And then like tore it down, done a V2 and then iterate on it from there. So it doesn't surprise me that that's become sort of part of what you've had to do as well. But it doesn't feel as painful. You're not like, oh, a year of building this thing. It's like, oh, that was last week. And then I get to do it this week and I get to cut out a lot of what was there as well. I think functionality wise and how we're dealing with it from a product development standpoint, [10:21] I think we are learning to launch earlier. And it's definitely a balance around, you know, we've grown, we have like a strong enterprise footprint, people have expectations about like what the initial version is, but not assuming that we're going to know what every connector or everything that we need to add to the product is ahead of launch, because people still will absolutely surprise us, right? We're, we have a strong contingent, contingent of, we call them ant fooders, because we're ants at Anthropic. But the only that only gets you so far before you need that, that real world contact, like take co-work. [10:51] for example, we'd been noodling on a product of that shape for a long time. And then once we decided, no, let's get this out, let's actually build the V1 that we think solves the problem in the most minimal way possible and get that out in 10 days was really a good push around, yes, there are a hundred things that V1 should or could have had, but it didn't. And at the same time, it was useful enough to prove something out there. And I'm not sure developing it for another two

11:21-13:14

[11:21] would have been building in a, the indoor tree would have been getting built. And then the second they hit real world use, it's like, actually, nobody wants to do that. They want to do this, this other piece. So I think that piece of that, again, there's like the intuitions of the original lean startup ideas are still here. It's just, they manifest at different timescale and in a different way. [11:38] I'm really curious to hear how you think about [11:41] product design and how products should work because [11:44] The, I've been, anyone at every will tell you the, the, the, the, the phrase that I use the most of the word that I use the most about the software build is it has to be agent native. So agents have to be able to like use it as anything that an agent, a user can do in the, in the app, the agent can do. There's a couple other like little principles of being agent native, but I basically stole that from. [12:08] I think that Cloud Code is the canonical thing that taught me about how that kind of product can work so well, where it's like, it's an agent, it can do anything on your computer that you can do. And it's customizable and flexible and extensible. So it's easy to start, but it can do all sorts of unexpected things that the designers didn't really like. [12:28] think about beforehand. And I think that that's such a good model for [12:33] a product development in AI. And I'm kind of curious, like, [12:37] this is just sort of what I've cribbed from watching what you guys do and then like kind of put my own spin on, but how do you think about it and how do you, [12:45] How do you talk about making products like that? [12:48] Yeah, that's so much in here. And I love the agent native write up you all did. It's like, to me, the canonical exploration of this. So thanks for putting those ideas out in a really clear way. So I think a few threads to pull on this. One is a conversation I had with somebody recently, where they said, you know, like, you all, they're a non-technical person. They're like, you are talking about like agents and all this stuff. Like, they're just like, actually, computers just work now. I always wanted computers to work and they didn't work. And now they work.

13:18-14:57

[13:18] get on the command line and brew install the thing that like nobody is going to do that but now cloud can do it for you and therefore like the computer now feels like a tool that is alongside you and i think that's that that core insight is it's more than even just adding power and functionality to new software it's also just unlocking the functionality that always should have been there or available and just felt like extremely hard um for people so that's like maybe thought number one thought two is actually comparing our products that do this well versus not i think cloud [13:48] a lot. So as an example, I was watching somebody use Claude, and they were in a project, and they had built, I think, an artifact or a new document. And they said, great, can you add this to my project knowledge? And Claude's like, [13:59] yeah, let me tell you the steps to go add it to my project knowledge. It's like, no, that should just be a thing that it can do really natively. And so I think even in that, you see a product that was a 2024 product that has been iterated on and evolved a lot, but still I don't think has been baked in from the very beginning, the idea that every single one of its primitives, it should have knowledge about and the ability to modify. And I think that's essential in products these days. I think Cloud Code is the 2025 vintage of that. And I think there's even further aspects of it [14:29] that folks are experimenting with, where that can actually sort of modify the harness itself, that starts getting to the next maybe level of that, where, you know, it's probably esoteric for most people. But even unlocking that functionality means that you don't have to sit there and be like, oh, I wish it did this a little bit differently. You know, I wish Gmail worked in this slightly different way instead of just asking it to. And I think that feels like the big next step. But even within like Cloud Code, just teaching Cloud Code about Cloud Code was a really valuable experience.

14:59-16:29

[14:59] This is now getting very circular meta, but bear with me. I loved your write-up on agent-native, and I was like, I want this as a skill. So whenever I'm prototyping something, it thinks in an agent-native way. So I had it packaged it up as a skill, and that whole process was, you know, hey, what [15:14] cloud and cloud code. I, you know, can you create a skill for this? Like, sure. I'm looking at my skill skill. I'm going to create a skill about it. I'm going to install it. I'm like, great. Is that available now? Or do I need to reload? It's an [15:24] Right. I think you need to restart it. Let me check. Yep, you do. All right, let's get it. And everything was, it has knowledge about itself and that unlocks so much capability in there as well, which maybe is like the last thread to pull on. I think all of these could be hour long conversations, which is. [15:38] I think, and one of the things that we're really thinking about in labs is how do you imbue the software that Claude builds to be more Claude aware and even just Claude agent native sort of building aware so that it even thinks to build in that way to start with, because it still won't, partially because, uh, [15:53] Decades of software is not that, right? So how do you get new software to have that principle baked in? [15:58] That's the thing I was about to ask you about. So A, I'm super honored that you read the write-up and you made a skill for it. That's amazing. And B, yeah, you're pointing to a real problem that I have found is... [16:11] I think actually cloud models are the best for this. A codex model generally is not as good. [16:17] at building an agent native because their models in general, unless you push them, they think like traditional engineers. [16:26] And that's a whole different set of...

16:29-18:24

[16:29] You know, you want to have guardrails and tests. You want to make sure that there's like one path the user can go down versus we're creating this extensible thing that's super flexible. [16:37] So yeah, how do you... [16:41] How do you architect your product to teach the models and the harness that you teach the models to think and work in this way? [16:49] Yeah, I think there's two parts to it. One is the more sort of mundane part. The second one, I think, is the one that's more sort of interesting and developing. The first one is like even just having good patterns and paradigms available to the model while it builds has been really valuable and finding the right balance of templatized to skillified, right? And like what that right balance is. But having, you know, one of the things that we like have now is a skill about [17:13] the Cloud API, which sounds super obvious. But even just having that is really valuable because you would sometimes find, you know, we'd launch a new model. It wasn't in the model's sort of innate knowledge. And then you'd get into these really funny arguments like, no, I know you made a typo, it's Sonnet 4.5. You're like, no, I know it's 4.6.7. No, no, no. So having that capability, having good templatized examples of that and skills, I think helps. But then the second part is, [17:40] What's also interesting is that class of software is just a different type of test. Like it's much harder to sort of write an end-to-end functional test around an agent-native product because part of it is that unpredictability. And so another idea we've been kicking around a lot in labs is like, [17:54] how do you increase like the sort of fidelity of the verification? The other day I had a agent native iOS app that I was working on and I was, I was having cloud interact with it. And cloud was ended up having a conversation with itself in like a chat feature in the, I was very funny watching cloud talk to cloud because it's like, somebody's pretending to like be what humans are. And this particular one was like, I was doing about like a sort of like work journal reflections. And the cloud was like, yeah, my boss is really rough on me. Like I had a hard day. And then the clouds like, Oh, I'm so sorry to hear that.

18:24-20:03

[18:24] going back and forth, but you wouldn't have written a unit test for this. And, you know, maybe it would have come up with some other emergent idea as well. So yeah, I think you just have to go much more towards, um, you know, setting up harnesses that are actually exercising as much of that agent native capability as possible because you don't exactly know what things are going to do. And things are going to end up in a weird place where Claude's going to try to do something that you wouldn't even think it was going to, um, do. And it might put your app in a, in a new state. So maybe it's circling all the way back to still like what's hard. It's like, [18:52] um, [18:53] Having the underlying architecture still be robust to that is really important, right? It's like it's agent native, but it's also able to flex in a way that you might not have anticipated, but you've got the right primitives, right? I feel like that is the art and science of software design in 2026. That's really interesting. I totally agree with you. Yeah, you wanted to have a playground within a safe environment. [19:14] safe environment. That's the only way you can have playground is if it's safe around the edges. But I think [19:19] Initially, we made the playground like way too small and constrained. [19:23] And now the models have changed. And so we can open it up a lot, but we still haven't figured out exactly like [19:29] At least I have not figured out exactly what the lines are. [19:32] Um, [19:34] And yeah, I think that there's so much here. Like one thing this is making me think of is I have this idea in the back of my head. And I'm wondering if you have a way to put this that is more succinct. It's like the unit of value in products right now is it's... [19:52] It's like proof of work or proof of use where when someone on the team submits a PR to me, I want to see, not necessarily did all the tests pass because I just assumed that it did, but like

20:03-21:36

[20:03] Send me a loom of you using it or your agent using it so I can tell is this good or not. [20:09] You know, um, [20:11] Yeah. How are you, how are you thinking about that? [20:13] Yeah, I think there's probably like three layers to that. The first one was like, Claude, prove to me that you've exercised this in some way. I've started doing that in all my promises. When it's working on a future, I'm like, and by the end, before UPR, prove to yourself and then to me that it works as intended. Find the right way of doing it. Which actually, you have to change your own sort of way you build and scaffold around saying, what is the right way to get Claude able to at least test this change succinctly rather than what it likes to do? It's like, I read the code. It looks good. I'm like, you wrote the code. [20:43] So, you know, you got to really test this thing. And then the second one is that what you described is like, you know, everything having some, you know, sort of proof around like, is it? [20:54] is it working as intended and as you intended to, because cloud is going to make, or any of these models is going to make a lot of decisions for you. And sometimes you'll, you know, I'll have engineers on the team put up a PR and I'm like, Oh, why did you choose to do this versus that? And, [21:08] Many times the answer is they didn't choose. It was just the choice the model made. And maybe it was a reasonable choice. It was probably a reasonable-ish choice, but it was like the optimal choices that fit into the paradigm. I feel like that is the, it's like, it's not just proof of work, but it's like proof of thoughtfulness. Like, did you think this through? And I was talking to an engineer yesterday and there, there, there was like, I was really, I knew you were going to ask me a lot of questions about this. So I was reviewing what Claude had done so that I wouldn't be like, I'm not sure, you know, and that's, I don't,

21:38-23:11

[21:38] But when there's one that's like, oh, I'm refactoring this system and there's going to be these new primitives. Great. Let's make sure those are good and that you've thought through how they interrelate. Because it's very easy to end up otherwise with sort of this tower of assumptions that you're not fully aware of. I had literally the same experience today because... [21:57] I [21:59] I made proof. [22:01] Totally vibe coded. [22:03] And it's growing really fast right now, but it's going down a lot. And so I've been spending the last 12 hours like trying to fix it. And so we have a little swap team internally at every that like signed up to help me fix it. And so I had to like onboard them and I was like, [22:18] "Shit, how do I explain how this code base works?" And so I had to like go back and forth with the model a bunch to be like, [22:26] Okay, help me define these terms. Help me figure out how I can explain this so I don't look like a total idiot. Because yeah, I understand some of it, but not all of it. Definitely not enough to the way that I would used to have to know. And it's a whole different... [22:43] thing to be like [22:44] Do I need to know that anymore? Is it like, where's the line now? It's hard to tell. [22:49] Which maybe gets to something else, and I haven't tried to articulate this, so bear with me as I get there. There's products that you use that feel robust underneath, and there's ones that you use that are like, it feels like it's one wrong command or click away from the whole thing, either freezing or being slow. For us at Instagram, we had Instagram direct messaging V1, and that...

23:11-24:48

[23:11] who knows if you send a message, it might or may not arrive to the other person. Like we'd like, I wrote our own like bespoke real time system. It was like, uh, you know, fell over a bunch of just, you would not trust that to send a message that you really needed somebody else to see. It was just a, you know, more of a social thing. And when we built V2, it was really important that we really hammered, like, no, like if you send a message, we're not probably going to get to WhatsApp level of like, you know, you can be in the middle of absolutely nowhere with like one bar of edge and it will probably, you know, try to still go through. Maybe that's not the bar, but [23:41] a bar of when I load messages, it feels robust. When it's sent, it's really sent. I feel like there's like a little check. That's like one small example. But I think that that is a thing that we still need to figure out how to make, you know, feel, you know, [23:55] like an essential part of shipping on anything, not just at, you know, Anthropic, but in general, like you've built this thing, does it feel like it's built on sand or does it feel robust? And the agent native part adds something totally even beyond that, which is, can I push it a little bit and is that going to fall over? Or does it feel like, great, I've got a solid trunk and yeah, you can push me in different ways, but you know, your data is safe and it's underneath here. And it's not just like one deploy away from completely falling over. [24:21] So if you're, if that's, if that's the bar, which I agree, like that's, that's where you definitely want to get to. [24:26] How have you changed... [24:29] who you hire and how your teams are structured as the models have gotten better. Because for us, for example, one of our products, Spiral, we just hired a new GM who's like, I would say he's lightly technical, but he spikes super high on product and writing sense and Spiral is a writing product. And now we can like...

24:48-26:32

[24:48] hire someone like that where a year ago we wouldn't have been able to because the coding models weren't good enough. I'm curious, like, [24:55] But the downside is [24:57] Maybe the product won't feel quite as robust if there's not someone who's like super technical in all the details. So like, how do you think about who builds products right now inside of the labs team, and how that has changed over time and how it will change? [25:11] Yeah, I love that. I think it's actually you get pulled in two directions, but they're both important. There's the sort of primitives and architectural robustness, which I think still need a sort of senior technical force. I was laughing with somebody like, I thought, you know, my skills and distributed systems were like not going to be useful anymore. But actually, those are maybe some of the most useful skills and reasoning about that. And, you know, thinking things through, like I had a long debate with Claude last week around like whether the system that I was building needed Redis or not, or could go away with just Postgres. And, you know, it was a healthy debate. [25:41] only because I was grounded and having used a lot of those technologies before. But then there's the other side of robustness, which is... [25:49] Have you just papered over all the problems with like fixes to your system prompt and additional instructions? Or have you sort of architected the actual like set of tools correctly? And so the latter is as important and probably where this GM can be really valuable and not, okay, like I'm making changes, but just like you wouldn't patch a sort of, [26:11] flakiness in your distributed system by just being like, well, just retry it in five seconds. I'm sure it'll work. Like also not doing the same thing with never, ever, you know, all caps use, you know, marked out or whatever the, the, the thing that you're trying to patch is like, they're both actually symptoms of the same thing, which is, is the underlying piece, um, robust or not. And, um, yeah.

26:32-27:56

[26:32] Claude actually, I'd say this about all the models, but I think Claude could be much better at both. It's like still a place that still needs a lot of human oversight. On the systems part, you know, [26:41] It's now able to debug production systems, which is really valuable, but architecting them in the first place, I feel like we're still benefits from somebody who's really thought these three things through or has experience. And on the prompting side, you know, if you give it a, I've seen people get into this dev loop, even internally here, like, here's the prompt, here's a mistake that the system made, iterate on the prompt, its natural tendency is to just add more things to the prompt. [27:11] They'll be like, I'm just going to remember the last thing you told me. I'm going to like short circuit it. So then rethinking, OK, is these are these actually two different tools is actually two agents that each have a smaller amount of context that then you can break apart. So back to your original question, we're hiring for people with systems expertise, even within labs, which you think of as like more zero to one prototypes. Like it's still really valuable because, again, that robustness matters. [27:33] And also just who's going to be, you know, helpful in sorting through, you know, systems permissions and provisioning and early testing. Like that stuff is still, you know, it's still hard even for cloud when it can't edit the permissions itself, which it can't for good reasons. And then on the on the robustness side, actually, we've had a lot of success pairing our product teams with our applied AI teams. Our applied AI teams are the teams that are in the field every day helping customers iterate on their prompts.

28:03-29:44

[28:03] AI powered. So how do we bring that expertise in there? Because that expertise does not sit with our software engineers today, for example. What about the in-between of like, okay, it's not the underlying architecture. It's not the prompt. It's like the UI and the flow. Who's doing that? That's a great question. Like we have found, you know, some of the people that transferred into labs were the folks like really were focused on polish on the website, but they were interested in doing something new and they bring such a different approach as well around. We had the prototype. It was, [28:31] It looked generically nice versus, oh, this feels like it's branded and it has this. That's part one. Part two is designers. Like we've had our designers move much more into a sort of split designer and builder role. Not all of them, but most of them. And a lot of our, you know, we actually don't have a lot of full-time designers on labs. But the ones that we do, I would say are writing and contributing almost as much code as the engineers on those efforts. Because they can. And again, paired correctly with the right person, we have found this almost sort of co-founder model for some of these labs initiatives. [29:01] who had the original idea maybe and they're pushing on something and then the traditional software engineer that's going to go and, you know, [29:07] make... [29:08] pave the trail sometimes behind the designer to make sure that actually works. Okay. This I want to know about. So tell me about how that team structure works. So you've got a design, is it actually usually a designer or is it just anyone that has a product idea that can kind of execute it on it in some way? [29:25] paired with a real engineer that actually can smooth out the rough edges of the trail they're leaving. [29:33] It sort of varies, but we found the one thing that was most important, sort of our gaining factor in starting up new projects. I'm curious how similar is this to every, is having somebody with extreme conviction about, if not necessarily that idea,

29:44-31:20

[29:44] too much conviction on the exact idea is probably dangerous, but at least in the problem space or the question that they're asking. And that sort of like co-founder or founder level of, I will break through walls until this thing is either proven out or dead, but I want to like go either way. Um, [29:59] When we have labs bets that we've wound down, often in the post-mortem, we're like, nobody on this team actually really thought this was like the thing. They were like, yeah, this seems reasonable. Like that's the death knell for projects, right? So that person can be a designer and a couple of the bets it is. It can also be sort of a product-minded engineer. It's rarely a pure PM. We actually only have one, currently one PM for all of labs. We're hiring more and they're sort of playing, you know, sort of a wide role. [30:29] or like a product-oriented founder. And then what we look for is, well, what skills do we need to complement with that? So, you know, because we're doing, as part of our labs process, it's actually evaluating every project every two weeks and deciding whether we double down or whether we sort of release those folks back into the broader labs pool. At any given point, there's probably somebody who can be pulled onto the project that has that infrastructural expertise or has worked with that particular internal system or has a lot of deep prompting expertise to sort of flow in and out. So I think that's also where the sort of incubator [30:59] Thank you. [30:59] style space helps because nobody's fixed on a project forever. That's really interesting. Yeah, we do it slightly different. There's some overlaps, but we do have a slightly different structure where [31:11] um, [31:11] Yeah, we have GMs or they start as entrepreneurs in residence and they become general managers when they find a product that they want to work on.

31:20-32:53

[31:20] And each product just has one person, like one person that does everything full stack. So, you know, design, engineering, marketing, all that kind of stuff, at least all the basics of that. [31:30] the shape of that GM used to be [31:33] like super technical, like, [31:36] founder background and now I think has shifted towards [31:40] at least some light technical, but like I honestly just care that you can use Claude or Codex or whatever well. [31:46] and really good product sense, really good taste for the subject area or the thing that you're trying to build. [31:55] and evidence that you can build with AI. [31:59] And then what we have is... [32:01] a shared resource layer that sort of works a little bit like an agency where we have designers and we have growth marketers and we have, you know, ops people that you can like pull in and out for various initiatives. And that seems to work pretty well. So it's like we manage all the internal agencies and then each GM is out on the edge and they pull in resources as they need it for different projects. Yeah. [32:27] But it sounds similarly like you need somebody for whom that is like the thing and they are not going to sleep until it is fully working. Yes, exactly. Like, and I've been, I've been thinking about, okay, when, when would you hire someone else to work on a product or when would you add someone else to work on a product? And it's like. [32:44] There's some point at which you can't hold the entire thing in your head. Even if you're the one pushing it forward, you can't hold the entire thing in your head. And that point used to be much smaller.

32:54-34:35

[32:54] Now it's much bigger, but there's a certain point at which like even a small feature turns itself into its own product. You know, when you first make the messaging feature inside of Instagram, it's like, yeah, I can do that in like a week or whatever. [33:07] But... [33:07] at some point that's its own product it almost needs its own team and that uh i think that line is getting uh or the number of things you can do it with one person is getting bigger but it still exists somewhere but i haven't quite figured out like how to [33:21] how to manage that or how to tell. [33:23] I love that because there's actually, I think there's the two parts to that, which is when the idea is still enough to hold into reality. [33:31] your own head or an individual person's head, adding more people actually slows the team down. And that's like a non-obvious finding that we found on labs is scaling the teams too quickly actually is a net negative because they end up spending all this time on coordination. Like, oh, you were, I was going to take, oh, but my cloud could do that. And it just ends up in this sort of piece. And you also have all those alignment conversations. Like it was important in Instagram that it was just two of us. Like it was hard enough to align the two of us like, and go like get two people on the same page. Right. The second startup I did, Artifact, [34:01] I were doing that alone for the first few months, but then we hired a team that was about eight people [34:06] It was really hard because, you know, we hadn't had product market fit yet. And so we were still iterating. And then you end up in these things where we're in a Zoom with eight people talking about what we're doing next. And you really just want to be able to sit in a room and hash it out. So I find with these labs initiatives, there's some there's some similar sort of. [34:23] aspect at play, which is you don't want to pre-scale the team too early, even if the idea is exciting, because then you just end up in this sort of like meta coordination game. But I like your framing of there is some point where either

34:35-36:08

[34:35] you know, two people really will help go on it together and there's enough sort of [34:39] context and scope where they can hold some other complex piece in their head. And then there's also the, if somebody has been spinning on the same idea for two, four weeks, somebody is injecting some other thinking and that urgency can help too. [34:51] Yeah, I think it's especially important to keep it small in AI because... [34:56] One of the things that we deal with all the time, which I'm sure you see too, is [35:00] every three to six months, you have to throw out like half your product. [35:05] And that's really hard. [35:08] to do if you have to coordinate with a lot of people. But if it's one GM who realizes, oh, shit, yeah, I got to just like throw out half of this because the models are so much better. It just makes it much easier to to like pivot in that way. [35:19] Do you see that? And how do you deal with that? How do you think about, yes, I know in three months, the code maybe or even the whole feature set, I'm going to have to really rethink about it. It feels like it changes a lot in how you think about software. [35:33] Yeah, and being willing to delete code. I think that's something the Cloud Code team has done really well, is they have sort of deleting features as a sort of imperative of people on the team. Like if this is not working properly, [35:43] Let's go unship that. And it's often when you've created something else that... [35:47] Even if it doesn't entirely supersede, it does enough of what [35:51] that other aspect does that actually makes sense to deprecate and then remove that first one. It does get harder as we get more and more enterprise focus, even with these tools, because they come to depend on it. I'll never forget, we... [36:03] One of the things I did maybe six months into when I was still chief product officer was we did a big sort of

36:09-37:41

[36:09] redesign of cloud AI and we were so proud and we shipped it and we got a bunch of kudos and then we got this really angry email for somebody who's like I just recorded 20 hours of enablement content for my company to do for cloud enterprise and I have to like redo all of it and we're like okay like you're playing at a different release cadence and of course like shipping twice a year at one of our conferences is not an option so we are going to keep moving quickly but then we've since like learned to maybe moderate how we roll it out to the enterprise side a little bit more but yeah [36:39] I'll use an example. So there's a feature in Cloud IAC called styles. It's not [36:43] widely used, but the people who use it, use it a lot. And we've talked at different points, like, yeah, the style still makes sense in the product. You know, there's other ways of accomplishing the same thing. There's custom instructions and projects now. Um, there's skills now, right. There's so many other ways of accomplishing that. Um, and I don't know how long styles will end up in the product, but I know that the last time we talked about removing it, it ended up being really load bearing for a few companies, like entire use cases, like, oh, we have our house style that the CEO personally authored and gives to every employee and that's [37:13] finding ways of doing that is also really interesting. I would, [37:16] Hope that in the long run, what we can actually do is... [37:19] come up with a system of plugins and skills such that they no longer have to live in the core product. Because I think that it's always the hardest to delete something that is the core thing that you're shipping to everybody if you don't have the story around... [37:30] Great. You still like that feature? Awesome. Like here's how you can keep using it forever in your own and keep iterating on it and make it your own, but it doesn't have to add complexity to every future person that's signing up for the first time.

37:41-39:15

[37:41] I'm curious for labs... [37:44] And then also maybe just in general, what were your thoughts for startup founders? [37:50] Your enterprise point, it brings up something that I've been thinking about a lot, which is [37:54] If you are selling to enterprise right now in AI, [37:58] you [37:59] even if the product you have right now is modern, [38:02] it will be quite outdated quite quickly. But your customers were going to want the outdated version. [38:09] But as a startup, that's like a little bit... [38:11] That feels pretty risky because... [38:14] Yeah, you're susceptible to disruption if you are optimizing for what someone at a gigantic public company will buy right now. [38:26] And I think there's a lot of startups in that category where they maybe started two or three years ago. They have a certain tech stack, they have a certain way of thinking about here's how we do AI. And then the models are so different, but their customer contracts are for this like sort of out. It's like, you know, looking at looking at copilot or whatever. It's that's the sort of vibe that happens. Like, how do you think? How do you think about that yourself and inside of Anthropic? And then how do you think founders should think about that? [38:50] Yeah, no, this is such a good question, especially because then a wave will come like being more agent native, for example. And can you adopt it within your existing paradigm? Does it require to throw everything out? And are you just stuck in that like, oh, we kind of adopted it, we kind of. [39:04] bolted it back on, um, uh, [39:07] I think a couple of things for us, what we've started doing is basically treating like this train is going to keep moving and we'll provide enterprise toggles along the way. But,

39:15-40:25

[39:15] the core of it will continue to evolve. And that's sort of the, the bet and understanding you're taking working with us. And I think that's been well-received because I think companies have also seen that, you know, things are moving so quickly that the only way they even get comfortable with a year long commitment, for example, is to believe that it will continue to evolve along the way, but then we'll provide, you know, uh, [39:34] Coworker is a great example where, you know, from day one, there was like a way to turn it off for your employees if you didn't want it, for example. And that's, I think, a reasonably good paradigm. But the other one is just as we were talking earlier, like you can actually rethink and sort of. [39:50] rewrite a lot of the stack is I think companies should be way more willing to do that. And everything is getting compressed, right? In previous cycles, it was the kind of [40:00] idea of like having to fire some of your customers who might have been, you know, really into your product for a different reason than where you're going sooner. But that was on a multi-year kind of time range thing where it was like, yes, last year's product versus not three months ago's product. It seems crazy, but I actually think that's the kind of way you have to think about it, which is you have to be willing to put out the V3 or the V4 that is a, you know, big rethink of how the existing piece worked. And then maybe have a transition period and cloud can help

40:30-42:26

[40:30] and say like, yes, this is how we think the future of this piece of knowledge work or this, you know, AI powered manufacturing is going to be. [40:38] We got to keep it moving or else, to your point, you're either going to get replaced by the next company that then rethinks it from scratch or yourself replacing it yourself. And again, it's just the same old story, but now compressed to months. [40:52] What's your take on OpenClaw? It has the flavor of something else that I, or just the thing I really like seeing when you get people to see something that was already possible, but it's now in a package where people can actually try it out. And there's some intuition, you know, around how to build on top of that. Like you started seeing that with, [41:10] you could already use these models to write code, but it kind of took some of these breakout, like low code, the replets and lovables and v0s of the world to kind of put that in there. [41:21] And it's kind of almost the purest expression of the, just give the model tools and let it kind of go forward and do it and then go forward and build it. So it was a cool, interesting moment for people to realize both the potential but also pitfalls of this, of like, oh, it did this thing I didn't mean it to, [41:40] funniest one was I'm [41:42] friend was like, I think my wife is jealous of my open claw and like, I'm talking too much to it. And it's like, people start developing like deeper, sort of like very personal relationships by just having a lot of context in these things and access to all these different tools. Um, [41:56] I think there's the open question of how do you then make it [42:00] easy and it actually goes back to our conversation around like what where do you draw that like boundary around the way you like cloud operate right if v1 was hey like these are the three tools you can use only use these tools ever and then most people's interaction with those systems was hey can you do this and you know whatever back and be like no sorry like you're gonna do it yourself to like open claw which is like pretty like the aperture is like wider than i can see oh my god it called me to keep my emails and i didn't even know it could do that yeah exactly and it's

42:30-43:49

[42:30] interesting product question. I won't say for all of 2026, because who knows where we'll be in [42:34] September, but let's call it between now and like the end of August is going to be like [42:39] What product shape exists between that and where we are in most products these days, which is, you know, you can call MCPs, but they're gated and they ask for permissions for good reasons. That is still a useful product. [42:52] without being a, you know, kind of YOLO product. And I think that, you know, [42:56] We're thinking about that question. I'm sure the other labs are as well. I'm sure there's a lot of startups thinking about that as well. I think NVIDIA put out something that was like, they're safe, open, everybody's going after this question. I think it's going to be about figuring out what is... [43:08] What is that? Either shift the paradigm completely so you can be that open, but with a lot of safeguard. That would be one approach. Or figure out some boundary to draw in which it's still powerful and it's still useful. [43:19] but it's not likely to email every single one of your contacts and go haywire. [43:26] Yeah, I think the other interesting part about it is... [43:31] Like you said, the personal nature of it. [43:34] And I know people have personal relationships with Claude, but there's this weird thing where [43:40] if I watch someone else using Claude, [43:42] I'm like, I feel like I thought a stripper liked me or something. You know, it's like, Claude thinks you're smart too or whatever, you know?

43:51-45:35

[43:51] And so there's this thing that happens when you have a claw person [43:57] that like Maya Claus R2C2 [44:00] My girlfriend's call is called Shelly. [44:03] And [44:05] There's a thing that happens where it feels like it's mine. [44:10] Like it's really mine. It has its own name. It has a personality that sort of like mirrors me. [44:15] in this way that Claude feels like it knows me. [44:19] And I like Claude, but it's not mine. [44:21] How do you think about that? [44:23] Yeah, I mean, I was having this conversation with somebody this week around like, [44:27] is the right pattern sort of single point of contact, like named, you know, [44:33] version of that? Or is it the team of agents that you're talking to? I think there's a lot to the single person that is maybe the coordinator or the delegator. And then naturally, because it becomes the agent you interact with the most, you want to imbue it with a name and a bit more personality. It ends up reflecting sometimes your personality in the case of... [44:53] all of a sudden every cliche came out. It was like, you know, the, you know, the Q or the money penny or like, you know, whatever the, or the, you know, Hal or whatever these different sort of, you know, sci-fi characters. But I think you do build that sort of, sort of, [45:08] trust and knowledge. I think there's also that sort of like Ikea effect of like currently open clause, like still pretty hard to set up. So the fact that you went through all of that and it works, you're like, I did that thing. Like I, I, I, I birthed, uh, you know, Shelly for example, and now we can, you know, interact with them as well. But I think that paradigm is really powerful. Like the, um, I think moving away, like even within my cloud code usage now, uh, one of the things I have like strongly prompted in there is like, don't do very much work yourself, like

45:38-47:21

[45:38] Because it means most of the time, the sort of run loop is available for you to talk to. And I think OpenClaw and Pi have like a similar architecture of keep the run loop open. And I think that actually makes it feel much more like somebody that you are talking to versus like a tool that you are delegating to and occasionally gets blocked for five minutes because it's doing some really complex tasks. [45:56] Yeah, I totally agree. And I've had similar debates because we're also building our like everyone, we're building our own little like open claw one click Slack implementation to see if we can we can do when that feels like ours. [46:10] And [46:11] We've had a lot of those debates about, do you want one agent? Do you want many? And one of the patterns that we found, which is kind of cool, is... [46:19] So I have an agent. [46:20] I use the agent for stuff that I do. And then people watch me use the agent for that. And they know what I'm good at. And if I'm using the agent for that stuff, they're going to trust it because they trust me. [46:32] And it's modified itself in response to me. So like I sort of transfer my trust to it and then people in the organization start using it for that. [46:39] And so you get like this almost shadow org chart. [46:43] Where when everyone has a claw, their claw becomes known for and used for the thing that they're specialized at. Their owner is specialized at New York. [46:52] Yeah, I mean, that makes a lot of sense, too. And you could think about, you know, there's a lot of interesting research questions, I think, around that, you know, I think people are experiencing visually for the first time around privacy and like what my agent knows about me versus what it discloses to other people. But I think there's the positive version of that, which is all the things that it has learned from all your interactions and how it actually brings it to bear on other problems versus the generic like, yes, it's just like everybody else's agent, except, you know, it has a name that's attached to Dan and it has like maybe some of Dan's, you know, access below the hood.

47:22-48:28

[47:22] Yeah. Well, Mike, we're out of time. This was a pleasure. I learned a lot. If people want to follow you or your work, where can they find you? [47:31] I think probably easiest is Mikey K on next. [47:34] Fuck yeah. Thanks for joining Mike. [47:37] Great to see you, Dan. [47:46] Oh my gosh, folks. You absolutely, positively have to smash that like button and subscribe to AI and I. Why? Because this show is the epitome of awesomeness. It's like finding a treasure chest in your backyard. But instead of gold, it's filled with pure, unadulterated knowledge bombs about chat GPT. [48:08] on the edge of your seat, craving for more. It's not just a show. It's a journey into the future with Dan Shipper as the captain of the spaceship. [48:17] So do yourself a favor, hit like, smash subscribe, and strap in for the ride of your life. [48:23] And now, without any further ado, let me just say, Dan, I'm absolutely hopelessly in love with you.

Want to learn more?