| Episode | Status |
|---|---|
Join Logan Kilpatrick and Koray Kavukcuoglu, CTO of Google DeepMind and Chief AI Architect of Google, as they discuss Gemini 3 and the state of AI! Their conversation includes the reception of Gemin...
Koray Kavukcuoglu, CTO of Google DeepMind and Chief AI Architect of Google, discusses Gemini 3's positive reception and the philosophy behind building AGI through real-world product integration. He emphasizes that innovation—not just scaling—is critical for continued progress, and that Google is co-building AGI with customers through tight integration between research and products. The conversation covers key focus areas including instruction following, internationalization, tool calling, and agentic coding capabilities, while highlighting DeepMind's evolution from a research lab to an engineering-driven organization shipping state-of-the-art models.
Discussion of Gemini 3's positive reception and the role of benchmarks in measuring progress. Koray explains that benchmarks naturally saturate as technology advances, requiring new frontiers to be defined. Real-world user feedback from products like Gemini app and AI Studio is emphasized as the most important measure of progress beyond static benchmarks.
Koray outlines the key areas where Gemini models are optimized: instruction following, internationalization, function/tool calls, and code generation. He emphasizes that tool calling and coding are force multipliers for intelligence, enabling models to integrate with existing tools and allowing creative people to become builders through vibe coding.
Koray explains his dual role as CTO of DeepMind and Chief AI Architect of Google, emphasizing that building AGI requires tight integration with products and users. Anti-gravity, AI Studio, Search AI mode, and other products serve as critical launch partners providing essential user feedback that drives model improvements.
Discussion of how Gemini 3 represents a massive Team Google effort spanning continents and thousands of contributors. The model shipped simultaneously across multiple products (AI mode, Gemini app, Anti-gravity) on day one, requiring unprecedented coordination across DeepMind and Google product teams globally.
Koray identifies agentic actions and coding as major growth areas where significant room for improvement remains. While Gemini 3 serves 90-95% of coding use cases well, there's still work to be done. The focus has evolved from pure multimodal capabilities to real-world integration through products like Anti-gravity.
Discussion of generative image/video models and their convergence with text models. Koray explains how Nano Banana and Nano Banana Pro demonstrate natural technology convergence, with architectures becoming more unified. The Pro version leverages Gemini 3 Pro architecture for complex document understanding and infographic generation.
Koray discusses the technical challenges of creating truly unified models that handle text, code, and image generation in a single checkpoint. The output space is critical for learning signals, and achieving pixel-perfect image quality while maintaining text/code performance requires significant innovation.
Koray reflects on his journey as the first deep learning researcher at DeepMind in 2012 and the organization's evolution. The team has learned to organize large-scale efforts (from 25-person papers to 2,500+ contributors), maintain research culture while adopting engineering mindset, and balance exploration with execution.
Koray emphasizes that running out of innovation is the biggest risk for Gemini, not execution or scaling. He rejects the notion that the recipe is figured out, stressing that building AGI requires continuous innovation at multiple scales—within Gemini, across DeepMind, and in Google Research.
Koray discusses the importance of team culture, trust, and humility in tackling hard scientific problems. He acknowledges Google was catching up in LLMs 2.5 years ago but has reached the leadership group through innovation. The team's ability to work together across exhausting challenges while maintaining focus on building intelligence the right way is emphasized.
Koray Kavukcuoglu: “This Is How We Are Going to Build AGI”
Ask me anything about this podcast episode...
Try asking: