Home/Google Cloud Next 2026 Developer Livestreams/Generative AI & Media

Topic

Generative AI & Media

Focus on the principles and applications of generative AI, including multimodal models, and their use in creating new content like video, music, and text-to-speech, as well as broader media automation.

17viral moments3quotes37chapters3articles5sessions

Viral Moments

Fixing audio with words

AI Self-Correction in Action

Beyond video: World Models

World Models: The Future of Creative Control

Talk your app into existence!

Yap to App: Voice-Powered Creation

AI voices, truly human

Gemini TTS: 200 Tags for Voice Emotion

Code generation explosion

Dozens of PRs Daily?!

Same dog, new scene

Unbreakable Character Consistency

Interactive AI avatars

Live Avatars: Real-Time Interaction with Gemini

AI writes your prompts!

Tap Tap Tab: AI Completes Your Ideas

Deep dive into Nano Banana

Nano Banana: Unprecedented Artistic Control

Lulu's AI-generated debut

Text to Image Magic

AI music, perfectly timed

Lyria 3 Pro: Music & Sound Effects on Demand

Instant app ideas!

AI's 'I'm Feeling Lucky' Button

LLMs evaluate images

AI Judges AI Art

Video AI, redefined

VEO 3.1 Light: Speed & Savings

AI's surprising data skill!

GenAI Outsmarts Humans in Data

Text-to-speech, but better

Multi-Voice AI Audiobooks: A Game Changer

All Google's Gen Media

Google's Gen Media Stack Explained

Key Quotes

“The amazing thing about this model is that it connects to Google search. So, it can answer you with the live data from Google search.”

Kulani Davaja, Product Marketing Manager for Gen Media

Unleash Your Inner Director: Google's Gen Media Stack Revealed!

“Gemini's really awesome at multimodality, so we're able to kind of analyze those images even with Gemini, and then fact-check a lot of these questions to make sure that everything's aligned.”

Katie Nguyen, Developer Relations Engineer

Unleash AI Creativity: Build Your Own Generative Media Agents!

“I like to have fairly long conversation about my idea, the tech stack, and then ask it to do the research if maybe, you know, my idea is not perfect or try to optimize it.”

Tomek Porozynski, Google Developer Expert

From Idea to App: Build AI with a Google Expert's Secrets!

Chapters

Unleash Your Inner Director: Google's Gen Media Stack Revealed!

Watch full session→

Unleash AI Creativity: Build Your Own Generative Media Agents!

Watch full session→

From Idea to App: Build AI with a Google Expert's Secrets!

Watch full session→

Gemma 4: Run Google's AI on YOUR Phone?! The Future of Local AI is Here!

Watch full session→

Google Cloud Next '26: The Developer Keynote Secrets You Missed!

Watch full session→

The Future of Code: Why You Won't Write It Anymore!

Watch full session→

AI Agents: The Shocking Truth About Your 'Naive' Data Strategy

Watch full session→

AI Agents: Why Tuning the 'Harness' Beats Model Weights!

Watch full session→

AI Studio's Secret Weapon: How "Vibe Coding" Is Changing Everything

Watch full session→

Stop! Are Your AI Agents Ready for the Real World?

Watch full session→

Acquired's Ben & David Reveal Google Cloud's Secret Weapon!

Watch full session→

Articles

Google Unveils Next-Gen AI Media Stack: From Pixel-Perfect Images to Live Avatars

Google is dramatically expanding its generative media capabilities, empowering creators with an integrated suite of AI models that redefine artistic control and efficiency. A recent conference session showcased the full power of Nano Banana, VEO, Lyria, and Gemini, demonstrating how these tools can transform creative workflows.

Google Cloud Engineers Unveil AI Agents That Automate Creative Media Production

At a recent Google Cloud session, Developer Relations Engineer Katie Wynn demonstrated how to build sophisticated Generative Media agents capable of automating complex creative tasks, from character design to full story production, leveraging Google's ADK and MCP frameworks.

From Concept to Cloud: Google Expert Reveals Secrets to Rapid AI App Development

In an insightful session, Google Developer Expert Tomek Wierzchowski demystifies the process of building AI applications, sharing his journey from a multi-voice audiobook concept to a deployable solution. He emphasizes practical tools and a strategic mindset for navigating the fast-evolving AI landscape.