Home/Google Cloud Next 2026 Developer Livestreams/AI Infrastructure & Performance

Topic

AI Infrastructure & Performance

Covers the hardware and software infrastructure essential for AI, including TPUs, GPUs, and NVIDIA technologies, focusing on optimizing inference, model deployment, and achieving scalable, high-performance AI.

11viral moments2quotes28chapters2articles2sessions

Viral Moments

AI Auto-Scaling Magic

Never Worry About Scaling

AI for every device

Gemma Runs Everywhere: Phone to GPU

AI delivering value?

Inference Over Training: AI's True Impact

Best of both worlds

Hybrid Inference: Local AI's Smart Router

Inference is Key

The Real Promise of AI

TensorRT LLM Hack

Boost LLM Performance

The Inference Bible

Unpacking Inference Engineering

Optimize tokens, save capacity

Boost AI Efficiency: New Skills Repository

Video AI, redefined

VEO 3.1 Light: Speed & Savings

Training vs. Inference

TPU V8's Game-Changing Split

Vera Rubin & Blackwell

Next-Gen NVIDIA Hardware

Key Quotes

“To me, what inference means is being able to actually deliver on the promise of AI applications.”

Jason Davenport, Google Cloud

Unleash AI's Full Power: Gemma 4, Baseten, NVIDIA Secrets Revealed!

“Google's the only company that has a chip, a cloud, and a model. All integrated.”

Jason Davenport, Host, Google Cloud

Acquired's Ben & David Reveal Google Cloud's Secret Weapon!

Chapters

Unleash AI's Full Power: Gemma 4, Baseten, NVIDIA Secrets Revealed!

Watch full session→

Unleash AI Power: Build & Share No-Code Agents!

Watch full session→

AI Security: Stop Shifting Left, Start Shifting DOWN!

Watch full session→

Gemma 4: Run Google's AI on YOUR Phone?! The Future of Local AI is Here!

Watch full session→

Acquired's Ben & David Reveal Google Cloud's Secret Weapon!

Watch full session→

From Idea to App: Build AI with a Google Expert's Secrets!

Watch full session→

Stop! Are Your AI Agents Ready for the Real World?

Watch full session→

Unleash Your Inner Director: Google's Gen Media Stack Revealed!

Watch full session→

AI Agents: The Shocking Truth About Your 'Naive' Data Strategy

Watch full session→

Articles

NVIDIA and Baseten Unveil Next-Gen AI Inference Capabilities on Google Cloud

At a recent conference, NVIDIA and Baseten leaders detailed their strategic partnership with Google Cloud, focusing on groundbreaking advancements in AI inference. The collaboration promises to deliver unparalleled speed, reliability, and scalability for AI applications, leveraging next-generation hardware and sophisticated software optimizations.

Acquired's Ben Gilbert & David Rosenthal Unpack Google Cloud's AI Revolution at Next

At Google Cloud Next, Acquired podcast hosts Ben Gilbert and David Rosenthal offered a compelling analysis of Google's AI advancements and the dramatic evolution of its cloud division. Their insights highlighted a pivotal moment for artificial intelligence, marked by significant hardware innovations and strategic enterprise shifts.

Sessions

Unleash AI's Full Power: Gemma 4, Baseten, NVIDIA Secrets Revealed!

Jason Davenport·18:23

Acquired's Ben & David Reveal Google Cloud's Secret Weapon!

Jason Davenport·20:25

AI Infrastructure & Performance

Viral Moments

Key Quotes

Chapters

Articles

Sessions

Related Topics