Home/Google Cloud Next 2026 Developer Livestreams/AI Infrastructure & Performance
Topic

AI Infrastructure & Performance

Covers the hardware and software infrastructure essential for AI, including TPUs, GPUs, and NVIDIA technologies, focusing on optimizing inference, model deployment, and achieving scalable, high-performance AI.

11viral moments2quotes28chapters2articles2sessions

Viral Moments

11

AI Auto-Scaling Magic

Never Worry About Scaling

AI for every device

Gemma Runs Everywhere: Phone to GPU

AI delivering value?

Inference Over Training: AI's True Impact

Best of both worlds

Hybrid Inference: Local AI's Smart Router

Inference is Key

The Real Promise of AI

TensorRT LLM Hack

Boost LLM Performance

The Inference Bible

Unpacking Inference Engineering

Optimize tokens, save capacity

Boost AI Efficiency: New Skills Repository

Video AI, redefined

VEO 3.1 Light: Speed & Savings

Training vs. Inference

TPU V8's Game-Changing Split

Vera Rubin & Blackwell

Next-Gen NVIDIA Hardware

Key Quotes

2

Chapters

28
Unleash AI's Full Power: Gemma 4, Baseten, NVIDIA Secrets Revealed!
Watch full sessionโ†’
Unleash AI Power: Build & Share No-Code Agents!
Watch full sessionโ†’
AI Security: Stop Shifting Left, Start Shifting DOWN!
Watch full sessionโ†’
Gemma 4: Run Google's AI on YOUR Phone?! The Future of Local AI is Here!
Watch full sessionโ†’
Acquired's Ben & David Reveal Google Cloud's Secret Weapon!
Watch full sessionโ†’
From Idea to App: Build AI with a Google Expert's Secrets!
Watch full sessionโ†’
Stop! Are Your AI Agents Ready for the Real World?
Watch full sessionโ†’
Unleash Your Inner Director: Google's Gen Media Stack Revealed!
Watch full sessionโ†’
AI Agents: The Shocking Truth About Your 'Naive' Data Strategy
Watch full sessionโ†’

Articles

2

Sessions

2

Related Topics