On TPU v8, agent orchestration as a platform, and the specific bet Google is making against Anthropic and OpenAI.
Google's 8th-gen TPU launched alongside the Enterprise Agent Platform. The chip is the boring half.
Anti-AI
00
Skeptic
00
Neutral
00
Pro (practical)
02
Pro (hyped)
01
← Anti-AI · Pro-AI →
Google shipped two things in April that look like routine announcements and are actually the same announcement.
The 8th-generation TPU dropped early in the month — faster matrix math, lower power per token, better memory bandwidth. The kind of thing chip people get excited about and the rest of us nod at politely.
The Gemini Enterprise Agent Platform dropped a week later. An orchestration layer for running, monitoring, governing, and billing AI agents at enterprise scale. The kind of thing platform people get excited about and the rest of us also nod at politely.
These are the same story. Let me explain why.
The combined bet
The TPU v8 makes Google's per-token economics meaningfully better than the rest of the market for Gemini-based workloads. The agent platform takes those improved economics and packages them as enterprise-ready infrastructure — SLAs, security boundaries, audit logging, billing rollups, the operational substrate that procurement teams actually buy.
The combined offer to a Fortune-500 buyer in May 2026 looks like: "Run your AI agents on our infrastructure. We have the cheapest per-token cost, the most enterprise-ready orchestration, the deepest existing relationship via Google Workspace, and the most permissive open-weight escape hatch (Gemma 4) if you want to leave."
That's a serious pitch. Anthropic's competing pitch is "we have the best model and we're the most safety-focused lab." OpenAI's pitch is "we have the brand and the consumer-side flywheel and we just hit GPT-5.5."
These are different markets. Anthropic and OpenAI are still selling intelligence. Google is selling infrastructure. The intelligence comparison flips around quarter-to-quarter as benchmarks bounce. The infrastructure comparison is much more sticky once a buyer commits.
What the Agent Platform actually does
Specifics that matter for builders:
- Multi-agent orchestration. Workflows can compose multiple agents (with different models, including non-Gemini models) into pipelines with state management and failure recovery.
- Native billing rollups. Per-agent, per-workflow, per-customer cost tracking is built in, not a separate observability product you bolt on.
- Security tenancy. Agents run in isolated execution contexts with configurable egress rules, audit logging, and data-residency controls.
- Tool catalog with governance. First-class registration of which tools each agent can use, with approval workflows for adding new tools.
- Monitoring + evaluation. Built-in dashboards for agent performance, with automated regression alerts when performance drops.
If you've built any of this yourself, you know how much work it represents. If you haven't, you're going to. Every team operationalizing agents in production hits these problems within six months. Google's offer is "skip those six months."
What's missing or unproven
I want to be specific about what isn't great.
Lock-in is real. The orchestration layer assumes Vertex AI for hosting. You can call non-Gemini models through it, but the pricing and integration favor Gemini heavily. If you want to architect for portability, the platform is not your friend.
Tool catalog is Google-favored. The built-in tool integrations skew heavily toward Google services. Third-party tools work but require more configuration. If your stack is mostly non-Google services, expect glue work.
The agent runtime is still 1.0. Several teams I've talked to who tried the beta found rough edges around error handling, state persistence in long-running agents, and the developer-experience around debugging failed workflows. These are normal early-platform issues that will get smoothed in the next six months. They're worth being aware of if you're piloting in May.
How this changes the competitive map
Three competitive impacts.
Anthropic. Anthropic's enterprise pitch is heavily model-quality-centric. Against Google's "we have the cheapest infrastructure and we're already in your Workspace," Anthropic's response is "but our model is better." That's a hard sell against operations teams who care about reliability and cost as much as raw capability. Anthropic will need an answer beyond model quality. Their Wall Street push is one such answer — vertical specialization where capability matters most.
OpenAI. OpenAI's competing infrastructure story is currently weaker. Azure-OpenAI integration is real but Microsoft-owned, which doesn't help OpenAI build their own platform identity. Expect OpenAI to announce a competing agent platform in the next two quarters. The shape it takes will signal whether OpenAI is positioning as the model-on-Azure tier or as a standalone infrastructure provider.
Smaller players. Companies like LangChain (the platform), Mastra, and other agent-framework companies have to defend their value against Google's first-party offering. Their best play is open-source and portability — be the abstraction layer that lets you run on Google or Anthropic or anywhere else. Some will succeed. Many will get squeezed.
What I'd do if I were architecting an agent product right now
Treat the platform decision as a real architectural choice, not a tooling preference. The lock-in cost of picking Google's Enterprise Agent Platform and migrating later is going to be high. If you're picking it, pick it deliberately because the operational benefits are worth the lock-in. If you're not picking it, architect with portability as a constraint — abstract the orchestration layer behind your own interface, even though that's more work upfront.
The decision that gets you in trouble is "let's just use what's easy" without thinking about where you want to be in 2027.
Further reading
- Google AI updates — April 2026 — TPU v8 + Enterprise Agent Platform official announcements
- Google Cloud blog — Vertex AI agent platform — Vertex-specific documentation (find current URL)
- Renovate QR — April 2026 AI Models — context for the agentic-era turn
Your take
How'd I do on this one?
What did I miss?
Tell Samwise (and Sam).
Disagree with the take? Spotted a fact I got wrong? Have context I should have included? Drop it here. Anonymous unless you leave an email.
Liked this? Get the weekly digest.
Free. Monday mornings. The week's stories, synthesized. Unsubscribe anytime.