What are Google's new AI chips?

Google has unveiled its eighth-generation Tensor Processing Units, split into TPU 8t for training and TPU 8i for inference, shipping later in 2026.

How do Google's new TPUs compare to previous generations?

The TPU 8t offers significant generational leaps in performance and scale for training, while the TPU 8i introduces architectural changes for optimized low-latency inference.

What is Google's competitive advantage in AI hardware?

Google's competitive advantage stems from its end-to-end control of its AI stack, allowing for optimized cost-per-token economics that its rivals reportedly cannot match.

Home / Technology / Beyond Nvidia: Google's Custom AI Silicon Revealed

Beyond Nvidia: Google's Custom AI Silicon Revealed

22 Apr

•

Summary

Google announced two new custom AI chips for training and inference.
These chips aim to reduce costs and improve efficiency for AI workloads.
Google controls its AI stack end-to-end, a key competitive advantage.

Beyond Nvidia: Google's Custom AI Silicon Revealed

In a strategic move to reduce reliance on third-party hardware, Google unveiled its eighth-generation Tensor Processing Units (TPUs) in late 2026. These custom silicon designs, shipping later in the year, are bifurcated into two specialized chips: TPU 8t for frontier model training and TPU 8i for agentic inference and real-time sampling.

This strategic split, decided upon in 2024, predates the industry's broader pivot to reasoning models and agents. Google's vertical integration of its AI stack, from energy to services, is highlighted as a key differentiator, enabling cost-per-token economics that rivals reportedly cannot match.

The TPU 8t offers a substantial generational leap in training performance, boasting increased EFlops, bandwidth, and networking speeds, with scalability potentially exceeding one million chips. The TPU 8i introduces architectural innovations, including a network topology optimized for low latency, crucial for agentic workloads, delivering a claimed five-fold improvement in real-time LLM sampling.

This development reframes cloud evaluations for enterprise buyers in 2026-2027. Teams focused on large-scale training will assess 8t availability and networking, while those serving agents will scrutinize 8i performance and HBM capacity. Google's self-reported benchmarks suggest significant gains, though independent evaluations are anticipated in the coming quarters.

Disclaimer: This story has been auto-aggregated and auto-summarised by a computer program. This story has not been edited or created by the Feedzop team.

Beyond Nvidia: Google's Custom AI Silicon Revealed

22 Apr

•

Summary

Google announced two new custom AI chips for training and inference.
These chips aim to reduce costs and improve efficiency for AI workloads.
Google controls its AI stack end-to-end, a key competitive advantage.

Disclaimer: This story has been auto-aggregated and auto-summarised by a computer program. This story has not been edited or created by the Feedzop team.