This is Google’s 7th iteration of a TPU (Tensor Processing Unit).
Ironwood is the first purpose-built specifically for inference — the process of deploying trained AI models to make predictions or generate responses.Ironwood delivers twice the performance per watt compared to Trillium, and is nearly 30 times more power efficient than Google’s first Cloud TPU from 2018.
Last but not least the specs for computing… which is truly mind blowing….
When scaled to 9,216 chips per pod, Ironwood delivers 42.5 exaflops of computing power — dwarfing El Capitan‘s 1.7 exaflops, currently the world’s fastest supercomputer. Each individual Ironwood chip delivers peak compute of 4,614 teraflops.