Companies are hoarding expensive AI GPUs and leaving most of that expensive computing power idle while bills silently rise.



  • Most AI GPUs run at surprisingly low utilization across production systems
  • Companies pay for twenty times more GPU capacity than necessary
  • Excess provisions increase sharply instead of improving year after year

Companies across the tech industry are rushing to buy massive amounts of AI infrastructure, but most of it barely does any useful work.

A report from Cast AI, based on tens of thousands of Kubernetes clusters on AWS, Azure, and GCP, found that average GPU utilization is just 5%.

Leave a Comment

Your email address will not be published. Required fields are marked *