Even tech giants with their own LLMs are having trouble finding enough computing power.
Google was forced to cap Meta’s use of its Gemini AI model after Mark Zuckerberg’s company exceeded its computing capacity, sources familiar with the matter told The Financial Times. The incident reveals that even tech giants with their own LLMs are having trouble finding enough computing power for themselves, let alone their customers.
Meta does not operate its own cloud business and is trying to rapidly expand its own data center build out, having pledged $600 billion in cloud computing investments over the next two years. Google reportedly warned the social media company about its capacity limits in March, in turn forcing Meta to request that employees use tokens more efficiently, the sources said.
Gemini AI is being used by Meta for customer service, advertiser chatbots and coding, alongside processes like harmful content takedowns and scam detection. Meta initially chose Gemini for those things because it outperformed its own Llama open-source models, the sources said. The company also employs other models like Anthropic’s Claude for similar purposes.
Despite the billions spent so far on data centers, big companies are struggling to get enough capacity for their usage needs. Google itself recently agreed to pay SpaceX $920 million a month to use xAI’s data centers, due to the extra computing power required for Gemini Enterprise.
AI power users are benefiting from the boom, but providers like OpenAI aren’t profiting yet, since revenue earned from AI so far is a small percentage of the costs, according to analysts. Recently, token prices have surged, forcing some companies to back off on AI usage — including, it appears, the AI companies themselves.





