Meta to build 2GW data center with over 1.3 million Nvidia AI GPUs — invest $65B in AI in 2025

(Image credit: Nvidia)

Reuters reports that Meta plans to invest between $60 billion and $65 billion in 2025 to enhance its artificial intelligence (AI) and other infrastructure. This marks a significant leap from its estimated $38 billion—$40 billion spending in 2024, citing Mark Zuckerberg's post on Facebook. The company aims to solidify its competitive position in the AI industry against rivals like Google, Microsoft, and Open AI. However, $65 billion is slightly smaller than Microsoft's $80 billion.

The report says that Meta will construct a 2GW data center using some 1.3 million Nvidia [presumably H100] GPUs as part of the plan. The spending probably includes the company's custom data center-grade processors, too. Yet, we are speculating here.

"We are planning to invest $60 billion — $65 billion in CapEx this year while also growing our AI teams significantly, and we have the capital to continue investing in the years ahead," Zuckerberg said in a Facebook post. "This is a massive effort, and over the coming years it will drive our core products and business, unlock historic innovation, and extend American technology leadership."

The announcement comes amidst escalating competition in AI investments. Microsoft is allocating $80 billion for data centers in fiscal 2025, Amazon expects to exceed $75 billion in spending for the same year, and OpenAI, SoftBank, and Oracle have committed $500 billion to their Stargate initiative (yet it remains to be seen where precisely those Stargate money will come from). According to Reuters, this spending far exceeds analyst estimates of $50.25 billion for Meta’s 2025 capital expenditures.

TOPICS

Anton Shilov is a contributing writer at Tom’s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends.

10 Comments Comment from the forums

phead128

DeepSeek V3 and R1 trained on $5 million budget and 2043 H800s, with performance exceeding GPT 4o and o1, already rendered the massive NPU farm clusters obsolete. Same with Sky-T1 model out of Berkeley trained an 01 peer for $450.
Reply
bit_user

phead128 said:
DeepSeek V3 and R1 trained on $5 million budget and 2043 H800s, with performance exceeding GPT 4o and o1, already rendered the massive NPU farm clusters obsolete. Same with Sky-T1 model out of Berkeley trained an 01 peer for $450.
Performance according to what metric? I don't see any of those on the OpenLLM Leaderboard:
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/
Reply
phead128

bit_user said:
Performance according to what metric? I don't see any of those on the OpenLLM Leaderboard:
https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/
Huggingface only has open sourced models on their leaderboard, so no GPT or Claude.

Here is an ELO-style leaderboard with OpenAI, Anthropic, Google, DeepSeek models included, where users are blind tested AI chat responses.

https://lmarena.ai/?leaderboard
Reply
bit_user

phead128 said:
Huggingface only has open sourced models on their leaderboard, so no GPT or Claude.
It's not just GPT and Claude missing. I didn't see any of the models you listed!

phead128 said:
Here is an ELO-style leaderboard with OpenAI, Anthropic, Google, DeepSeek models included, where users are blind tested AI chat responses.

https://lmarena.ai/?leaderboard
That's just based on votes by the public. They're not ranking it on skills tests, like Huggingface's scores do.

What I'm getting at is that I can believe you can train a specialized model with a lot fewer resources and end up with something competitive. Training one that's as versatile as leading contenders on Huggingface's LLM leaderboard is a lot harder and probably takes disproportionately more resources.
Reply
jp7189

I never thought I'd be a fan of Meta/Facebook, but i am a HUGE fan of llama specifically because it's open source/open weight and allows people to add their own fine tune training on top of it.
Reply
phead128

bit_user said:
That's just based on votes by the public.
These are ranked by blinded votes on best anonymous AI responses, not based on some popularity contest.

bit_user said:
They're not ranking it on skills tests, like Huggingface's scores do.
For skills based test, those are available on DeepSeek 's publication.
https://github.com/deepseek-ai/DeepSeek-V3/raw/main/figures/benchmark.png
Also, for DeepSeek R-1's publicationhttps://github.com/deepseek-ai/DeepSeek-R1/raw/main/figures/benchmark.jpg
The MMLU Pro and MMLU scores for DeepSeek/GPT/Claude/Google really blows everything out of the water compared to any model on HuggingFace (which peaks at 70 points for the top model)

bit_user said:
What I'm getting at is that I can believe you can train a specialized model with a lot fewer resources and end up with something competitive. Training one that's as versatile as leading contenders on Huggingface's LLM leaderboard is a lot harder and probably takes disproportionately more resources.
Except as shown by the MMLU Pro scores, DeepSeek/GPT/Claude/Google blows every open source model on the HuggingFace's LLM board out of the water, and they aren't even listed on HuggingFace's leader board. It's likely because it's voluntary submission.
Reply
bit_user

phead128 said:
These are ranked by blinded votes on best anonymous AI responses, not based on some popularity contest.
But the questions are just a pool submitted by the public and not sorted by skill. So, you can still have one specialized model that's really good at certain prompts winning out over a more general and versatile model.

phead128 said:
For skills based test, those are available on DeepSeek 's publication.
https://github.com/deepseek-ai/DeepSeek-V3/raw/main/figures/benchmark.png
Why didn't they submit it to Hugging Face's LLM leaderboard, at least in addition to their comparison against GPT and Claude?
Reply
phead128

bit_user said:
Why didn't they submit it to Hugging Face's LLM leaderboard, at least in addition to their comparison against GPT and Claude?
Maybe because HuggingFace's LLM leaderboard is not good?

HuggingFace’s leaderboards show how truly blind they are because they actively hurting the open source movement by tricking it into creating a bunch of models that are useless for real usage.
https://semianalysis.com/2023/08/28/google-gemini-eats-the-world-gemini/
That explains why no respectable LLM is listed on there, because they are useless (except maybe MMLU Pro).
Reply
Gaidax

All I understand from that is that 6090 will cost $3k MSRP
Reply
zsydeepsky

bit_user said:
Why didn't they submit it to Hugging Face's LLM leaderboard, at least in addition to their comparison against GPT and Claude?

probably because huggingface havn't completed the test yet? I just checked the leaderboard, DeepSeek-R1 has 3 models finished the test and has score there:
deepseek-ai/DeepSeek-R1-Distill-Qwen-14Bdeepseek-ai/DeepSeek-R1-Distill-Llama-70Bdeepseek-ai/DeepSeek-R1-Distill-Qwen-32B
anyway, there are other tests done by 3rd party, which aim to provide more challenging tests (since the current tests became too simple/too familiar for new models), like the "Humanity's Last Exam", which actually shows DeepSeek can solve hard questions better than GPT-O1 (or any other model)

I personally also compared those 2 models, uploaded the code framework, and asked them to give opinions and analysis,. DeepSeek-R1 does provide WAY BETTER insights than GPT-O1.
Reply

Show more comments