Oracle's 65,000+ GPU supercluster now generally available
Category:
News
Date:
Nov 21, 2024


Can provide up to 260 Exaflops of FP8 performance
An Oracle Cloud Infrastructure (OCI) Supercluster with Nvidia H200 GPUs is now generally available.
The Supercluster can scale up to 65,536 Nvidia H200 GPUs, and offers up to 260 exaflops of peak FP8 performance. Oracle claims it is the largest AI supercomputer in the cloud.
According to Oracle, each compute instance within the Supercluster has 76 percent more high-bandwidth memory and 40 percent more memory bandwidth than the H100 instance thus improving its LLM inference performance by up to 1.9 times.Supercluster has a custom-designed cluster network using RDMA over Converged Ethernet Version 2 (RoCE v2) on top of Nvidia ConnectX-7 network interface cards (NICs) which can handle up to 400 Gbps GPU to GPU interconnects.It also features an upgraded 200 Gbps front-end network to move large data sets between storage and GPUs more efficiently.The instances are Bare metal and each features eight Nvidia H200s with 141GB HBM3e memory, and two 56-core Intel Sapphire Rapids 8480+ CPUs.Pricing remains $10 per GPU per hour, the same as with H100 instances. The H100 Supercluster can scale to 16,384 GPUs.In September 2024, Oracle revealed that it would build a supercluster with up to 131,072 of the upcoming Nvidia Blackwell GPUs, set to launch during the first half of 2025.
Can provide up to 260 Exaflops of FP8 performance
An Oracle Cloud Infrastructure (OCI) Supercluster with Nvidia H200 GPUs is now generally available.
The Supercluster can scale up to 65,536 Nvidia H200 GPUs, and offers up to 260 exaflops of peak FP8 performance. Oracle claims it is the largest AI supercomputer in the cloud.
According to Oracle, each compute instance within the Supercluster has 76 percent more high-bandwidth memory and 40 percent more memory bandwidth than the H100 instance thus improving its LLM inference performance by up to 1.9 times.Supercluster has a custom-designed cluster network using RDMA over Converged Ethernet Version 2 (RoCE v2) on top of Nvidia ConnectX-7 network interface cards (NICs) which can handle up to 400 Gbps GPU to GPU interconnects.It also features an upgraded 200 Gbps front-end network to move large data sets between storage and GPUs more efficiently.The instances are Bare metal and each features eight Nvidia H200s with 141GB HBM3e memory, and two 56-core Intel Sapphire Rapids 8480+ CPUs.Pricing remains $10 per GPU per hour, the same as with H100 instances. The H100 Supercluster can scale to 16,384 GPUs.In September 2024, Oracle revealed that it would build a supercluster with up to 131,072 of the upcoming Nvidia Blackwell GPUs, set to launch during the first half of 2025.
Can provide up to 260 Exaflops of FP8 performance
An Oracle Cloud Infrastructure (OCI) Supercluster with Nvidia H200 GPUs is now generally available.
The Supercluster can scale up to 65,536 Nvidia H200 GPUs, and offers up to 260 exaflops of peak FP8 performance. Oracle claims it is the largest AI supercomputer in the cloud.
According to Oracle, each compute instance within the Supercluster has 76 percent more high-bandwidth memory and 40 percent more memory bandwidth than the H100 instance thus improving its LLM inference performance by up to 1.9 times.Supercluster has a custom-designed cluster network using RDMA over Converged Ethernet Version 2 (RoCE v2) on top of Nvidia ConnectX-7 network interface cards (NICs) which can handle up to 400 Gbps GPU to GPU interconnects.It also features an upgraded 200 Gbps front-end network to move large data sets between storage and GPUs more efficiently.The instances are Bare metal and each features eight Nvidia H200s with 141GB HBM3e memory, and two 56-core Intel Sapphire Rapids 8480+ CPUs.Pricing remains $10 per GPU per hour, the same as with H100 instances. The H100 Supercluster can scale to 16,384 GPUs.In September 2024, Oracle revealed that it would build a supercluster with up to 131,072 of the upcoming Nvidia Blackwell GPUs, set to launch during the first half of 2025.
READ MORE READ MORE
READ MORE READ MORE
Featured
Jun 17, 2025
News
OpenAI secures $200m US defence contract to develop ‘frontier’ AI

Featured
Jun 17, 2025
News
OpenAI secures $200m US defence contract to develop ‘frontier’ AI

Featured
Jun 13, 2025
News
Consortium plans 500MW of AI compute in Morocco

Featured
Jun 13, 2025
News
Consortium plans 500MW of AI compute in Morocco

Featured
Apr 28, 2025
News
The cost of compute: A $7 trillion race to scale data centers

Featured
Apr 28, 2025
News
The cost of compute: A $7 trillion race to scale data centers

Apr 28, 2025
News
Brazil to offer tax breaks to lure data center investments, sources say

Apr 28, 2025
News
Brazil to offer tax breaks to lure data center investments, sources say

Featured
Apr 26, 2025
News
A new framework for data embassies: Saudi Arabia’s Global AI Hub Law

Featured
Apr 26, 2025
News
A new framework for data embassies: Saudi Arabia’s Global AI Hub Law

Featured
Apr 9, 2025
Press Release
Commission sets course for Europe's AI leadership with an ambitious AI Continent Action Plan

Featured
Apr 9, 2025
Press Release
Commission sets course for Europe's AI leadership with an ambitious AI Continent Action Plan

Featured
Jun 17, 2025
News
OpenAI secures $200m US defence contract to develop ‘frontier’ AI

Featured
Jun 13, 2025
News
Consortium plans 500MW of AI compute in Morocco

Featured
Apr 28, 2025
News
The cost of compute: A $7 trillion race to scale data centers

Apr 28, 2025
News
Brazil to offer tax breaks to lure data center investments, sources say

Featured
Apr 26, 2025
News
A new framework for data embassies: Saudi Arabia’s Global AI Hub Law

Featured
Apr 9, 2025
Press Release
Commission sets course for Europe's AI leadership with an ambitious AI Continent Action Plan

Featured
Mar 17, 2025
News
Tech Giants Expected to Ramp Up AI Spending Spree After DeepSeek

Mar 4, 2025
Article
Public Concerns Over AI Data Centers Grow as Demand Surges – Report

Featured
Jun 17, 2025
News
OpenAI secures $200m US defence contract to develop ‘frontier’ AI

Featured
Jun 13, 2025
News
Consortium plans 500MW of AI compute in Morocco

Featured
Apr 28, 2025
News
The cost of compute: A $7 trillion race to scale data centers

Apr 28, 2025
News
Brazil to offer tax breaks to lure data center investments, sources say
