Cerebras WSE-3 accelerator king single-handedly trained an AI model with 1 trillion parameters

Cerebras Systems, in collaboration with the US Department of Energy (DOE) Sandia National Laboratories (SNL), conducted a successful experiment to train an AI model with 1 trillion parameters using a single CS-3 system with a WSE-3 czar accelerator and 55 TB of MemoryX external memory.

Training models of this scale typically requires thousands of GPU-based accelerators that consume megawatts of power, dozens of experts, and weeks of hardware and software tuning, Cerebras says. However, SNL scientists were able to train the model on a single system without making changes to either the model or the infrastructure software. Moreover, they were able to achieve almost linear scaling – 16 CS-3 systems showed a 15.3-fold increase in learning speed.

Image source: Cerebras

A model of this scale requires terabytes of memory, thousands of times more than is available on a single GPU. In other words, classical clusters of thousands of accelerators must be correctly connected to each other before training begins. Cerebras systems for storing scales use external MemoryX memory based on 1U nodes with the most common DDR5, making it as easy to train a model with a trillion parameters as a small model on a single accelerator, the company says.

Previously, SNL and Cerebras deployed the Kingfisher cluster based on CS-3 systems, which will be used as a test platform for the development of AI technologies for national security.

admin

Share
Published by
admin

Recent Posts

Trump’s new executive order calls for the creation of a US national cryptocurrency reserve

Donald Trump, who during his first term criticized cryptocurrencies as a whole, by the time…

47 minutes ago

Dasung has released a compact 10.3-inch monitor with an electronic ink matrix and an update frequency of 60 Hz

The Chinese company Dasung has released a compact monochrome touchscreen monitor, Paperlike 103, equipped with…

47 minutes ago

Google launches accounts through the print scanner on Android

Google has launched a new security feature for Android 15 that will help protect users'…

1 hour ago

Nvidia has removed Hot Spot monitoring from GeForce RTX 50 series video cards

Nvidia has talked a lot about evolutionary design solutions for its graphics card cooling systems,…

2 hours ago

FitBit will pay a fine of $ 12 million for burns from Ionic smart watch in 78 people

Google-owned Fitbit will pay a $12.25 million fine over problems with its Ionic smartwatch. The…

2 hours ago

A large American retailer announced the date of the start of sales of the AMD series of the Radeon RX 9070 series

One of the most famous American retailers, B&H, announced that it will begin accepting pre-orders…

3 hours ago