AI startup SambaNova Systems, founded in 2017 by people from Sun, Oracle, and Stanford, has announced the world’s fastest DeepSeek-R1 671B deployment. That’s not possible with Nvidia hardware.
Image source: sambanova.ai
SambaNova achieved a DeepSeek-R1 speed of 198 tokens per second on just 16 custom-built accelerators, a feat that would require 40 racks of 320 Nvidia GPUs. “SambaNova’s SN40L RDUs are the fastest platform to run DeepSeek, a 5x increase over the speed of the latest GPU on a single rack, and we’ll be offering 100x the capacity for DeepSeek-R1 by the end of the year,” promised Rodrigo Liang, co-founder and CEO of SambaNova.
While computationally intensive AI workloads have traditionally been powered by Nvidia GPUs, SambaNova claims its configurable dataflow architecture is a more efficient solution. Its hardware runs three times faster and uses five times less power than today’s most powerful GPUs, while still delivering the full computing power of the DeepSeek-R1. The achievement was confirmed by experts at Artificial Analysis, an independent AI assessment firm.
The open source DeepSeek-R1 671B is available on the SambaNova cloud via API. The company is actively increasing its capacity and hopes to reach a total throughput of 20,000 tokens per second in the near future.
Chinese short-video service TikTok is set to shut down its TikTok Notes section on May…
Meta✴'s VP of AI research Joelle Pineau has announced her departure from the company. Her…
Meta✴ is preparing a more expensive version of smart glasses as part of a joint…
On March 28, 2025, Guangdong EHang General Aviation and its two partner air transport operators…
Xiaomi recently launched the Poco F7 Ultra and Poco F7 Pro smartphones, which feature high-performance…
An internal presentation of Project X7, a cancelled ZA/UM spin-off of Disco Elysium led by…