DeepSeek accelerated to record speed on just 16 SambaNova chips – 20 times more Nvidia chips would have been needed

AI startup SambaNova Systems, founded in 2017 by people from Sun, Oracle, and Stanford, has announced the world’s fastest DeepSeek-R1 671B deployment. That’s not possible with Nvidia hardware.

Image source: sambanova.ai

SambaNova achieved a DeepSeek-R1 speed of 198 tokens per second on just 16 custom-built accelerators, a feat that would require 40 racks of 320 Nvidia GPUs. “SambaNova’s SN40L RDUs are the fastest platform to run DeepSeek, a 5x increase over the speed of the latest GPU on a single rack, and we’ll be offering 100x the capacity for DeepSeek-R1 by the end of the year,” promised Rodrigo Liang, co-founder and CEO of SambaNova.

While computationally intensive AI workloads have traditionally been powered by Nvidia GPUs, SambaNova claims its configurable dataflow architecture is a more efficient solution. Its hardware runs three times faster and uses five times less power than today’s most powerful GPUs, while still delivering the full computing power of the DeepSeek-R1. The achievement was confirmed by experts at Artificial Analysis, an independent AI assessment firm.

The open source DeepSeek-R1 671B is available on the SambaNova cloud via API. The company is actively increasing its capacity and hopes to reach a total throughput of 20,000 tokens per second in the near future.

admin

Share
Published by
admin

Recent Posts

TikTok to Shut Down Its Instagram Clone on May 8

Chinese short-video service TikTok is set to shut down its TikTok Notes section on May…

6 hours ago

Meta Loses Head of Fundamental AI Research

Meta✴'s VP of AI research Joelle Pineau has announced her departure from the company. Her…

6 hours ago

Meta to Release Smart Glasses with Display and Price Tag Over $1000 by End of Year

Meta✴ is preparing a more expensive version of smart glasses as part of a joint…

6 hours ago

China Allows EHang Electric Jets to Transport People by Air, but Air Taxi Services Still Banned

On March 28, 2025, Guangdong EHang General Aviation and its two partner air transport operators…

6 hours ago

A World in a Box of Locusts and Single-Player Co-op: Details on Disco Elysium’s Cancelled Kuno and Kunu Spin-Off

An internal presentation of Project X7, a cancelled ZA/UM spin-off of Disco Elysium led by…

6 hours ago