Twitter found an abandoned cluster of once scarce NVIDIA V100

Developer Tim Zaman, who worked at Twitter during the sale of the social network to Elon Musk, and has now moved to Google DeepMind, spoke about an unusual find, Tom’s Hardware reports. According to him, a few weeks after the deal, experts discovered a cluster of 700 idle NVIDIA V100 accelerators. Zaman himself described the discovery as “an honest attempt to build a cluster within the framework of Twitter 1.0.” Zaman was reminded of this event by the news about the xAI AI supercomputer consisting of 100 thousand NVIDIA H100 accelerators.

The discovery makes me sad that for years Twitter had 700 high-performance accelerators based on NVIDIA Volta architecture at its disposal, which were turned on but idle. They were in short supply at the time of release in 2017, and Zaman only discovered the dormant cluster in 2022. It is not surprising that around the same time it was decided to close some of the social network’s data centers. It is noteworthy that the cluster used PCIe cards, and not the SXM2 version of the V100 with NVLink, which are much more efficient in AI tasks.

Image source: Alexander Shatov/unsplash.com

Zaman also shared his thoughts about the “AI Gigafactory”. He suggested that using 100 thousand accelerators within one network fabric should be an epic challenge, since at such a scale failures are inevitable, which must be properly managed to maintain the functionality of the entire system. In his opinion, the system should be divided into independent domains (large clusters are designed this way). Zaman also wondered what the maximum number of accelerators could be within a single cluster. As companies build ever larger AI training systems, there will be both predictable and unexpected limits to how many accelerators can be combined.

admin

Share
Published by
admin

Recent Posts

Alibaba Cloud Reduces Data Center Assembly Time by 50% Using Modular Architecture

Alibaba Cloud presented at its annual Apsara conference a modular data center architecture called “CUBE…

12 mins ago

The release has crept up unnoticed: the classic version of Resident Evil 3 will appear on GOG very soon

The original Resident Evil 3: Nemesis turned 25 years old yesterday, and the digital distribution…

41 mins ago

Biden and Modi agreed to build a chip factory in India

The United States and India have reached an agreement under which a new semiconductor manufacturing…

1 hour ago

An insider has revealed the main source of inspiration for the multiplayer Assassin’s Creed Invictus – Fall Guys

Image Source: Mediatonic Among the available formats are team deathmatch, every man for himself, and…

3 hours ago

Seasonic has released a PRIME PX-2200 power supply with a power of 2200 W for $500

Seasonic has released the PRIME PX-2200 2200 W power supply. The new product was first…

3 hours ago