Enthusiasts ran the modern AI model Llama on an ancient PC with Pentium II and Windows 98

Experts from EXO Labs were able to run a fairly powerful large language model (LLM) Llama on a 26-year-old computer running the Windows 98 operating system. The researchers clearly showed how an old PC equipped with an Intel Pentium II processor with an operating frequency of 350 MHz and 128 MB of RAM, after which the neural network is launched and further interacts with it.

Image source: GitHub

To run LLM, EXO Labs specialists used their own output interface for the Llama98.c algorithm, which was created based on the Llama2.c engine, written in the C programming language by former OpenAI and Tesla employee Andrej Karpathy. After loading the algorithm, he was asked to create a story about Sleepy Joe. Surprisingly, the AI ​​model actually works even on such an ancient PC, and the story is written at a good speed.

The mysterious organization EXO Labs, formed by researchers and engineers from Oxford University, emerged from the shadows in September this year. She reportedly advocates for the openness and accessibility of artificial intelligence-based technologies. Representatives of the organization believe that advanced AI technologies should not be in the hands of a handful of corporations, as is the case now. Going forward, they hope to “build an open infrastructure for training advanced AI models, allowing anyone to run them anywhere.” Demonstrating the ability to run LLM on an ancient PC, in their opinion, proves that AI algorithms can run on almost any device.

In their blog, enthusiasts said that to implement the task, they purchased an old PC with Windows 98 on eBay. Then, by connecting the device to the network using an Ethernet connector, they were able to transfer the necessary data to the device’s memory via FTP. Probably, compiling modern code for Windows 98 turned out to be a more difficult task, which was solved by the work of Andrei Karpathy published on GitHub. Ultimately, we were able to achieve a text generation speed of 35.9 tokens per second using a 260K LLM with the Llama architecture, which is quite good considering the modest computing capabilities of the device.

admin

Share
Published by
admin

Recent Posts

New details about Radeon RX 9070 video cards will appear only on January 24, AMD partner said

Sometimes new product announcements don't go according to plan, and AMD's recent presentation is an…

4 hours ago

Asus introduced the ROG XG Mobile 2025 external graphics card with GeForce RTX 5090 and Thunderbolt 5

Asus has introduced the first external graphics card with a Thunderbolt 5 interface - a…

5 hours ago

Sony introduced the gamepad and accessories for the PlayStation 5 in deep black color

Soon, owners of PlayStation 5 game consoles will be able to acquire accessories in deep…

6 hours ago

“Just beyond madness”: the first season of League of Legends in 2025 received a cinematic trailer from the creators of Arcane

In anticipation of the transition of League of Legends to a new development model, developers…

6 hours ago