Categories: Barebone systems, nettops, all-in-one PC, media centersTechnology and IT market. news

State-of-the-art AI model runs on tiny Raspberry Pi Zero computer – impractical, but it works

ahr0chm6ly8zzg5ld3mucnuvyxnzzxrzl2v4dgvybmfsl2lsbhvzdhjhdglvbnmvmjayns8wmi8xoc8xmte4ndyxl3vzyi1sbg0uanbn

Enthusiast Binh Pham has created a USB device based on a Raspberry Pi Zero single-board computer, which locally runs a large artificial intelligence language model that generates literary texts.

Image source: youtube.com/@build_with_binh

The author developed the software part of the project using the llama.cpp library and the llamafile utility, which are designed to output large AI language models. This turned out to be a difficult task, since the Raspberry Pi Zero has only 512 MB of RAM and a processor with an outdated ARMv6 architecture, which prevented the project from compiling in its original version. To get around these limitations, the enthusiast had to convert the ARMv8-optimized instruction set to llama.cpp and remove all notations and optimization mechanisms intended for modern hardware.

Binh Pham placed the Raspberry Pi Zero single-board computer and an expansion board with a USB port for connecting to modern computers in a 3D-printed case. Due to the modest computing resources, it was necessary to limit the context to 64 tokens and use models containing from 15 million to 136 million parameters. The smallest one — Tiny15M — showed the highest speed among all the tested models: 223 ms per token. For the Lamini-T5-Flan-77M, this figure was 2.5 s per token, and for the SmolLM2-136M — 2.2 s per token.

At such a speed, the device can hardly be called practical, but the author of the project decided not to stop there. He considered that controlling AI through the command line interface is not convenient enough, and suggested a more convenient way. To send a request, the user needs to create an empty text file in the specified location, the name of which serves as a request to the model. Having found the file, the system sends a request to the AI and writes its response to the contents of the same file.

With his project, Binh Pham decided to show what interaction with local AI models could be like in the future.

admin

Next Microsoft has rolled back the previously reported automatic sign-in to accounts »

Previous « Affordable Realme P3 Pro and P3x smartphones with capacious batteries and IP69 protection announced

Fake GeForce RTX 4090s with RTX 3090 chips have started popping up in China — craftsmen are even changing the GPU markings

In China, scammers have started selling GeForce RTX 3090 graphics cards, passing them off as…

5 days ago

State-of-the-art AI model runs on tiny Raspberry Pi Zero computer – impractical, but it works

Recent Posts

Nissan Leaf EV to Become NACS-Ported Compact Crossover in Third Generation

OpenAI expects to more than triple its revenue this year and then double it next year.

OpenAI Decides to Hold 4o Image Generation Launch for Free Users

1440p and 240Hz for just $200: Xiaomi updates the 27-inch Redmi G27Q gaming monitor

Beware, Android is shutting down: OS development will cease to be public, but there is no reason to panic

Fake GeForce RTX 4090s with RTX 3090 chips have started popping up in China — craftsmen are even changing the GPU markings