Nvidia Helps Apple Improve the Efficiency of Large AI Language Models

Apple engineers talked about their collaboration with Nvidia, thanks to which they were able to improve the performance of systems when generating text from large artificial intelligence language models.

Image source: developer.nvidia.com

This year, Apple published the source code for its Recurrent Drafter (ReDrafter), a new method for generating text using large language models. It is characterized by high speed, combining two technologies: beam search and dynamic attention tree. Apple’s research project showed compelling results, but the ReDrafter deployment integrated the technology into Nvidia’s TensorRT-LLM system, a tool that allows large language models to run faster on Nvidia accelerators.

Performance measurements showed that when running large language models with tens of billions of parameters using the Nvidia TensorRT-LLM framework and ReDrafter, the speed of token generation increased by 2.7 times. Thus, the technology makes it possible to reduce the delay between the user entering a request and receiving a response from the model – while using fewer accelerators and reducing energy consumption, Apple engineers concluded.

«Large language models are increasingly used in applications, and improving inference efficiency can impact computational costs and reduce latency for users. With ReDrafter’s new approach to speculative execution integrated into the Nvidia TensorRT-LLM framework, developers can now generate tokens faster on Nvidia accelerators for their applications,” Apple added.

admin

Share
Published by
admin

Recent Posts

Microsoft Unveils Redesigned Start Menu in Windows 11 with Automatic Program Grouping

Microsoft has officially confirmed changes to the Windows 11 Start menu regarding the All apps…

5 hours ago

Physicists Doubt Microsoft’s Majorana 1 Quantum Processor’s Performance on Majorana Fermions

There is an opinion among experts that the new topological quantum processor Microsoft Majorana 1…

5 hours ago

Google has begun to disable uBlock Origin en masse in Chrome due to the transition to Manifest V3

Some Chrome users have noticed that the uBlock Origin extension no longer works. The developers…

5 hours ago

Apple CEO Promises Trump to Invest Hundreds of Millions of Dollars in Developing Manufacturing in the U.S.

The directness of the current US President Donald Trump sometimes creates inconvenience for his partners,…

8 hours ago

Apple Confirms It Will Soon Make Vision Pro Headsets More Comfortable and Smarter

Apple has officially confirmed that its generative AI platform, Apple Intelligence, will be coming to…

14 hours ago