Mistral AI and NVIDIA unveil the Mistral NeMo 12B enterprise AI model with “common sense” and “world knowledge”

NVIDIA Corporation and the French company Mistral AI announced the Mistral NeMo 12B large language model (LLM), specially designed to solve various enterprise-level tasks – chatbots, data summarization, working with program code, etc.

Mistral NeMo 12B has 12 billion parameters and uses a context window of 128 thousand tokens. The inference uses the FP8 data format, which is said to reduce memory requirements and speed up deployment without any reduction in response accuracy.

Image Source: Pixabay.com

When training the model, the Megatron-LM library, which is part of the NVIDIA NeMo platform, was used. In this case, 3072 NVIDIA H100 accelerators based on DGX Cloud were used. It is claimed that Mistral NeMo 12B copes well with multi-pass dialogues, mathematical problems, programming, etc. The model has “common sense” and “world knowledge”. Overall, it reports accurate and reliable performance across a wide range of applications.

The model is released under the Apache 2.0 license and is offered as a NIM container. The implementation of LLM, according to the creators, takes a matter of minutes, not days. To run the model, one NVIDIA L40S accelerator, GeForce RTX 4090 or RTX 4500 is enough. Among the key advantages of deployment via NIM are high efficiency, low computational cost, security and privacy.

admin

Share
Published by
admin

Recent Posts

Despelote — goo-o-o-o-o-o-o-o-o-o-ol! Review

One of my first memories (or perhaps the very first one – is it possible…

19 hours ago

Design and specifications of the flagship smartphone Sony Xperia 1 VII leaked online

A few days before the official presentation, details about the new flagship Sony Xperia 1…

19 hours ago

GTA VI Delay to 2026 Causes New Panic Among Game Developers

Bloomberg journalist Jason Schreier reported on the domino effect triggered by the recent delay of…

20 hours ago

Nintendo warns it will block consoles for users who engage in piracy and hacking

Nintendo has updated its user agreement, formalizing the right to remotely disable Switch consoles if…

21 hours ago

Gigabyte Unveils X870 and B850 Aorus Stealth Motherboards with Back-Side Power Connectors

Gigabyte has unveiled the X870 Aorus Stealth and B850 Aorus Stealth motherboards for Ryzen 7000,…

22 hours ago

Alienware Unveils Thin, Affordable Aurora 16 and 16X Gaming Laptops with Understated Designs

Alienware, a subsidiary of Dell known for its futuristic gaming laptops, has released new high-performance…

2 days ago