Categories: Artificial Intelligence, Machine Learning, Neural NetworksTechnology and IT market. news

Nvidia unveils Fugatto AI model that “understands and generates sound like humans do”

Nvidia has unveiled a new experimental generative AI model that the company describes as a “Swiss army knife for audio.” The Fugatto (Foundational Generative Audio Transformer Opus 1) model uses text prompts to generate new or modify existing music, voice and sound files. Developers from all over the world took part in the creation of the model, which strengthened the “multi-accent and multilingual capabilities of the model.”

Image source: NVIDIA

«We wanted to create a model that understands and generates sound the way humans do,” said Rafael Valle, a project participant and manager of applied audio research at Nvidia. The company has proposed several scenarios in which the Fugatto model may be in demand:

Music producers can quickly create a prototype song that can be easily edited by trying out different styles, voices and instruments.
Fugatto can be used to create language learning tools with the choice of the most suitable voice.
Video game developers can use it to create variations of pre-recorded assets to match changes in the game based on player choices and actions.

The researchers claim that the model, with some additional fine-tuning, can also perform tasks that were not part of its prior training. The model can combine separate instructions, for example, generating speech with a certain intonation and accent, or the sound of birds singing during a thunderstorm. The model can also generate sounds that change over time, such as the sound of an approaching rainstorm or a departing train.

Fugatto is not the first generative AI technology that can create sounds from text prompts. Meta✴ previously released a similar open-source AI model. Google offers its own AI text-to-music tool, MusicLM, which can be accessed through the company’s AI Test Kitchen website.

Nvidia has not yet provided public access to Fugatto and has refrained from commenting on this matter.

admin

Next The largest manufacturer of glass for smartphones has found a way to fight off an antitrust investigation in the EU »

Previous « An Irish newspaper has revealed how much Larian earned from Baldur's Gate 3 in 2023

Windows 11 will become smarter: Microsoft is testing AI file search

Microsoft is testing a new artificial intelligence (AI)-powered search feature in the latest build for…

42 minutes ago

Merger instead of sale: Perplexity AI wants to save TikTok in the US

Perplexity AI proposed on Saturday, a day before TikTok was blocked in the United States,…

42 minutes ago

Technology and IT market. news

Battle Shapers – fear of ambition. Review

Not defined Roguelikes with a first-person perspective are a fairly niche genre segment, but they…

6 hours ago

Technology and IT market. news

Nvidia unveils Fugatto AI model that “understands and generates sound like humans do”

Recent Posts

Windows 11 will become smarter: Microsoft is testing AI file search

Merger instead of sale: Perplexity AI wants to save TikTok in the US

Battle Shapers – fear of ambition. Review

ASRock introduced industrial mini-PCs and motherboards based on Intel Arrow Lake-H and AMD Ryzen 300 AI

The potential US Secretary of Transportation promised to deal with SpaceX fines and eliminate the space bureaucracy

Vast Space has built the world’s first private space station; it will go into orbit this year