Nvidia's new music generation model Fugatto creates 'never before heard sounds'

Nvidia Corp. has developed a generative artificial intelligence model called Fugatto, designed to create new music and audio from human language prompts. The model can modify human voices and create unique sounds that no other model can produce. It can transform a musical segment played on a piano into notes sung by a human voice or another instrument, and can also alter the accent and mood of a human voice recording. However, Nvidia has not yet publicly released the model due to safety concerns.

The company claims that Fugatto is different from other audio generation models as it can absorb and modify existing sounds, creating original soundscapes by overlaying two distinct audio effects. Nvidia's VP of Applied Deep Learning Research, Bryan Catanzaro, believes generative AI has the potential to impact music production in the same way that electronic synthesizers did. However, Nvidia has no immediate plans to release the model due to potential risks and copyright issues.

Key takeaways

Nvidia has developed a new generative AI model called Fugatto, designed to create new music and audio from human language prompts.
Fugatto can modify human voices and create novel sounds, transform musical segments into different instruments or voices, and alter the accent and mood of a human voice recording.
The model can create original soundscapes by overlaying two distinct audio effects, a capability not seen before in an audio-generation model.
Nvidia has not publicly released the model due to safety concerns and potential copyright issues, and is still considering how to safely release it to the public.

Nvidia's new music generation model Fugatto creates 'never before heard sounds' - SiliconANGLE

Key takeaways

Discussion (0)