Nvidia Corp. today joined the likes of Meta Platforms Inc., OpenAI and Runway AI Inc. in releasing a generative artificial intelligence model that’s designed to create “new” music and audio from human language prompts.
According to the chipmaker, the new model, called Fugatto (for Foundational Generative Audio Transformer Opus 1), is uniquely able to modify human voices and create “novel sounds” that no other model can produce.
Nvidia, which is better known for making the powerful graphics processing units that power AI models, has not publicly released the model yet, o account of concerns around safety.
The company said Fugatto is different from other music and audio generation models because it has the ability to absorb and modify existing sounds. For instance, it can listen to a musical segment played on a piano, and transform that sound into notes sung by a human voice, or an alternative instrument like a violin. It can also take a human voice recording and alter …