Summary
TLDR: Stability AI released Stable Audio 2, a new audio and music generator that can produce high-quality tracks up to three minutes long from a single prompt. The model leverages DiT technology and allows for audio-to-audio generation. While Stable Audio 2 shows improvements over its predecessor, Suno 3 remains the leading model in the AI music space with more creative output and smoother transitions between song parts. Stable Audio 2 does offer unique features like audio-to-audio generations but may need further advancements to compete with Suno 3.
Key Points
1. Stability AI released Stable Audio 2, a new audio and music generator, which can produce high-quality, full tracks with coherent musical structure up to three minutes long at 44.1 kHz stereo from a single natural language prompt.
2. Stable Audio 2 leverages diffusion transformer technology (DiT) for audio generation, allowing for the transformation of sound samples uploaded by users through natural language prompts, expanding sound effect generation and style transfer options.
3. Stable Audio 2 falls short in comparison to Suno 3, another leading AI music generator, in terms of the quality and complexity of generated audio tracks, as well as the speed of audio generation. However, Stable Audio 2 offers a unique feature of audio-to-audio generations that Suno 3 does not provide.