Stable Audio 3.0 Showcase
Hey yall! Stable Audio 3.0 Base and Distilled are available in comfys templates. Just update your comfy and itll be there. Pretty small models, around 9gb in size. Encoders are less than 5gb during run so it all fits inside around 16gb of compute. Offers full song generation, sectional editing, extensions to full song from a section, and just straight up instrument or SFX generation as well.
VERY fast, generating a 2 minute and 40 second song in about 60 seconds or less in some runs. Very coherent but VERY limited in seed variation. I noticed running the same prompt on 3 different seeds essentially gives the same output with a SLIGHTLY different melody. Rhythm percussion will pretty much be exact. Kind of sad but changing prompt slightly can rearrange the output.
Full Youtube video showcase: