🔉 Introducing SAM Audio, the first unified model that isolates any sound from complex audio mixtures using text, visual, or span prompts. We’re sharing SAM Audio with the community, along with a perception encoder model, benchmarks and research papers, to empower others to explore new forms of expression and build applications that were previously out of reach. 🔗 Learn more:
SAM Audio represents a significant advancement in audio separation technology, outperforming previous models across a wide range of benchmarks and tasks.
1.26K