Meta brings Segment Anything to audio, letting editors pull sounds from video with a click or text prompt

2025-12-26 09:48 GMT · 4 months ago aimagpro.com

Filtering a dog bark from street noise or isolating a sound source with a single click on a video: Meta’s SAM Audio brings the company’s visual segmentation approach to the audio world. The model lets users edit audio using text commands, clicks, or time markers. Code and weights are open source.
The article Meta brings Segment Anything to audio, letting editors pull sounds from video with a click or text prompt appeared first on The Decoder.