Filtering a dog bark from street noise or isolating a sound source with a single click on a video: Meta’s SAM Audio brings the company’s visual segmentation approach to the audio world. The model lets users edit audio using text commands, clicks, or time markers. Code and weights are open source.
The article Meta brings Segment Anything to audio, letting editors pull sounds from video with a click or text prompt appeared first on The Decoder.
