Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track

Aug 19, 2024

Feiyu Pan, Hao Fang, Runmin Cong, Wei Zhang, Xiankai Lu

Share this with someone who'll enjoy it:

Abstract:Video Object Segmentation (VOS) task aims to segmenting a particular object instance throughout the entire video sequence given only the object mask of the first frame. Recently, Segment Anything Model 2 (SAM 2) is proposed, which is a foundation model towards solving promptable visual segmentation in images and videos. SAM 2 builds a data engine, which improves model and data via user interaction, to collect the largest video segmentation dataset to date. SAM 2 is a simple transformer architecture with streaming memory for real-time video processing, which trained on the date provides strong performance across a wide range of tasks. In this work, we evaluate the zero-shot performance of SAM 2 on the more challenging VOS datasets MOSE and LVOS. Without fine-tuning on the training set, SAM 2 achieved 75.79 J&F on the test set and ranked 4th place for 6th LSVOS Challenge VOS Track.

* arXiv admin note: substantial text overlap with arXiv:2408.00714

View paper on

Share this with someone who'll enjoy it:

Title:Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track

Paper and Code