Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation

Loading...
Thumbnail Image
File version

Accepted Manuscript (AM)

Author(s)
Miao, B
Bennamoun, M
Gao, Y
Mian, A
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2022
Size
File type(s)
Location

Taipei, Taiwan

License
Abstract

We propose a self-supervised spatio-temporal matching method, coined Motion-Aware Mask Propagation (MAMP), for video object segmentation. MAMP leverages the frame reconstruction task for training without the need for annotations. During inference, MAMP builds a dynamic memory bank and propagates masks according to our proposed motion-aware spatio-temporal matching module, which is able to handle fast motion and long-term matching scenarios. Evaluation on DAVIS-2017 and YouTube-VOS datasets show that MAMP achieves state-of-the-art performance with stronger generalization ability compared to existing self-supervised methods, i.e., 4.2% higher mean \mathcal{J} & \mathcal{F} on DAVIS-2017 and 4.85% higher mean \mathcal{J} & \mathcal{F} on the unseen categories of YouTube-VOS than the nearest competitor. Moreover, MAMP performs at par with many supervised video object segmentation methods. Our code is available at: https://github.com/bo-miao/MAMP.

Journal Title
Conference Title

2022 IEEE International Conference on Multimedia and Expo (ICME)

Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Item Access Status
Note
Access the data
Related item(s)
Subject

Semi- and unsupervised learning

Computer vision and multimedia computation

Persistent link to this record
Citation

Miao, B; Bennamoun, M; Gao, Y; Mian, A, Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation, 2022 IEEE International Conference on Multimedia and Expo (ICME), 2022