TY - JOUR
T1 - Learning spatio-temporal dependency of local patches for complex motion segmentation
AU - Xu, Jiang
AU - Yuan, Junsong
AU - Wu, Ying
N1 - Funding Information:
We thank the anonymous reviewers for their helpful comments. This work was supported in part by National Science Foundation IIS-0347877 , IIS-0916607 , and US Army Research Laboratory and the US Army Research Office under Grant ARO W911NF-08-1-0504 . This project is partly supported by the Nanyang Assistant Professorship (SUG M58040015 ) to Junsong Yuan.
PY - 2011/3
Y1 - 2011/3
N2 - Segmenting complex motion, such as articulated motion and deformable objects, can be difficult if the prior knowledge of the motion pattern is not available. We present a novel method for motion segmentation by learning the motion priors from exemplar motions to guide the segmentation. Instead of modeling the motion field explicitly, we decompose each video frame into a number of local patches and learn the spatio-temporal contextual relations among them, e.g., if their motion relationships are consistent with that from the training data. Based on a novel motion feature to measure the relative motion of two patches, the SVM classifier learns their pairwise relationship. We convert the motion segmentation problem to a binary labeling problem, and propose an iterative solution to group the local patches whose motions are consistent. Compared with other approaches, such as the graph cut and normalized cut methods, this new method is computationally more efficient and is able to better handle the inaccurate inference of pairwise relationships. Results on both synthesized and real videos show that our method can learn to segment different types of complex motion patterns.
AB - Segmenting complex motion, such as articulated motion and deformable objects, can be difficult if the prior knowledge of the motion pattern is not available. We present a novel method for motion segmentation by learning the motion priors from exemplar motions to guide the segmentation. Instead of modeling the motion field explicitly, we decompose each video frame into a number of local patches and learn the spatio-temporal contextual relations among them, e.g., if their motion relationships are consistent with that from the training data. Based on a novel motion feature to measure the relative motion of two patches, the SVM classifier learns their pairwise relationship. We convert the motion segmentation problem to a binary labeling problem, and propose an iterative solution to group the local patches whose motions are consistent. Compared with other approaches, such as the graph cut and normalized cut methods, this new method is computationally more efficient and is able to better handle the inaccurate inference of pairwise relationships. Results on both synthesized and real videos show that our method can learn to segment different types of complex motion patterns.
KW - Bipolar segmentation
KW - Learning
KW - Motion profile symmetry correlation
KW - Motion segmentation
UR - http://www.scopus.com/inward/record.url?scp=79951661621&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79951661621&partnerID=8YFLogxK
U2 - 10.1016/j.cviu.2010.11.010
DO - 10.1016/j.cviu.2010.11.010
M3 - Article
AN - SCOPUS:79951661621
VL - 115
SP - 334
EP - 351
JO - Computer Vision and Image Understanding
JF - Computer Vision and Image Understanding
SN - 1077-3142
IS - 3
ER -