Abstract
One fundamental problem in object retrieval with the bag-of-visual words (BoW) model is its lack of spatial information. Although various approaches are proposed to incorporate spatial constraints into the BoW model, most of them are either too strict or too loose so that they are only effective in limited cases. We propose a new spatially-constrained similarity measure (SCSM) to handle object rotation, scaling, view point change and appearance deformation. The similarity measure can be efficiently calculated by a voting-based method using inverted files. Object retrieval and localization are then simultaneously achieved without post-processing. Furthermore, we introduce a novel and robust re-ranking method with the k-nearest neighbors of the query for automatically refining the initial search results. Extensive performance evaluations on six public datasets show that SCSM significantly outperforms other spatial models, while k-NN re-ranking outperforms most state-of-the-art approaches using query expansion.
Original language | English (US) |
---|---|
Title of host publication | 2012 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2012 |
Pages | 3013-3020 |
Number of pages | 8 |
DOIs | |
State | Published - Oct 1 2012 |
Event | 2012 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2012 - Providence, RI, United States Duration: Jun 16 2012 → Jun 21 2012 |
Other
Other | 2012 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2012 |
---|---|
Country | United States |
City | Providence, RI |
Period | 6/16/12 → 6/21/12 |
ASJC Scopus subject areas
- Software
- Computer Vision and Pattern Recognition