Abstract
Most existing methods of semi-supervised clustering introduce supervision from outside, e.g., manually label some data samples or introduce constrains into clustering results. This paper studies an interesting problem: can the supervision come from inside, i.e., the unsupervised training data themselves? If the data samples are not independent, we can capture the contextual information reflecting the dependency among the data samples, and use it as supervision to improve the clustering. This is called context-aware clustering. The investigation is substantialized on two scenarios of (1) clustering primitive visual features (e.g., SIFT features) with help of spatial contexts, and (2) clustering '0'-'9' hand written digits with help of contextual patterns among different types of features. Our context-aware clustering can be well formulated in a closed-form, where the contextual information serves as a regularization term to balance the data fidelity in original feature space and the influences of contextual patterns. A nested-EM algorithm is proposed to obtain an efficient solution, which proves to converge. By exploring the dependent structure of the data samples, this method is completely unsupervised, as no outside supervision is introduced.
Original language | English (US) |
---|---|
Title of host publication | 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR |
DOIs | |
State | Published - 2008 |
Event | 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR - Anchorage, AK, United States Duration: Jun 23 2008 → Jun 28 2008 |
Publication series
Name | 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR |
---|
Other
Other | 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR |
---|---|
Country/Territory | United States |
City | Anchorage, AK |
Period | 6/23/08 → 6/28/08 |
Funding
*The author is indebted to John Campbell, Benjamin Friedman, Jonathan Ingersoll, Edward Kane, Stephen LeRoy, Jeffrey Miron, and J. Huston McCulloch for helpful comments and discussions, and to Jeeman Jung, Sejin Kim, Plutarchos Sakellaris, and James Robinson for research assistance. Research was supported by the National Science Foundation. The research reported here is part of the NBER's research program in Financial Markets and Monetary Economics. Any opinions expressed are those of the authors and not those of the National Bureau of Economic Research.
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition
- Control and Systems Engineering