TY - JOUR
T1 - Efficient, certifiably optimal clustering with applications to latent variable graphical models
AU - Eisenach, Carson
AU - Liu, Han
N1 - Publisher Copyright:
Copyright © 2018, The Authors. All rights reserved.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2018/6/1
Y1 - 2018/6/1
N2 - Motivated by the task of clustering either d variables or d points into K groups, we investigate efficient algorithms to solve the Peng-Wei (P-W) K-means semi-definite programming (SDP) relaxation. The P-W SDP has been shown in the literature to have good statistical properties in a variety of settings, but remains intractable to solve in practice. To this end we propose FORCE, a new algorithm to solve this SDP relaxation. Compared to the naive interior point method, our method reduces the computational complexity of solving the SDP from Oe(d7 log ∊−1) to Oe(d6K−2∊−1) arithmetic operations for an ∊-optimal solution. Our method combines a primal first-order method with a dual optimality certificate search, which when successful, allows for early termination of the primal method. We show for certain variable clustering problems that, with high probability, FORCE is guaranteed to find the optimal solution to the SDP relaxation and provide a certificate of exact optimality. As verified by our numerical experiments, this allows FORCE to solve the P-W SDP with dimensions in the hundreds in only tens of seconds. For a variation of the P-W SDP where K is not known a priori a slight modification of FORCE reduces the computational complexity of solving this problem as well: from Oe(d7 log ∊−1) using a standard SDP solver to Oe(d4∊−1).
AB - Motivated by the task of clustering either d variables or d points into K groups, we investigate efficient algorithms to solve the Peng-Wei (P-W) K-means semi-definite programming (SDP) relaxation. The P-W SDP has been shown in the literature to have good statistical properties in a variety of settings, but remains intractable to solve in practice. To this end we propose FORCE, a new algorithm to solve this SDP relaxation. Compared to the naive interior point method, our method reduces the computational complexity of solving the SDP from Oe(d7 log ∊−1) to Oe(d6K−2∊−1) arithmetic operations for an ∊-optimal solution. Our method combines a primal first-order method with a dual optimality certificate search, which when successful, allows for early termination of the primal method. We show for certain variable clustering problems that, with high probability, FORCE is guaranteed to find the optimal solution to the SDP relaxation and provide a certificate of exact optimality. As verified by our numerical experiments, this allows FORCE to solve the P-W SDP with dimensions in the hundreds in only tens of seconds. For a variation of the P-W SDP where K is not known a priori a slight modification of FORCE reduces the computational complexity of solving this problem as well: from Oe(d7 log ∊−1) using a standard SDP solver to Oe(d4∊−1).
UR - http://www.scopus.com/inward/record.url?scp=85094535325&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85094535325&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:85094535325
JO - Free Radical Biology and Medicine
JF - Free Radical Biology and Medicine
SN - 0891-5849
ER -