Correlation clustering with asymmetric classification errors

Jafar Jafarov*, Sanchit Kalhan, Konstantin Makarychev, Yury Makarychev

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Scopus citations

Abstract

In the Correlation Clustering problem, we are given a weighted graph G with its edges labelled as "similar"or "dissimilar"by a binary classifier. The goal is to produce a clustering that minimizes the weight of "disagreements": the sum of the weights of "similar"edges across clusters and "dissimilar"edges within clusters. We study the correlation clustering problem under the following assumption: Every "similar"edge e has weight we ϵ [aw, w] and every "dissimilar"edge e has weight we = aw (where a = 1 and w > 0 is a scaling parameter). We give a (3 + 2 loge (1/a)) approximation algorithm for this problem. This assumption captures well the scenario when classification errors are asymmetric. Additionally, we show an asymptotically matching Linear Programming integrality gap of ϵ(log 1/a).

Original languageEnglish (US)
Title of host publication37th International Conference on Machine Learning, ICML 2020
EditorsHal Daume, Aarti Singh
PublisherInternational Machine Learning Society (IMLS)
Pages4591-4600
Number of pages10
ISBN (Electronic)9781713821120
StatePublished - 2020
Event37th International Conference on Machine Learning, ICML 2020 - Virtual, Online
Duration: Jul 13 2020Jul 18 2020

Publication series

Name37th International Conference on Machine Learning, ICML 2020
VolumePartF168147-6

Conference

Conference37th International Conference on Machine Learning, ICML 2020
CityVirtual, Online
Period7/13/207/18/20

Funding

Jafar Jafarov and Yury Makarychev were supported in part by NSF CCF-1718820 and NSF TRIPODS CCF-1934843. Sanchit Kalhan and Konstantin Makarychev were supported in part by NSF TRIPODS CCF-1934931.

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Human-Computer Interaction
  • Software

Fingerprint

Dive into the research topics of 'Correlation clustering with asymmetric classification errors'. Together they form a unique fingerprint.

Cite this