CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies

Arie Cattan, Tom Hope, Doug Downey, Roy Bar-Haim, Lilach Eden, Yoav Kantor, Ido Dagan

Research output: Contribution to conferencePaperpeer-review

Abstract

Various NLP tasks require a complex hierarchical structure over nodes, where each node is a cluster of items. Examples include generating entailment graphs, hierarchical cross-document coreference resolution, annotating event and subevent relations, etc. To enable efficient annotation of such hierarchical structures, we release CHAMP, an open source tool allowing to incrementally construct both clusters and hierarchy simultaneously over any type of texts. This incremental approach significantly reduces annotation time compared to the common pairwise annotation approach and also guarantees maintaining transitivity at the cluster and hierarchy levels. Furthermore, CHAMP includes a consolidation mode, where an adjudicator can easily compare multiple cluster hierarchy annotations and resolve disagreements.

Original languageEnglish (US)
Pages403-412
Number of pages10
StatePublished - 2023
Event2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023 - Singapore, Singapore
Duration: Dec 6 2023Dec 10 2023

Conference

Conference2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023
Country/TerritorySingapore
CitySingapore
Period12/6/2312/10/23

Funding

This work was supported by the Israel Science Foundation (grant no. 2827/21). Arie Cattan is partially supported by the PBC fellowship for outstanding PhD candidates in data science.

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Information Systems

Fingerprint

Dive into the research topics of 'CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies'. Together they form a unique fingerprint.

Cite this