Abstract
Arabic writing is typically underspecified for short vowels and other markups, referred to as diacritics. In addition to the lexical ambiguity exhibited in most languages, the lack of diacritics in written Arabic adds another layer of ambiguity which is an artifact of the orthography. In this paper, we present the details of three annotation experimental conditions designed to study the impact of automatic ambiguity detection, on annotation speed and quality in a large scale annotation project.
Original language | English (US) |
---|---|
Title of host publication | Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity (CL4LC) |
Publisher | The COLING 2016 Organizing Committee |
Pages | 127–136 |
State | Published - 2016 |