The importance of annotated corpora for NLP: the cases of anaphora resolution and clause splitting
Mitkov, R., Orasan, C. and Evans, R. (1999): The importance of annotated corpora for NLP: the cases of anaphora resolution and clause splitting in Proceeding of "Corpora and NLP: Reflecting on Methodology Workshop", TALN'99
Abstract
In this paper we present two applications that depend on annotated corpora for their implementation, evaluation and improvement. The first is an automatic anaphora resolution system. After describing the algorithm we discuss the importance of corpora for the tasks of evaluation and automatic scoring and the development of a coreferentially annotated corpus. We go on to look ahead at the role of corpora in optimisation and semi-automatic annotation.
The second task investigates the use of an annotated corpus with a machine learning algorithm for clause splitting. We show that the method minimises the number of hand made rules necessary to achieve a good result.
Electronic version
http://clg.wlv.ac.uk/papers/mitkov-99b.pdf
BibTeX reference
@InProceedings{mitkov-TALN-99,
author = {Ruslan Mitkov and Constantin Orasan and Richard
Evans},
title = {The importance of annotated corpora for NLP: the
cases of anaphora resolution and clause splitting},
booktitle = {Proceeding of "Corpora and NLP: Reflecting on
Methodology Workshop", TALN'99},
pages = {60 -- 69},
year = {1999},
address = {Cargese, Corse},
month = {12 - 17 July},
URL = {http://clg.wlv.ac.uk/papers/mitkov-99b.pdf}
}