The importance of annotated corpora for NLP: the cases of anaphora resolution and clause splitting
Mitkov, R., Orasan, C. and Evans, R. (1999): The importance of annotated corpora for NLP: the cases of anaphora resolution and clause splitting in Proceeding of "Corpora and NLP: Reflecting on Methodology Workshop", TALN'99
Abstract

In this paper we present two applications that depend on annotated corpora for their implementation, evaluation and improvement. The first is an automatic anaphora resolution system. After describing the algorithm we discuss the importance of corpora for the tasks of evaluation and automatic scoring and the development of a coreferentially annotated corpus. We go on to look ahead at the role of corpora in optimisation and semi-automatic annotation.
The second task investigates the use of an annotated corpus with a machine learning algorithm for clause splitting. We show that the method minimises the number of hand made rules necessary to achieve a good result.

Electronic version

http://clg.wlv.ac.uk/papers/mitkov-99b.pdf
BibTeX reference
@InProceedings{mitkov-TALN-99,
  author =	 {Ruslan Mitkov and Constantin Orasan and Richard
                  Evans},
  title =	 {The importance of annotated corpora for NLP: the
                  cases of anaphora resolution and clause splitting},
  booktitle =	 {Proceeding of "Corpora and NLP: Reflecting on
                  Methodology Workshop", TALN'99},
  pages =	 {60 -- 69},
  year =	 {1999},
  address =	 {Cargese, Corse},
  month =	 {12 - 17 July},
  URL =		 {http://clg.wlv.ac.uk/papers/mitkov-99b.pdf}
}

(c) 2006 - 2012 Research Group in Computational Linguistics
Last modified: August 31 2010