Saarland University
COLLATE
Research
current corpus annotation
dialogue system
information extraction
information management
People
Publications
Contact
Funding

COLLATE (UdS)

Computational Linguistics and Language Technology for Real Life Applications

Research

Collate Logo

Multilevel Corpus Annotation

We annotate a corpus of German business news at the levels of

  • named entity
  • coreference
  • domain templates
  • framenet

In co-operation with the project TiGER, the same corpus is annotated with part of speech, morphology and dependency relations.

The corpus is to be used as a "gold standard" for developing, training, and evaluating information extraction systems.


last change: 18th September 2002 by bering@coli.uni-sb.de