Annotation invovles a methodology for adding information to a document at some level—a word or phrase, paragraph or section or the entire document..... The ultimate goal is to enable interoperability among annotations for different linguistic phenomena for the same language, together with linguistic annotations applied to different languages and modalities.
Inter-annotator agreement
Intra-annotator agreement
How to measure agreement between annotators?
Machine annotation
and the problemsHow good is a given annotation? Is it correct? Is it consistent? How can you check this for thousands of sentences? The annotation manual may easily be 50 or 100 pages long, and annotation takes a lot of time. E.g. SALSA: 20,000 sentences, about 4 years
Is there any way we can speed this up?
Things must be taken into consideration: (Palmer and Xue, 2009)
tagset
: design criteria; granularitytagger
: different algorithms
For advanced use:
Preparing for the quiz. (Antconc, BNC-WEB and WSE)
Shower Presentation Template
Author: Vadim Makeev, Opera Software
Modified: Ramnath Vaidyanthan, for Slidify