Gumar Annotation
The current Gumar Corpus annotations are mostly provided automatically using MADAMIRA. The manual annotations are provided through trained annotators. There will be an indicator showing the source of the annotation next to each word in the search results (coming soon).
Annotation Guidelines
The following is a list with the guidelines used to manually annotate a portion of this corpus::
- Conventional Orthography for Dialectal Arabic (CODA) guidelines
- Part-of-Speech guidelines
Contributors
The following are the names of the people who contributed to the annotations:
- Salam Khalifa
- Sara Hassan
- Fadhl Al Eryani
- Fatema Al Fardan
Obtaining Annotations
Click here to download the Annotated Gumar Corpus.