Five equally experienced annotators provided linguistic annotations of a subset of 212 clauses extracted from Augmentative and Alternative Communication resources from the Aragonese Portal of Augmentative and Alternative Communication (http://www.arasaac.org/index.php). Each clause is annotated with linguistic information (error type: morphological, syntactic, lexicon, grammar, orthographic, target, lemmatiser), the rating concerning the correctness of the clause and other useful information for natural language generation purposes.
Permitted Uses:
1. The information may only be used for research and development of natural-language processing, information-retrieval or knowledge-discovery systems.
2. Summaries, analyses and interpretations of the linguistic properties of the Information may be derived and published, provided it is not possible to reconstruct the Information from these summaries.
3. Small excerpts of the Information may be displayed to others or published in a scientific or technical context, solely for the purpose of describing the research and development and related issues. Any such use shall not infringe on the rights of any third party including, but not limited to, the authors and publishers of the excerpts.
Besides, we provide the English subset regarding manual evaluation. The format is as follows:
Spanish_sentence_1\tSpanish_lemmas_1\s|\sEnglish_sentence_1\tEnglish_lemmas1
Spanish_sentence_2\tSpanish_lemmas_2\s|\sEnglish_sentence_2\tEnglish_lemmas2
Note that when Spanish_lemmas is equals to EMPTY this means it was consider as automatic evaluation in the Spanish version.