... al.,2004). The morphological annotation we useis the “before-file”, which lists the untokenizedwords (as they appear in the Arabic original text)and all possible analyses according to the Buck-walter ... the Levantine Arabic Treebank (LATB) from the Linguistic Data Consortium. However, thereare three major differences: the text is transcribedspeech, the corpus is much smaller, and, since,there ... Levantinecurrently, the before-files are the result of running the MSA Buckwalter analyzer on the Levantine to-ken, with many of the analyses incorrect, and only the analysis chosen for the token in...