... the repository to train a decision list for NE classification. 3. The learned rules are applied to the NE candidates stored in the repository. 4. The proper names tagged in Step 3 and their ... containsDigitAndAlpha, containsDigitAndDash, containsDigitAndSlash, containsDigitAndComma, containsDigitAndPeriod, otherNum, allCaps, capPeriod, initCap, lowerCase, other. 6 Benchmarking and Discussion ... 86.7% To benchmark the quality of the automatically constructed corpus (Table 2), the testing corpus is first processed by our parser and then saved into the repository. The repository level...