... 93–96,
Columbus, Ohio, USA, June 2008.
c
2008 Association for Computational Linguistics
Using Structural Information for Identifying Similar Chinese Characters
Chao-Lin Liu Jen-Hsiang Lin
Department ... possible for us to employ image-
based methods to identify visually similar
characters, the resulting computational costs
can be very high. We propose methods for...
...
1996), incorporating information gained from the
textual context of the candidate term.
2
Context information
for terms
The idea of incorporating context information for
term extraction ... product.
Since context carries information about terms it
should be involved in the procedure for their ex-
traction. We incorporate context information in the
form of weights con...
... Aca-
demia Sinica.
Chinese Knowledge Information Processing Group.
1996. A study of Chinese Word Boundaries and
Segmentation Standard for Information proc-
essing (in Chinese) . Technical Report, ... Lan-
guage Modeling for Chinese, ACM
Transactions on Asian Language Information
Processing, 1(1):3-33.
Gu, H.Y., C.Y. Tseng and L.S. Lee. 1991. Markov
modeling of mandarin...
... brackets).
For average size texts (e.g. the Written Ques-
tions), these words account for about 5% of the
total (about 3k words / text). This number varies
according to language similarity. For instance,
on ... intermingled.
Parallel texts
(texts that are mutual transla-
tions) are valuable sources of information for
bilingual lexicography. However, they are not of
much use unless...
... language is
available for download (download.wikimedia.org)
in a text format suitable for inclusion in a database.
For the remainder of this paper, we refer to this
format.
1
Within Wikipedia, ... language
article, if available, for additional information.
•
A second pass checks for multi-word phrases
that exist as titles of Wikipedia articles.
•
We look for certain ty...
... of additional conforma-
tions. Some of those transiently formed conforma-
tions may be perfectly suited for a selected protein
ligand interaction. The distinguished protein confor-
mation is then ... with
dynamic information from heteronuclear NOEs and structural insight from
homonuclear NOE-based distance constraints indicated that micelle-associ-
ated VpUcyt retains a substantial degr...
... of pool-based active learning.
Various methods for selecting informative exam-
ples can be combined with this framework.
2.2 Selection Algorithm for Large Margin
Classifiers
One of the most accurate ... set.
2. While resources for labeling examples are
available
(a) Apply the current classifier to each un-
labeled example
(b) Find the m examples which are most in-
formative for the clas...
... hypothesis (Harris, 1954)
states that words that have similar distributions are
semantically similar. We compute f(u, w) as the
pointwise mutual information between a lexical ele-
ment u and a feature ... delicious in book reviews.
Therefore, a model that is trained only using book
reviews might not have any weights learnt for deli-
cious or rust, which would make it difficult for thi...
...
Manning. 2005. Incorporating Non-local Information
into Information Extraction Systems by Gibbs Sam-
pling. In Proc. 43rd Annual Meeting of the Associa-
tion for Computational Linguistics, pages ... “peripheral vision”.
Gupta and Ji (2009) used cross-event informa-
tion within ACE extraction, but only for recovering
implicit time information for events.
Liao and Grishman (201...
... debate by
identifying which aspects of language can poten-
tially be learnt from the input available to a child.
Here we try to identify linguistic properties that
convey information useful for learning ... con-
textual information is less important for their acqui-
sition than, say, syntax.
2 From PCFGs to Adaptor Grammars
This section introduces adaptor grammars as an ex-
tension...