... Vaithyanathan. 2010b. Systemt: an algebraic ap-
proach to declarativeinformation extraction. ACL.
Laura Chiticariu, Rajasekar Krishnamurthy, Yunyao
Li, Frederick Reiss, and Shivakumar Vaithyanathan.
2010c. ... information extrac-
tion program using a language called AQL. AQL is a
declarative relational language similar in syntax to the
database language SQL, which was chosen as a basis for
our language due ... Record,
37(4):14–20.
A. Doan, R. Ramakrishnan, and S. Vaithyanathan. 2006.
Managing Information Extraction: State of the Art and
Research Directions. In SIGMOD.
F.Reiss, S. Raghavan, R. Krishnamurthy, H. Zhu, and
S....
... Approach to DeclarativeInformation Extraction
Laura Chiticariu Rajasekar Kri shnamurthy Yunyao Li
Sriram Raghavan Frederick R. Reiss Shivakumar Vaithyanathan
IBM Research – Almaden
San Jose, CA, ... index-based entity annotation. In
InfoScale.
Frederick Reiss, Sriram Raghavan, Rajasekar Kr-
ishnamurthy, Huaiyu Zhu, and Shivakumar
Vaithyanathan. 2008. An algebraic a pproach to
rule-based information ... policy.
3.2 AQL
Extraction rules in SystemT are written in AQL,
a declarative relational language similar in syn-
tax to the database language SQL. We chose SQL
as a basis for our language due to...
... Information extraction
with HMMs and shrinkage. In Proceedings of the AAAI-99
Workshop on Machine Learning for Information Extraction.
D. Freitag. 1998. Machine learning for information extraction
in ... document.
This approach has the added advantage of allowing
the training procedure to automatically learn good
weightings for these “global” features relative to the
local ones. However, this approach cannot ... relationships between nodes which are the
same class, but may not be similar in any other way.
For instance, in the CMU Seminar Announcements
dataset, we can normalize all entities labeled as a
start...
... from ANNIE in a simi-
lar way to the Bulgarian one, using a tokeniser,
gazetteer and a JAPE semantic grammar. Fig-
ure 3 shows some Romanian text annotated in
GATE.
Romanian is a more flexible language ... gate. Technical Report CS-02-01,
University of Sheffield.
Katerina Pastra, Diana Maynard, Hamish Cun-
ningham, Oana Hamza, and Yorick Wilks.
2002. How feasible is the reuse of grammars
for named ... Bulgarian text
annotated in GATE.
Since the structure of the Bulgarian and
Russian languages is quite similar, we antic-
ipate that converting the Bulgarian system
to Russian will be fairly straightforward,...
... information available to
an average, idealized speaker). A Metalinguistic
Information Database (MID), on the other hand,
compiles the real-time data provided by metalan-
guage analysis of leading-edge ... create special databases to boot-
strap compilation and facilitate update of the
huge and dynamically changing glossaries,
knowledge bases and ontologies that are vital
to modern-day research. ... self-
referential lexical items that are the logical or
grammatical subject of a predication that needs
not be a complete grammatical sentence.
3
At a very basic semiotic level natural language has...
... an
algebraic approach to declarativeinformation extrac-
tion. ACL.
L. Chiticariu, R. Krishnamurthy, Y. Li, F. Reiss, and
S. Vaithyanathan. 2010b. Domain adaptation of rule-
based annotators for named-entity ... in an ap-
plication using a Java API interface. WizIE can also
wrap the executable plan in a pre-packaged applica-
tion that can be run in a map-reduce environment,
then deploy this application ... concrete extrac-
tion tasks are captured by a tree structure called the
extraction plan (e.g. right panel in Fig. 2). Each
leaf node in an extraction plan corresponds to an
atomic extraction task, while...
... Daisuke Kawahara
†
Yoshikiyo Kato
†
Tetsuji Nakagawa
†
Kentaro Inui
†
Sadao Kurohashi
†‡
Yutaka Kidawara
†
†
National Institute of Information and Communications Technology
‡
Graduate School ... from an ordinary dictionary.
5. Extraction of Evaluative Information
The extraction and classification of evaluative
information from texts are important tasks with
3
many applications and ... page author, and informa-
tion appearance (e.g., contact address, privacy
policy, volume of advertisements, and images)
are automatically analyzed and stored in the
standard format.
4. Extraction...
...
Satellite
span
elaboration
span
elaboration elaboration
span
Figure 2. Exam-
ple of anchor
Anchor A
i
Marshal
pos_NNP
list_personWord
Cand_AtArg1
Minipar_obj
Arg2
Spade_Satellite ... pair A
i
and A
j
.
5.3 Evaluation of templates
At this stage, we have a set of accepted integral
relation paths between any anchor pair A
i
and A
j
.
The next task is to merge appropriate ... re-
ported accuracy of Spade is 49% on the RST-DT
corpus. To obtain a clausal path, we map each
anchor A
i
to its clause in Spade. If anchors A
i
and A
j
belong to the same clause, we assign...
... separate data area is reserved for this purpose.
The separate data area of these database systems means
that they do not need the segment cleaning mechanisms of
the Sprite LFS to reclaim log space. ... under grant CCR-8900029, and in part
by the National Aeronautics and Space Administration and the
Defense Advanced Research Projects Agency under contract
NAG2-591.
This paper will appear in the ... illustrates the fact that a log-structured file
system produces a different form of locality on disk than
traditional file systems. A traditional file system achieves
logical locality by assuming certain...
... signal.
Alarm system interface unit
The alarm system delivered does not function as a complete home alarm system, but
merely illustrates that the home automation system can interface ... interface with a larger
existing alarm system.
The alarm interface unit provides the home alarm system with an arm/disarm signal
and reports back to the master unit the current integrity status of ... very important, as it means that on a noisy powerline circuit, home
automation signals can be sent reliably, as long as the rate of transmission is low
enough. Since a home automation system does...