Package edu.upenn.cis.propbank_shen

Provides an API to the Propbank, modified for the version included in Libin Shen's LTAG treebank in the file prop-all.idx.

See:
          Description

Class Summary
Annotation This class represents an annotation in the propbank.
ArgLabel An Argument Label represents the information ascribed to an argument in a predicate-argument structure.
ArgLoc This class represents a location of an argument in the text.
Argument This class represents an "argument" in a "predicate argument structure".
Example This class represents an example in the lexical guidelines.
FrameSet This is an interface to a frameset as defined in the propbank lexical guidelines.
InflAspect the "aspect" part of inflectional information.
Inflection This class represents inflectional information as is found in the english propbank.
InflForm This represents the overall "form" of the inflection.
InflPerson The "person" part of the inflectional information for a predicate argument structure.
InflTense We represent the future, past, and present tenses with the propbank inflectional information.
InflVoice This class represents the voice part of the inflection of a verb.
ModLabel A class emulating an enumeration of labels representing "modifiers" in the propbank.
PASLoc This class represents the location of a predicate argument structure.
PAStruct A class representing an instance of predicate argument structure.
PBConfig This class represents configuration information -- particular filenames and directories of the propbank data set.
Predicate A class representing a "predicate" in the propbank frames.
Propbank This class encapsulates the actual Propbank annotation file as modified by Libin Shen.
Role This class represents a "role" in the propbank lexical guidelines.
RoleSet A representation of a RoleSet as defined in the propbank lexical guidelines, frameset.dtd.
VNRole This class represents a propbank pointer to verbnet roles.
WordSpan This class represents the span of a propbank argument or of the subtree of a sentence by a pair of integers indicating the first word of the argument and the first word that is outside the argument.
 

Exception Summary
CorruptDataException This exception is thrown for processing corrupt data in propbank formats.
 

Package edu.upenn.cis.propbank_shen Description

Provides an API to the Propbank, modified for the version included in Libin Shen's LTAG treebank in the file prop-all.idx. This is the java propbank programming API. It provides a clean and convenient interface for manipulating various aspects of propbank data in memory. It requires Java version 1.4.

The propbank consists of annotation and a lexicon. The lexicon, here sometimes referred to as "lexical guidelines" is stored in a series of xml files.

The annotation and lexicon are available through the LDC. The annotation is built on top of the Penn Treebank, and in most respects requires access to that data. Consequently, this package is built on top of a treebank API. (Note: For the present release, all dependencies to the treebank API have been removed. Lucas Champollion)

Two system properties should set to use this API "TREEBANKDIR", "FRAMEDIR", and "PROPBANKFILE". The defaults are set to "/usr/local/corpora/wsj", "/usr/local/lexica/frames", and "/usr/local/propbank/prop-all.id". These can be set with -DTREEBANKDIR=, -DFRAMEDIR= and -DPROPBANKFILE= on the command line. See also PBConfig.