Describes the tagset and language-specific characteristics of the English tagger and noun phrase recognizer. For information about how to use the tagging API and general tag information that applies to all languages, see the document LinguistX Tagging that comes with the runtime. For information about the noun phrase recognition service, see LinguistX Noun Phrase Recognition.

English Tagger

The English tagger output contains the following language-specific characteristics:

Proper nouns are identified with three main tags: Prop-Org, Prop-Name, and Prop-Place. Businesses and other organizations are tagged Prop-Org; place names are tagged Prop-Place. People and other things (buildings that aren't considered places, book titles, etc.) are tagged Prop-Name. Titles such as Mr. are tagged Prop-Title.

Hyphenated words, when used as modifiers, are tagged Adj.

Words such as his, her, and its, when used as modifiers, are tagged as possessive determiners (Det-Poss). The possessive pronouns his, hers, and its are tagged as pronouns (Pron).

Past-tense verbs are tagged V-Past; past perfect and passive uses are tagged past participle (V-PaPart); past participles that have become adjectives and are used as modifiers are tagged Adj.

Present participles are tagged V-Prog when used as a verb. They are tagged Nn-Sg when used as a noun, and Adj when used as a modifier.

The following table shows the complete English tag set.

TagDescription Examples
Abbrabbreviation that is not a title i.e.
Abbr-Measabbreviation of measure oz.
Adjadjectivebig
Adj-Compcomparative adjective bigger
Adj-Supsuperlative adjective biggest
Advadverbquickly
Adv-Compcomparative adverb earlier
Adv-Intwh-adverb how, when
Adv-Supsuperlative adverb fastest
Auxauxiliary or modal has, could
Conj-Coordcoordinating conjunction and
Conj-Subsubordinating conjunction if, that
Detinvariant determiner (singular or plural) some, no
Det-Defdefinite determiner the
Det-Indefindefinite determiner a
Det-Intwh-determiner what, which, whose
Det-Plplural determiner these, those
Det-Posspossessive determiner her, his, its
Det-Relrelative determiner whose
Det-Sgsingular determiner this, that
Interjinterjectionoh, hello
Letterlettera, b, c
Markup-SGMLSGML markup <TITLE>
Nninvariant nounsheep
Nn-Plplural nouncomputers
Nn-Sgsingular nountable
Numnumber or numeric expression 40.5
Num-Moneymonetary amount $12.55
Num-Percentpercentage 12%
Num-Romanroman numeral XVII, xvii
Onomonomatopoeiameow
Ordordinal numberfirst, second
Part-Infinfinitive marker to
Part-Negnegative particle not
Part-Posspossessive marker 's, '
Prepprepositionin, on, to
Pronpronounhe
Pron-Intwh-pronoun who
Pron-Reflreflexive pronoun himself
Pron-Relrelative pronoun who, whom, that, which
Prop-Namename of a person or thing Graceland, Aesop
Prop-Name-Famlast name Jones
Prop-Name-Givfirst name Susan, Jacob
Prop-Orgname of an organization Xerox
Prop-Placeplace name Colorado
Prop-TitletitleMr., Gen.
Punctother punctuation - ; /
Punct-Closeclosing punctuation ) ] }
Punct-Commacomma,
Punct-Openopening punctuation ( [ {
Punct-Quotequote' " ''
Punct-Sentsentence-ending punctuation . ! ?
Timetime expression 9:00
V-PaPartverb, past participle understood
V-PaPart-bepast participle of to be been
V-Pastverb, past tense ran
V-Past-Pl-beverb, past tense plural of to be were
V-Past-Sg-beverb, past tense singular of to be was
V-Presverb, present tense or infinitive walk
V-Pres-3-Sgverb, present tense, 3rd person singular runs
V-Pres-Pl-beverb, present tense plural of to be are
V-Pres-Sg-beverb, present tense singular of to be is
V-Progprogressive verb swimming
WordPartpart of a multi-word phrase quo

English Noun Phrase Recognition

The English phrase extractor defines simple noun phrases as:

Prepositions other than of and at are excluded because of ambiguity in English of prepositional phrase binding.

Noun phrases with commas are recognized.

Proper-noun groups are kept together during subphrase finding.


$RCSfile: lxenp.html,v $ $Revision: 1.1 $ $Date: 1997/12/17 03:40:42 $