See: Description
Interface | Description |
---|---|
AdornedWord |
Interface for a word adorned with addition morphological information.
|
AdornedWordFilter |
An Adorned word filter.
|
Class | Description |
---|---|
AdornedWordStopWordFilter |
An Adorned word stop word filter.
|
BaseAdornedWord |
A word adorned with addition morphological information.
|
PorterStopWordFilter |
An Adorned word filter for Martin Porter's stop word list.
|
An AdornedWord
represents a single word spelling,
symbol, or punctuation mark in text. In addition to its original
textual representation, the adorned word provides a corrected spelling
which records a form corrected for transcription errors or "gaps".
This is the form of the word used for subsequent morphological processing,
which can add a standardized spelling,
a lemma, a part of speech, a token type, and possibly other
morphological adornments to each word.
MorphAdorner's part of speech taggers produce a list of adorned words as the output of the tagging process, collected into a list of AdornedWords. MorphAdorner adds other adornments in subsequent processing.
An AdornedWordFilter
provides an interface for filtering the contents of an AdornedWord list.