Interface | Description |
---|---|
Stemmer |
Defines an interface for retrieving the stem of a word.
|
Class | Description |
---|---|
DefaultStemmer |
DefaultStemmer: The default MorphAdorner spelling stemmer.
|
LancasterStemmer |
LancasterStemmer: Implements the Lancaster (Paice/Husk) word stemmer.
|
PorterStemmer |
Stemmer, implementing the Porter Stemming Algorithm
The Stemmer class transforms a word into its root form.
|
StemmerFactory |
Stemmer factory.
|
Stemming offers a simpler alternative to lemmatization. Stemming also attempts to reduce a word to a base form by removing affixes, but the resulting stem is not necessarily a proper lemma. Such stems can be useful in information retrieval applications.
All MorphAdorner stemmers must implement the
Stemmer
interface. The
StemmerFactory
provides the mechanism for instantiating a default or specified instance
of a stemmer implementation.
Stemmers can also be used as pseudo-lemmatizers. See the
edu.northwestern.at.morphadorner.corpuslinguistics.lemmatizer
package.
Two widely used stemmers are included in MorphAdorner.