NU IT
Northwestern University Information Technology
MorphAdorner Northwestern
 
MorphAdorner Server Services: Lemmatizer

Service name: lemmatizer
Service description: Find lemma form (dictionary headword) for a spelling.
HTTP methods allowed: GET, POST, OPTIONS
POST accepts as input: application/x-www-form-urlencoded
HTTP return codes: 200: service succeeded
400: service failed with an error

Query parameters

    corpusConfig Corpus configuration name. In the standard distribution these are ece, eme, and ncf.
    media Result format. One of json, xml, html, text .
    spelling Spelling of a word.
    standardize Standardize (modernize) spelling before performing operation. Allowed values are true to request spelling standardization and false to disallow spelling standardization.
    wordClass Primary word class. One of adjective, adverb, compound, conjunction, infinitive-to, noun, noun-possessive, preposition, pronoun, pronoun-possessive, pronoun-possessive-determiner, verb .
    wordClass2 Secondary word class. One of adjective, adverb, compound, conjunction, infinitive-to, noun, noun-possessive, preposition, pronoun, pronoun-possessive, pronoun-possessive-determiner, verb .

Sample POST form

<form accept-charset="UTF-8" method="post" action="lemmatizer"
      target="_blank"
      name="lemmatizer">
<table cellpadding="0" cellspacing="5">
<tr>
<td><strong>Spelling:</strong></td>
<td><input type="text" name="spelling" size="20" value="" /></td>
</tr>
<tr>
<td>
<td><input type="checkbox" name="standardize" value="true" checked="checked" />Standardize spelling</td>
</td>
<td>&nbsp;</td>
</tr>
<tr>
<td><strong>Primary word class:</strong></td>
<td>
<select name="wordClass">
<option value="" selected="selected"></option>
<option value="adjective">adjective</option>
<option value="adverb">adverb</option>
<option value="compound">compound</option>
<option value="conjunction">conjunction</option>
<option value="infinitive-to">infinitive-to</option>
<option value="noun">noun</option>
<option value="noun-possessive">noun-possessive</option>
<option value="preposition">preposition</option>
<option value="pronoun">pronoun</option>
<option value="pronoun-possessive">pronoun-possessive</option>
<option value="pronoun-possessive-determiner">pronoun-possessive-determiner</option>
<option value="verb">verb</option>
</select>
</td>
</tr>
<tr>
<td><strong>Secondary word class:</strong></td>
<td>
<select name="wordClass2">
<option value="" selected="selected"></option>
<option value="adjective">adjective</option>
<option value="adverb">adverb</option>
<option value="compound">compound</option>
<option value="conjunction">conjunction</option>
<option value="infinitive-to">infinitive-to</option>
<option value="noun">noun</option>
<option value="noun-possessive">noun-possessive</option>
<option value="preposition">preposition</option>
<option value="pronoun">pronoun</option>
<option value="pronoun-possessive">pronoun-possessive</option>
<option value="pronoun-possessive-determiner">pronoun-possessive-determiner</option>
<option value="verb">verb</option>
</select>
</td>
</tr>
<tr>
<td valign="top">
<strong>
Lexicon:</strong>
</td>
<td>
<input type="radio" name="corpusConfig" value="eme">Early Modern English</input><br />
<input type="radio" name="corpusConfig" value="ece">Eighteen Century English</input><br />
<input type="radio" name="corpusConfig" value="ncf" checked="checked">Nineteenth Century Fiction</input>
</td>
</tr>
<tr>
<td>
&nbsp;
</td>
<td>
&nbsp;
</td>
</tr>
<tr>
<td valign="top">
<strong>Results format:</strong>
</td>
<td>
<input type="radio" name="media" value="json">JSON format</input><br />
<input type="radio" name="media" value="xml" checked="checked">XML format</input><br />
<input type="radio" name="media" value="html">HTML format</input><br />
<input type="radio" name="media" value="text">Text format</input>
</td>
</tr>
<tr>
<td>
&nbsp;
</td>
<td>
&nbsp;
</td>
</tr>
<tr>
<td>
<input type="submit" name="lemmatize" value="Lemmatize" />
</td>
</tr>
</table>
</form>

Output

Here is sample output for spelling strykynge, using the eme (early modern English) corpus configuration. We request spelling standardization and supply verb as the primary word class.

The JSON and XML formats echo the input spelling, the corpusConfig, the value of the standardize settings, and the primary and secondary word classes wordClass and wordClass2 respectively. The resulting standard spelling is emitted as standardSpelling and the resulting lemma form appears as lemma. In addition, the Lancaster stemmer result appears in the lancasterStem output field, and the Porter stemmer result appears in the porterStem output field. The input query parameter field values are not emitted for the HTML or plain text output formats which are suitable for display.

JSON output

{
  "LemmatizerResult": {
    "spelling": "strykynge",
    "standardSpelling": "striking",
    "corpusConfig": "eme",
    "wordClass": "verb",
    "wordClass2": "",
    "lemma": "strike",
    "standardize": true,
    "lancasterStem": "strik",
    "porterStem": "strike"
  }
}

XML output

<LemmatizerResult>
   <spelling>strykynge</spelling>
   <standardSpelling>striking</standardSpelling>
   <corpusConfig>eme</corpusConfig>
   <wordClass>verb</wordClass>
   <wordClass2/>
   <lemma>strike</lemma>
   <standardize>true</standardize>
   <lancasterStem>strik</lancasterStem>
   <porterStem>strike</porterStem>
</LemmatizerResult>

HTML output (source)

<h3>Lemmatizer Results</h3>
<table border="0">
<tr>
<td valign="top" align="left"><strong>Lemma:</strong></td>
<td valign="top" align="left">strike</td>
</tr>
<tr>
<td valign="top" align="left"><strong>Lancaster stem:</strong></td>
<td valign="top" align="left">strik</td>
</tr>
<tr>
<td valign="top" align="left"><strong>Porter stem:</strong></td>
<td valign="top" align="left">strike</td>
</tr>
</table>

HTML output (display)

Lemmatizer Results

Lemma: strike
Lancaster stem: strik
Porter stem: strike

Text output

Lemmatizer Results
Lemma:	strike
Lancaster stem:	strik
Porter stem:	strike
Home
 
Announcements and News
 
Documentation
 
Download MorphAdorner
 
Glossary
 
Helpful References
 
Licenses
 
Server
 
Talks
 
Tech Talk