Poets that lasting marble seek,
Must carve in Latin or in Greek.
We write in sand, our language grows,
And like the tide, our work o'erflows.

-- Edmund Waller



Northwestern
MorphAdorner
    INFORMATION TECHNOLOGY  
    MorphAdorner Site Map  
MorphAdorner > Documentation > Command Line Syntax
 
Home
 
Announcements and News
 
Download MorphAdorner
 
Documentation
 
Licenses
 
Glossary
 
Helpful References
 
Tech Talk
 

Language Recognizer
 
Lemmatizer
 
Lexicon Lookup
 
Name Recognizer
 
Parser
 
Part of Speech Tagger
 
Pluralizer
 
Sentence Splitter
 
Spelling Standardizer
 
Text Segmenter
 
Verb Conjugator
 
Word Tokenizer
 
  Command Line Syntax
 
 

The MorphAdorner command line takes the following form.

java -Xmx640m -Xss1m edu.northwestern.at.morphadorner.MorphAdorner
  -a spellingpairs.tab
  -l lexicon.lex
  -o adornedoutput/
  -p overriding.properties
  -r contextrules.txt
  -s standardspellings.txt
  -t transitionmatrix.mat
  -u suffixlexicon.lex
  -w spellingsbywordclass.txt
  -x lexicalrules.txt
  input1 input2 ...

where

Parameter Definition
a A spelling map file. This file contains two columns separated by a tab. The first column is a variant spelling. The second column is the standard spelling. You may repeat this argument multiple times to specify more than one spelling map.
l A word lexicon file in MorphAdorner format.
o The directory into which adorned output files are written.
p A MorphAdorner configuration properties file. The entries in this file override the default morphadorner.properties file.
r The name of a file providing contextual rules for a rule-based part of speech tagger.
s A text file containing a list of standard spellings, one per line.
t The part of speech tag transition probability matrix used by the probabilistic part of speech taggers.
u A suffix lexicon file in MorphAdorner format. This should be generated from the word lexicon file specified by the l= parameter.
w A spelling map file which breaks down the variant to standard spellings by word class.
x The name of a file providing lexical rules for a rule-based part of speech tagger.
input1
input2 ...
The input files to be adorned.

Settings which appear in the default morphadorner.properties file will be overridden by those specified in the p= properties file. The other command line parameters override the settings in both properties files.

See the batch files and scripts such as adornncf, adorneme, etc. provided in the MorphAdorner release materials for examples of the use of the MorphAdorner command line parameters.

 

Information Technology | Academic Technologies | Scholarly Technologies 2East Resource Center |
Northwestern Home | Calendar: Plan-It Purple | Sites A-Z | Search
Academic Technologies  NU Library 2East  1970 Campus Drive  Evanston, IL 60208
E-mail: pib@northwestern.edu
Last updated Wed Apr 01 23:06:12 2009   World Wide Web Disclaimer and University Policy Statements   © 2007, 2008 Northwestern University