Package edu.northwestern.at.morphadorner.tools.countadornedwords

Counts adorned words by processing XMLToTab output.

See: Description

Package edu.northwestern.at.morphadorner.tools.countadornedwords Description

Counts adorned words by processing XMLToTab output.

Usage:

java edu.northwestern.at.morphadorner.tools.countadornedwords.CountAdornedWords output.tab input1.tab input2.tab ...

output.tab -- output tab-separated values file described below.
input*.tab -- input tabbed files produced as output by XMLToTab.

The output file is a tab-delimited utf-8 text file containing the following fields, in order.

  1. Short work name, formed from input file name by stripping the path and file extension.
  2. The corrected original spelling.
  3. The standard spelling.
  4. The parts of speech.
  5. The lemmata.
  6. The count of the tuple (work name, corrected spelling, standard spelling, parts of speech, lemmata).