public class MergeWordLists
extends java.lang.Object
Usage:
java edu.northwestern.at.morphadorner.tools.mergewordlists.MergeWordLists output.txt input.txt input2.txt ...
output.txt -- output merged word list file.
input*.txt -- input text files containing word lists to be merged.
The output file is a utf-8 text file containing the merged word list from the input files. Only one copy of a word is output if it appears multiple times. The merged words appear in ascending alphanumeric order in the output file.
Modifier and Type | Field and Description |
---|---|
protected static java.util.Set<java.lang.String> |
mergedWordSet
Merged word list.
|
Modifier | Constructor and Description |
---|---|
protected |
MergeWordLists()
Allow overrides but not instantiation.
|
Modifier and Type | Method and Description |
---|---|
protected static void |
loadAndMergeWords(java.lang.String inputFileName)
Merge word lists from a file.
|
static void |
main(java.lang.String[] args)
Main program.
|
protected static void |
saveMergedWords(java.lang.String outputFileName)
Save the merged word lists.
|
protected static java.util.Set<java.lang.String> mergedWordSet
public static void main(java.lang.String[] args)
protected static void loadAndMergeWords(java.lang.String inputFileName) throws java.lang.Exception
java.lang.Exception
protected static void saveMergedWords(java.lang.String outputFileName) throws java.lang.Exception
java.lang.Exception