See: Description
Class | Description |
---|---|
CompareStringCounts |
Compare string counts in two files using Dunning's log-likelihood.
|
CompareStringCounts.ReverseScoredString |
ScoredString modified to sort results from highest to lowest.
|
Compare string counts in two files using Dunning's log-likelihood.
Usage:
java edu.northwestern.at.morphadorner.tools.comparestringcounts.CompareStringCounts analysis.tab reference.tab
analysis.tab -- Input tab-separated file of strings and counts
for an analysis text.
reference.tab -- Input tab-separated file of strings and counts
for a reference text.
The analysis.tab and reference.tab files contain strings and counts of those strings compiled from two texts or corpora. Both files contain two tab-separated columns. The first column is a string. The second column contains the count of the number of times that string occurred in the associated text.
The output contains seven tab-separated columns, sorted in descending order by log-likelihood value. One line of output appears for each string in the analysis text.
These results are written to the standard output file which can be redirected to another file. A brief summary of the analysis is written to the standard error file.