public class PunktAbbreviationDetector
extends java.lang.Object
java -Xmx512m edu.northwestern.at.morphadorner.tools.punktabbreviationdetector.PunktAbbreviationDetector
isolangcode abbrevs.txt text1.txt text2.txt ...
Modifier and Type | Field and Description |
---|---|
protected static int |
INITPARAMS
# params before input file specs.
|
protected static java.io.PrintStream |
printStream
Wrapper for printStream to allow utf-8 output.
|
Modifier | Constructor and Description |
---|---|
protected |
PunktAbbreviationDetector()
Allow overrides but not instantiation.
|
Modifier and Type | Method and Description |
---|---|
static java.util.Locale |
languageCodeToLocale(java.lang.String languageCode)
Get a Java Locale from an ISO language code.
|
static void |
main(java.lang.String[] args)
Main program.
|
static PunktToken |
makePunktToken(java.lang.String token)
Create Punkt token from a string.
|
protected static final int INITPARAMS
protected static java.io.PrintStream printStream
protected PunktAbbreviationDetector()
public static void main(java.lang.String[] args) throws java.io.IOException
java.io.IOException
public static java.util.Locale languageCodeToLocale(java.lang.String languageCode)
languageCode
- The ISO language code.public static PunktToken makePunktToken(java.lang.String token)
token
- The token.