public class DefaultMorphAdornerXMLWriter extends java.lang.Object implements MorphAdornerXMLWriter
Modifier and Type | Field and Description |
---|---|
protected SortedArrayList<SentenceAndWordNumber> |
sortedWords
Sorted list of word IDs and word and sentence number information.
|
protected XMLWriter |
writer
Output XML writer.
|
Constructor and Description |
---|
DefaultMorphAdornerXMLWriter()
Create XML writer.
|
Modifier and Type | Method and Description |
---|---|
protected void |
getWordAndSentenceNumbers(MorphAdornerSettings settings)
Get word and sentence numbers.
|
void |
writeXML(java.lang.String inFile,
java.lang.String outFile,
int maxID,
PartOfSpeechTags posTags,
java.util.Map<java.lang.Integer,java.lang.Integer> splitWords,
int totalWords,
int totalPageBreaks,
MorphAdorner adorner,
boolean tokenizingOnly)
Write XML output.
|
protected SortedArrayList<SentenceAndWordNumber> sortedWords
protected XMLWriter writer
public DefaultMorphAdornerXMLWriter()
public void writeXML(java.lang.String inFile, java.lang.String outFile, int maxID, PartOfSpeechTags posTags, java.util.Map<java.lang.Integer,java.lang.Integer> splitWords, int totalWords, int totalPageBreaks, MorphAdorner adorner, boolean tokenizingOnly) throws java.io.IOException, org.xml.sax.SAXException
writeXML
in interface MorphAdornerXMLWriter
inFile
- The XML input file.outFile
- The XML output file.maxID
- The maximum ID value in the input file.posTags
- The part of speech tags.splitWords
- The map of (word ID, # of word parts)
for multipart words.totalWords
- Total words.totalPageBreaks
- Total page breaks.adorner
- The adorner.tokenizingOnly
- Only emit tokenization-related attributes.IOException,
- SAXExceptionjava.io.IOException
org.xml.sax.SAXException
protected void getWordAndSentenceNumbers(MorphAdornerSettings settings)