public class DefaultMorphAdornerXMLWriter extends java.lang.Object implements MorphAdornerXMLWriter
| Modifier and Type | Field and Description |
|---|---|
protected SortedArrayList<SentenceAndWordNumber> |
sortedWords
Sorted list of word IDs and word and sentence number information.
|
protected XMLWriter |
writer
Output XML writer.
|
| Constructor and Description |
|---|
DefaultMorphAdornerXMLWriter()
Create XML writer.
|
| Modifier and Type | Method and Description |
|---|---|
protected void |
getWordAndSentenceNumbers(MorphAdornerSettings settings)
Get word and sentence numbers.
|
void |
writeXML(java.lang.String inFile,
java.lang.String outFile,
int maxID,
PartOfSpeechTags posTags,
java.util.Map<java.lang.Integer,java.lang.Integer> splitWords,
int totalWords,
int totalPageBreaks,
MorphAdorner adorner,
boolean tokenizingOnly)
Write XML output.
|
protected SortedArrayList<SentenceAndWordNumber> sortedWords
protected XMLWriter writer
public DefaultMorphAdornerXMLWriter()
public void writeXML(java.lang.String inFile,
java.lang.String outFile,
int maxID,
PartOfSpeechTags posTags,
java.util.Map<java.lang.Integer,java.lang.Integer> splitWords,
int totalWords,
int totalPageBreaks,
MorphAdorner adorner,
boolean tokenizingOnly)
throws java.io.IOException,
org.xml.sax.SAXException
writeXML in interface MorphAdornerXMLWriterinFile - The XML input file.outFile - The XML output file.maxID - The maximum ID value in the input file.posTags - The part of speech tags.splitWords - The map of (word ID, # of word parts)
for multipart words.totalWords - Total words.totalPageBreaks - Total page breaks.adorner - The adorner.tokenizingOnly - Only emit tokenization-related attributes.IOException, - SAXExceptionjava.io.IOExceptionorg.xml.sax.SAXExceptionprotected void getWordAndSentenceNumbers(MorphAdornerSettings settings)